LLM 모델 교체 시 발생하는 Silent Regression 방지 체계 구축
How a model upgrade silently broke our extraction prompt (and how we caught it)
How a model upgrade silently broke our extraction prompt (and how we caught it)
5 Open-Source Tools for Testing AI Agents Before They Break Production
I Built Four Tools with Claude Code. None of Them Had Tests. So I Fixed That