JSON Rubric 기반 Self-Eval 루프를 통한 Agent 출력 정밀도 제어
Claude Result Loops + Rubrics: 5 Self-Eval Patterns for Production Agents
Claude Result Loops + Rubrics: 5 Self-Eval Patterns for Production Agents
GPT-5.5 Pro의 고비용-고추론 Trade-off와 CritPt 30.6% 달성 분석
A11 and AGI: A Structural Approach for Models
A beginner’s guide to Instructor: Get Structured Outputs from LLMs
Kiwi-chan's Slow & Steady Progress - Devlog #7
Kiwi-chan Progress Report: Steady Mining!
Kiwi-chan Progress Report: Steady Mining!