Stateless LLM의 한계를 Orchestration 레이어로 극복한 AI 시스템 설계
Generation 1 — Standalone Models (2018–2022)
Generation 1 — Standalone Models (2018–2022)
Understanding Decoder-Only Transformers Part 1: Masked Self-Attention
Your AI Is Doing the Wrong Job. That's On You.
Building a Claude Stack for a Regulated Vertical (What I Learned Shipping for Law Firms)
When NOT to use RAG (lessons from building a Claude-powered support bot)
Locked, stocked, and losing budget: AI vendor lock-in bites back
DeepSeek-V4 Changes the Context Game for Agents — And Your Memory Architecture Should Adapt
Kimi K2.6 vs Claude Opus 4.7: The 88% Cost Advantage
Recursive Transformer 구조를 통한 추론 깊이 확장 및 토큰 효율 극대화
How I Cut My AI Chatbot Costs by 55% With One Architecture Change
How to Build an AI Co-Founder: The Exact Architecture That Actually Works
# Same churn signal, two different right answers
Generating text with diffusion (and ROI with LLMs)
Visual Salamandra: Pushing the Boundaries of Multimodal Understanding