LLM 신뢰성 확보를 위한 10계층 Deterministic Trust Stack 설계
The LLM Is Not the Final Authority: Building Trust Infrastructure for AI Agents
The LLM Is Not the Final Authority: Building Trust Infrastructure for AI Agents
에이전트 상태 외부화 및 가중치 컴파일을 통한 LLM 추론 효율 극대화
BAGEN: LLM Agents Waste 44% of Tokens on Tasks They'll Fail
Cloudflare Adds Support for Claude Managed Agents
LLM Agents Are Now Finding Zero-Days: How AI is Autonomously Rewriting the Rules of Vulnerability Research
What 1,281 agent runs reveal about coding agent failure in large codebases
Building a general-purpose accessibility agent—and what we learned in the process
LLM 에이전트의 전문성 확보를 위한 Markdown 기반 Skills 아키텍처와 Waterfall 프로세스
Designing Reliable Tool Schemas with Zod for LLM Agents
PIIGhost: a Python library for PII anonymization in LLM agents
AI supply chain attacks don’t even require malware…just post poisoned documentation
How Dash uses context engineering for smarter AI
Introducing smolagents: simple agents that write actions in code.
Our Transformers Code Agent beats the GAIA benchmark 🏅