Claude Sonnet 5 공개
Opus급 Agentic 성능을 Sonnet 비용으로 구현한 Claude Sonnet 5 출시
Opus급 Agentic 성능을 Sonnet 비용으로 구현한 Claude Sonnet 5 출시
Token Counting Done Right: Stop Using tiktoken for Claude
Why Your Word Counter Gives Different Results Than Others (And How They All Work)
Why SQLite FTS5's default tokenizer drops your Japanese substrings (and the one-line fix)
GPT-5.5 may burn fewer tokens, but it always burns more cash
My Claude API Bill Jumped 47% and I Didn't Change a Single Prompt — Here's Why
One Open Source Project a Day (No.51): VibeVoice - Microsoft's Speech AI That Processes 90 Minutes of Audio in a Single Pass
7.5Hz 초저 프레임 레이트 기반 고효율 음성 AI VibeVoice 공개
The Hidden Challenge of Multi-LLM Context Management
Opus 4.7: 출력 토큰 최적화 통한 추론 비용 11% 절감 및 Intelligence Index 향상
Opus 4.7 Uses 35% More Tokens Than 4.6. Here's What I'm Doing About It.
Claude Opus 4.7
Self-Verification 도입으로 코딩 성능 13% 및 프로덕션 해결률 3배 향상
Claude Opus 4.7: What the release notes don't tell you about token costs
9M 파라미터 GuppyLM으로 분석하는 LLM의 내부 동작 원리
I built the algorithm behind ChatGPT from scratch — here's what I learned
Tokenization in Transformers v5: Simpler, Clearer, and More Modular
Falcon-Arabic: A Breakthrough in Arabic Language Models
Universal Assisted Generation: Faster Decoding with Any Assistant Model
Welcome Llama 3 - Meta's new open LLM