bfloat16 도입을 통한 64K Context 처리 및 0.5M TPS 달성
Is Brain Float (bf16) Worth it?
Is Brain Float (bf16) Worth it?
SubQ Model: Can Subquadratic Make Long-Context AI More Efficient?
Flux Attention halves inference cost on long contexts
1,000x Claim, No Independent Proof: Subquadratic Architecture
Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents
DeepSeek V4: Million-Token Context That Actually Works
Cancelé Claude: medí el deterioro de calidad con mis propios benchmarks antes de irme
Kimi K2.6 Has Arrived: An Open-Weight Powerhouse for Agentic Work
Claude Code's Edge: Why Sonnet 4.5 Beats GPT-4o for Multi-File Projects
Claude Opus 4.7 Debuts, Qwen 3.6-35B Open-Source, & Claude Code Workflow
Claude vs GPT-4o for Autonomous Agent Work: 30 Days of Real Data
Intelligence-per-Token: Why AI's Cost Problem Is Forcing a Reckoning in 2026
Claude Wrote a Cosmology Solver in Days — Patterns a Game Dev Wants to Steal
TextQuests: How Good are LLMs at Text-Based Video Games?
SmolLM3: smol, multilingual, long-context reasoner