HBM 제거 및 M3D 기반 330GB 온다이 DRAM으로 80B 모델 7.2만 tok/s 달성
Sophon PFG-1: a monolithic-3D AI ASIC with 330 GB of on-die DRAM and no HBM
Sophon PFG-1: a monolithic-3D AI ASIC with 330 GB of on-die DRAM and no HBM
Why Attention Becomes the Bottleneck — And How Efficient Attention Fixes It
Speculative decoding: when and why it actually speeds up inference
전력 효율 2배 향상 및 학습·추론 전용 칩 분리로 구현한 수직 통합 AI 인프라