1.3B 파라미터 MiniCPM-V 4.6 기반의 On-Device Multimodal Tiered Architecture
A 1.3B model just shipped that runs on your phone, and the labs obsessed with frontier scores won't see this story coming
A 1.3B model just shipped that runs on your phone, and the labs obsessed with frontier scores won't see this story coming
Your Latency Problem Isn't Model Size (It’s Your Routing)