#sparse-moe 아티클 모음

Hugging Face Blog

12.5% 전문가 서브셋만으로 풀 모델 성능을 구현한 EMO MoE 아키텍처

EMO: Pretraining mixture of experts for emergent modularity

AI/MLadvanced22 분 소요5일 전

Dev.to

llama.cpp supports Sparse MoE, new Qwen3.6 GGUF, & WebWorld for local agents

AI/MLintermediate8 분 소요6일 전