#model-sharding 아티클 모음

Dev.to

QLoRA 기반 7B LLM 튜닝 및 14GB 모델 배포의 인프라 제약 분석

Fine-tuned 7B LLM as a broke student. And Can't even use it 😭.

AI/MLintermediate8 분 소요2026년 6월 6일

Dev.to

Designing GenAI Infrastructure: How to Scale Video Generation

Infrastructureadvanced12 분 소요2026년 4월 12일

Hugging Face Blog

How good are LLMs at fixing their mistakes? A chatbot arena experiment with Keras and TPUs

AI/MLintermediate30 분 소요2024년 12월 5일