#large-language-models 아티클 모음

Hugging Face Blog

Technology Innovation Institute(TII)가 순수 State Space Model 기반 Falcon Mamba 7B를 개발해 Attention 메커니즘 없이도 Transformer 수준의 성능 달성

Welcome Falcon Mamba: The first strong attention-free 7B model

AI/MLintermediate20 분 소요2024년 8월 12일

Hugging Face Blog

Large-scale Near-deduplication Behind BigCode

AI/MLintermediate48 분 소요2023년 5월 16일

Hugging Face Blog

Parameter-Efficient Fine-Tuning using 🤗 PEFT

AI/MLintermediate14 분 소요2023년 2월 10일

Hugging Face Blog

How 🤗 Accelerate runs very large models thanks to PyTorch

Backendintermediate32 분 소요2022년 9월 27일

Hugging Face Blog

The Technology Behind BLOOM Training

AI/MLadvanced62 분 소요2022년 7월 14일

Hugging Face Blog

Accelerate Large Model Training using DeepSpeed

AI/MLintermediate27 분 소요2022년 6월 28일

Hugging Face Blog

Few-shot learning in practice: GPT-Neo and the 🤗 Accelerated Inference API

AI/MLintermediate15 분 소요2021년 6월 3일