#deepspeed 아티클 모음

Hugging Face Blog

Snowflake AI Research가 Ulysses Sequence Parallelism으로 어텐션 헤드를 GPU 간 분산 처리하여 64K 토큰에서 3.7배 처리량 증가 달성

Ulysses Sequence Parallelism: Training with Million-Token Contexts

AI/MLadvanced38 분 소요2026년 3월 9일

Hugging Face Blog

From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate

Backendintermediate15 분 소요2024년 6월 13일

Hugging Face Blog

Fast Inference on Large Language Models: BLOOMZ on Habana Gaudi2 Accelerator

AI/MLintermediate26 분 소요2023년 3월 28일

Hugging Face Blog

Optimum+ONNX Runtime - Easier, Faster training for your Hugging Face models

AI/MLintermediate17 분 소요2023년 1월 24일

Hugging Face Blog

Incredibly Fast BLOOM Inference with DeepSpeed and Accelerate

Backendadvanced24 분 소요2022년 9월 16일

Hugging Face Blog

The Technology Behind BLOOM Training

AI/MLadvanced62 분 소요2022년 7월 14일

Hugging Face Blog

Accelerate Large Model Training using DeepSpeed

AI/MLintermediate27 분 소요2022년 6월 28일

Hugging Face Blog

Fit More and Train Faster With ZeRO via DeepSpeed and FairScale

AI/MLintermediate25 분 소요2021년 1월 19일