전체 피드 소스 목록

카테고리

Frontend Backend DevOps AI/ML Mobile Database Security Career Infrastructure

© 2026 DevPick

#pipeline-parallelism

피드 검색 북마크 설정

Dev.to

Local Gradient Accumulation을 통한 Training 속도 1.69배 향상

Local Gradient Accumulation Speeds Training 1.7

AI/MLadvanced6 분 소요2026년 6월 21일

InfoQ

Disaggregated Prefill과 Infire 엔진을 통한 LLM 인프라 최적화

Cloudflare Builds High-Performance Infrastructure for Running LLMs

Infrastructureadvanced8 분 소요2026년 5월 3일

The Register

100Tbps Ethernet Mesh 기반의 고효율 AI 서버 Galaxy Blackhole 출시

Tenstorrent’s Galaxy Blackhole AI servers escape the event horizon

AI/MLadvanced7 분 소요2026년 4월 28일

Cloudflare Blog

PD Disaggregation 및 Infire 엔진 통한 Token Latency 3배 개선

Building the foundation for running extra-large language models

AI/MLadvanced24 분 소요2026년 4월 16일

Hugging Face Blog

HuggingFace 팀이 Megatron-DeepSpeed 학습 모델을 Transformers로 포팅하고 Pipeline Parallelism + Accelerate + CUDA 커널 최적화로 BLOOM 모델 추론 지연시간 5배 단축 및 처리량 50배 증가

Optimization story: Bloom inference

Backendadvanced58 분 소요2022년 10월 12일