전체 피드 소스 목록

카테고리

Frontend Backend DevOps AI/ML Mobile Database Security Career Infrastructure

© 2026 DevPick

#on-policy-distillation

피드 검색 북마크 설정

Dev.to

GRPO 기반 RL 및 OPD 증류를 통한 Qwen-Image-2.0 성능 최적화

The Interesting Part of Qwen-Image-2.0-RL Is Not the Image Score

AI/MLadvanced16 분 소요3일 전

Hacker News

KV cache 90% 절감 및 1M 토큰 컨텍스트 구현한 MoE 아키텍처

DeepSeek-V4: Towards Highly Efficient Million-Token Context Intelligence

AI/MLadvanced16 분 소요2026년 4월 24일