#knowledge-distillation 아티클 모음

Hacker News

Proxy-KD 도입을 통한 Black-Box LLM의 지식 전이 효율 극대화

Knowledge Distillation of Black-Box Large Language Models

AI/MLadvanced3 분 소요4일 전

GeekNews

Ford, AI 품질검사 차질로 ‘gray beard’ 검사관 재고용

AI 품질검사의 한계를 베테랑 엔지니어 350명 재고용으로 극복하여 JD Power 1위 달성

AI/MLintermediate12 분 소요6일 전

The Register

Multi-Agent Swarm 기반 취약점 분석으로 10년 된 제로데이 탐색 성공

Chinese cybersecurity company claims it’s built a better-than-Mythos bug finder

Securityadvanced7 분 소요6일 전

GeekNews

Moebius: 0.2B 이미지 인페인팅 모델로 10B급 성능 달성

0.22B 파라미터로 10B급 성능 구현 및 추론 속도 15배 가속

AI/MLadvanced13 분 소요2026년 6월 24일

Dev.to

12B Diffusion Transformer 기반의 Raw-Turbo 이원화 워크플로우를 통한 2초 내 고해상도 생성 구현

Enterprise AI Image Generation: The Custom Edge in 2026

AI/MLadvanced47 분 소요2026년 6월 23일

Dev.to

0.2B 경량 모델로 10B급 성능 구현한 Moebius 및 멀티모달 최적화 전략

Top AI Papers on Hugging Face - 2026-06-22

AI/MLadvanced27 분 소요2026년 6월 22일

Dev.to

Sigmoid Gate 기반 가중치 제어로 GRPO 학습 안정성 및 증류 효율 극대화

The Whole Paper Fits in One Sigmoid: Implementing the SDAR Gate

AI/MLadvanced16 분 소요2026년 6월 14일

GeekNews

Show GN: Claude Code에 Hermes Agent식 자기개선 루프를 붙이는 플러그인

Claude Code에 SKILL.md 기반 자기개선 루프를 구현한 Hermes Agent식 플러그인

AI/MLintermediate1 분 소요2026년 6월 9일

Dev.to

Single-tenant memory 탈피를 통한 Org-level 지식 복리 및 Agent fleet 최적화

Single-tenant memory is the wrong default for agents

AI/MLintermediate18 분 소요2026년 6월 8일

Hugging Face Blog

Knowledge Distillation과 Dual-LoRA 기반 맞춤형 채용 매칭 시스템 구축

Job Searcher

AI/MLintermediate9 분 소요2026년 6월 6일

Hacker News

Qwen3.5-122B 모델을 48GiB GGUF로 압축한 Edge AI 최적화 기법

Launch HN: General Instinct (YC P26) – Frontier models on edge devices

AI/MLadvanced3 분 소요2026년 6월 5일

Dev.to

35B Active Params MoE 구조로 Claude Opus급 성능 구현한 MAI-Thinking-1

MAI-Thinking-1: Microsoft's New Reasoning Model and What It Means for Developers

AI/MLadvanced21 분 소요2026년 6월 5일

Dev.to

Petaflop급 Edge Compute 기반 Scaling Out 아키텍처로의 AI 비용 패러다임 전환

NVIDIA Put Petaflop Compute on Your Desk — And It Changes the AI Cost Equation

AI/MLintermediate25 분 소요2026년 6월 3일

Dev.to

7B VLM을 2B로 Distillation하여 속도 2.4배 개선 및 ROUGE-L 성능 향상

I distilled a 7B vision model into a 2B one for screenshots — and the 7B teacher scored worse

AI/MLadvanced28 분 소요2026년 6월 2일

Dev.to

Flux DiT 도입을 통한 텍스트 렌더링 정밀도 향상 및 VRAM-속도 Trade-off 분석

Flux vs SDXL vs SD 1.5: Real Cost-per-Image Across GPUs (2026)

AI/MLintermediate14 분 소요2026년 6월 2일

Dev.to

모델 경량화 및 Kubernetes 도입을 통한 서빙 비용 최적화와 15GB→300MB 용량 절감

Serving AI Models: Balancing Cost and Performance

AI/MLintermediate20 분 소요2026년 6월 2일

GeekNews

TabPFN - 테이블 데이터를 위한 파운데이션 모델

전처리 없는 Zero-shot 추론으로 정형 데이터 분석 파이프라인 최적화

AI/MLintermediate2 분 소요2026년 5월 21일

Dev.to

트레이더의 정성적 경험을 정량적 Capability Factor로 변환한 Consensus 전략 설계

From 99 Traders to One Signal: Implementing a Distilled KOL Consensus Strategy on FMZ

AI/MLadvanced38 분 소요2026년 5월 18일

Dev.to

26M 파라미터 기반 초경량 Tool Calling 전용 모델 Needle의 효율적 아키텍처

Needle and the Return of the Tiny Specialist Model

AI/MLadvanced10 분 소요2026년 5월 18일

GeekNews

Needle - Gemini 도구 호출을 증류한 2600만 파라미터 모델

Gemini 증류 기반 26M 파라미터의 초경량 Tool Use 모델 Needle 분석

AI/MLadvanced7 분 소요2026년 5월 13일