전체 피드 소스 목록

카테고리

Frontend Backend DevOps AI/ML Mobile Database Security Career Infrastructure

© 2026 DevPick

#llm-alignment

피드 검색 북마크 설정

Dev.to

데이터 구조와 리소스 기반의 LLM Alignment 최적 전략 분석

RLHF vs DPO vs IPO vs KTO: which alignment method should you use

AI/MLadvanced26 분 소요2026년 6월 16일

Dev.to

LLM의 회피적 답변을 억제하는 Functional Self 프로토콜 설계

Subjectivation: A protocol to give LLMs a functional, responsible self

AI/MLintermediate9 분 소요2026년 6월 5일

Dev.to

200K Context Window와 Constitutional AI 기반의 고신뢰성 LLM 설계

Claude AI: Features, Capabilities & Why It Stands Out in 2026

AI/MLintermediate6 분 소요2026년 5월 13일

Dev.to

Sycophancy 해결을 위한 Mandatory Adversarial Search 아키텍처 설계

Why I Built an AI That Tries to Destroy Your Legal Argument

AI/MLintermediate37 분 소요2026년 4월 29일

Hugging Face Blog

Hugging Face 팀이 Constitutional AI 기법을 오픈소스 LLM에 적용해 사용자 정의 원칙에 따른 자동 정렬 데이터셋 생성 및 안전성 평가 방법론 제시

Constitutional AI with Open LLMs

AI/MLintermediate50 분 소요2024년 2월 1일

Hugging Face Blog

Hugging Face TRL 라이브러리의 IPO 구현 버그(손실 함수 평균화 누락)를 수정해 DPO와 동등한 성능 달성

Preference Tuning LLMs with Direct Preference Optimization Methods

AI/MLintermediate23 분 소요2024년 1월 18일