ํ”ผ๋“œ๋กœ ๋Œ์•„๊ฐ€๊ธฐ
๐Ÿค—ย PEFT welcomes new merging methods
Hugging Face BlogHugging Face Blog
AI/ML

Hugging Face PEFT๊ฐ€ LoRA ์–ด๋Œ‘ํ„ฐ ๋ณ‘ํ•ฉ์„ ์œ„ํ•ด Concatenation, Linear/Task Arithmetic, DARE, TIES 4๊ฐ€์ง€ ๋ฐฉ์‹์„ ์ถ”๊ฐ€ํ•ด ๋ฉ”๋ชจ๋ฆฌ ํšจ์œจ์ ์ธ ์˜จ๋”ํ”Œ๋ผ์ด ๋ณ‘ํ•ฉ ์ง€์›

๐Ÿค—ย PEFT welcomes new merging methods

2024๋…„ 2์›” 19์ผ10๋ถ„intermediate

Context

๊ธฐ์กด ๋ชจ๋ธ ๋ณ‘ํ•ฉ์€ ์ฒดํฌํฌ์ธํŠธ๋ฅผ ๋‹ค์šด๋กœ๋“œํ•œ ํ›„ ๋ณ‘ํ•ฉํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ ๋ฉ”๋ชจ๋ฆฌ ์ง‘์•ฝ์ ์ด์—ˆ๋‹ค. ๊ฐ™์€ ๊ธฐ๋ณธ ๋ชจ๋ธ์—์„œ ์–ป์€ ์—ฌ๋Ÿฌ LoRA ์–ด๋Œ‘ํ„ฐ๋ฅผ ์‹คํ—˜์ ์œผ๋กœ ์กฐํ•ฉํ•  ๋•Œ, ๋ฉ”๋ชจ๋ฆฌ ์ œ์•ฝ ์†์—์„œ ๋‹ค์–‘ํ•œ ๋ณ‘ํ•ฉ ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ฆ‰์‹œ ์‹œํ—˜ํ•˜๊ธฐ ์–ด๋ ค์› ๋‹ค.

Technical Solution

  • Concatenation (cat) ๋ฐฉ์‹: LoRA ํ–‰๋ ฌ(A, B)์„ ๊ฐ€์ค‘์น˜์™€ ์Šค์ผ€์ผ๋ง ๊ณ„์ˆ˜๋ฅผ ๊ณฑํ•œ ํ›„ ์ฐจ์› 0๊ณผ 1์— ๋”ฐ๋ผ ์—ฐ๊ฒฐํ•˜์—ฌ ์›๋ณธ ์–ด๋Œ‘ํ„ฐ๋“ค์˜ ๊ฐ€์ค‘ ๋ณ‘ํ•ฉ๊ณผ ๋™์ผํ•œ ์ถœ๋ ฅ ์ƒ์„ฑ
  • Linear/Task Arithmetic (linear) ๋ฐฉ์‹: A์™€ B ํ–‰๋ ฌ์˜ ๊ฐ€์ค‘ํ•ฉ์„ ๊ณ„์‚ฐํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ ๊ตฌํ˜„ํ•˜๋˜, ๋ชจ๋“  ์ฐธ์—ฌ LoRA ์–ด๋Œ‘ํ„ฐ๊ฐ€ ๋™์ผํ•œ rank๋ฅผ ๊ฐ€์ ธ์•ผ ํ•จ
  • DARE ๋ฐฉ์‹: ๊ณ ๋ฐ€๋„(high density) ์„ค์ •์œผ๋กœ ์Šคํƒ€์ผ ์–ด๋Œ‘ํ„ฐ ์กฐํ•ฉ ์‹œ ์„ฑ๋Šฅ ํ–ฅ์ƒ
  • TIES ๋ฐฉ์‹: majority_sign_method="frequency" ์˜ต์…˜์œผ๋กœ ์Šคํƒ€์ผ ์–ด๋Œ‘ํ„ฐ ์กฐํ•ฉ ์‹œ ์ตœ์  ์„ฑ๋Šฅ ์ œ๊ณต
  • ์˜จ๋”ํ”Œ๋ผ์ด ๋ณ‘ํ•ฉ: ์–ด๋Œ‘ํ„ฐ ํ™œ์„ฑํ™”/๋น„ํ™œ์„ฑํ™” ์‹œ ๋ฉ”๋ชจ๋ฆฌ ๋‚ด์—์„œ ์‹ค์‹œ๊ฐ„์œผ๋กœ ๋ณ‘ํ•ฉ ์ˆ˜ํ–‰

Key Takeaway

๋‹ค์–‘ํ•œ ํŒŒ๋ผ๋ฏธํ„ฐ ํšจ์œจ ๊ธฐ๋ฒ•(LoRA, IA3 ๋“ฑ) ๊ฐ„ ๋ณ‘ํ•ฉ ์š”๊ตฌ์‚ฌํ•ญ์ด ๋‹ค๋ฅด๋ฏ€๋กœ, ๊ฐ ์–ด๋Œ‘ํ„ฐ ํƒ€์ž…์— ์ตœ์ ํ™”๋œ ๋ณ„๋„์˜ ๋ณ‘ํ•ฉ ์•Œ๊ณ ๋ฆฌ์ฆ˜์ด ํ•„์š”ํ•˜๋‹ค. ๋ฉ”๋ชจ๋ฆฌ ์ œ์•ฝ ํ™˜๊ฒฝ์—์„œ๋Š” ์ฒดํฌํฌ์ธํŠธ ๋‹ค์šด๋กœ๋“œ ๊ธฐ๋ฐ˜ ๋ณ‘ํ•ฉ ๋Œ€์‹  ์˜จ๋”ํ”Œ๋ผ์ด ๋ฐฉ์‹์ด ์‚ฌ์šฉ์ž ๊ฒฝํ—˜์„ ํ–ฅ์ƒ์‹œํ‚จ๋‹ค.


LoRA ์–ด๋Œ‘ํ„ฐ๋ฅผ ์—ฌ๋Ÿฌ ๊ฐœ ํ•™์Šตํ•ด์„œ ์กฐํ•ฉ ์‹คํ—˜์„ ํ•˜๋Š” ํŒ€์—์„œ๋Š” PEFT์˜ ์ƒˆ๋กœ์šด ๋ณ‘ํ•ฉ ๋ฉ”์„œ๋“œ๋ฅผ ์‚ฌ์šฉํ•˜๋ฉด ๊ฐ ์–ด๋Œ‘ํ„ฐ๋ฅผ ๋ฉ”๋ชจ๋ฆฌ์— ๋™์‹œ ๋กœ๋“œํ•  ํ•„์š” ์—†์ด ์ฆ‰์‹œ ๋‹ค์–‘ํ•œ ๋ณ‘ํ•ฉ ๋ฐฉ์‹(cat, linear, DARE, TIES)์„ ํ…Œ์ŠคํŠธํ•˜๊ณ  ์ตœ์ƒ์˜ ๊ฒฐ๊ณผ๋ฅผ ์ œ๊ณตํ•˜๋Š” ๋ณ‘ํ•ฉ ๋ฐฉ์‹์„ ์„ ํƒํ•  ์ˆ˜ ์žˆ๋‹ค.


์›๋ฌธ ์ฝ๊ธฐ