ํ”ผ๋“œ๋กœ ๋Œ์•„๊ฐ€๊ธฐ
Defluffer - reduce token usage ๐Ÿ“‰ by 45% using this one simple trick! [Earthday challenge]
Dev.toDev.to
AI/ML

์ •๊ทœ์‹ ๊ธฐ๋ฐ˜ ํ…์ŠคํŠธ ์••์ถ•์„ ํ†ตํ•œ LLM Prompt Token 45% ์ ˆ๊ฐ

Defluffer - reduce token usage ๐Ÿ“‰ by 45% using this one simple trick! [Earthday challenge]

GrahamTheDev2026๋…„ 4์›” 18์ผ7๋ถ„beginner

Context

LLM์˜ Context Window ํŠน์„ฑ์ƒ ๋Œ€ํ™”๊ฐ€ ๊ธธ์–ด์งˆ์ˆ˜๋ก ์ž…๋ ฅ Token ์–‘์ด ๋ˆ„์ ๋˜์–ด ๋น„์šฉ๊ณผ ์ง€์—ฐ ์‹œ๊ฐ„์ด ์ฆ๊ฐ€ํ•˜๋Š” ๊ตฌ์กฐ์  ํ•œ๊ณ„ ์กด์žฌ. LLM์„ ์‚ฌ์šฉํ•˜์—ฌ Prompt๋ฅผ ์ตœ์ ํ™”ํ•˜๋Š” ๋ฐฉ์‹์€ ์ถ”๊ฐ€ Token์„ ์†Œ๋ชจํ•˜๋Š” ๋ชจ์ˆœ์ ์ธ ๋น„์šฉ ๊ตฌ์กฐ๋ฅผ ๊ฐ€์ง.

Technical Solution

  • Code Block ๋ณดํ˜ธ ๋กœ์ง: ์ •๊ทœ์‹์„ ํ†ตํ•ด ์ฝ”๋“œ ์˜์—ญ์„ ๋ณ„๋„ ๋ณด๊ด€ํ•จ์œผ๋กœ ๋ถ„๋ฆฌํ•˜์—ฌ ํ…์ŠคํŠธ ์••์ถ• ๊ณผ์ •์—์„œ์˜ ์ฝ”๋“œ ๋ฌด๊ฒฐ์„ฑ ๋ณด์žฅ
  • Phrase Collapsing: ์‚ฌ์ „ ์ •์˜๋œ Dictionary ๊ธฐ๋ฐ˜์˜ ๋ฌธ๊ตฌ ์น˜ํ™˜์„ ํ†ตํ•œ Payload ํฌ๊ธฐ ์ตœ์†Œํ™”
  • Blacklist Filtering: Hash Set์„ ํ™œ์šฉํ•œ ๋ถˆํ•„์š”ํ•œ Filler Word์˜ ๊ณ ์† ์ œ๊ฑฐ
  • Symbolic Logic Mapping: ์ž์—ฐ์–ด ๋ถ€์ •๋ฌธ์ด๋‚˜ ๋…ผ๋ฆฌ ๊ตฌ์กฐ๋ฅผ ๊ธฐํ˜ธ(์˜ˆ: not โ†’ !)๋กœ ๋ณ€ํ™˜ํ•˜์—ฌ Token ๋ฐ€๋„ ํ–ฅ์ƒ
  • Stemming & Synonym Replacement: ๋‹จ์–ด์˜ ์›ํ˜• ๋ณต์› ๋ฐ ์งง์€ ๋™์˜์–ด ์น˜ํ™˜์„ ํ†ตํ•œ ๋ฌธ์ž์—ด ๊ธธ์ด ์ถ•์†Œ
  • Multi-pass Cleanup: ๊ณต๋ฐฑ ์ œ๊ฑฐ ๋ฐ ํŠน์ˆ˜๋ฌธ์ž ์ •๊ทœํ™”๋ฅผ ํ†ตํ•œ ์ตœ์ข… ํ…์ŠคํŠธ ์ตœ์ ํ™”

Impact

  • Prompt Token ์‚ฌ์šฉ๋Ÿ‰ ํ‰๊ท  45% ์ ˆ๊ฐ
  • compute ๋น„์šฉ ์ œ๋กœ์— ๊ฐ€๊นŒ์šด ์ •๊ทœ์‹ ๋ฐ Dictionary ๊ธฐ๋ฐ˜ ์ฒ˜๋ฆฌ
  • 4์ฒœ๋งŒ ๋ช…์˜ ๊ฐœ๋ฐœ์ž๊ฐ€ ์ผ์ผ 30ํšŒ ์‚ฌ์šฉ ์‹œ ์—ฐ๊ฐ„ 60GW ์ „๋ ฅ ์ ˆ๊ฐ ๊ฐ€๋Šฅ์„ฑ ์ œ์‹œ

Key Takeaway

๊ณ ๋น„์šฉ์˜ LLM ์ถ”๋ก ์„ ๊ฑฐ์น˜์ง€ ์•Š๊ณ ๋„ ๊ฒฐ์ •๋ก ์ ์ธ(Deterministic) ๊ทœ์น™ ๊ธฐ๋ฐ˜์˜ ์ „์ฒ˜๋ฆฌ(Preprocessing)๋งŒ์œผ๋กœ ์œ ์˜๋ฏธํ•œ ๋น„์šฉ ์ ˆ๊ฐ๊ณผ ์„ฑ๋Šฅ ์ตœ์ ํ™” ๋‹ฌ์„ฑ ๊ฐ€๋Šฅ


- LLM ์ž…๋ ฅ ์ „ ์ •๊ทœ์‹์„ ํ†ตํ•œ ๋ถˆํ•„์š”ํ•œ ๊ณต๋ฐฑ ๋ฐ filler word ์ œ๊ฑฐ ๋‹จ๊ณ„ ๊ฒ€ํ†  - ๋„๋ฉ”์ธ ํŠนํ™” ์šฉ์–ด ์‚ฌ์ „์„ ๊ตฌ์ถ•ํ•˜์—ฌ ๊ธด ๋ฌธ๊ตฌ๋ฅผ ์งง์€ ํ† ํฐ์œผ๋กœ ์น˜ํ™˜ํ•˜๋Š” ๋งคํ•‘ ํ…Œ์ด๋ธ” ์ ์šฉ - ์ฝ”๋“œ๋‚˜ JSON ๋“ฑ ๊ตฌ์กฐํ™” ๋ฐ์ดํ„ฐ๋Š” ์••์ถ• ๋Œ€์ƒ์—์„œ ์ œ์™ธํ•˜๋Š” Protection Logic ์„ค๊ณ„

์›๋ฌธ ์ฝ๊ธฐ