ํ”ผ๋“œ๋กœ ๋Œ์•„๊ฐ€๊ธฐ
My AI Experience in Russia as a European๐Ÿคฏ
Dev.toDev.to
AI/ML

VPN ์ „๋ฉด ์ฐจ๋‹จ ํ™˜๊ฒฝ์„ ๊ทน๋ณตํ•œ Local LLM ๊ธฐ๋ฐ˜ Full-Stack ๊ฐœ๋ฐœ ํ™˜๊ฒฝ ๊ตฌ์ถ•

My AI Experience in Russia as a European๐Ÿคฏ

Ross Peili2026๋…„ 4์›” 29์ผ4๋ถ„intermediate

Context

๋Ÿฌ์‹œ์•„ ๋‚ด ์ƒ์šฉ VPN 99% ์ฐจ๋‹จ์œผ๋กœ ์ธํ•œ GCP, Gemini, Claude ๋“ฑ ์™ธ๋ถ€ AI API ์ ‘๊ทผ ๋ถˆ๊ฐ€ ์ƒํ™ฉ ๋ฐœ์ƒ. ๊ธฐ์กด Cloud-native ๊ฐœ๋ฐœ ์›Œํฌํ”Œ๋กœ์šฐ๊ฐ€ ์™„์ „ํžˆ ๋งˆ๋น„๋œ ํ™˜๊ฒฝ์—์„œ Enterprise ์ˆ˜์ค€์˜ ์ฝ”๋“œ ์ƒ์„ฑ ๋ฐ ๋ถ„์„ ๋Šฅ๋ ฅ ํ™•๋ณด๊ฐ€ ์‹œ๊ธ‰ํ•œ ์ƒํƒœ.

Technical Solution

  • GGUF ํฌ๋งท์˜ Open-weight ๋ชจ๋ธ(Gemma 4, Qwen 2.5 Coder, DeepSeek Coder)์„ SSD์— ์‚ฌ์ „ ํ™•๋ณดํ•˜์—ฌ ๋„คํŠธ์›Œํฌ ์˜์กด์„ฑ ์ œ๊ฑฐ
  • Ollama๋ฅผ ํ†ตํ•œ Local Inference ์„œ๋ฒ„ ๊ตฌ์ถ•์œผ๋กœ ์™ธ๋ถ€ API ํ˜ธ์ถœ ์—†์ด ๋กœ์ปฌ ๋ฆฌ์†Œ์Šค ๋‚ด์—์„œ LLM ๊ตฌ๋™
  • VSCode์™€ Continue ํ”Œ๋Ÿฌ๊ทธ์ธ์„ ์—ฐ๋™ํ•˜์—ฌ Autocomplete, Chat, Code Generation ๋“ฑ ํƒœ์Šคํฌ๋ณ„ ์ „์šฉ ๋ชจ๋ธ์„ ๋งคํ•‘ํ•œ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์ถ”๋ก  ๊ตฌ์กฐ ์„ค๊ณ„
  • VRAM ๋ถ€์กฑ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด CPU ๋ฐ RAM ๊ธฐ๋ฐ˜ ์ถ”๋ก  ์„ค์ •๊ณผ config.yaml ์ตœ์ ํ™”๋ฅผ ํ†ตํ•œ ์‹œ์Šคํ…œ ์•ˆ์ •์„ฑ ํ™•๋ณด
  • Skillware์˜ Prompt Rewriter๋ฅผ ๋„์ž…ํ•˜์—ฌ ํ† ํฐ ์‚ฌ์šฉ๋Ÿ‰์„ ์••์ถ•ํ•จ์œผ๋กœ์จ ์ œํ•œ๋œ ๋กœ์ปฌ ๋ฆฌ์†Œ์Šค ๋‚ด ์ปจํ…์ŠคํŠธ ์ฒ˜๋ฆฌ ํšจ์œจ ๊ทน๋Œ€ํ™”
  • Repo-level ์ปจํ…์ŠคํŠธ ์ฃผ์ž… ๋ฐ ๋‹จ๊ณ„์  Task Planning ์„ค์ •์„ ํ†ตํ•œ ๋ณต์žกํ•œ Multi-step ์—”์ง€๋‹ˆ์–ด๋ง ํƒœ์Šคํฌ ์ˆ˜ํ–‰ ๋Šฅ๋ ฅ ๊ตฌํ˜„

- ๋„คํŠธ์›Œํฌ ๋‹จ์ ˆ ์ƒํ™ฉ์— ๋Œ€๋น„ํ•œ Open-weight ๋ชจ๋ธ(GGUF) ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ ๊ตฌ์ถ• - Ollama + Continue ํ”Œ๋Ÿฌ๊ทธ์ธ ์กฐํ•ฉ์„ ํ†ตํ•œ Local AI ๊ฐœ๋ฐœ ํ™˜๊ฒฝ ์…‹์—… ๊ฒ€ํ†  - ํƒœ์Šคํฌ ์„ฑ๊ฒฉ(์ž๋™์™„์„ฑ vs ์ฑ„ํŒ…)์— ๋”ฐ๋ฅธ ๋ชจ๋ธ ๋ถ„๋ฆฌ ๋ฐ Temperature/Context ํŒŒ๋ผ๋ฏธํ„ฐ ํŠœ๋‹ ์ ์šฉ - ๋กœ์ปฌ ์ถ”๋ก  ์‹œ ํ† ํฐ ์ตœ์ ํ™”๋ฅผ ์œ„ํ•œ Prompt Compression ๊ธฐ๋ฒ• ๋„์ž…

์›๋ฌธ ์ฝ๊ธฐ