ํ”ผ๋“œ๋กœ ๋Œ์•„๊ฐ€๊ธฐ
OVHcloud on Hugging Face Inference Providers ๐Ÿ”ฅ
Hugging Face BlogHugging Face Blog
Backend

Hugging Face๊ฐ€ OVHcloud๋ฅผ Inference Provider๋กœ ํ†ตํ•ฉํ•˜์—ฌ ๋ชจ๋ธ ํŽ˜์ด์ง€์—์„œ ์ง์ ‘ ์„œ๋ฒ„๋ฆฌ์Šค ์ถ”๋ก  ํ˜ธ์ถœ ๊ฐ€๋Šฅ

OVHcloud on Hugging Face Inference Providers ๐Ÿ”ฅ

2025๋…„ 11์›” 24์ผ6๋ถ„beginner

Context

Hugging Face Hub ์‚ฌ์šฉ์ž๋“ค์ด ๋ชจ๋ธ์„ ํ™œ์šฉํ•  ๋•Œ ๋‹จ์ผ ์ถ”๋ก  ์ œ๊ณต์ž๋งŒ ์„ ํƒํ•  ์ˆ˜ ์žˆ์–ด ์ง€์—ญ์„ฑ, ๊ฐ€๊ฒฉ, ๊ธฐ๋Šฅ ์ธก๋ฉด์—์„œ ์ œํ•œ์ ์ด์—ˆ๋‹ค. ์œ ๋Ÿฝ ์‚ฌ์šฉ์ž๋“ค์€ ๋ฐ์ดํ„ฐ ์ฃผ๊ถŒ๊ณผ ์ง€์—ฐ์‹œ๊ฐ„ ์š”๊ตฌ์‚ฌํ•ญ์„ ์ถฉ์กฑํ•˜๋Š” ์ถ”๋ก  ์ธํ”„๋ผ์˜ ๋ถ€์กฑ์œผ๋กœ ์–ด๋ ค์›€์„ ๊ฒช๊ณ  ์žˆ์—ˆ๋‹ค.

Technical Solution

  • OVHcloud๋ฅผ Inference Provider๋กœ Hugging Face Hub์— ๋“ฑ๋ก: ๋ชจ๋ธ ํŽ˜์ด์ง€ UI์—์„œ OVHcloud ์„ ํƒ ๊ฐ€๋Šฅํ•˜๋„๋ก ๊ตฌํ˜„
  • ์–‘๋ฐฉํ–ฅ ์ธ์ฆ ์ง€์›: ์‚ฌ์šฉ์ž API ํ‚ค ์ง์ ‘ ์‚ฌ์šฉ(Custom Key ๋ชจ๋“œ) ๋˜๋Š” Hugging Face ํ† ํฐ์œผ๋กœ ์ž๋™ ๋ผ์šฐํŒ…(Routed by HF ๋ชจ๋“œ) ์˜ต์…˜ ์ œ๊ณต
  • Python(huggingface_hub >= 1.1.5) ๋ฐ JavaScript(@huggingface/inference) ํด๋ผ์ด์–ธํŠธ SDK ํ†ตํ•ฉ: ๋ชจ๋ธ๋ช…์— ":ovhcloud" ์ ‘๋ฏธ์‚ฌ ์ถ”๊ฐ€๋กœ ์ œ๊ณต์ž ์ง€์ •
  • ์‚ฌ์šฉ์ž ๊ณ„์ • ์„ค์ •์—์„œ ์ œ๊ณต์ž API ํ‚ค ์ €์žฅ ๋ฐ ์šฐ์„ ์ˆœ์œ„ ๊ด€๋ฆฌ ๊ธฐ๋Šฅ ์ถ”๊ฐ€
  • ๊ตฌ์กฐํ™”๋œ ์ถœ๋ ฅ, ํ•จ์ˆ˜ ํ˜ธ์ถœ, ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ(ํ…์ŠคํŠธ/์ด๋ฏธ์ง€) ๊ธฐ๋Šฅ ์ง€์›ํ•˜๋Š” OVHcloud AI Endpoints ํ”Œ๋žซํผ ํ†ตํ•ฉ

Impact

OVHcloud AI Endpoints๋Š” ์ฒซ ํ† ํฐ ์‘๋‹ต์‹œ๊ฐ„ 200ms ๋ฏธ๋งŒ ์ œ๊ณต, โ‚ฌ0.04/๋ฐฑ๋งŒ ํ† ํฐ ๊ฐ€๊ฒฉ๋Œ€ ์ง€์›, Hugging Face PRO ์‚ฌ์šฉ์ž์—๊ฒŒ ๋งค์›” $2 ์ถ”๋ก  ํฌ๋ ˆ๋”ง ์ œ๊ณต


Hugging Face Hub๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๊ฐœ๋ฐœํŒ€์—์„œ ๋‹ค์ค‘ ์ถ”๋ก  ์ œ๊ณต์ž(OVHcloud, OpenAI ๋“ฑ)๋ฅผ SDK๋กœ ํ†ตํ•ฉํ•  ๋•Œ, ๋ชจ๋ธ๋ช…์— ์ œ๊ณต์ž๋ช…์„ ์ ‘๋ฏธ์‚ฌ๋กœ ์ถ”๊ฐ€ํ•˜๋Š” ๋‹จ์ˆœํ•œ ๋ฌธ๋ฒ• ๊ทœ์น™์œผ๋กœ ์ œ๊ณต์ž ์ „ํ™˜์ด ๊ฐ€๋Šฅํ•˜๋ฉฐ, ์ฒญ๊ตฌ ๋ฐฉ์‹(์ง์ ‘ ๋˜๋Š” Hub ๋ผ์šฐํŒ…)์„ ์„ ํƒํ•ด ๋น„์šฉ ์ตœ์ ํ™”์™€ ๋ฐ์ดํ„ฐ ์ฃผ๊ถŒ์„ ๋™์‹œ์— ๋‹ฌ์„ฑํ•  ์ˆ˜ ์žˆ๋‹ค.

์›๋ฌธ ์ฝ๊ธฐ