ํ”ผ๋“œ๋กœ ๋Œ์•„๊ฐ€๊ธฐ
I Thought โ€œData Analystโ€ Was the Whole Gameโ€ฆ Then I Entered the Data Avengers Office ๐Ÿ‘€
Dev.toDev.to
Infrastructure

๋ฐ์ดํ„ฐ ์ƒํƒœ๊ณ„ ์ „ ๊ณผ์ •์˜ ์—ญํ•  ๋ถ„๋ฆฌ๋ฅผ ํ†ตํ•œ End-to-End ํŒŒ์ดํ”„๋ผ์ธ ์ตœ์ ํ™”

I Thought โ€œData Analystโ€ Was the Whole Gameโ€ฆ Then I Entered the Data Avengers Office ๐Ÿ‘€

Yukti Sahu2026๋…„ 5์›” 25์ผ7๋ถ„beginner

Context

๋‹จ์ผ ์—ญํ• ์˜ ๋ฐ์ดํ„ฐ ๋ถ„์„๊ฐ€ ์ค‘์‹ฌ ๊ตฌ์กฐ๋กœ๋Š” ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ์˜ ์ˆ˜์ง‘, ์ •์ œ, ๋ฐฐํฌ ๋ฐ ์˜ˆ์ธก ๋ชจ๋ธ์˜ ์„œ๋น„์Šคํ™”๋ผ๋Š” ๋ณต์žกํ•œ ์š”๊ตฌ์‚ฌํ•ญ ์ถฉ์กฑ์— ํ•œ๊ณ„ ๋ฐœ์ƒ. ์›์‹œ ๋ฐ์ดํ„ฐ์˜ ๋ฌด๊ฒฐ์„ฑ ๊ฒฐ์—ฌ์™€ ์‹œ์Šคํ…œ ํ™•์žฅ์„ฑ ๋ถ€์กฑ์œผ๋กœ ์ธํ•œ ๋น„ํšจ์œจ์  ์šด์˜ ํ™˜๊ฒฝ์„ ํ•ด๊ฒฐํ•ด์•ผ ํ•˜๋Š” ์ƒํ™ฉ.

Technical Solution

  • Data Architecture ์„ค๊ณ„๋ฅผ ํ†ตํ•œ ์ „์‚ฌ์  ๋ฐ์ดํ„ฐ ์ €์žฅ์†Œ ๋ฐ ์‹œ์Šคํ…œ ์—ฐ๊ฒฐ ๊ตฌ์กฐ ์ •์˜๋กœ ํ™•์žฅ์„ฑ ํ™•๋ณด
  • Data Engineering ํŒŒ์ดํ”„๋ผ์ธ ๊ตฌ์ถ•์„ ํ†ตํ•œ Raw Data์˜ ์ •์ œ ๋ฐ ์ž๋™ํ™”๋œ ์ˆ˜์ง‘/์ด๋™ ํ”„๋กœ์„ธ์Šค ๊ตฌํ˜„
  • Data Analysis์™€ BI Development์˜ ๋ถ„๋ฆฌ๋ฅผ ํ†ตํ•œ ๋น„์ฆˆ๋‹ˆ์Šค ํ†ต์ฐฐ๋ ฅ ๋„์ถœ ๋ฐ ์‹œ๊ฐํ™” KPI ํŠธ๋ž˜ํ‚น ์ตœ์ ํ™”
  • Data Science์˜ ์˜ˆ์ธก ๋ชจ๋ธ๋ง๊ณผ ML Engineering์˜ Production ๋ฐฐํฌ ๋ถ„๋ฆฌ๋ฅผ ํ†ตํ•œ ๋ชจ๋ธ ์„ฑ๋Šฅ์˜ ์‹ค์„œ๋น„์Šค ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ ๊ทน๋Œ€ํ™”
  • ์—ญํ• ๋ณ„ ์ „์šฉ ์Šคํƒ(Spark, Airflow, Kafka, Kubernetes ๋“ฑ) ๋„์ž…์œผ๋กœ ๊ฐ ๋‹จ๊ณ„์˜ ์ฒ˜๋ฆฌ ๋ณ‘๋ชฉ ์ง€์  ํ•ด์†Œ

- ๋ฐ์ดํ„ฐ ํŒŒ์ดํ”„๋ผ์ธ ์„ค๊ณ„ ์‹œ ์ˆ˜์ง‘(Engineer)๊ณผ ๋ถ„์„(Analyst)์˜ ์ฑ…์ž„ ์˜์—ญ์„ ๋ช…ํ™•ํžˆ ๋ถ„๋ฆฌํ–ˆ๋Š”๊ฐ€ - ์‹คํ—˜์  ๋ชจ๋ธ(Scientist)์„ ์‹ค์ œ ํŠธ๋ž˜ํ”ฝ ํ™˜๊ฒฝ์— ๋ฐฐํฌํ•˜๊ธฐ ์œ„ํ•œ ML Ops ํ”„๋กœ์„ธ์Šค๊ฐ€ ์ •์˜๋˜์–ด ์žˆ๋Š”๊ฐ€ - ๋น„์ฆˆ๋‹ˆ์Šค ์š”๊ตฌ์‚ฌํ•ญ์— ๋”ฐ๋ฅธ ๋ฐ์ดํ„ฐ ๋ชจ๋ธ๋ง๊ณผ ์‹œ์Šคํ…œ ์•„ํ‚คํ…์ฒ˜ ์„ค๊ณ„๊ฐ€ ์„ ํ–‰๋˜์—ˆ๋Š”๊ฐ€ - ๋ฐ์ดํ„ฐ์˜ ์ƒ๋ช…์ฃผ๊ธฐ(Lifecycle)์— ๋”ฐ๋ผ ์ ์ ˆํ•œ ์ €์žฅ์†Œ์™€ ์ฒ˜๋ฆฌ ๋„๊ตฌ๊ฐ€ ์„ ํƒ๋˜์—ˆ๋Š”๊ฐ€

์›๋ฌธ ์ฝ๊ธฐ