[GitOps ๊ธฐ๋ฐ˜ E2E ML Platform - ์šด์˜ ํ™˜๊ฒฝ ๋ฐ˜์˜ ์ œ์–ด]

์ด ๊ธ€์—์„œ ๋‹ค๋ฃจ๋Š” ๊ฒƒ E2E ํŒŒ์ดํ”„๋ผ์ธ ์ „์ฒด ๊ตฌ์กฐ, DAG ๋ ˆ๋ฒจ ์—๋Ÿฌ ์ฒ˜๋ฆฌ(SLA/on_failure_callback), SSOT ํŒจํ„ด(ids.py/policy.py), Promotion/Shadow ๋ถ„๊ธฐ, Rollback ๊ตฌ์กฐ ์„ ์ˆ˜์ง€์‹ GitOps ๊ธฐ๋ฐ˜ E2E ML Platform - ํ† ๊ธ€ ๊ตฌ์กฐ E2E DAG: ํ•™์Šต ์„ฑ๊ณต์ด ๊ณง ์šด์˜ ๋ฐ˜์˜์„ ์˜๋ฏธํ•˜์ง€ ์•Š๋„๋ก ์„ค๊ณ„ ๋“ค์–ด๊ฐ€๋ฉฐ ๋งŽ์€ ๋จธ์‹ ๋Ÿฌ๋‹ ํŒŒ์ดํ”„๋ผ์ธ์€ ๋‹ค์Œ๊ณผ ๊ฐ™์€ ํ˜•ํƒœ๋ฅผ ๊ฐ€์ง‘๋‹ˆ๋‹ค. Train โ†’ Register โ†’ Deploy ์ด ๊ตฌ์กฐ๋Š” ์‹คํ—˜ ํ™˜๊ฒฝ์—์„œ๋Š” ์ถฉ๋ถ„ํžˆ ๋™์ž‘ํ•ฉ๋‹ˆ๋‹ค. ํ•˜์ง€๋งŒ ์šด์˜ ํ™˜๊ฒฝ์—์„œ๋Š” ๋ช‡ ๊ฐ€์ง€ ์ค‘์š”ํ•œ ๋ฌธ์ œ๊ฐ€ ์ƒ๊น๋‹ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด ๋‹ค์Œ ์ƒํ™ฉ์„ ์ƒ๊ฐํ•ด๋ณผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ...

March 6, 2026 ยท 7 min

[GitOps ๊ธฐ๋ฐ˜ E2E ML Platform - ํ† ๊ธ€ ๊ตฌ์กฐ]

์ด ๊ธ€์—์„œ ๋‹ค๋ฃจ๋Š” ๊ฒƒ Optional ๋ ˆ์ด์–ด(Feature Store)๋ฅผ ์‹ค์ œ๋กœ ON/OFFํ•˜๋ฉด์„œ Core ์‹œ์Šคํ…œ์ด ๋…๋ฆฝ์ ์œผ๋กœ ๋™์ž‘ํ•˜๋Š”์ง€ ๊ฒ€์ฆํ•˜๋Š” ๊ณผ์ • ์„ ์ˆ˜์ง€์‹ GitOps ๊ธฐ๋ฐ˜ E2E ML Platform - GitOps ๊ตฌ์กฐ Optional ON/OFF ํ™•์žฅ ๊ธฐ๋Šฅ์„ ๋น„ํŒŒ๊ดด์ ์œผ๋กœ ๋ถ™์˜€๋‹ค ๋–ผ๋Š” ๊ตฌ์กฐ ๋“ค์–ด๊ฐ€๋ฉฐ ML ํ”Œ๋žซํผ์„ ์„ค๊ณ„ํ•˜๋‹ค ๋ณด๋ฉด ์ž์—ฐ์Šค๋Ÿฝ๊ฒŒ ๋‹ค์Œ ์งˆ๋ฌธ์ด ๋“ฑ์žฅํ•ฉ๋‹ˆ๋‹ค. Feature Store๋ฅผ ๋ถ™์ผ๊นŒ? ๋ฐ์ดํ„ฐ ํŒŒ์ดํ”„๋ผ์ธ ์‹œ์Šคํ…œ์„ ์ถ”๊ฐ€ํ• ๊นŒ? ์ถ”๊ฐ€ ์‹คํ—˜ ํ”Œ๋žซํผ์„ ๋„์ž…ํ• ๊นŒ? ์ด๋Ÿฐ ์‹œ์Šคํ…œ๋“ค์€ ๋Œ€๋ถ€๋ถ„ ์œ ์šฉํ•ฉ๋‹ˆ๋‹ค. ํ•˜์ง€๋งŒ ๋™์‹œ์— ๋‹ค์Œ๊ณผ ๊ฐ™์€ ๋ฌธ์ œ๋ฅผ ๊ฐ€์ ธ์˜ฌ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ํ”Œ๋žซํผ ๋ณต์žก๋„ ์ฆ๊ฐ€ ์‹œ์Šคํ…œ ์˜์กด์„ฑ ์ฆ๊ฐ€ ์žฅ์•  ์ „ํŒŒ ๋ฒ”์œ„ ํ™•๋Œ€ ์˜ˆ๋ฅผ ๋“ค์–ด Feature Store๋ฅผ Core ์‹œ์Šคํ…œ์— ๊ฐ•ํ•˜๊ฒŒ ์—ฐ๊ฒฐํ•˜๋ฉด ...

March 6, 2026 ยท 4 min

[GitOps ๊ธฐ๋ฐ˜ E2E ML Platform - GitOps ๊ตฌ์กฐ]

์ด ๊ธ€์—์„œ ๋‹ค๋ฃจ๋Š” ๊ฒƒ ArgoCD AppProject์™€ ApplicationSet์„ ์‚ฌ์šฉํ•ด dev/prod ํ™˜๊ฒฝ๊ณผ Core/Baseline/Optional ๋ ˆ์ด์–ด ๊ฒฝ๊ณ„๋ฅผ ๊ตฌ์กฐ์ ์œผ๋กœ ๊ฐ•์ œํ•˜๋Š” ๋ฐฉ๋ฒ• ์„ ์ˆ˜์ง€์‹ GitOps ๊ธฐ๋ฐ˜ E2E ML Platform - ์šด์˜ ๊ฒฝ๊ณ„ ArgoCD AppProject / ApplicationSet dev/prod์™€ ๋ ˆ์ด์–ด ๊ฒฝ๊ณ„๋ฅผ ๊ฐ•์ œํ•˜๊ธฐ ๋“ค์–ด๊ฐ€๋ฉฐ GitOps๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ์ด์œ ๋Š” ๋ณดํ†ต ๋‹ค์Œ๊ณผ ๊ฐ™์ด ์„ค๋ช…๋ฉ๋‹ˆ๋‹ค. Git์ด Single Source of Truth๋‹ค ๋ฐฐํฌ๊ฐ€ ์ž๋™ํ™”๋œ๋‹ค ๋ณ€๊ฒฝ ์ด๋ ฅ์„ ์ถ”์ ํ•  ์ˆ˜ ์žˆ๋‹ค ์ด ์„ค๋ช…์€ ๋งž์ง€๋งŒ, ์šด์˜ ๊ด€์ ์—์„œ GitOps์˜ ๊ฐ€์žฅ ์ค‘์š”ํ•œ ์—ญํ• ์€ ๋”ฐ๋กœ ์žˆ์Šต๋‹ˆ๋‹ค. ๋ฐ”๋กœ ๋ฐฐํฌ ๊ฒฝ๊ณ„๋ฅผ ๊ตฌ์กฐ๋กœ ๊ฐ•์ œํ•˜๋Š” ๊ฒƒ ...

March 6, 2026 ยท 4 min

[GitOps ๊ธฐ๋ฐ˜ E2E ML Platform - ์šด์˜ ๊ฒฝ๊ณ„]

์ด ๊ธ€์—์„œ ๋‹ค๋ฃจ๋Š” ๊ฒƒ Core / Baseline / Optional ์„ธ ๋ ˆ์ด์–ด์˜ ์„ค๊ณ„ ๊ธฐ์ค€๊ณผ ์‹ค์ œ GitOps ๋ฐฐํฌ ๊ตฌ์กฐ์—์„œ์˜ ๊ตฌํ˜„ ์„ ์ˆ˜์ง€์‹ GitOps ๊ธฐ๋ฐ˜ E2E ML Platform - ์„ค๊ณ„ ์˜๋„ Core / Baseline / Optional ์šด์˜ ๊ฐ€๋Šฅํ•œ ๊ฒฝ๊ณ„๋ฅผ ๋จผ์ € ์„ค๊ณ„ ๋“ค์–ด๊ฐ€๋ฉฐ ML ํ”Œ๋žซํผ์„ ์„ค๊ณ„ํ•  ๋•Œ ํ”ํžˆ ํ•˜๋Š” ์ ‘๊ทผ์€ ํ•„์š”ํ•œ ๋„๊ตฌ๋ฅผ ํ•˜๋‚˜์”ฉ ๋ถ™์—ฌ ๋‚˜๊ฐ€๋Š” ๋ฐฉ์‹์ž…๋‹ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด ๋‹ค์Œ๊ณผ ๊ฐ™์€ ์‹์ž…๋‹ˆ๋‹ค. Airflow ์„ค์น˜ MLflow ์ถ”๊ฐ€ Triton ์ถ”๊ฐ€ Monitoring ์ถ”๊ฐ€ Feature Store ์ถ”๊ฐ€ ์ด ๋ฐฉ์‹์€ ์ฒ˜์Œ์—๋Š” ๋น ๋ฅด๊ฒŒ ์‹œ์Šคํ…œ์„ ๋งŒ๋“ค ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ...

March 6, 2026 ยท 5 min

[GitOps ๊ธฐ๋ฐ˜ E2E ML Platform - ์„ค๊ณ„ ์˜๋„]

์ด ๊ธ€์—์„œ ๋‹ค๋ฃจ๋Š” ๊ฒƒ GitOps ๊ธฐ๋ฐ˜ E2E ML Platform์˜ 4๊ฐ€์ง€ ์„ค๊ณ„ ์›์น™๊ณผ ์‹œ๋ฆฌ์ฆˆ ์ „์ฒด ๊ตฌ์กฐ ์„ ์ˆ˜์ง€์‹ ์ด ๊ธ€๋ถ€ํ„ฐ ์‹œ์ž‘ ๊ฐ€๋Šฅ (์‹œ๋ฆฌ์ฆˆ ์ฒซ ๊ธ€) Kubernetes, ArgoCD ๊ธฐ๋ณธ ๊ฐœ๋…์„ ์•Œ๋ฉด ๋” ์ข‹์Œ GitOps ๊ธฐ๋ฐ˜ E2E ML Platform ์„ค๊ณ„ ์˜๋„ ๋“ค์–ด๊ฐ€๋ฉฐ ๋จธ์‹ ๋Ÿฌ๋‹ ํ”„๋กœ์ ํŠธ๋ฅผ ์ฒ˜์Œ ์ ‘ํ•˜๋ฉด ๋ณดํ†ต ์ด๋Ÿฐ ํ๋ฆ„์œผ๋กœ ์‹œ์ž‘ํ•ฉ๋‹ˆ๋‹ค. ๋ฐ์ดํ„ฐ๋ฅผ ์ค€๋น„ํ•œ๋‹ค ๋ชจ๋ธ์„ ํ•™์Šตํ•œ๋‹ค ๋ชจ๋ธ์„ ๋ฐฐํฌํ•œ๋‹ค ์ด ํ๋ฆ„์€ ํ•™์Šต์ด๋‚˜ ์‹คํ—˜์—๋Š” ์ถฉ๋ถ„ํ•ฉ๋‹ˆ๋‹ค. ํ•˜์ง€๋งŒ ์šด์˜ ํ™˜๊ฒฝ์—์„œ๋Š” ์ด์•ผ๊ธฐ๊ฐ€ ์กฐ๊ธˆ ๋‹ฌ๋ผ์ง‘๋‹ˆ๋‹ค. ์šด์˜ ํ™˜๊ฒฝ์—์„œ๋Š” ๋‹ค์Œ ์งˆ๋ฌธ์ด ๋จผ์ € ๋“ฑ์žฅํ•ฉ๋‹ˆ๋‹ค. ...

March 6, 2026 ยท 4 min

[Feature Store & Feast - Feast]

์ด ๊ธ€์—์„œ ๋‹ค๋ฃจ๋Š” ๊ฒƒ Feature Store-lite ์œ„์— Feast๋ฅผ ์–น์–ด, S3 Offline + Redis Online + Feature Server ๊ตฌ์„ฑ์œผ๋กœ โ€œ์ €์žฅํ•˜๋Š” ํŒŒ์ดํ”„๋ผ์ธ"์„ โ€œ์กฐํšŒ ๊ฐ€๋Šฅํ•œ ํ”ผ์ฒ˜ ํ”Œ๋žซํผ"์œผ๋กœ ํ™•์žฅํ•˜๋Š” ๊ณผ์ • ์„ ์ˆ˜์ง€์‹ Feature Store & Feast - Feature Store-lite ์ด ๋‹จ๊ณ„์—์„œ ํ•ด๊ฒฐํ•˜๋ ค๋Š” ๋ฌธ์ œ ์ด์ „ ๊ธ€(Feature Store-lite)์—์„œ ๊ณ„์•ฝ(์Šคํ‚ค๋งˆ/๋ฉ”ํƒ€) + ๋ฒ„์ „ํ™” ์ €์žฅ + ์žฌํ˜„์„ฑ๊นŒ์ง€ ๊ณ ์ •ํ–ˆ์Šต๋‹ˆ๋‹ค. ํ•˜์ง€๋งŒ ์‹ค๋ฌด์—์„œ๋Š” โ€œ์ €์žฅ"์—์„œ ๋๋‚˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค. ๊ฒฐ๊ตญ ์ค‘์š”ํ•œ ๊ฑด ์กฐํšŒ ๊ฐ€๋Šฅ(Serving-ready) ์ƒํƒœ์ž…๋‹ˆ๋‹ค. ์ด๋ฒˆ ๊ธ€์€ Feature Store-lite ์œ„์— Feast๋ฅผ ์–น์–ด ์•„๋ž˜๋ฅผ ์™„์„ฑํ•ฉ๋‹ˆ๋‹ค. Offline Source: S3์˜ latest/features.parquet (Feast๊ฐ€ ์ฝ๋Š” ๊ณ ์ • ํฌ์ธํ„ฐ) Registry: S3์— registry.pb ์ €์žฅ(ํ™˜๊ฒฝ๋ณ„ ๋ถ„๋ฆฌ) Online Store: Redis ์ ์žฌ(materialize)๋กœ ์˜จ๋ผ์ธ ์กฐํšŒ ๊ฐ€๋Šฅ Feature Server: ์ƒ์‹œ ์„œ๋น„์Šค + startup ์‹œ feast apply ์ฆ‰, โ€œ์ €์žฅํ•˜๋Š” ํŒŒ์ดํ”„๋ผ์ธโ€ -> โ€œ์กฐํšŒ ๊ฐ€๋Šฅํ•œ ํ”ผ์ฒ˜ ํ”Œ๋žซํผ"์œผ๋กœ ํ™•์žฅํ•˜๋Š” ๋‹จ๊ณ„์ž…๋‹ˆ๋‹ค. ...

January 15, 2026 ยท 4 min

[Feature Store & Feast - Feature Store-lite]

์ด ๊ธ€์—์„œ ๋‹ค๋ฃจ๋Š” ๊ฒƒ Feast ํ’€๋„์ž… ์ด์ „ ๋‹จ๊ณ„๋กœ, GitOps + Airflow๋ฅผ ์‚ฌ์šฉํ•ด ํ”ผ์ฒ˜ ์ƒ์„ฑ/๋ฒ„์ „ํ™”/์žฌํ˜„์„ฑ์˜ ์ตœ์†Œ ์š”๊ฑด(๊ณ„์•ฝ/๋ฉ”ํƒ€/๋ฒ„์ „ํ™”)์„ ๊ณ ์ •ํ•˜๋Š” Feature Store-lite ์„ค๊ณ„ ์„ ์ˆ˜์ง€์‹ Triton ์„œ๋น™ ํ”Œ๋žซํผ - dynamic_batching + instance_group ์ด ๋‹จ๊ณ„์—์„œ ํ•ด๊ฒฐํ•˜๋ ค๋Š” ๋ฌธ์ œ Feature Store๋Š” โ€œML ์„ฑ๋Šฅ"์ด ์•„๋‹ˆ๋ผ ์šด์˜ ์•ˆ์ •์„ฑ/์žฌํ˜„์„ฑ์—์„œ ๋จผ์ € ๊ฐˆ๋ฆฝ๋‹ˆ๋‹ค. โ€œ์˜ค๋Š˜ ๋งŒ๋“  feature.csv"๊ฐ€ ์•„๋‹ˆ๋ผ, ์•„๋ž˜๊ฐ€ ๋ฐ˜๋“œ์‹œ ๋‚จ์•„์•ผ ์šด์˜์ด ๋ฉ๋‹ˆ๋‹ค. ์–ธ์ œ ์ƒ์„ฑ๋๋Š”์ง€ (generated_at) ์–ด๋–ค ์Šคํ‚ค๋งˆ(๊ณ„์•ฝ)๋กœ ์ƒ์„ฑ๋๋Š”์ง€ (schema + schema_hash) ์–ด๋–ค ์†Œ์Šค์—์„œ ์ƒ์„ฑ๋๋Š”์ง€ (source) ์–ด๋–ค ๋ฒ„์ „์œผ๋กœ ์ €์žฅ๋๋Š”์ง€ (version) ๊ฒฐ๊ณผ๋ฌผ์ด ์–ด๋”” ์žˆ๋Š”์ง€ (feature_uri) Feature Store ๋„์ž…์„ ๊ณ ๋ฏผํ•˜๋ฉด ํ”ํžˆ โ€œFeast๋ถ€ํ„ฐ ์จ์•ผ ํ•˜๋‚˜?โ€œ๊ฐ€ ๋จผ์ € ๋‚˜์˜ค๋Š”๋ฐ, ๋„๊ตฌ๋ณด๋‹ค ๋จผ์ € ํ”ผ์ฒ˜ ์ƒ์„ฑ/๋ฒ„์ „ํ™”/์žฌํ˜„ ๋ฐฉ์‹์ด ๊ณ ์ •๋ผ์•ผ ํ•ฉ๋‹ˆ๋‹ค. ...

January 14, 2026 ยท 4 min

[Triton ์šด์˜ํ˜• ์„œ๋น™ ํ”Œ๋žซํผ (GitOps ยท ๊ฒ€์ฆ ยท Alerting) - Alerting ์šด์˜ ํ‘œ์ค€ ๋งค๋‰ด์–ผ]

์ด ๊ธ€์—์„œ ๋‹ค๋ฃจ๋Š” ๊ฒƒ Triton ์„œ๋น™ ํ™˜๊ฒฝ์—์„œ dev/prod ์•Œ๋Ÿฟ์„ ์™„์ „ํžˆ ๋ถ„๋ฆฌํ•˜๊ณ , PrometheusRule/Alertmanager/Grafana๋ฅผ ํ•˜๋‚˜์˜ ํŒ๋‹จ ํ๋ฆ„์œผ๋กœ ๊ณ ์ •ํ•˜๋Š” GitOps ๊ธฐ๋ฐ˜ Alerting ์šด์˜ ํ‘œ์ค€ ์„ค๊ณ„ ์„ ์ˆ˜์ง€์‹ Triton ์„œ๋น™ ํ”Œ๋žซํผ - MLflow โ†’ Triton ์ž๋™ ๋ฐฐํฌ ํŒŒ์ดํ”„๋ผ์ธ ์ด ๋‹จ๊ณ„์—์„œ ํ•ด๊ฒฐํ•˜๋ ค๋Š” ๋ฌธ์ œ Observability๋Š” ๋Œ€์‹œ๋ณด๋“œ๊ฐ€ ์•„๋‹ˆ๋ผ, ์‚ฌ๊ณ ๋ฅผ ๋ง‰๋Š” ์šด์˜ ์ •์ฑ…์ž…๋‹ˆ๋‹ค. ์ด ๋ฌธ์„œ๋Š” dev/prod ์•Œ๋Ÿฟ์„ ์™„์ „ํžˆ ๋ถ„๋ฆฌํ•˜๊ณ , ๋ผ๋ฒจ ์‹ค์ˆ˜๋กœ ์ธํ•œ ๊ต์ฐจ ์ „์†ก๊นŒ์ง€ ๊ตฌ์กฐ์ ์œผ๋กœ ์ฐจ๋‹จํ•˜๋ฉฐ, Triton ์„œ๋น™ ํ’ˆ์งˆ์„ ๋ชจ๋ธ ์‹คํ–‰ ๊ด€์ ์—์„œ ๊ฐ์ง€ํ•˜๋„๋ก ์„ค๊ณ„๋œ GitOps ๊ธฐ๋ฐ˜ Alerting ์šด์˜์ž…๋‹ˆ๋‹ค. ...

January 2, 2026 ยท 7 min

[Triton ์šด์˜ํ˜• ์„œ๋น™ ํ”Œ๋žซํผ (GitOps ยท ๊ฒ€์ฆ ยท Alerting) - MLflow โ†’ Triton ์ž๋™ ๋ฐฐํฌ ํŒŒ์ดํ”„๋ผ์ธ ๊ตฌ์ถ•]

์ด ๊ธ€์—์„œ ๋‹ค๋ฃจ๋Š” ๊ฒƒ MLflow Registry๋ฅผ ๋‹จ์ผ ์†Œ์Šค๋กœ ์‚ผ์•„ Airflow DAG์—์„œ Triton์— ๋ชจ๋ธ์„ ์ž๋™ ๋ฐฐํฌํ•˜๊ณ , ๊ฒ€์ฆ ์ฒด์ธ(load/ready/infer) ํ†ต๊ณผ ํ›„์—๋งŒ ์šด์˜ ํ™•์ •ํ•˜๋ฉฐ, ์‹คํŒจ ์‹œ ์ž๋™ ๋กค๋ฐฑํ•˜๋Š” ํŒŒ์ดํ”„๋ผ์ธ์„ ๊ตฌ์ถ•ํ•œ ๊ณผ์ • ์„ ์ˆ˜์ง€์‹ Triton ์„œ๋น™ ํ”Œ๋žซํผ - Triton ๊ตฌ์ถ• ์ด ๋‹จ๊ณ„์—์„œ ํ•ด๊ฒฐํ•˜๋ ค๋Š” ๋ฌธ์ œ ์‹ค๋ฌด ํ™˜๊ฒฝ์—์„œ ๋ชจ๋ธ ๋ฐฐํฌ๋Š” โ€œ์ƒˆ ๋ชจ๋ธ์„ ์˜ฌ๋ฆฌ๋Š” ์ž‘์—…"์ด ์•„๋‹ˆ๋ผ โ€œํ˜„์žฌ ์šด์˜ ์ƒํƒœ๋ฅผ ์•ˆ์ „ํ•˜๊ฒŒ ๊ฐฑ์‹ ํ•˜๋Š” ์ƒํƒœ ์ „์ด(State Transition)โ€œ์— ๊ฐ€๊น๋‹ค. ์ด๋ฒˆ ๋‹จ๊ณ„์—์„œ๋Š” MLflow Registry๋ฅผ ๋‹จ์ผ ์†Œ์Šค๋กœ ์‚ผ์•„ Triton Inference Server์— ๋ชจ๋ธ์„ ์ž๋™ ๋ฐฐํฌํ•˜๊ณ , ๋กœ๋”ฉ/ํ—ฌ์Šค ์ฒดํฌ/์‹ค์ œ ์ถ”๋ก  ๊ฒ€์ฆ์„ ๋ชจ๋‘ ํ†ต๊ณผํ•œ ๊ฒฝ์šฐ์—๋งŒ ์šด์˜ ๋ชจ๋ธ์„ ํ™•์ •(commit)ํ•˜๋ฉฐ, ์ค‘๊ฐ„ ๋‹จ๊ณ„์—์„œ ํ•˜๋‚˜๋ผ๋„ ์‹คํŒจํ•˜๋ฉด ์ด์ „ ์šด์˜ ์ƒํƒœ๋กœ ์ž๋™ ๋ณต๊ตฌ๋˜๋Š” ์ตœ์†Œ ๋กค๋ฐฑ ๊ตฌ์กฐ๋ฅผ ๊ตฌํ˜„ํ–ˆ๋‹ค. ...

December 29, 2025 ยท 4 min

[Triton ์šด์˜ํ˜• ์„œ๋น™ ํ”Œ๋žซํผ (GitOps ยท ๊ฒ€์ฆ ยท Alerting) - Triton ๊ตฌ์ถ•]

์ด ๊ธ€์—์„œ ๋‹ค๋ฃจ๋Š” ๊ฒƒ Triton Inference Server๋ฅผ CPU-only GitOps ๊ตฌ์กฐ๋กœ ๋ฐฐํฌํ•˜๊ณ , ONNX ๋ชจ๋ธ 1๊ฐœ์˜ load/infer ๊ฒ€์ฆ ๋ฐ Prometheus/Grafana ๊ด€์ธก๊นŒ์ง€ ์„œ๋น™ ํ”Œ๋žซํผ ๋ผˆ๋Œ€๋ฅผ ๊ตฌ์ถ•ํ•œ ๊ณผ์ • ์„ ์ˆ˜์ง€์‹ Observability 8๋‹จ๊ณ„: Data Pipeline ๊ณ ๋„ํ™” ์ด ๋‹จ๊ณ„์—์„œ ํ•ด๊ฒฐํ•˜๋ ค๋Š” ๋ฌธ์ œ ์‹ค๋ฌด์—์„œ ์„œ๋น™ ๊ณ„์ธต์€ ๊ณง๋ฐ”๋กœ ํŠธ๋ž˜ํ”ฝ๊ณผ SLA๋ฅผ ๋งž๋Š” ์ตœ์ „์„ ์ด๋‹ค. ๋ชจ๋ธ์ด ์•„๋ฌด๋ฆฌ ์ข‹์•„๋„ ์„œ๋น™์ด ๋ถˆ์•ˆ์ •ํ•˜๋ฉด ์šด์˜ ์‹œ ๋ฐ”๋กœ ๋ฌด๋„ˆ์ง„๋‹ค. ์ด๋ฒˆ์—๋Š” Triton ์ฒซ ๊ตฌ์ถ•์œผ๋กœ GPU/ํŒŒ์ดํ”„๋ผ์ธ ์—ฐ๋™์„ ์ผ๋ถ€๋Ÿฌ ๋นผ๊ณ , Triton ์ž์ฒด๋ฅผ GitOps๋กœ ์•ˆ์ •์ ์œผ๋กœ ๋„์šฐ๊ณ , ๋ชจ๋ธ load โ†’ infer โ†’ metrics ๊ด€์ธก๊นŒ์ง€ ์„œ๋น™ ํ”Œ๋žซํผ ๋ผˆ๋Œ€ ๊ตฌ์ถ•์„ ์ง„ํ–‰ํ–ˆ๋‹ค. ...

December 26, 2025 ยท 4 min