[GitOps ๊ธฐ๋ฐ˜ E2E ML Platform - ์šด์˜ ๊ฒฝ๊ณ„]

์ด ๊ธ€์—์„œ ๋‹ค๋ฃจ๋Š” ๊ฒƒ Core / Baseline / Optional ์„ธ ๋ ˆ์ด์–ด์˜ ์„ค๊ณ„ ๊ธฐ์ค€๊ณผ ์‹ค์ œ GitOps ๋ฐฐํฌ ๊ตฌ์กฐ์—์„œ์˜ ๊ตฌํ˜„ ์„ ์ˆ˜์ง€์‹ GitOps ๊ธฐ๋ฐ˜ E2E ML Platform - ์„ค๊ณ„ ์˜๋„ Core / Baseline / Optional ์šด์˜ ๊ฐ€๋Šฅํ•œ ๊ฒฝ๊ณ„๋ฅผ ๋จผ์ € ์„ค๊ณ„ ๋“ค์–ด๊ฐ€๋ฉฐ ML ํ”Œ๋žซํผ์„ ์„ค๊ณ„ํ•  ๋•Œ ํ”ํžˆ ํ•˜๋Š” ์ ‘๊ทผ์€ ํ•„์š”ํ•œ ๋„๊ตฌ๋ฅผ ํ•˜๋‚˜์”ฉ ๋ถ™์—ฌ ๋‚˜๊ฐ€๋Š” ๋ฐฉ์‹์ž…๋‹ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด ๋‹ค์Œ๊ณผ ๊ฐ™์€ ์‹์ž…๋‹ˆ๋‹ค. Airflow ์„ค์น˜ MLflow ์ถ”๊ฐ€ Triton ์ถ”๊ฐ€ Monitoring ์ถ”๊ฐ€ Feature Store ์ถ”๊ฐ€ ์ด ๋ฐฉ์‹์€ ์ฒ˜์Œ์—๋Š” ๋น ๋ฅด๊ฒŒ ์‹œ์Šคํ…œ์„ ๋งŒ๋“ค ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ...

March 6, 2026 ยท 5 min

[GitOps ๊ธฐ๋ฐ˜ E2E ML Platform - ์„ค๊ณ„ ์˜๋„]

์ด ๊ธ€์—์„œ ๋‹ค๋ฃจ๋Š” ๊ฒƒ GitOps ๊ธฐ๋ฐ˜ E2E ML Platform์˜ 4๊ฐ€์ง€ ์„ค๊ณ„ ์›์น™๊ณผ ์‹œ๋ฆฌ์ฆˆ ์ „์ฒด ๊ตฌ์กฐ ์„ ์ˆ˜์ง€์‹ ์ด ๊ธ€๋ถ€ํ„ฐ ์‹œ์ž‘ ๊ฐ€๋Šฅ (์‹œ๋ฆฌ์ฆˆ ์ฒซ ๊ธ€) Kubernetes, ArgoCD ๊ธฐ๋ณธ ๊ฐœ๋…์„ ์•Œ๋ฉด ๋” ์ข‹์Œ GitOps ๊ธฐ๋ฐ˜ E2E ML Platform ์„ค๊ณ„ ์˜๋„ ๋“ค์–ด๊ฐ€๋ฉฐ ๋จธ์‹ ๋Ÿฌ๋‹ ํ”„๋กœ์ ํŠธ๋ฅผ ์ฒ˜์Œ ์ ‘ํ•˜๋ฉด ๋ณดํ†ต ์ด๋Ÿฐ ํ๋ฆ„์œผ๋กœ ์‹œ์ž‘ํ•ฉ๋‹ˆ๋‹ค. ๋ฐ์ดํ„ฐ๋ฅผ ์ค€๋น„ํ•œ๋‹ค ๋ชจ๋ธ์„ ํ•™์Šตํ•œ๋‹ค ๋ชจ๋ธ์„ ๋ฐฐํฌํ•œ๋‹ค ์ด ํ๋ฆ„์€ ํ•™์Šต์ด๋‚˜ ์‹คํ—˜์—๋Š” ์ถฉ๋ถ„ํ•ฉ๋‹ˆ๋‹ค. ํ•˜์ง€๋งŒ ์šด์˜ ํ™˜๊ฒฝ์—์„œ๋Š” ์ด์•ผ๊ธฐ๊ฐ€ ์กฐ๊ธˆ ๋‹ฌ๋ผ์ง‘๋‹ˆ๋‹ค. ์šด์˜ ํ™˜๊ฒฝ์—์„œ๋Š” ๋‹ค์Œ ์งˆ๋ฌธ์ด ๋จผ์ € ๋“ฑ์žฅํ•ฉ๋‹ˆ๋‹ค. ...

March 6, 2026 ยท 4 min

[Feature Store & Feast - Feast]

์ด ๊ธ€์—์„œ ๋‹ค๋ฃจ๋Š” ๊ฒƒ Feature Store-lite ์œ„์— Feast๋ฅผ ์–น์–ด, S3 Offline + Redis Online + Feature Server ๊ตฌ์„ฑ์œผ๋กœ โ€œ์ €์žฅํ•˜๋Š” ํŒŒ์ดํ”„๋ผ์ธ"์„ โ€œ์กฐํšŒ ๊ฐ€๋Šฅํ•œ ํ”ผ์ฒ˜ ํ”Œ๋žซํผ"์œผ๋กœ ํ™•์žฅํ•˜๋Š” ๊ณผ์ • ์„ ์ˆ˜์ง€์‹ Feature Store & Feast - Feature Store-lite ์ด ๋‹จ๊ณ„์—์„œ ํ•ด๊ฒฐํ•˜๋ ค๋Š” ๋ฌธ์ œ ์ด์ „ ๊ธ€(Feature Store-lite)์—์„œ ๊ณ„์•ฝ(์Šคํ‚ค๋งˆ/๋ฉ”ํƒ€) + ๋ฒ„์ „ํ™” ์ €์žฅ + ์žฌํ˜„์„ฑ๊นŒ์ง€ ๊ณ ์ •ํ–ˆ์Šต๋‹ˆ๋‹ค. ํ•˜์ง€๋งŒ ์‹ค๋ฌด์—์„œ๋Š” โ€œ์ €์žฅ"์—์„œ ๋๋‚˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค. ๊ฒฐ๊ตญ ์ค‘์š”ํ•œ ๊ฑด ์กฐํšŒ ๊ฐ€๋Šฅ(Serving-ready) ์ƒํƒœ์ž…๋‹ˆ๋‹ค. ์ด๋ฒˆ ๊ธ€์€ Feature Store-lite ์œ„์— Feast๋ฅผ ์–น์–ด ์•„๋ž˜๋ฅผ ์™„์„ฑํ•ฉ๋‹ˆ๋‹ค. Offline Source: S3์˜ latest/features.parquet (Feast๊ฐ€ ์ฝ๋Š” ๊ณ ์ • ํฌ์ธํ„ฐ) Registry: S3์— registry.pb ์ €์žฅ(ํ™˜๊ฒฝ๋ณ„ ๋ถ„๋ฆฌ) Online Store: Redis ์ ์žฌ(materialize)๋กœ ์˜จ๋ผ์ธ ์กฐํšŒ ๊ฐ€๋Šฅ Feature Server: ์ƒ์‹œ ์„œ๋น„์Šค + startup ์‹œ feast apply ์ฆ‰, โ€œ์ €์žฅํ•˜๋Š” ํŒŒ์ดํ”„๋ผ์ธโ€ -> โ€œ์กฐํšŒ ๊ฐ€๋Šฅํ•œ ํ”ผ์ฒ˜ ํ”Œ๋žซํผ"์œผ๋กœ ํ™•์žฅํ•˜๋Š” ๋‹จ๊ณ„์ž…๋‹ˆ๋‹ค. ...

January 15, 2026 ยท 4 min

[Triton ์šด์˜ํ˜• ์„œ๋น™ ํ”Œ๋žซํผ (GitOps ยท ๊ฒ€์ฆ ยท Alerting) - ๊ฒ€์ฆ]

๐Ÿงญ ๋ชฉ์ฐจ ๊ตฌ๋ถ„ ์ฆ๋ช… ํ•ต์‹ฌ A. GitOps ๋ถ„๋ฆฌ Triton dev/prod ๋…๋ฆฝ ๋ฐฐํฌ ๋ฐ ์ƒํƒœ ๊ณ ์ • B. ๋ชจ๋ธ ์ œ์–ด NFS model-repo ๋ถ„๋ฆฌ + explicit load ํ†ต์ œ C. ์„œ๋น™ ๊ฒ€์ฆ load โ†’ ready โ†’ infer E2E ์„ฑ๊ณต D. ๊ด€์ธก ๊ฐ€๋Šฅ์„ฑ /metrics โ†’ Prometheus โ†’ Grafana ์—ฐ๊ณ„ E. ๋ฐฐํฌ ํ†ต์ œ MLflowโ†’Airflow ๊ฒ€์ฆ ์ฒด์ธ + commit/rollback F. ์•Œ๋Ÿฟ ๋ถ„๋ฆฌ Alertmanager null default ๊ธฐ๋ฐ˜ dev/prod ๋ถ„๋ฆฌ G. ์•Œ๋Ÿฟ ์‹ค์ฆ Triton latency ์•Œ๋Ÿฟ E2E ๋™์ž‘ A. Triton GitOps & Dev/Prod ๋ถ„๋ฆฌ 1๏ธโƒฃ ArgoCD Applications (GitOps ๊ธฐ์ค€) โœ” dev/prod ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜ ์ƒํƒœ ์ฆ๋ช… ...

January 7, 2026 ยท 4 min

[Triton ์šด์˜ํ˜• ์„œ๋น™ ํ”Œ๋žซํผ (GitOps ยท ๊ฒ€์ฆ ยท Alerting) - ์—ํ•„๋กœ๊ทธ]

์—ํ•„๋กœ๊ทธ โ€” โ€œGitOps ๊ธฐ๋ฐ˜ Triton ์„œ๋น™์ด โ€˜๋ฐฐํฌโ†’๊ฒ€์ฆโ†’๊ด€์ธกโ†’์•Œ๋ฆผโ€™ ๋ฃจํ”„๋กœ ๊ณ ์ •โ€ ๐Ÿ“Œ ์ „์ฒด ๊ฒฝ๋กœ ์š”์•ฝ ์ˆœ์„œ ์ฃผ์ œ 1 ๐Ÿ”— Triton (CPU-only) GitOps ํ†ตํ•ฉ: ONNX 1๊ฐœ ์„œ๋น™ + Prometheus/Grafana ๊ด€์ธก 2 ๐Ÿ”— MLflow โ†’ Triton ์ž๋™ ๋ฐฐํฌ ํŒŒ์ดํ”„๋ผ์ธ ๊ตฌ์ถ• (Airflow ยท ๊ฒ€์ฆ ์ฒด์ธยท ์ตœ์†Œ ๋กค๋ฐฑ) 3 ๐Ÿ”— Alerting ์šด์˜ ํ‘œ์ค€ ๋งค๋‰ด์–ผ (Dev/Prod ๋ถ„๋ฆฌ + Triton Serving Alerts) 4 ๐Ÿ”— Triton ์šด์˜ํ˜• ์„œ๋น™ ํ”Œ๋žซํผ (GitOps ยท ๊ฒ€์ฆ ยท Alerting) - ๊ฒ€์ฆ ๐ŸŽฏ ์ „์ฒด ํšŒ๊ณ  ์š”์•ฝ ๋‹จ๊ณ„ ํ•ต์‹ฌ ๋ชฉํ‘œ ์ฃผ์š” ๊ฐœ์„ ์  1 Triton ์„œ๋น™ ๊ธฐ๋ฐ˜ GitOps ๋ถ„๋ฆฌ ยท explicit load ยท ๊ด€์ธก 2 ๋ฐฐํฌ ์ž๋™ํ™” MLflow ๋‹จ์ผ ์†Œ์Šค ยท ๊ฒ€์ฆ ๊ธฐ๋ฐ˜ commit/rollback 3 ์•Œ๋žŒ ์šด์˜ null default ยท namespace ๋ผ์šฐํŒ… ยท latency ์•Œ๋Ÿฟ ๐Ÿ”„ ํ•ต์‹ฌ ๋ฌธ์žฅ: ...

January 5, 2026 ยท 3 min

[Triton ์šด์˜ํ˜• ์„œ๋น™ ํ”Œ๋žซํผ (GitOps ยท ๊ฒ€์ฆ ยท Alerting) - Alerting ์šด์˜ ํ‘œ์ค€ ๋งค๋‰ด์–ผ]

์ด ๊ธ€์—์„œ ๋‹ค๋ฃจ๋Š” ๊ฒƒ Triton ์„œ๋น™ ํ™˜๊ฒฝ์—์„œ dev/prod ์•Œ๋Ÿฟ์„ ์™„์ „ํžˆ ๋ถ„๋ฆฌํ•˜๊ณ , PrometheusRule/Alertmanager/Grafana๋ฅผ ํ•˜๋‚˜์˜ ํŒ๋‹จ ํ๋ฆ„์œผ๋กœ ๊ณ ์ •ํ•˜๋Š” GitOps ๊ธฐ๋ฐ˜ Alerting ์šด์˜ ํ‘œ์ค€ ์„ค๊ณ„ ์„ ์ˆ˜์ง€์‹ Triton ์„œ๋น™ ํ”Œ๋žซํผ - MLflow โ†’ Triton ์ž๋™ ๋ฐฐํฌ ํŒŒ์ดํ”„๋ผ์ธ ์ด ๋‹จ๊ณ„์—์„œ ํ•ด๊ฒฐํ•˜๋ ค๋Š” ๋ฌธ์ œ Observability๋Š” ๋Œ€์‹œ๋ณด๋“œ๊ฐ€ ์•„๋‹ˆ๋ผ, ์‚ฌ๊ณ ๋ฅผ ๋ง‰๋Š” ์šด์˜ ์ •์ฑ…์ž…๋‹ˆ๋‹ค. ์ด ๋ฌธ์„œ๋Š” dev/prod ์•Œ๋Ÿฟ์„ ์™„์ „ํžˆ ๋ถ„๋ฆฌํ•˜๊ณ , ๋ผ๋ฒจ ์‹ค์ˆ˜๋กœ ์ธํ•œ ๊ต์ฐจ ์ „์†ก๊นŒ์ง€ ๊ตฌ์กฐ์ ์œผ๋กœ ์ฐจ๋‹จํ•˜๋ฉฐ, Triton ์„œ๋น™ ํ’ˆ์งˆ์„ ๋ชจ๋ธ ์‹คํ–‰ ๊ด€์ ์—์„œ ๊ฐ์ง€ํ•˜๋„๋ก ์„ค๊ณ„๋œ GitOps ๊ธฐ๋ฐ˜ Alerting ์šด์˜์ž…๋‹ˆ๋‹ค. ...

January 2, 2026 ยท 7 min

[Triton ์šด์˜ํ˜• ์„œ๋น™ ํ”Œ๋žซํผ (GitOps ยท ๊ฒ€์ฆ ยท Alerting) - MLflow โ†’ Triton ์ž๋™ ๋ฐฐํฌ ํŒŒ์ดํ”„๋ผ์ธ ๊ตฌ์ถ•]

์ด ๊ธ€์—์„œ ๋‹ค๋ฃจ๋Š” ๊ฒƒ MLflow Registry๋ฅผ ๋‹จ์ผ ์†Œ์Šค๋กœ ์‚ผ์•„ Airflow DAG์—์„œ Triton์— ๋ชจ๋ธ์„ ์ž๋™ ๋ฐฐํฌํ•˜๊ณ , ๊ฒ€์ฆ ์ฒด์ธ(load/ready/infer) ํ†ต๊ณผ ํ›„์—๋งŒ ์šด์˜ ํ™•์ •ํ•˜๋ฉฐ, ์‹คํŒจ ์‹œ ์ž๋™ ๋กค๋ฐฑํ•˜๋Š” ํŒŒ์ดํ”„๋ผ์ธ์„ ๊ตฌ์ถ•ํ•œ ๊ณผ์ • ์„ ์ˆ˜์ง€์‹ Triton ์„œ๋น™ ํ”Œ๋žซํผ - Triton ๊ตฌ์ถ• ์ด ๋‹จ๊ณ„์—์„œ ํ•ด๊ฒฐํ•˜๋ ค๋Š” ๋ฌธ์ œ ์‹ค๋ฌด ํ™˜๊ฒฝ์—์„œ ๋ชจ๋ธ ๋ฐฐํฌ๋Š” โ€œ์ƒˆ ๋ชจ๋ธ์„ ์˜ฌ๋ฆฌ๋Š” ์ž‘์—…"์ด ์•„๋‹ˆ๋ผ โ€œํ˜„์žฌ ์šด์˜ ์ƒํƒœ๋ฅผ ์•ˆ์ „ํ•˜๊ฒŒ ๊ฐฑ์‹ ํ•˜๋Š” ์ƒํƒœ ์ „์ด(State Transition)โ€œ์— ๊ฐ€๊น๋‹ค. ์ด๋ฒˆ ๋‹จ๊ณ„์—์„œ๋Š” MLflow Registry๋ฅผ ๋‹จ์ผ ์†Œ์Šค๋กœ ์‚ผ์•„ Triton Inference Server์— ๋ชจ๋ธ์„ ์ž๋™ ๋ฐฐํฌํ•˜๊ณ , ๋กœ๋”ฉ/ํ—ฌ์Šค ์ฒดํฌ/์‹ค์ œ ์ถ”๋ก  ๊ฒ€์ฆ์„ ๋ชจ๋‘ ํ†ต๊ณผํ•œ ๊ฒฝ์šฐ์—๋งŒ ์šด์˜ ๋ชจ๋ธ์„ ํ™•์ •(commit)ํ•˜๋ฉฐ, ์ค‘๊ฐ„ ๋‹จ๊ณ„์—์„œ ํ•˜๋‚˜๋ผ๋„ ์‹คํŒจํ•˜๋ฉด ์ด์ „ ์šด์˜ ์ƒํƒœ๋กœ ์ž๋™ ๋ณต๊ตฌ๋˜๋Š” ์ตœ์†Œ ๋กค๋ฐฑ ๊ตฌ์กฐ๋ฅผ ๊ตฌํ˜„ํ–ˆ๋‹ค. ...

December 29, 2025 ยท 4 min

[Triton ์šด์˜ํ˜• ์„œ๋น™ ํ”Œ๋žซํผ (GitOps ยท ๊ฒ€์ฆ ยท Alerting) - Triton ๊ตฌ์ถ•]

์ด ๊ธ€์—์„œ ๋‹ค๋ฃจ๋Š” ๊ฒƒ Triton Inference Server๋ฅผ CPU-only GitOps ๊ตฌ์กฐ๋กœ ๋ฐฐํฌํ•˜๊ณ , ONNX ๋ชจ๋ธ 1๊ฐœ์˜ load/infer ๊ฒ€์ฆ ๋ฐ Prometheus/Grafana ๊ด€์ธก๊นŒ์ง€ ์„œ๋น™ ํ”Œ๋žซํผ ๋ผˆ๋Œ€๋ฅผ ๊ตฌ์ถ•ํ•œ ๊ณผ์ • ์„ ์ˆ˜์ง€์‹ Observability 8๋‹จ๊ณ„: Data Pipeline ๊ณ ๋„ํ™” ์ด ๋‹จ๊ณ„์—์„œ ํ•ด๊ฒฐํ•˜๋ ค๋Š” ๋ฌธ์ œ ์‹ค๋ฌด์—์„œ ์„œ๋น™ ๊ณ„์ธต์€ ๊ณง๋ฐ”๋กœ ํŠธ๋ž˜ํ”ฝ๊ณผ SLA๋ฅผ ๋งž๋Š” ์ตœ์ „์„ ์ด๋‹ค. ๋ชจ๋ธ์ด ์•„๋ฌด๋ฆฌ ์ข‹์•„๋„ ์„œ๋น™์ด ๋ถˆ์•ˆ์ •ํ•˜๋ฉด ์šด์˜ ์‹œ ๋ฐ”๋กœ ๋ฌด๋„ˆ์ง„๋‹ค. ์ด๋ฒˆ์—๋Š” Triton ์ฒซ ๊ตฌ์ถ•์œผ๋กœ GPU/ํŒŒ์ดํ”„๋ผ์ธ ์—ฐ๋™์„ ์ผ๋ถ€๋Ÿฌ ๋นผ๊ณ , Triton ์ž์ฒด๋ฅผ GitOps๋กœ ์•ˆ์ •์ ์œผ๋กœ ๋„์šฐ๊ณ , ๋ชจ๋ธ load โ†’ infer โ†’ metrics ๊ด€์ธก๊นŒ์ง€ ์„œ๋น™ ํ”Œ๋žซํผ ๋ผˆ๋Œ€ ๊ตฌ์ถ•์„ ์ง„ํ–‰ํ–ˆ๋‹ค. ...

December 26, 2025 ยท 4 min

[MLOps ํ”Œ๋žซํผ Observability & Data Pipeline - ๊ฒ€์ฆ]

๐Ÿงญ ๋ชฉ์ฐจ ๊ตฌ๋ถ„ ์ฆ๋ช… ํฌ์ธํŠธ A. Observability ๋ฉ”ํŠธ๋ฆญยท๋กœ๊ทธยท์•Œ๋žŒ dev/prod ์™„์ „ ๋ถ„๋ฆฌ B. FastAPI & Platform Observability FastAPI + Platform ๋Œ€์‹œ๋ณด๋“œ ์ •์ƒ ๋™์ž‘ C. Data Pipeline Rawโ†’Feature ETL ์ž๋™ ์‹คํ–‰ ์„ฑ๊ณต D. Data Pipeline Advanced ๋ฒ„์ „ยท์Šคํ‚ค๋งˆยท๋ฉ”ํƒ€๋ฐ์ดํ„ฐยท๊ด€์ธก ๋“ฑ ์šด์˜ํ˜• ๊ตฌ์กฐ A. Observability ๊ณ„์ธต Observability ๊ณ„์ธต(๋ชจ๋‹ˆํ„ฐ๋ง + ๋กœ๊ทธ ์ˆ˜์ง‘ + ์•Œ๋žŒ)์ด GitOps ๊ธฐ๋ฐ˜์œผ๋กœ dev/prod ์™„์ „ ๋ถ„๋ฆฌ + ์ž๋™ํ™” ๋˜์–ด ์žˆ์Œ์„ ์ฆ๋ช…ํ•ฉ๋‹ˆ๋‹ค. 1๏ธโƒฃ ArgoCD Applications (GitOps ๊ธฐ๋ฐ˜ ๊ตฌ์„ฑ) โœ” 1-1. CLI๋กœ ์ „์ฒด Application ์ƒํƒœ ํ™•์ธ kubectl -n argocd get applications ...

November 30, 2025 ยท 11 min

[MLOps ํ”Œ๋žซํผ Observability & Data Pipeline - ์—ํ•„๋กœ๊ทธ]

์—ํ•„๋กœ๊ทธ โ€” โ€œ๊ด€์ธกยท์•Œ๋ฆผยท๋ฐ์ดํ„ฐ๊นŒ์ง€ ํ•œ ๋ฒˆ์— ์ด์–ด์ง€๋Š” MLOps ์šด์˜ ํ”Œ๋žซํผโ€ ๐Ÿ“Œ ์ „์ฒด ๊ฒฝ๋กœ ์š”์•ฝ ์ˆœ์„œ ์ฃผ์ œ 1 ๐Ÿ”— kube-prometheus-stack + GitOps ๊ตฌ์ถ• 2 ๐Ÿ”— Alertmanager Slack & ํŠธ๋Ÿฌ๋ธ”์ŠˆํŒ… 3 ๐Ÿ”— Prometheus/KSM/Kubelet ์™„์ „ ๋ถ„๋ฆฌ ๊ตฌ์กฐ 4 ๐Ÿ”— Loki/Promtail ๋กœ๊ทธ ํŒŒ์ดํ”„๋ผ์ธ ๊ตฌ์ถ• 5 ๐Ÿ”— ์šด์˜ ์ค‘ ๋ฐœ์ƒํ•œ ์‹ค์ œ ์ด์Šˆ & ํ•ด๊ฒฐ ๊ณผ์ • 6 ๐Ÿ”— FastAPI Observability Dashboard & Alert Library 7 ๐Ÿ”— Data Pipeline ๊ตฌ์ถ• 8 ๐Ÿ”— Data Pipeline ๊ณ ๋„ํ™” 9 ๐Ÿ”— ๊ฒ€์ฆ ๐ŸŽฏ ์ „์ฒด ํšŒ๊ณ  ์š”์•ฝ ๋‹จ๊ณ„ ํ•ต์‹ฌ ๋ชฉํ‘œ ์ฃผ์š” ๊ฐœ์„ ์  1 ๋ชจ๋‹ˆํ„ฐ๋ง ๋ผˆ๋Œ€ ๊ตฌ์ถ• kube-prometheus-stack dev/prod ์™„์ „ ๋ถ„๋ฆฌ 2 Slack ์•Œ๋žŒ ํ†ตํ•ฉ Alertmanager configSecret ํ‘œ์ค€ํ™” + ์•Œ๋žŒ ๋ผ์šฐํŒ… 3 ๋ฉ”ํŠธ๋ฆญ ๊ต์ฐจ ์ˆ˜์ง‘ ์ œ๊ฑฐ KSM/Kubelet ๋ผ๋ฒจ ํ†ต์ผ, Prometheus TSDB local-path 4 ๋กœ๊ทธ ํŒŒ์ดํ”„๋ผ์ธ ์™„์„ฑ Loki/Promtail dev/prod ๋ถ„๋ฆฌ + LogQL Range Query 5 ์‹ค์ œ ์šด์˜ ์ด์Šˆ ํ•ด๊ฒฐ ๋ผ๋ฒจ/Secret/NFS ๊ถŒํ•œ/NFS unmount ๋ฌธ์ œ ์ผ๊ด„ ํ•ด๊ฒฐ 6 FastAPI/Platform ๊ด€์ธก ๋Œ€์‹œ๋ณด๋“œ FastAPI ์„œ๋น„์Šค + Kubernetes/Node ํ”Œ๋žซํผ ๊ด€์ธก ์ฒด๊ณ„ ์™„์„ฑ 7 Data Pipeline ๊ตฌ์ถ• (v1) S3 Raw โ†’ Feature ETL ์ž๋™ํ™” (Airflow DAG) 8 Data Pipeline ๊ณ ๋„ํ™” (v2) ๋ฒ„์ „ยท์Šคํ‚ค๋งˆยท๋ฉ”ํƒ€๋ฐ์ดํ„ฐ ์ž๋™ํ™” + ๊ด€์ธก ๋Œ€์‹œ๋ณด๋“œ 9 ๊ฒ€์ฆ Observability ๊ณ„์ธต, Grafana ๋Œ€์‹œ๋ณด๋“œ, Datapipeline ๐Ÿ”„ ํ•ต์‹ฌ ๋ฌธ์žฅ: ...

November 20, 2025 ยท 3 min