[GitOps 기반 E2E ML Platform - λΆ€ν•˜ ν…ŒμŠ€νŠΈ] k6둜 κ²€μ¦ν•œ Triton + FastAPI μ„œλΉ™ μ„±λŠ₯: 136 RPS, p95 553ms

이 κΈ€μ—μ„œ λ‹€λ£¨λŠ” 것 k6 λΆ€ν•˜ ν…ŒμŠ€νŠΈλ‘œ Triton + FastAPI μ„œλΉ™ μ„±λŠ₯을 μ‹€μΈ‘ 검증: 136 RPS, p95 553ms, μ—λŸ¬μœ¨ 0% (CPU-only 3λ…Έλ“œ ν΄λŸ¬μŠ€ν„°) μ„ μˆ˜μ§€μ‹ GitOps 기반 E2E ML Platform - 운영 λ¬Έμ„œν™” Load Test: μ„œλΉ™ μ„±λŠ₯ 검증 μ‹€μ œλ‘œ μ–Όλ§ˆλ‚˜ λ²„ν‹°λŠ”κ°€ λ“€μ–΄κ°€λ©° 이 μ‹œλ¦¬μ¦ˆμ—μ„œ μ§€κΈˆκΉŒμ§€ λ‹€μŒμ„ ν™•μΈν–ˆμŠ΅λ‹ˆλ‹€. Triton READY Model Repository Loaded FastAPI Health OK Reload API Success Metrics Exported ν•˜μ§€λ§Œ μ—¬κΈ°μ„œ μ€‘μš”ν•œ 질문이 ν•˜λ‚˜ 더 λ‚¨μŠ΅λ‹ˆλ‹€. μ‹€μ œ νŠΈλž˜ν”½μ΄ 듀어왔을 λ•Œ 이 μ‹œμŠ€ν…œμ΄ λ²„ν‹°λŠ”κ°€? ...

March 18, 2026 Β· 5 min