Kubernetes is quickly gaining popularity for managing ML workflows with its load balancing, automatic scaling, and performance optimization features. However, when deploying at an enterprise level, multi-region operations become a crucial consideration for disaster tolerance and performance optimization. This talk will explore the challenges of managing Kubernetes clusters across multiple continents, and showcase real-world examples of MLOps at scale.

Technical level: Advanced