Tech Week Singapore 2025

Loading

Scaling AI and Autonomous Agents: A Production Platform on Kubernetes

08 Oct 2025
DevOps & Platform Engineering Theatre

As enterprises increasingly integrate AI into their applications and develop autonomous agents, deploying Large Language Models demands a robust, scalable infrastructure. This talk explores a production-grade AI platform built on Amazon EKS, demonstrating how Kubernetes orchestrates complex AI workloads while ensuring operational excellence. We'll showcase an innovative architecture that leverages vLLM for high-performance inference, LiteLLM for unified access management, and Langfuse for end-to-end observability. The platform effectively addresses hybrid deployment requirements, resolving critical challenges in privacy, GPU optimization, and latency. Through EKS Auto Mode and intelligent scaling mechanisms, organizations can shift their focus from infrastructure management to innovation. Whether you're deploying basic language models or sophisticated AI agents with advanced reasoning capabilities, this architecture provides a solid foundation for your AI journey on Kubernetes.

Speaker(s)
Eng-Hwa Tan, Principal Solution Architect (AppMod) - Amazon Web Services

2025 Sponsors

Platinum Sponsors



 

Silver Sponsor



 

Bronze Sponsor



 

VIP Lunch Partner




 

2025 Partners

Association Partner


 

Association Partner


 

Association Partner


 

Association Partner


 

Association Partner


 

Media Partner


 

Media Partner


 

Media Partner


 

Media Partner


 

Media Partner