Tech Week Singapore 2025

Loading

From GPU Waste to Smart Scaling: Building Cost-Effective Private AI Infrastructure

08 Oct 2025
Productivity Optimisation & AI Adoption Theatre
From GPU Waste to Smart Scaling: Building Cost-Effective Private AI Infrastructure

While GPU hardware dominates AI infrastructure costs, most private deployments suffer from chronically low utilization rates due to static resource allocation. This session demonstrates how open-source elastic inference technology transforms GPU pools to serve multiple models dynamically, significantly reducing infrastructure costs while maintaining production-grade performance.

Speaker(s)
Yanzhen Yu, R&D Manager - Arcfra

2025 Sponsors

Platinum Sponsors



 

Silver Sponsor



 

Bronze Sponsor



 

VIP Lunch Partner




 

2025 Partners

Association Partner


 

Association Partner


 

Association Partner


 

Association Partner


 

Association Partner


 

Media Partner


 

Media Partner


 

Media Partner


 

Media Partner


 

Media Partner