Tag

Inference Platform

1 views collected around this technical thread.

Baidu Geek Talk
Baidu Geek Talk
Mar 29, 2023 · Cloud Native

Punica: A Cloud‑Native Platform for Content Understanding Inference Services

Punica provides a cloud‑native, one‑stop platform that unifies Baidu’s content‑understanding inference services, automates testing, resource provisioning, and monitoring, and enables unattended, self‑healing operations with dynamic scaling and GPU scheduling, cutting onboarding time by half and reclaiming hundreds of GPUs.

AI inferenceInference PlatformResource Scheduling
0 likes · 14 min read
Punica: A Cloud‑Native Platform for Content Understanding Inference Services
58 Tech
58 Tech
Dec 22, 2022 · Artificial Intelligence

Implementing a Cloud-Native Istio Gateway for 58.com Deep Learning Inference Platform

This article details the evolution of 58.com’s deep learning inference platform, describing the transition from the original SCF‑based architecture to a cloud‑native Istio gateway (architecture 2.0), and explains design choices, traffic‑management, adaptive rate‑limiting, observability, model pre‑warming, and performance improvements.

AIInference Platformcloud-native
0 likes · 22 min read
Implementing a Cloud-Native Istio Gateway for 58.com Deep Learning Inference Platform
DataFunSummit
DataFunSummit
Jun 30, 2022 · Artificial Intelligence

MLOps Practices on the Beike Inference Platform: Architecture, Evolution, and Future Plans

This article presents a comprehensive overview of Beike's machine learning platform and its inference service, detailing the platform's architecture, GPU virtualization, cloud‑native migration, MLOps implementation, and future roadmap to achieve cost‑effective, automated AI model deployment at scale.

AIGPU virtualizationInference Platform
0 likes · 13 min read
MLOps Practices on the Beike Inference Platform: Architecture, Evolution, and Future Plans