Tag

AI serving

1 views collected around this technical thread.

Youzan Coder
Youzan Coder
Jan 17, 2022 · Artificial Intelligence

Model Deployment Challenges and a Seldon‑Based Cloud‑Native Solution

The team replaced the cumbersome ABox deployment stack with Seldon‑based cloud‑native serving on Kubernetes, unifying TensorFlow and other framework models, adding GPU sharing, automated CRUD, per‑model ingress, monitoring, and log collection, achieving scalable, fault‑tolerant, zero‑downtime model deployment.

AI servingCloud NativeGPU
0 likes · 11 min read
Model Deployment Challenges and a Seldon‑Based Cloud‑Native Solution
Tencent Cloud Developer
Tencent Cloud Developer
Apr 15, 2019 · Artificial Intelligence

Serverless AI Inference with TensorFlow Serving on Tencent Cloud SCF

This tutorial shows how to package a TensorFlow SavedModel for MNIST, upload it to Tencent Cloud Object Storage, create a Python 2.7 SCF function that loads the model with TensorFlow Serving, and expose it via API Gateway as a scalable, server‑less AI inference endpoint.

AI servingAPI gatewayCloud Functions
0 likes · 14 min read
Serverless AI Inference with TensorFlow Serving on Tencent Cloud SCF