Tag

AI workloads

1 views collected around this technical thread.

ByteDance Cloud Native
ByteDance Cloud Native
Aug 12, 2024 · Cloud Native

How mGPU Enables Efficient GPU Sharing for AI Workloads in Cloud‑Native Environments

The article explains the mGPU solution from Volcano Engine, detailing its kernel‑level GPU virtualization, container runtime hooks, and scheduling mechanisms that allow multiple containers to share a single NVIDIA GPU with isolated compute and memory resources, achieving near‑lossless performance and up to 50% higher utilization for AI tasks.

AI workloadsGPU sharingResource Isolation
0 likes · 9 min read
How mGPU Enables Efficient GPU Sharing for AI Workloads in Cloud‑Native Environments
Baidu Geek Talk
Baidu Geek Talk
Aug 2, 2023 · Cloud Native

Baidu Intelligent Cloud GPU Container Virtualization 2.0: Advancements and Full-Scenario Practices

Baidu Intelligent Cloud’s GPU Container Virtualization 2.0 combines user‑mode and kernel‑mode isolation in a dual‑engine design that unifies scheduling of AI compute, rendering and encoding, supports mixed deployment and multi‑scheduler integration, and boosts GPU utilization across inference, offline tasks, autonomous‑driving simulation, and cloud‑gaming workloads.

AI workloadsContainer OrchestrationGPU virtualization
0 likes · 14 min read
Baidu Intelligent Cloud GPU Container Virtualization 2.0: Advancements and Full-Scenario Practices