Baidu Geek Talk
Author

Baidu Geek Talk

Follow us to discover more Baidu tech insights.

515
Articles
0
Likes
1.1k
Views
0
Comments
Recent Articles

Latest from Baidu Geek Talk

100 recent articles max
Baidu Geek Talk
Baidu Geek Talk
Mar 5, 2025 · Cloud Computing

Inside GPU Cloud Servers: Architecture, Interconnects, and Performance Secrets

This article provides a comprehensive technical overview of GPU cloud server design, covering data‑processing pipelines, hardware topology, NUMA considerations, PCIe and proprietary interconnects, multi‑GPU communication strategies, virtualization approaches (BCC and BBC), DPU acceleration, and future trends for scaling up and out.

Cloud ComputingGPUPerformance Optimization
0 likes · 27 min read
Inside GPU Cloud Servers: Architecture, Interconnects, and Performance Secrets
Baidu Geek Talk
Baidu Geek Talk
Mar 3, 2025 · Big Data

Real-Time Anti-Cheat Streaming System Based on Flink: Architecture, Challenges, and Solutions

The article details a Flink‑based real‑time anti‑cheat streaming architecture that combines tumbling, sliding and session windows with early triggers, batch state updates cached in memory, coarse‑grained key reduction, and YAML‑driven strategy configuration to deliver millisecond‑level detection, seamless integration with ClickHouse, Hive, Redis and message queues, and self‑service analytics, achieving high throughput, low latency, and robust stability for large‑scale risk control.

Configuration ManagementFlinkPerformance Optimization
0 likes · 25 min read
Real-Time Anti-Cheat Streaming System Based on Flink: Architecture, Challenges, and Solutions
Baidu Geek Talk
Baidu Geek Talk
Feb 26, 2025 · Artificial Intelligence

Baidu Cloud's DeepSeek Integrated Solutions for Enterprise AI Transformation

Baidu Cloud’s new DeepSeek integrated machine suite—Bai Ge, Qian Fan and Yi Jian—combines domestic Kunlun P800 chips with one‑click private deployment of full DeepSeek models, delivering high cost‑performance, security compliance and end‑to‑end AI capabilities for diverse industry digital transformation.

AI transformationDeepSeek ModelEnterprise Solutions
0 likes · 8 min read
Baidu Cloud's DeepSeek Integrated Solutions for Enterprise AI Transformation
Baidu Geek Talk
Baidu Geek Talk
Feb 24, 2025 · Artificial Intelligence

Using a Graph Engine to Drive Workflow for Intelligent Agents

By leveraging mature graph‑engine technology, the article shows how visual, low‑code workflow orchestration can give intelligent LLM‑based agents fine‑grained path control, reusable functions, hierarchical sub‑flows, and robust error handling, turning complex business tasks into modular, scalable processes adopted by hundreds of thousands of developers.

AI agentsLLMLow‑code
0 likes · 18 min read
Using a Graph Engine to Drive Workflow for Intelligent Agents
Baidu Geek Talk
Baidu Geek Talk
Feb 19, 2025 · Frontend Development

Technical Practice of Baidu Live‑Streaming Interactive Framework: Performance and Stability Optimization

Baidu live streaming interactive framework optimized performance and stability for music+red‑packet activities, using component reuse, page pre‑static generation, SSR, ISR, prefetching, view prerender, fallback mechanisms, and animation downgrade, achieving first‑screen load reductions to 0.5 s and delivering a reusable solution for large‑scale live events.

Front-end ArchitecturePerformance OptimizationSSR
0 likes · 16 min read
Technical Practice of Baidu Live‑Streaming Interactive Framework: Performance and Stability Optimization
Baidu Geek Talk
Baidu Geek Talk
Feb 17, 2025 · Operations

How Baidu Netdisk Prevents Service Avalanches: Dynamic Circuit Breaking & Queue Control

This article analyzes Baidu Netdisk's anti‑avalanche architecture, explaining how avalanche cascades occur in high‑concurrency services and detailing practical prevention, blocking, and mitigation techniques such as dynamic circuit breaking, traffic isolation, request‑validity checks, and socket‑level detection to maintain system reliability.

Backend ArchitectureCircuit BreakingDynamic Throttling
0 likes · 18 min read
How Baidu Netdisk Prevents Service Avalanches: Dynamic Circuit Breaking & Queue Control
Baidu Geek Talk
Baidu Geek Talk
Feb 12, 2025 · Artificial Intelligence

Deploy DeepSeek, Llama, Qwen Models Fast on Baidu Baige AI Heterogeneous Platform

This guide walks you through creating a lightweight compute instance, adding it to Baidu Baige AI heterogeneous computing platform, deploying the vLLM tool, loading and serving small‑scale dense models such as DeepSeek, Llama and Qwen, and provides recommended configuration lists to achieve low‑cost, high‑performance inference.

AI model deploymentBaidu BaigeCloud AI
0 likes · 3 min read
Deploy DeepSeek, Llama, Qwen Models Fast on Baidu Baige AI Heterogeneous Platform
Baidu Geek Talk
Baidu Geek Talk
Feb 10, 2025 · Artificial Intelligence

How Baidu Cloud Slashes Inference Costs: DeepSeek Model Optimizations Unveiled

Baidu Cloud's Qianfan platform launched DeepSeek‑R1 and DeepSeek‑V3 with ultra‑low inference pricing, leveraging advanced engine performance tweaks, a split Prefill/Decode architecture, and comprehensive security measures that together boost throughput, cut costs, and ensure enterprise‑grade reliability.

AI inferenceBaidu CloudModel Serving
0 likes · 5 min read
How Baidu Cloud Slashes Inference Costs: DeepSeek Model Optimizations Unveiled
Baidu Geek Talk
Baidu Geek Talk
Feb 5, 2025 · Artificial Intelligence

How to Unlock Full GPU Efficiency for Enterprise AI Platforms

This article analyzes common GPU efficiency problems in enterprise AI compute platforms—such as low utilization, long fault‑resolution times, and limited performance gains—and presents three practical solutions: dynamic resource allocation, systematic fault‑tolerance, and system‑level tuning, illustrated with real‑world case studies.

AI PlatformGPU utilizationlarge model training
0 likes · 11 min read
How to Unlock Full GPU Efficiency for Enterprise AI Platforms
Baidu Geek Talk
Baidu Geek Talk
Jan 22, 2025 · Mobile Development

iOS Sandbox Disk Management and Cleaning Strategies

The article explains iOS sandbox storage by detailing the four main directories, their backup rules, naming conventions, and retrieval APIs, then outlines how to calculate physical file size and implements both automatic quota‑based and manual user‑driven cleaning methods, including system cache removal for tmp, WKWebView, and dyld caches.

Cache CleaningObjective‑Cdisk-management
0 likes · 22 min read
iOS Sandbox Disk Management and Cleaning Strategies