Baidu Geek Talk
Author

Baidu Geek Talk

Follow us to discover more Baidu tech insights.

515
Articles
0
Likes
1.1k
Views
0
Comments
Recent Articles

Latest from Baidu Geek Talk

100 recent articles max
Baidu Geek Talk
Baidu Geek Talk
Dec 11, 2024 · Artificial Intelligence

How AI Cuts Essay Grading Time by 6×: Inside the Smart Writing Platform

This article examines how an AI‑powered essay‑grading platform combines PaddleOCR and Baidu's Wenxin large model to automate scoring, generate personalized feedback, and reduce teachers' grading workload by over six times, while improving student learning outcomes across hundreds of Chinese schools.

AIEducation TechnologyPaddleOCR
0 likes · 11 min read
How AI Cuts Essay Grading Time by 6×: Inside the Smart Writing Platform
Baidu Geek Talk
Baidu Geek Talk
Dec 4, 2024 · Artificial Intelligence

AI-Driven Microservice Governance Platform Based on Multi-Agent Architecture

The article introduces Jarvis, an AI-driven microservice governance platform that uses a multi-agent architecture and natural-language dialogue to automate full-process operations such as deployments, rate limiting, and circuit-breaker configuration, while leveraging large language model reasoning for root-cause diagnosis and a data-flywheel that continuously trains lightweight expert models.

AI DevOpsData FlywheelIntelligent Fault Diagnosis
0 likes · 10 min read
AI-Driven Microservice Governance Platform Based on Multi-Agent Architecture
Baidu Geek Talk
Baidu Geek Talk
Nov 25, 2024 · Artificial Intelligence

PP-ShiTuV2: A General Image Recognition Pipeline in PaddleX

PP‑ShiTuV2, a PaddleX pipeline that integrates subject detection, deep feature encoding, and vector retrieval, delivers 91 % recall@1 on AliProducts, surpasses earlier models by over 20 points, runs efficiently on GPU and CPU, and offers simple installation, quick‑start code, and full fine‑tuning support.

Model DeploymentPP-ShiTuV2PaddleX
0 likes · 8 min read
PP-ShiTuV2: A General Image Recognition Pipeline in PaddleX
Baidu Geek Talk
Baidu Geek Talk
Nov 20, 2024 · Artificial Intelligence

Boosting ANN Search with GPU: Inside RAFT’s IVF_INT8 Implementation

This article examines how Baidu and NVIDIA leveraged the open‑source RAFT library to build a GPU‑accelerated approximate nearest neighbor (ANN) retrieval system, detailing algorithm choices, offline indexing, online batch processing, performance results, and practical guidelines for deploying ANN on GPUs.

ANNGPUIVF_INT8
0 likes · 20 min read
Boosting ANN Search with GPU: Inside RAFT’s IVF_INT8 Implementation
Baidu Geek Talk
Baidu Geek Talk
Nov 18, 2024 · Big Data

Optimizing Multi-Dimensional User Count Statistics in Big Data Computing: A Data Tagging Approach

By replacing exponential row expansion with a data‑tagging strategy that encodes dimension combinations and aggregates at the user level, the authors cut Baidu Feed’s multi‑dimensional user‑count computation time from 49 to 14 minutes and shuffle size from 16 TB to 800 GB, enabling scalable analysis across dozens of dimensions for billions of daily users.

Big Data OptimizationHive SQLdata tagging
0 likes · 12 min read
Optimizing Multi-Dimensional User Count Statistics in Big Data Computing: A Data Tagging Approach
Baidu Geek Talk
Baidu Geek Talk
Nov 13, 2024 · Industry Insights

Why Cloud‑Native Data Lakes Are the New Standard for Storage Acceleration

This article analyzes the evolution of data‑lake storage acceleration, compares traditional parallel file systems, object‑storage‑based solutions and modern cache‑enabled architectures, and explains how cloud‑native data lakes address scalability, cost, and performance challenges for AI and big‑data workloads.

AIBig DataCloud Native
0 likes · 24 min read
Why Cloud‑Native Data Lakes Are the New Standard for Storage Acceleration
Baidu Geek Talk
Baidu Geek Talk
Nov 6, 2024 · Cloud Computing

Baidu Canghai Storage Unified Technology Base: Architecture and Evolution of Metadata, Namespace, and Data Layers

Baidu’s Canghai Storage unifies metadata, hierarchical namespace, and data layers into a Meta‑Aware, three‑generation architecture that scales to trillions of metadata items and zettabyte‑scale data, using a distributed transactional KV store, single‑machine‑distributed namespace, and online erasure‑coding micro‑services to deliver high performance, low cost, and seamless scalability.

Big DataDistributed SystemsNewSQL
0 likes · 18 min read
Baidu Canghai Storage Unified Technology Base: Architecture and Evolution of Metadata, Namespace, and Data Layers
Baidu Geek Talk
Baidu Geek Talk
Nov 4, 2024 · Big Data

Why Object Storage Is Replacing HDFS for Modern Data Lakes: Baidu’s 2.0 Acceleration

Data lakes have evolved from HDFS to object storage, addressing resource inefficiency, scalability limits, and operational burdens; Baidu’s Data Lake Storage Acceleration 2.0 introduces hierarchical Namespace 2.0, a streaming storage engine, RapidFS caching, and a fully HDFS‑compatible BOS‑HDFS layer to boost performance and support massive AI workloads.

AIBaiduBig Data
0 likes · 12 min read
Why Object Storage Is Replacing HDFS for Modern Data Lakes: Baidu’s 2.0 Acceleration
Baidu Geek Talk
Baidu Geek Talk
Oct 30, 2024 · Cloud Computing

Baidu Cloud Infrastructure for AI-Native Era

Baidu Intelligent Cloud outlines how its evolving, high-performance infrastructure—featuring rapid 3-minute instance provisioning, over 200 GB bandwidth, elastic computing, specialized storage, and AI-driven MLOps tools—enables AI-native model training and deployment across booming sectors such as automotive and finance, supporting the industry’s shift to AI-centric cloud services.

Cloud ComputingDistributed SystemsMLOps
0 likes · 9 min read
Baidu Cloud Infrastructure for AI-Native Era
Baidu Geek Talk
Baidu Geek Talk
Oct 28, 2024 · Artificial Intelligence

Baidu Intelligent Cloud Qianfan AppBuilder: Enterprise-Level Large Model Application Development Platform

Baidu Intelligent Cloud’s Qianfan AppBuilder 3.0 offers an enterprise‑grade platform that simplifies large‑model application development by providing high‑accuracy RAG, robust agent scheduling, extensive integration, secure private‑or‑hybrid deployment, and a guided methodology, enabling industries to transform processes, add AI copilots, and create novel capabilities.

AI integrationBaidu Intelligent CloudQianfan AppBuilder
0 likes · 12 min read
Baidu Intelligent Cloud Qianfan AppBuilder: Enterprise-Level Large Model Application Development Platform