Baidu Geek Talk
Author

Baidu Geek Talk

Follow us to discover more Baidu tech insights.

515
Articles
0
Likes
1.1k
Views
0
Comments
Recent Articles

Latest from Baidu Geek Talk

100 recent articles max
Baidu Geek Talk
Baidu Geek Talk
Jan 20, 2025 · Industry Insights

How Baidu’s Qianfan AppBuilder Is Redefining AI‑Native App Development

The interview explores how Baidu Cloud's Qianfan AppBuilder platform evolves from traditional coding to AI‑native low‑code development, detailing the impact of large‑model agents, Retrieval‑Augmented Generation, security, multimodal support, and future roadmap on enterprise productivity and digital transformation.

AI agentsAI native appsLarge Models
0 likes · 18 min read
How Baidu’s Qianfan AppBuilder Is Redefining AI‑Native App Development
Baidu Geek Talk
Baidu Geek Talk
Jan 15, 2025 · Artificial Intelligence

Understanding Large Model Inference Engines and Reducing Token Interval (TPOT)

Large‑model inference engines convert prompts into responses via a Prefill stage and an autoregressive Decoder, measured by TTFT and TPOT, and Baidu’s AIAK suite improves TPOT by separating tokenization, using static slot scheduling, and asynchronous execution, cutting token‑interval latency from ~35 ms to ~14 ms and boosting GPU utilization to about 75 % while also leveraging quantization and speculative execution for higher throughput.

AI accelerationGPU utilizationTPOT
0 likes · 10 min read
Understanding Large Model Inference Engines and Reducing Token Interval (TPOT)
Baidu Geek Talk
Baidu Geek Talk
Jan 13, 2025 · Industry Insights

Top 12 Must-Read Baidu Tech Articles of 2024: Insights & Innovations

This roundup highlights twelve standout Baidu Geek articles from 2024, covering breakthroughs in search personalization, high‑performance Go services, transaction reconciliation, login system evolution, AI‑native applications, microservice governance, caching algorithms, RLHF optimization, ClickHouse deployment, and more, each with concise recommendation reasons.

2024AIBaidu
0 likes · 8 min read
Top 12 Must-Read Baidu Tech Articles of 2024: Insights & Innovations
Baidu Geek Talk
Baidu Geek Talk
Jan 8, 2025 · Artificial Intelligence

Evolution of Video Search Ranking Architecture Towards an End‑to‑End Large‑Model Framework

The article outlines how video search ranking has shifted from a tightly‑coupled multi‑stage cascade to an extensible, end‑to‑end, model‑centric framework called Rankflow, leveraging large‑model inference, decoupled recall, fine‑grained parallelism, and elastic compute allocation to boost performance, flexibility, and maintainability while paving the way for future retrieval‑augmented generation integration.

AILarge Modelselastic resources
0 likes · 11 min read
Evolution of Video Search Ranking Architecture Towards an End‑to‑End Large‑Model Framework
Baidu Geek Talk
Baidu Geek Talk
Jan 6, 2025 · Information Security

MarkupLM-based Detection of Malicious Content Scraping

The article presents a MarkupLM‑based approach that enriches BERT with XPath embeddings to jointly model webpage text and structure, enabling site‑level detection of malicious content‑scraping pages that bypass traditional rule‑based filters and demonstrating the critical role of structural cues in improving spam classification accuracy.

Machine LearningMarkupLMXPath embedding
0 likes · 16 min read
MarkupLM-based Detection of Malicious Content Scraping
Baidu Geek Talk
Baidu Geek Talk
Dec 30, 2024 · Industry Insights

How Baidu’s HTAP Table Storage Achieves Massive IO Gains and Faster Development

Baidu’s Search Content Storage team built an HTAP table storage system and a serverless compute‑scheduling architecture that separates OLTP and OLAP workloads, delivering up to 200 GB/s peak IO, reducing storage cost by 75 %, and enabling SQL‑style task development with native FaaS functions.

Big DataCompute SchedulingHTAP
0 likes · 20 min read
How Baidu’s HTAP Table Storage Achieves Massive IO Gains and Faster Development
Baidu Geek Talk
Baidu Geek Talk
Dec 25, 2024 · Industry Insights

How to Build a Multimodal Web Page Model for the LLM Era

This article examines the unique multimodal and multi‑granular nature of web pages, compares fusion strategies, proposes a cross‑modal attention approach, outlines fine‑ and coarse‑grained pre‑training tasks, and explores low‑cost adaptor methods for adapting large multimodal models to web‑page modeling in the LLM era.

AIHTMLLLM adaptation
0 likes · 10 min read
How to Build a Multimodal Web Page Model for the LLM Era
Baidu Geek Talk
Baidu Geek Talk
Dec 23, 2024 · Industry Insights

How Baidu’s One‑Stop Search Platform Cuts Development Costs by 80%

This article analyzes Baidu’s vertical‑search architecture team’s one‑stop development platform, detailing the background challenges, the FaaS and SaaS mechanisms introduced, design decisions, performance optimizations, dynamic form and DAG visualisation, and the resulting cost reductions and productivity gains.

Cloud ComputingFaaSIndustry Insights
0 likes · 17 min read
How Baidu’s One‑Stop Search Platform Cuts Development Costs by 80%
Baidu Geek Talk
Baidu Geek Talk
Dec 18, 2024 · Artificial Intelligence

GEE Graph Embedding Algorithm for Business Security Anomaly Detection

The article presents the GEE (Graph Encoder Embedding) algorithm for business security anomaly detection, explains its label‑propagation foundation, evaluates it on ten‑million‑edge real data, identifies inefficiencies in the original implementation, and demonstrates that vectorized NumPy/Pandas optimizations reduce runtime from 55 seconds to about 4 seconds while preserving meaningful TSNE‑visualized embeddings.

Anomaly DetectionGEE algorithmanti-fraud
0 likes · 21 min read
GEE Graph Embedding Algorithm for Business Security Anomaly Detection
Baidu Geek Talk
Baidu Geek Talk
Dec 16, 2024 · Artificial Intelligence

AIAPI: Baidu's AI-Native Retrieval System for Large Language Model Applications

AIAPI, Baidu’s AI‑native retrieval platform for large language models, tackles hallucination, slow domain updates, and output opacity by delivering authoritative, timely, full‑content data through a dual‑channel architecture that combines traditional search and RAG, employs reusable ranking, graph‑enhanced data layers, dynamic caching that cuts storage by 70 %, and QueryPlan‑based QoS, achieving markedly higher retrieval quality and a 34 % speed gain with Wenxin 4.0.

AI-Native SystemsAIAPIQuery Planning
0 likes · 12 min read
AIAPI: Baidu's AI-Native Retrieval System for Large Language Model Applications