Tag

document rendering

0 views collected around this technical thread.

Architect
Architect
Nov 1, 2021 · Fundamentals

Document Rendering and Structured Data Extraction in Baidu Wenku: From Layout Data to Flow Data and Chart Metadata

The article explains Baidu Wenku's document conversion pipeline, detailing how various office formats are transformed into PDF layout data, then into adaptive flow data for mobile devices, and describes the technical methods for extracting structured content and chart metadata from PDFs and OOXML documents.

Baidu WenkuData ExtractionOOXML
0 likes · 11 min read
Document Rendering and Structured Data Extraction in Baidu Wenku: From Layout Data to Flow Data and Chart Metadata
Baidu Geek Talk
Baidu Geek Talk
Sep 15, 2021 · Frontend Development

Technical Overview of Baidu Reading/Wenku Cross‑Platform Layout Engine

Baidu Reading/Wenku’s cross‑platform layout engine delivers professional book‑level typesetting on Android and iOS by using a single native engine that parses diverse formats into an abstract DOM, generates a memory‑resident layout description, and supports relative, multi‑directional, and reusable layout techniques for high‑quality, adaptive document rendering.

Frontend DevelopmentTypographycross‑platform
0 likes · 11 min read
Technical Overview of Baidu Reading/Wenku Cross‑Platform Layout Engine
Baidu Geek Talk
Baidu Geek Talk
Jul 26, 2021 · Artificial Intelligence

Document Rendering and Structured Extraction Techniques in Baidu Wenku

Baidu Wenku converts all document types to PDF, parses the PDF into a proprietary format, uses absolute‑position layout for PC rendering, and transforms this into flow‑type structural data for mobile devices by re‑typing layout, extracting OOXML structures, and detecting charts, thereby enabling adaptive layouts, accurate formula rendering, and interactive chart extraction.

Mobile OptimizationOOXML parsingPDF conversion
0 likes · 12 min read
Document Rendering and Structured Extraction Techniques in Baidu Wenku