Tagged articles
9 articles
Page 1 of 1
Architect's Guide
Architect's Guide
Feb 25, 2026 · Backend Development

Convert Word (.docx) to PDF in Spring Boot with docx4j – A Complete Guide

Learn how to seamlessly convert uploaded .docx files to PDF in a Spring Boot application using the pure‑Java docx4j library, covering solution comparison, Maven dependencies, a utility class, controller implementation, and Windows/Linux font‑encoding fixes for reliable, high‑fidelity document rendering.

JavaMavenPDF conversion
0 likes · 11 min read
Convert Word (.docx) to PDF in Spring Boot with docx4j – A Complete Guide
Java Architect Handbook
Java Architect Handbook
Feb 11, 2026 · Backend Development

Convert Word (.docx) to PDF in Spring Boot with docx4j

This guide walks you through a fully open‑source, pure‑Java solution for converting uploaded .docx files to PDF in a Spring Boot application, covering library selection, Maven dependencies, a reusable utility class, controller implementation, and handling Chinese font issues on Windows and Linux.

JavaMavenPDF conversion
0 likes · 12 min read
Convert Word (.docx) to PDF in Spring Boot with docx4j
Baidu Geek Talk
Baidu Geek Talk
Jul 26, 2021 · Artificial Intelligence

Document Rendering and Structured Extraction Techniques in Baidu Wenku

Baidu Wenku converts all document types to PDF, parses the PDF into a proprietary format, uses absolute‑position layout for PC rendering, and transforms this into flow‑type structural data for mobile devices by re‑typing layout, extracting OOXML structures, and detecting charts, thereby enabling adaptive layouts, accurate formula rendering, and interactive chart extraction.

Mobile OptimizationOOXML parsingPDF conversion
0 likes · 12 min read
Document Rendering and Structured Extraction Techniques in Baidu Wenku
Python Programming Learning Circle
Python Programming Learning Circle
Jul 3, 2021 · Artificial Intelligence

Automatic PDF Slide Transcription Using Deep Learning OCR

This article demonstrates how to automatically convert PDF slide decks into editable markdown text by first converting each page to images, then applying a deep‑learning OCR pipeline (CTPN for detection and CRNN for recognition) with Python code examples, achieving high transcription accuracy.

OCRPDF conversiondeep learning
0 likes · 6 min read
Automatic PDF Slide Transcription Using Deep Learning OCR