Tag

text-extraction

0 views collected around this technical thread.

Python Programming Learning Circle
Python Programming Learning Circle
May 20, 2025 · Fundamentals

Python Script for Extracting Text from PDF Files Using PyPDF2

This article introduces a Python utility built with PyPDF2 that extracts text from PDF files, saves it as a TXT file, and provides an interactive command‑line interface with error handling, usage instructions, and code examples for easy document processing.

Command LinePDFPyPDF2
0 likes · 9 min read
Python Script for Extracting Text from PDF Files Using PyPDF2
Code Ape Tech Column
Code Ape Tech Column
Mar 18, 2024 · Backend Development

Implementing File Upload and Text Extraction with Elasticsearch Ingest Attachment Plugin in Spring Boot

This tutorial explains how to let users upload PDF, Word, or TXT files, install the Elasticsearch Ingest Attachment Processor Plugin, create an ingest pipeline and index mapping, convert files to Base64, and perform fuzzy searches with highlighted results using Spring Boot and Java code examples.

ElasticsearchFile UploadIngest Attachment
0 likes · 21 min read
Implementing File Upload and Text Extraction with Elasticsearch Ingest Attachment Plugin in Spring Boot
php中文网 Courses
php中文网 Courses
Jun 29, 2023 · Backend Development

How to Extract Text from Images Using PHP and Tesseract OCR

This tutorial demonstrates how to install the Tesseract OCR library via Composer, set up a PHP script to load an image, create a TesseractOCR instance, run the OCR process, and output the extracted text, providing complete sample code for each step.

OCRPHPTesseract
0 likes · 3 min read
How to Extract Text from Images Using PHP and Tesseract OCR
Sohu Tech Products
Sohu Tech Products
Sep 28, 2022 · Fundamentals

PyMuPDF (Python bindings for MuPDF) – Introduction, Features, Installation and Usage Guide

This article provides a comprehensive overview of PyMuPDF, the Python binding for the lightweight MuPDF library, covering its purpose, supported document formats, key features such as rendering, text extraction and PDF manipulation, installation methods, and detailed code examples for common operations.

MuPDFPDFPyMuPDF
0 likes · 13 min read
PyMuPDF (Python bindings for MuPDF) – Introduction, Features, Installation and Usage Guide
Python Programming Learning Circle
Python Programming Learning Circle
May 9, 2022 · Fundamentals

Introduction and Usage Guide for PyMuPDF (Python Bindings for MuPDF)

This article provides a comprehensive overview of PyMuPDF, the Python binding for the lightweight MuPDF library, covering its installation, core features such as page rendering, text and image extraction, PDF manipulation, and detailed code examples for common document‑processing tasks.

MuPDFPDFPyMuPDF
0 likes · 12 min read
Introduction and Usage Guide for PyMuPDF (Python Bindings for MuPDF)