Tag

text detection

0 views collected around this technical thread.

Model Perspective
Model Perspective
Sep 1, 2023 · Fundamentals

Unlocking the Power of Finite State Transducers: From Theory to Python Implementation

This article introduces finite‑state transducers, explains their mathematical definition, illustrates state‑transition examples such as binary counters, word detection, and parentheses matching, explores key applications in speech synthesis, spell‑checking, lemmatization, transliteration, and lexical analysis, and provides a concise Python implementation.

Natural Language ProcessingPythonautomata theory
0 likes · 9 min read
Unlocking the Power of Finite State Transducers: From Theory to Python Implementation
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Aug 12, 2023 · Artificial Intelligence

An Introduction to OCR: Concepts, History, Applications, Datasets, and Technical Workflow

This article provides a comprehensive overview of Optical Character Recognition (OCR), covering its definition, historical development, classification, real‑world applications, technical pipeline, common challenges, mitigation strategies, popular datasets, model performance comparisons, and leading open‑source platforms.

Deep LearningOCROptical Character Recognition
0 likes · 16 min read
An Introduction to OCR: Concepts, History, Applications, Datasets, and Technical Workflow
Kuaishou Tech
Kuaishou Tech
Dec 26, 2022 · Artificial Intelligence

ICDAR 2023-DSText Video Text Reading Competition Overview

The ICDAR 2023-DSText competition, launching on February 15, 2023, focuses on dense and small text detection and recognition in video, providing a YouTube‑sourced dataset of 100 videos, two challenge tasks, a detailed timeline, eligibility rules, and a list of international sponsoring institutions.

ICDARcompetitioncomputer vision
0 likes · 6 min read
ICDAR 2023-DSText Video Text Reading Competition Overview
Shopee Tech Team
Shopee Tech Team
Nov 10, 2022 · Artificial Intelligence

ShopeeVideo OCR: Multi-language Text Recognition System for E-commerce Video

ShopeeVideo OCR is a multi‑language text‑recognition system for Southeast Asian e‑commerce videos that unifies detection, Transformer‑based recognition, layout analysis, and large‑scale synthetic data generation to handle Indonesian, Filipino, English, Vietnamese, Thai and Chinese scripts, delivering industry‑leading accuracy and winning thirteen ICDAR first‑place awards.

Deep LearningMulti-language OCROCR
0 likes · 15 min read
ShopeeVideo OCR: Multi-language Text Recognition System for E-commerce Video
DataFunTalk
DataFunTalk
Nov 10, 2022 · Artificial Intelligence

A Comprehensive Overview of OCR Technology Development and Engineering Practices

This article reviews the 40‑year evolution of Optical Character Recognition, discusses its integration with Intelligent Document Processing, outlines recent research hotspots such as scene text recognition and domain‑specific symbol detection, and shares practical engineering experiences and future directions from Datagrand.

Deep LearningIntelligent Document ProcessingOCR
0 likes · 24 min read
A Comprehensive Overview of OCR Technology Development and Engineering Practices
DataFunTalk
DataFunTalk
Sep 21, 2021 · Artificial Intelligence

Text Recognition Techniques for Content Safety: Risks, Workflow, Algorithms, and Deployment Optimization

This article explains how OCR-based text recognition is applied to content safety, detailing common risk categories, a step‑by‑step detection and recognition pipeline, mainstream detection and recognition algorithms such as regression‑based and segmentation‑based methods, and practical deployment and performance optimization strategies.

AIDeep LearningOCR
0 likes · 15 min read
Text Recognition Techniques for Content Safety: Risks, Workflow, Algorithms, and Deployment Optimization
Baidu Geek Talk
Baidu Geek Talk
Aug 4, 2021 · Artificial Intelligence

PaddleOCR v2.2 Release: PP-Structure for Document Layout Analysis and Table Recognition

PaddleOCR v2.2 launches PP‑Structure, a Python‑installable toolkit that combines PP‑YOLO v2 layout analysis (classifying text, title, table, image, list) with RARE‑based table recognition to extract structured content and export editable Excel files, while supporting custom training and simple command‑line use.

AIDeep LearningExcel Export
0 likes · 8 min read
PaddleOCR v2.2 Release: PP-Structure for Document Layout Analysis and Table Recognition
iQIYI Technical Product Team
iQIYI Technical Product Team
Mar 26, 2021 · Artificial Intelligence

Insights into OCR Technology at iQIYI: Development, Challenges, and Applications

iQIYI’s OCR journey, explained by researcher Harlon, covers the evolution from separate detection and recognition pipelines to end‑to‑end models, key algorithms like CTPN, DB and CRNN, large‑scale simulated training, diverse video‑text applications, and future goals such as mobile deployment and tighter NLP integration.

AIDeep LearningOCR
0 likes · 21 min read
Insights into OCR Technology at iQIYI: Development, Challenges, and Applications
Tencent Cloud Developer
Tencent Cloud Developer
Mar 4, 2021 · Artificial Intelligence

WeChat OCR: Implementation of Image Text Extraction Feature

WeChat’s 8.0 update introduced an OCR pipeline that first quickly detects text in images, classifies the image type, applies a lightweight multi‑language detection network and a MobileNetV3‑based DBNet recognizer with a multi‑task CTC/Attention model, then merges results via a rule‑based layout analyzer to deliver accurate, well‑formatted extracted text across diverse languages and document types.

DBNetDeep LearningOCR
0 likes · 13 min read
WeChat OCR: Implementation of Image Text Extraction Feature
HaoDF Tech Team
HaoDF Tech Team
Feb 2, 2021 · Artificial Intelligence

AI‑Based Structuring of Medical Examination Reports: OCR, Text Detection, Classification, and NER

This article describes how a Chinese online medical platform tackled the large‑scale extraction and structuring of hospital report images by combining OCR, deep‑learning text‑region detection, fast text classification, and advanced NER techniques, detailing challenges, algorithm choices, performance results, and remaining issues.

AINERNLP
0 likes · 19 min read
AI‑Based Structuring of Medical Examination Reports: OCR, Text Detection, Classification, and NER
Ctrip Technology
Ctrip Technology
Feb 28, 2019 · Artificial Intelligence

OCR Techniques and Solutions for Ctrip Business: Deep Learning Based Text Detection and Recognition

This article presents an overview of computer‑vision based OCR in Ctrip's operations, detailing deep‑learning text detection methods for controlled and uncontrolled scenarios, sequence‑based recognition models, training strategies with synthetic data, and performance results, while discussing current challenges and future improvements.

AICtripDeep Learning
0 likes · 11 min read
OCR Techniques and Solutions for Ctrip Business: Deep Learning Based Text Detection and Recognition