Text Detection and Recognition

Detect and recognize text using image feature detection and description, deep learning, and OCR

Detecting and recognizing text in images is a common task performed in computer vision applications. For example, you can capture video of a road scene from a moving vehicle, recognize signposts in the captured scene, and alert the driver about the signs.

You can combine detection and recognition combined into a two-step process, where the first step finds regions that contain text, and then the second step recognizes the text within the regions.

Input image showing an accessible parking sign, connected to a detector, which outputs an image with predicted bounding boxes overlaid on the sign text, connected to a recognizer that outputs a list of the words recognized on the sign.

Text detection algorithms use local image features, machine learning or deep learning, to locate or segment text within an image. The examples in the Computer Vision Toolbox™ demonstrate how to use blob analysis, the maximally stable extremal regions (MSER) feature detector, and the character region awareness for text detection (CRAFT) deep learning model for text detection.

Once you have detected the text, text recognition models, based on machine learning or deep learning, process the text regions to return the predicted text. The ocr function uses pretrained language models to recognize text in multiple languages. You can also train a custom language model using the trainOCR function. For more information, see Getting Started with OCR.

Apps

Image Labeler

Label images for computer vision applications

Functions

expand all

Text Recognition

`ocr`	Recognize text using optical character recognition
`ocrText`	Store OCR results
`visionSupportPackages`	Start Installer to download, install, or uninstall Computer Vision Toolbox data

Training and Evaluation

`trainOCR`	Train OCR model to recognize text in image (Since R2023a)
`evaluateOCR`	Evaluate OCR results against ground truth (Since R2023a)
`ocrMetrics`	Store OCR quality metrics (Since R2023a)
`ocrTrainingOptions`	Options for training OCR model (Since R2023a)
`ocrTrainingData`	Create training data for OCR from ground truth (Since R2023a)

Quantization

quantizeOCR Quantize OCR model (Since R2023a)

Text Detection

`detectTextCRAFT`	Detect texts in images by using CRAFT deep learning model (Since R2022a)
`detectMSERFeatures`	Detect MSER features
`vision.BlobAnalysis`	Properties of connected regions
`extractHOGFeatures`	Extract histogram of oriented gradients (HOG) features

Topics

Get Started

Getting Started with OCR
Detect and recognize text in multiple languages, train OCR models to recognize custom text.
Train Custom OCR Model
Train an optical character recognition (OCR) model to recognize custom text.
Install OCR Language Data Files
Support files for optical character recognition (OCR) languages.
Local Feature Detection and Extraction
Learn the benefits and applications of local feature detection and extraction.
Point Feature Types
Choose functions that return and accept points objects for several types of features.

Featured Examples

Recognize Text Using Optical Character Recognition (OCR)

Recognize text in images using optical character recognition.

Open Live Script

Recognize Seven-Segment Digits Using OCR

Use OCR to recognize seven-segmented digits in text detected by CRAFT and region properties.

Open Live Script

Automatically Detect and Recognize Text Using MSER and OCR

Automatically detect and recognize text in images using MSER and OCR.

Open Live Script

Automatically Detect and Recognize Text Using Pretrained CRAFT Network and OCR

Perform text recognition by using a deep learning based text detector and OCR. In the example, you use a pretrained CRAFT (character region awareness for text) deep learning network to detect the text regions in the input image. You can modify the region threshold and the affinity threshold values of the CRAFT model to localise an entire paragraph, a sentence, or a word. Then, you use OCR to recognize the characters in the detected text regions.

Open Live Script

Automate Ground Truth Labeling for OCR

Automate the labeling of text for OCR training and evaluation.

Open Live Script

Train an OCR Model to Recognize Seven-Segment Digits

Train an OCR model that can recognize seven-segment numerals.

Open Live Script

Digit Classification Using HOG Features

Classify digits using HOG features and a multiclass SVM classifier.

Open Live Script