Wolfram Language & System Documentation Center

TextRecognize

TextRecognize[image]

recognizes text in image and returns it as a string.

TextRecognize[image,level]

returns a list of strings at the specified structural level.

TextRecognize[image,level,prop]

returns prop for text at the given level.

TextRecognize[video,…]

recognizes text in frames of video.

Details and Options

Text recognition, also known as OCR, is the process of detecting text in an image and converting it to text. It is typically used to extract text from scanned books, pictures of documents and more.
TextRecognize works with arbitrary grayscale and multichannel images, operating on the intensity value of each pixel.
TextRecognize[{image₁,image₂,…}] returns recognition for all image_i.
By default, the recognized text is returned as a single string for the whole image. Recognized text can be split into levels.
Structural elements specified in level include:

	Automatic	text found in the whole image as a single string (default)
	"Block"	a list of results for each block of text
	"Line"	a list of results for each line
	"Word"	a list of results for each word
	"Character"	a list of results for each character

TextRecognize[image,level,prop] computes prop at the given level and returns the result as a list {val₁,val₂,…}.
Possible settings for prop include:

	"BoundingBox"	bounding box around the text as a Rectangle
	"Confidence"	strength of the recognized text
	"Image"	cropped image containing the recognized text
	"Text"	recognized text (default)
	{prop₁,prop₂,…}	a list of properties

The following options can be specified:

Language	$Language	the language to recognize
Masking	All	the region of interest that includes text
Method	Automatic	the method to use
RecognitionPrior	Automatic	assumption about text in each masked area

TextRecognize accepts a Language option. By default, Language:>$Language is used. Using Language->{lang₁,lang₂,…} can be used to perform multi-language recognition.
The following Language settings can be used:
By default, the text found anywhere in the image is returned. Use Masking to specify the region of interest. Possible settings include:
All all of the image

mask a single region of interest

{mask₁,mask₂,…} multiple regions of interest
Possible settings for Method, specifying the type of document, include:

	Automatic	automatic choice of the method
	"Document"	optimized for detection in scanned documents
	"NaturalScene"	optimized for detection in natural scene images

Possible settings for Method, specifying the engine, include:

	Automatic	automatic choice of the method
	"EasyOCR"	optimized for scanned documents
	"OperatingSystem"	using the operating system (macOS only)
	"Tesseract"	optimized for scanned documents

RecognitionPrior makes an assumption about the kind of text present in the whole image or in each masked area. Possible settings include:

	Automatic	automatic structure recognition (default)
	"Column"	a single column of text
	"Line"	a single line of text
	"Word"	a single word
	"Character"	a single character
	"SparseText"	text in no particular structure

TextRecognize uses machine learning. Its methods, training sets and biases included therein may change and yield varied results in different versions of the Wolfram Language.
TextRecognize may download resources that will be stored in your local object store at $LocalBase, and that can be listed using LocalObjects[] and removed using ResourceRemove.

Examples

open all close all

Basic Examples (2)

Recognize text in an image:

Recognize lines of text and their corresponding bounding boxes:

Highlight the bounding box of each recognized line:

Scope (14)

Basic Uses (5)

Recognize dark text on a bright background:

Bright text on a dark background:

Color image:

Recognize a two-column text:

OCR of a list of images:

Levels (5)

The default recognition is performed on the whole image:

Recognize the text on the image per block:

Recognize the text on the image per line:

Recognize the text on the image per word:

Recognize the text on the image per character:

Properties (4)

By default, the recognized text is returned as a string:

The bounding box containing the recognized text:

The strength of the recognition:

The image containing the recognized text:

The subimages containing each word:

Get the recognized text and its strength for each word:

Get the images of the recognized text per line:

Construct a dataset from multiple properties of the recognized text per word:

Options (11)

Language (4)

The default recognition language is $Language:

Specify the language of the text to recognize:

Use an entity to specify the language:

Recognize text containing multiple languages:

Masking (1)

By default, using Masking->All, text in the whole image is recognized:

Specify the region of interest:

Use an image to specify the mask:

Use a mask specifying multiple regions of interest:

Use separate masks for each region of interest:

Method (3)

By default, the most suitable method is used:

Specify the document type:

Specify the engine to use:

RecognitionPrior (3)

Specify that the image contains a single character:

Without the prior, multiple characters may be returned:

Without the correct prior, the text may be unrecognized:

Specify the prior:

Recognize text formatted in two columns:

Replace double new lines with a single new line:

Applications (7)

Perform a frequency analysis on the recognized text:

Visualize the result, highlighting the less frequent words:

Use LanguageIdentify to identify the language of some recognized text:

Use the identified language to improve the OCR result:

Format the recognized text to interactively show the definition of each word:

Split the text to have all the words separated:

Keep the text content structure, including spaces and punctuation signs, for further reconstruction:

Write a function to look up a word's definition in the dictionary:

Apply the function to the words present in the extracted text:

Recombine the text in a single cell:

Recognize characters in a geometrical diagram:

Specify the regions that contain text:

Extract some properties and highlight them on the image:

Highlight each character with the recognized text and strength:

Recognize lines of text and their corresponding bounding boxes:

Highlight the bounding box of each recognized line using a tooltip for each box:

Recognize, group and highlight all occurrences of a single word in an image:

Synthesize audio corresponding to the text in an image:

Read from a book cover:

Properties & Relations (2)

Recognize warped text:

By default, the whole image is assumed to be the region of interest:

Remove all the new lines:

Replace extra new lines with a single new line:

Possible Issues (8)

Rotated text may not be recognized correctly:

Text of different colors may not be recognized:

The quality of recognized text may improve by processing the image:

Fonts smaller than a certain size cannot be recognized:

The quality of recognized text may improve by enlarging the image:

When recognizing formatted numbers, whitespace may cause OCR errors:

Text of different colors may not be fully recognized:

Some color adjustments and upsampling might help improve the recognition result:

Text given as outlines cannot be recognized properly:

Fill the background to improve the result:

Text surrounded by some non-textual content may not be properly recognized:

Specifying the content type might improve the recognition result:

Sometimes small text in a large image is not recognized:

Specify the regions that contain text to improve recognition:

Top

More Learning

Tech Support

Wolfram Solutions

Wolfram Solutions For Education

Get Started

Grow Your Skills

Work with Us

Educational Programs for Adults

Educational Programs for Youth

Read

TextRecognize

Details and Options

Examples

Basic Examples (2)

Scope (14)

Basic Uses (5)

Levels (5)

Properties (4)

Options (11)

Language (4)

Masking (1)

Method (3)

RecognitionPrior (3)

Applications (7)

Properties & Relations (2)

Possible Issues (8)

Text

CMS

APA

BibTeX

BibLaTeX

TextRecognize

Details and Options

Examples

Basic Examples (2)

Scope (14)

Basic Uses (5)

Levels (5)

Properties (4)

Options (11)

Language (4)

Masking (1)

Method (3)

RecognitionPrior (3)

Applications (7)

Properties & Relations (2)

Possible Issues (8)

See Also

Related Guides

Related Links

History

Text

CMS

APA

BibTeX

BibLaTeX