Wolfram Language & System Documentation Center

SpeechCases

SpeechCases[audio,form]

gives a list of cases of text identified as being of type form that appear in the transcription of audio.

SpeechCases[audio,{form₁,form₂,…}]

gives an association of results for all the types form_i.

SpeechCases[audio,formspecprop]

gives the specified property for each result found.

SpeechCases[audio,formspec{prop₁,prop₂,…}]

gives a list of properties for each result found.

SpeechCases[audio,spec,n]

gives the first n cases found.

SpeechCases

SpeechCases[audio,form]

gives a list of cases of text identified as being of type form that appear in the transcription of audio.

SpeechCases[audio,{form₁,form₂,…}]

gives an association of results for all the types form_i.

SpeechCases[audio,formspecprop]

gives the specified property for each result found.

SpeechCases[audio,formspec{prop₁,prop₂,…}]

gives a list of properties for each result found.

SpeechCases[audio,spec,n]

gives the first n cases found.

Details and Options

SpeechCases[{audio₁,audio₂,…},…] gives identified cases for each audio_i.
Identification type form can be:

	"type"	any text content type (e.g. "Noun", "City")
	Entity[…,…]	a specific entity of a text content type
	form₁\|form₂\|…	form matching any of the form_i
	Containing[outer,inner]	forms of type outer containing type inner
	Verbatim["string"]	a specific string to be matched exactly
	pattern	a string pattern to be matched

Possible choices for the property prop are:

	"String"	string of the identified text (default)
	"Position"	start and end position of the string in text
	"Probability"	estimated probability that the identification is correct
	"Interpretation"	standard interpretation of the identified string
	"Snippet"	a snippet around the identified string
	"HighlightedSnippet"	a snippet with the identified string highlighted
	f	apply f to the association containing all properties
	{prop₁,prop₂,…}	a list of property specifications

The following options can be given:

AcceptanceThreshold	Automatic	minimum probability to accept identification
Masking	All	interval of interest
PerformanceGoal	Automatic	favor algorithms with specific advantages
TargetDevice	"CPU"	whether CPU or GPU computation should be used for entity detection
VerifyInterpretation	False	whether interpretability should be verified

SpeechCases uses machine learning. Its methods, training sets and biases included therein may change and yield varied results in different versions of the Wolfram Language.
SpeechCases may download resources that will be stored in your local object store at $LocalBase, and that can be listed using LocalObjects[] and removed using ResourceRemove.

Examples

open all close all

Basic Examples (2)

Find the cities in a speech recording:

Wolfram Language code: SpeechCases[\!$\*AudioBox["![Embedded Audio Player](audio://content-vphqd)"]$, "City"]

Find the cities and get interpretations:

Wolfram Language code: SpeechCases[\!$\*AudioBox["![Embedded Audio Player](audio://content-vphqd)"]$, "City" -> "Interpretation"]

Scope (13)

Basic Uses (3)

Find all cities:

Wolfram Language code: SpeechCases[\!$\*AudioBox["![Embedded Audio Player](audio://content-il8lf)"]$, "City"]

Find and interpret all instances of cities:

Wolfram Language code: SpeechCases[\!$\*AudioBox["![Embedded Audio Player](audio://content-il8lf)"]$, "City" -> "Interpretation"]

Specify the maximum number of identifications to return for each type:

Wolfram Language code: SpeechCases[\!$\*AudioBox["![Embedded Audio Player](audio://content-il8lf)"]$, "City" , 1]

Form Specification (4)

Find the spoken nouns:

Wolfram Language code: SpeechCases[\!$\*AudioBox["![Embedded Audio Player](audio://content-91z7d)"]$, "Noun"]

Find the spoken words:

Wolfram Language code: SpeechCases[\!$\*AudioBox["![Embedded Audio Player](audio://content-91z7d)"]$, "Word"]

Find cities and countries in a recording:

Wolfram Language code: SpeechCases[\!$\*AudioBox["![Embedded Audio Player](audio://content-vphqd)"]$, {"City", "Country"}]

Use Alternatives to match multiple types:

Wolfram Language code: SpeechCases[\!$\*AudioBox["![Embedded Audio Player](audio://content-91z7d)"]$, "Noun" | "Verb"]

Find all sentences in a string that contain currency amounts:

Wolfram Language code:

SpeechCases[\!\(\*AudioBox["![Embedded Audio Player](audio://content-wl9a6)"]\), Containing["Sentence", "CurrencyAmount"]]

Properties (6)

Find currency amounts and get interpretations:

Wolfram Language code: SpeechCases[\!$\*AudioBox["![Embedded Audio Player](audio://content-wl9a6)"]$, "CurrencyAmount" -> "Interpretation"]

Obtain probabilities and interpretations for detected cities and countries:

Wolfram Language code:

SpeechCases[\!\(\*AudioBox["![Embedded Audio Player](audio://content-vphqd)"]\), {"City", "Country"} -> {"String", "Interpretation", "Probability"}]

Specify multiple return types:

Wolfram Language code:

SpeechCases[\!\(\*AudioBox["![Embedded Audio Player](audio://content-il8lf)"]\), "CurrencyAmount"  -> {"String", "Interpretation"}]

Wolfram Language code:

SpeechCases[\!\(\*AudioBox["![Embedded Audio Player](audio://content-il8lf)"]\), {"CurrencyAmount", "City", "Date"}  -> {"String", "Interpretation"}]

Show all available properties in an Association:

Wolfram Language code: SpeechCases[\!$\*AudioBox["![Embedded Audio Player](audio://content-il8lf)"]$, "Date" -> Identity]

Create a dataset with the properties of several types of entities:

Wolfram Language code:

SpeechCases[\!\(\*AudioBox["![Embedded Audio Player](audio://content-il8lf)"]\), {"CurrencyAmount", "City", "Date"} -> Identity]//Dataset

Get the geodetic positions of the locations occurring in a spoken text:

Wolfram Language code:

SpeechCases[\!\(\*AudioBox["![Embedded Audio Player](audio://content-il8lf)"]\), "Location" -> (#String -> #Interpretation&)]

Options (3)

AcceptanceThreshold (1)

By default, an automatic acceptance threshold is used:

Wolfram Language code:

SpeechCases[\!\(\*AudioBox["![Embedded Audio Player](audio://content-il8lf)"]\), "Location" -> {"String", "Probability"}]

Specify the minimum identification probability:

Wolfram Language code: SpeechCases[\!$\*AudioBox["![Embedded Audio Player](audio://content-il8lf)"]$, "Location", AcceptanceThreshold -> .9]

Masking (1)

By default, the whole audio signal is processed for interpretable results:

Wolfram Language code: SpeechCases[\!$\*AudioBox["![Embedded Audio Player](audio://content-91z7d)"]$, "Noun"]

Search for nouns only in the first half of the signal:

Wolfram Language code:

SpeechCases[\!\(\*AudioBox["![Embedded Audio Player](audio://content-91z7d)"]\), "Noun", Masking -> {Quantity[0, "Seconds"], Quantity[1.3, "Seconds"]}]

VerifyInterpretation (1)

By default, the interpretability of a result is not verified and a string is returned instead of an interpretation:

Wolfram Language code:

SpeechCases[\!\(\*AudioBox["![Embedded Audio Player](audio://content-8qyu8)"]\), {"City", "Country"} -> "Interpretation"]

Filter out the entities that cannot be interpreted:

Wolfram Language code:

SpeechCases[\!\(\*AudioBox["![Embedded Audio Player](audio://content-8qyu8)"]\), {"City", "Country"} -> "Interpretation", VerifyInterpretation -> True]

Properties & Relations (2)

SpeechCases is effectively calling TextCases on the result of SpeechRecognize:

Wolfram Language code:

a = \!\(\*AudioBox["![Embedded Audio Player](audio://content-bajse)"]\);
TextCases[SpeechRecognize[a], "ProperNoun"]

Wolfram Language code: SpeechCases[a, "ProperNoun"]

SpeechCases supports the same identification types as TextCases:

Wolfram Language code:

a = \!\(\*AudioBox["![Embedded Audio Player](audio://content-bajse)"]\);
speech = SpeechRecognize[a]

Wolfram Language code: SpeechCases[a, "Word"]//TextElement

They identify the same substrings for a given type and transcription:

Wolfram Language code: TextCases[speech, "Word"]//TextElement

Top

More Learning

Tech Support

Wolfram Solutions

Wolfram Solutions For Education

Get Started

Grow Your Skills

Work with Us

Educational Programs for Adults

Educational Programs for Youth

Read

SpeechCases

Details and Options

Examples

Basic Examples (2)

Scope (13)

Basic Uses (3)

Form Specification (4)

Properties (6)

Options (3)

AcceptanceThreshold (1)

Masking (1)

VerifyInterpretation (1)

Properties & Relations (2)

Text

CMS

APA

BibTeX

BibLaTeX

SpeechCases

Details and Options

Examples

Basic Examples (2)

Scope (13)

Basic Uses (3)

Form Specification (4)

Properties (6)

Options (3)

AcceptanceThreshold (1)

Masking (1)

VerifyInterpretation (1)

Properties & Relations (2)

See Also

Related Guides

History

Text

CMS

APA

BibTeX

BibLaTeX