Wolfram Language & System Documentation Center

FeatureExtractor

is an option for functions such as Classify that specifies how features should be extracted.

Details

Possible settings for FeatureExtractor include:

	FeatureExtractorFunction[…]	apply the given extractor function
	extractor	apply the specified feature extractor method
	{extractor₁,extractor₂,…}	apply the sequence of extractor methods in turn
	specext	apply extractor ext to data parts specified by spec
	{spec₁ext₁,spec₂ext₂,…}	apply extractors ext_i to data parts specified by the spec_i

Possible feature extractor methods include:

	Automatic	automatic extraction
	Identity	give data unchanged
	"ConformedData"	conformed images, colors, dates, etc.
	"NumericVector"	numeric vector from any data
	f	applies function f to each example
	{extractor₁,extractor₂,…}	use a sequence of extractors in turn

Additional feature extractor methods can also be used for each data type.
Numeric data:

	"DiscretizedVector"	discretized numerical data
	"DimensionReducedVector"	reduced-dimension numeric vectors
	"MissingImputed"	data with missing values imputed
	"StandardizedVector"	numeric data processed with Standardize

Nominal data:
"IndicatorVector" nominal data "one-hot encoded" with indicator vectors

"IntegerVector" nominal data encoded with integers
Text:

	"LowerCasedText"	text with each character lowercase
	"SegmentedCharacters"	text segmented into characters
	"SegmentedWords"	text segmented into words
	"SentenceVector"	semantic embedding vector from a text
	"TFIDF"	term frequency-inverse document frequency vector
	"WordVectors"	semantic vectors sequence from a text (English only)

Images:

	"FaceFeatures"	semantic vector from an image of a human face
	"ImageFeatures"	semantic vector from an image
	"PixelVector"	vector of pixel values from an image

Audio objects:

	"AudioFeatures"	sequence of semantic vectors from an audio object
	"AudioFeatureVector"	semantic vector from an audio object
	"LPC"	audio linear prediction coefficients
	"MelSpectrogram"	audio spectrogram with logarithmic frequency bins
	"MFCC"	audio mel-frequency cepstral coefficient vectors sequence
	"SpeakerFeatures"	sequence of semantic speaker vectors
	"SpeakerFeatureVector"	semantic vector for a speaker
	"Spectrogram"	audio spectrogram

Video objects:
"VideoFeatures" sequence of semantic vectors from a video object

"VideoFeatureVector" semantic vector from a video object
Graphs:
"GraphFeatures" numeric vector summarizing graph properties
Molecules:

	"AtomPairs"	Boolean vector from pairs of atoms and the path lengths between them
	"MoleculeExtendedConnectivity"	Boolean vector from enumerated molecule subgraphs
	"MoleculeFeatures"	numeric vector summarizing molecule properties
	"MoleculeTopologicalFeatures"	Boolean vector from circular atom neighborhoods

By default, FeatureExtractorIdentity.
Typically, the value of FeatureExtractor is interpreted as a preprocessing step: it will not replace the other feature extractors used by the function.
When the feature extractor method is not a FeatureExtractorFunction[…] or a custom function, the feature extraction will be learned from the data.
With the settings specext or {spec₁ext₁,…}, possible forms for spec and the spec_i include:

	All	all parts of each example
	i	i part of each example
	{i₁,i₂,…}	parts i₁, i₂, … of each example
	"name"	part with the specified name in each example
	{"name₁","name₂",…}	parts with names "name_i" in each example

Parts not mentioned in spec or the spec_i are dropped for the purpose of extracting features.
In functions such as Classify, Predict, DimensionReduction or ClusterClassify, FeatureExtractor"Minimal" indicates that the internal preprocessing should be as simple as possible.

Examples

open all close all

Basic Examples (3)

Train a FeatureExtractorFunction on a simple dataset:

Use the feature extractor function as a preprocessing step in Classify:

Train a classifier using the extractor method "ImageFeatures" as a preprocessing step:

Classify a new image:

Generate a predictor function using FeatureExtractor to preprocess the data using a custom function:

Add the "StandardizedVector" method to the preprocessing pipeline:

Use the predictor on new data:

Scope (1)

Train a classifier on texts preprocessed by custom functions and an extractor method:

Top

More Learning

Tech Support

Wolfram Solutions

Wolfram Solutions For Education

Get Started

Grow Your Skills

Work with Us

Educational Programs for Adults

Educational Programs for Youth

Read

FeatureExtractor

Details

Examples

Basic Examples (3)

Scope (1)

Text

CMS

APA

BibTeX

BibLaTeX

	"IndicatorVector"	nominal data "one-hot encoded" with indicator vectors
	"IntegerVector"	nominal data encoded with integers

FeatureExtractor

Details

Examples

Basic Examples (3)

Scope (1)

See Also

Related Guides

History

Text

CMS

APA

BibTeX

BibLaTeX