Computer Vision

Topic
Overview  »

Using a variety of state-of-the-art methods, the Wolfram Language provides immediate functions for image identification and object detection and recognition, as well as feature extraction. The Wolfram Language supports specific geometrical features such as edges and corners, as well as general keypoints that can be used to register and compare images.

Image Classification

ImageIdentify, ImageInstanceQ identify what an image is of (out of 10,000+ objects)

Classify  ▪  "CountryFlag"  ▪  "NotablePerson"  ▪  ...

Object Detection & Recognition

ImageCases detect and recognize subimages of object categories

ImageContents  ▪  ImagePosition  ▪  ImageBoundingBoxes  ▪  ImageContainsQ  ▪  TrainImageContentDetector

Image Synthesis & Effects

ImageSynthesize generate an image from a textual or visual prompt

ImageRestyle image style transfer

Face Analysis

FindFaces find human faces in an image

FacialFeatures  ▪  FaceRecognize  ▪  FaceAlign

Optical Character Recognition

TextRecognize extract characters from an image

BarcodeRecognize  ▪  FindImageText

Machine Learning »

Classify perform classification on a collection of speech signals

FeatureSpacePlot  ▪  FeatureSpacePlot3D  ▪  FeatureExtractor  ▪  Nearest  ▪  ...

Neural Networks »

NetModel use pretrained nets for speech analysis

NetEncoder  ▪  "Image"  ▪  "Image3D"  ▪  "VideoFrames"

NetChain  ▪  NetGraph  ▪  NetTrain  ▪  ConvolutionLayer  ▪  ...

Feature Detection »

ImageGraphics convert an image to scalable graphics

RemoveBackground detect background from foreground and remove it

ImageKeypoints  ▪  ImageCorners  ▪  ImageLines  ▪  ImageMesh  ▪  ImageSaliencyFilter  ▪  ...

Feature Tracking

ImageCorrespondingPoints find corresponding keypoints in pairs of images

ImageFeatureTrack  ▪  ImageDisplacements

Image Registration »

ImageAlign align or register two images

ImageTransformation  ▪  FindGeometricTransform  ▪  ...

Image Transforms

Radon, InverseRadon Radon and inverse Radon transforms

Fourier  ▪  ImagePeriodogram  ▪  DiscreteWaveletTransform