Computer Vision
TopicOverview »
Using a variety of state-of-the-art methods, the Wolfram Language provides immediate functions for image identification and object detection and recognition, as well as feature extraction. The Wolfram Language supports specific geometrical features such as edges and corners, as well as general keypoints that can be used to register and compare images.
Image Classification
ImageIdentify, ImageInstanceQ — identify what an image is of (out of 10,000+ objects)
Classify ▪ "CountryFlag" ▪ "NotablePerson" ▪ ...
Object Detection & Recognition
ImageCases — detect and recognize subimages of object categories
ImageContents ▪ ImagePosition ▪ ImageBoundingBoxes ▪ ImageContainsQ ▪ TrainImageContentDetector
Image Synthesis & Effects
ImageSynthesize — generate an image from a textual or visual prompt
ImageRestyle — image style transfer
Face Analysis
FindFaces — find human faces in an image
FacialFeatures ▪ FaceRecognize ▪ FaceAlign
Optical Character Recognition
TextRecognize — extract characters from an image
BarcodeRecognize ▪ FindImageText
Machine Learning »
Classify — perform classification on a collection of speech signals
FeatureSpacePlot ▪ FeatureSpacePlot3D ▪ FeatureExtractor ▪ Nearest ▪ ...
Neural Networks »
NetModel — use pretrained nets for speech analysis
NetEncoder ▪ "Image" ▪ "Image3D" ▪ "VideoFrames"
NetChain ▪ NetGraph ▪ NetTrain ▪ ConvolutionLayer ▪ ...
Feature Detection »
ImageGraphics — convert an image to scalable graphics
RemoveBackground — detect background from foreground and remove it
ImageKeypoints ▪ ImageCorners ▪ ImageLines ▪ ImageMesh ▪ ImageSaliencyFilter ▪ ...
Feature Tracking
ImageCorrespondingPoints — find corresponding keypoints in pairs of images
ImageFeatureTrack ▪ ImageDisplacements
Image Registration »
ImageAlign — align or register two images
ImageTransformation ▪ FindGeometricTransform ▪ ...
Image Transforms
Radon, InverseRadon — Radon and inverse Radon transforms