Recognition, Object Detection, and Semantic Segmentation

Recognition, classification, semantic image segmentation, instance segmentation, object detection using features, and deep learning object detection using CNNs, YOLO, and SSD

Computer Vision Toolbox™ supports several approaches for image classification, object detection, semantic segmentation, instance segmentation, and recognition, including:

Deep learning and convolutional neural networks (CNNs)
Bag of features
Template matching
Blob analysis
Viola-Jones algorithm

A CNN is a popular deep learning architecture that automatically learns useful feature representations directly from image data. Bag of features encodes image features into a compact representation suitable for image classification and image retrieval. Template matching uses a small image, or template, to find matching regions in a larger image. Blob analysis uses segmentation and blob properties to identify objects of interest. The Viola-Jones algorithm uses Haar-like features and a cascade of classifiers to identify objects, including faces, noses, and eyes. You can train this classifier to recognize other objects.

Highlighted Topics

Featured Examples

Multiclass Object Detection Using YOLO v2 Deep Learning

Train a YOLO v2 multiclass object detector and evaluate object detector performance across selected classes and overlap thresholds.

Since R2024b
Open Live Script

Semantic Segmentation Using Deep Learning

Segment an image using a semantic segmentation network.

Open Live Script

Perform Instance Segmentation Using Mask R-CNN

Segment individual instances of people and cars using a multiclass mask region-based convolutional neural network (R-CNN).

Open Live Script

Perform 6-DoF Pose Estimation for Bin Picking Using Deep Learning

Perform six degrees-of-freedom (6-DoF) pose estimation by estimating the 3-D position and orientation of machine parts in a bin using RGB-D images and a deep learning network.