Vision
Vision
Vision
-Messar Aya
-Chouchaoui Med El Bachir
-Benamara Ichrak
Introduction
2
Object Detection
it identifies and locates objects within an image. It not only
recognizes what the objects are but also determines their
positions by drawing bounding boxes around them.
Basic Process
4
Manual Feature Extraction Automatic Feature Extraction
5
Image Segmentation
a method of processing and analyzing a digital image by dividing it into multiple
parts or segments, so you can process only the important elements instead of
processing the entire picture.
Pixel-Level Definition
6
Types
Semantic segmentation
labels each pixel of an
image with a class
instance segmentation
distinguishes between
individual instances of
objects within those
classes
panoptic segmentation
combines both by
labeling all pixels while
also identifying individual
object instances.
7
Object Detection vs. Classification & Segmentation
8
Pixel-wise
Out Single label or Bounding boxes around
segmentation
put category detected objects
masks
9
situations for segmentation
Medical Imaging: Precise identification of Autonomous Driving: Recognizing and
tissues, organs, or abnormalities within distinguishing between lanes, pedestrians, and
scans. It’s essential for accurate diagnosis obstacles at a pixel level. Critical for safe
and treatment planning. navigation.
10
situations for object detection
Retail and Inventory Management:
Automated systems to count and identify
products
11
Models of Object Detection
R-CNN (Region-based Convolutional Neural Networks)
R-CNN: Main Modules
1. Selective Search
Generates 2,000 region proposals
2. Resizing and Feature Extraction
Regions resized to a fixed pre-defined size
Extracts a feature vector from each region proposal
3. Classification with Pre-trained SVM
Classifies region proposals as background or specific object classes
13
YOLO (You Only Look Once)
it performs real-time detection ,It is famous because it revolutionized object detection
by achieving high-speed, real-time performance without sacrificing much accuracy
key concepts :
Grid cells
The concept of breaking down the images to grid
cells is unique in YOLO
14
Models of Segmentations
a convolutional network designed for biomedical image
U-NET segmentation
15
16
18
Video Surveillance (Security):
Identifying multiple people or objects in real-time video streams for anomaly detection, facial
recognition, or crowd monitoring.
19
Medical Imaging (Healthcare):
Segmenting organs or identifying anomalies (e.g., tumors) within medical images to assist in diagnosis.
20
Challenges of Detecting Multiple Objects in a Single Image
21
Challenges Related to Multiple Object Detection
Occlusion: Objects can overlap or be partially hidden, making their detection difficult.
Object Variability: Objects can vary in size, shape, color, and texture, complicating their identification.
Lighting Conditions: Variations in lighting can affect object visibility, making detection less reliable.
Processing Speed: In real-time applications like autonomous driving, detection speed is crucial.
Balanced Data: Models may be biased if training data does not fairly represent all object classes.
4o
22
23