0% found this document useful (0 votes)

30 views

Introduction To Face Processing With Computer Vision

This document provides an introduction to face processing with computer vision. It discusses the theory behind key tasks like face detection, recognition, and other tasks. It covers early approaches using hand-crafted features and recent deep learning methods. State-of-the-art models like Faster R-CNN, MTCNN, and RetinaFace are described. The document also discusses open challenges around cross-factor recognition, security, privacy, and other tasks like alignment and 3D reconstruction. It emphasizes the many tools and APIs available for rapid prototyping of face processing systems at scale.

Uploaded by

Arohan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views

Introduction To Face Processing With Computer Vision

Uploaded by

Arohan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 82

Introduction to

Face Processing with Computer Vision

Gabriel Bianconi
Founder, Scalar Research
AI & Data Science Consulting Firm

Previously at the Stanford AI Lab

Agenda
• Theory
• Detection
• Recognition
• Other Tasks

• Practice
• Rapid Prototyping
• Scaling

3
Theory
4
Face Detection

5
Haar-Like Features
• Summarize image based on simple color patterns
• Manually determined feature extractors (kernels)

• Leveraged for first real-time face detector (2001)

Ref: Viola & Jones (2001). Image: Wikimedia 6

7
8
Histogram of Oriented Gradients (HOG)
• Summarize image by distribution of color gradients
• Gradient intensities and orientations represent edges, etc.
• Captures more information than simple Haar-like features

Ref: Shu et al. (2011). 9

Ref: Shu et al. (2011) 10
Ref: Shu et al. (2011) 11
Ref: Rojas et al. (2011) 12
Ref: Rojas et al. (2011) 13
R-CNN
• Introduces CNNs for object detection
• CNNs learn how to extract features from data
• Breakthrough in performance
• Beats previous SOTA methods by huge margin
• However, detection is extremely slow

Ref: Girshick et al. (2014). 14

CNN Features

Ref: Lee et al. (2009). 15

CNN Features

Ref: Lee et al. (2009). 16

CNN Features

Ref: Lee et al. (2009). 17

CNN Features

Ref: Lee et al. (2009). 18

R-CNN

Ref: Girshick et al. (2014). 19

Fast R-CNN
• Improvement to R-CNN that leverages CNN for
classification and regression
• Other than proposing regions, system is now end-to-end vs. three
components trained greedily.
• Predictions are 200x+ faster with better performance
• Region proposals still are a bottleneck; total inference time is ~2s.

Ref: Girshick (2015). 20

Fast R-CNN

Ref: Girshick (2015). 21

Faster R-CNN
• Leverages CNN for region proposals as well
• “Region Proposal Network”
• Finally an end-to-end system with deep learning
• About 10x faster than Fast R-CNN, with better performance
• Total inference time is ~0.2s

Ref: Ren et al. (2016). 22

Faster R-CNN

Ref: Ren et al. (2016). 23

MTCNN
• Many model for face detection draw heavily from
the generalized object detection methods.
• MTCNN, for example, trains a multi-task system for
detection and alignment.

Ref: Zhang et al. (2015). 24

MTCNN

Ref: Zhang et al. (2015). 25

RetinaFace
• The current SOTA method combines many
techniques such as multi-task learning
• R-CNN family uses a two-stage approach
(proposals → refinement)
• RetinaFace uses a single-stage approach (faster,
higher recall, more false positives)

Ref: Deng et al. (2019). 26

RetinaFace

Ref: Deng et al. (2019). 27

Are we there yet?

WIDER Face (Easy) WIDER Face (Medium) WIDER Face (Hard)

~97% AP ~96% AP ~92% AP

Ref: Yang (2016). 28

Facial Recognition

29
Facial Recognition
• Facial recognition actually corresponds to group of
different tasks.
• Verification vs. Identification vs. Grouping vs. …
• Closed-Set vs. Open-Set

30
Closed-Set Recognition
• Every identity appears in training set
• Example: recognizing celebrities
• Effectively a classification problem
• Model aims to learn separable features

31
Closed-Set Identification

Test Sample Model Label Confidences

Label 0 Label 1 …

… …
Images: Wikimedia 32
Closed-Set Verification

Test Sample A Label Confidences

Model

Test Sample B Label Confidences

Images: Wikimedia 33
Open-Set Recognition
• Not every identity appears in training set
• Example: Facebook Photos
• Effectively a metric learning problem
• Model aims to learn large-margin features (embeddings)

34
Embeddings
• Map each sample to a vector (coordinate system)
• Used for words, graphs, faces, etc.
• Embeddings preserve similarity
• Similar samples close to each other
• Dissimilar samples far from each other

35
Images: Wikimedia 36
Embeddings
• “Similar” depends on the training data
• Same person, physical characteristic, etc.
• Embeddings represent latent information
• High-dimensional embeddings trained on large datasets
learn to represent latent information about the person (e.g.
physical characteristics)

37
Open-Set Identification

Test Sample Model Embedding + Distance

Emb. 0 Emb. 1 Emb. 2 …

Images: Wikimedia 38
Open-Set Verification

Test Sample A Embedding A

Distance
Model vs.
Threshold
Test Sample B Embedding B

Images: Wikimedia 39
Metric Learning

Ref: Liu et al. (2018) 40

Are we there yet?

LFW (Labeled Faces in the Wild)

Verification

99.85%+ accuracy

Ref: Yan et al. (2019); Learned-Miller et al. (2016) 41

Cross-Factor
Facial Recognition

42
Cross-Age

Ref: Zheng et al. (2017) 43

Cross-Pose

Ref: Li et al. (2011) 44

Cross-Makeup

Ref: Chen et al. (2013) 45

Further Research

46
Security
• How do we deal with adversarial users?
• Real face goes undetected or misclassified
• Fake face gets recognized
• Private data is extracted from model
•…

47
Security

Ref: Grigory Bakunov (2017) 48

Biometrics & Multi-Modal Data
• How do we deal with…
• Identical twins?
• Plastic surgery?
• ...

Ref: Singh et al. (2010) 49

Ref: Singh et al. (2010) 50
Biometrics & Multi-Modal Data
• Combine with other biometric data
• Biometric traits (e.g. hand)
• Multiple sensors (e.g. 2D + 3D)
• Multiple pictures (e.g. viewpoints, sequences)
•…

Ref: Singh et al. (2010); Ross & Jain (2004); Ross & Govindarajan(2005) 51
Ref: Apple 52
Privacy
• How do we deal with…
• Models that can predict gender, race, …?
• Models that leak the data?
• Predictions without sharing the raw data?
•…

Ref: Singh et al. (2010) 53

Other Tasks

54
Alignment & Pose Estimation

Ref: Ruiz et al. (2018) 55

Face Landmarks

56
Classification

Neutral

Happy
Happy

57
3D Reconstruction

Ref: Sela et al. (2017) 58

Practice
59
Rapid Prototyping

60
61
accuracy
…

ac
e
a ce
aF tF
in gh
t si
Re In
N
N et
N
TC ce
M Fa
n
tio
ni
Dozens of Tools

V
nC c og
pe re
O
c e_
fa
…

…
simplicity
APIs
• There are dozens of APIs providing low-cost face
processing at scale
• Most services charge less than $1 per 1000 images
• Depending on the use case, might be cheaper than provisioning GPUs
and deploying your own models (esp. if considering developer time)

• Often these APIs can achieve performance that’s

close to state-of-the-art

62
APIs – Example: Azure
• Detection
• Classification
• Gender, age, emotion, hair, smile, eyes, glasses, makeup, …
• Landmarks
• Pose Estimation
• Recognition
• Verification, identification, grouping, similarity search, …

63
Embeddings
• Face embeddings are typically used for open-set
recognition systems
• They can be leveraged to quickly train models for
downstream tasks (e.g. classification)
• Tools
• face_recognition (Github): extremely fast, reliable for frontal
• FaceNet: based on deep learning, strong across the board
64
Example – Facebook Photos
• Task: open-set face identification
• Strategy:
1. Detect faces and compute embeddings for known photos
of users; store for future use.
2. Whenever a photo is uploaded, do the same and compare
against known set.

65
Example – Detection

import face_recognition as fr

image = fr.load_image_file("file.jpg")

face_locations = fr.face_locations(image)

Ref: github.com/ageitgey/face_recognition 66
Example – Embedding

image = fr.load_image_file("file.jpg")

face_embedding = fr.face_encodings(image)[0]

Ref: github.com/ageitgey/face_recognition 67
Example – L2 Distance

- 0.31 0.59 0.69

0.31 - 0.52 0.63

0.59 0.52 - 0.50

0.69 0.63 0.50 -

Images: WikiMedia 68
Face Landmarks
• Face landmarks can also be quickly extracted with
pretrained models and used for a number of
downstream tasks.

69
Example – Face Landmarks

face_landmarks = fr.face_landmarks(image)[0]

print(face_landmarks.keys())
# left_eyebrow, right_eyebrow, lower_lip, top_lip, …

Ref: github.com/ageitgey/face_recognition 70
71
Example – Snapchat Filters
• Task: face manipulation
• Strategy:
1. Detect face and localize landmarks in image
2. Add objects, reshape image, etc. based on landmarks

72
Example – Snapchat Filters
from PIL import Image, ImageDraw
…
pil_image = Image.fromarray(image)
d = ImageDraw.Draw(pil_image, 'RGBA’)
lip_fill = (150, 0, 0, 128) # shade of red, 50% alpha
d.polygon(face_landmarks['top_lip'], fill=lip_fill)
d.polygon(face_landmarks['bottom_lip'], fill=lip_fill)

73
Scaling

75
Bias
• People & Demographics
• Is your training set… Coworkers? Single location?
• Environment
• Does it cover… Day and night? Seasons? Lighting
conditions? Backgrounds?
• Sensors
• Did you consider… Diverse hardware? Calibration?
Viewpoint (angle)? Resolution? Occlusion?
76
Optimizations
• It is often easier to simplify the real-world task than
drastically improve ML models.

77
Optimizations

Multiple model optimizations

($$$ in developer time, etc.)
Performance

Time (weeks)

78
Optimizations

Install a new light

($)
Performance

Time (weeks)

79
Risks
• What happens when your model makes a mistake?
• How can you deal with adversarial users?
• What is your threat model?

80
Other Considerations
• How do you handle…
• Model getting stale over time?
• Growing search space?
• Large amounts of real-time data?
• Detecting or tracking people vs. faces?
• Speed vs. cost vs. performance trade-offs?

81
Thank you.
gabriel@scalarresearch.com

Time Series Forecasting - Project Report
33% (3)
Time Series Forecasting - Project Report
68 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Face Recognition Project Report
100% (1)
Face Recognition Project Report
13 pages
Face Recognition Attendance System Using Python (With Code)
No ratings yet
Face Recognition Attendance System Using Python (With Code)
9 pages
Building An Open Source Facial Recognition System For Mass Surveillance
100% (1)
Building An Open Source Facial Recognition System For Mass Surveillance
31 pages
Face Recognition Based On MTCNN and FaceNet
No ratings yet
Face Recognition Based On MTCNN and FaceNet
6 pages
FULLTEXT01
No ratings yet
FULLTEXT01
60 pages
Face Detection With OpenCV in Python
No ratings yet
Face Detection With OpenCV in Python
23 pages
Hritu_Raj_Tesis_Plag
No ratings yet
Hritu_Raj_Tesis_Plag
35 pages
Convolutional Neural Network Approach Fo
No ratings yet
Convolutional Neural Network Approach Fo
6 pages
Face Recognition Report PDF
No ratings yet
Face Recognition Report PDF
16 pages
Transfer Learning Convolutional Neural Network-AlexNet Achieving Face Recognition
No ratings yet
Transfer Learning Convolutional Neural Network-AlexNet Achieving Face Recognition
4 pages
Chapter 5 Face Recognition
No ratings yet
Chapter 5 Face Recognition
75 pages
Journal Paper-2
No ratings yet
Journal Paper-2
11 pages
The Strengths & Weaknesses of Face2Vec - FaceNet
No ratings yet
The Strengths & Weaknesses of Face2Vec - FaceNet
6 pages
Week 8-Module 8 Computer Vision
No ratings yet
Week 8-Module 8 Computer Vision
40 pages
Best Face Rec PDF
No ratings yet
Best Face Rec PDF
1 page
Thesis PPT Hritu Raj-1
No ratings yet
Thesis PPT Hritu Raj-1
26 pages
Deep Face
No ratings yet
Deep Face
8 pages
Face
No ratings yet
Face
25 pages
DeepFace Summary
No ratings yet
DeepFace Summary
2 pages
Talking Avatar Application
No ratings yet
Talking Avatar Application
9 pages
Face Recognition System Using Deep Learning
No ratings yet
Face Recognition System Using Deep Learning
12 pages
Project Review - Final B187
No ratings yet
Project Review - Final B187
15 pages
Face Recognition Using CNN
No ratings yet
Face Recognition Using CNN
17 pages
Build Face Recognition Attendance System Using Python
No ratings yet
Build Face Recognition Attendance System Using Python
8 pages
A Comprehensive Survey on Face Recognition and Image Retrieval for Event-Based Applications
No ratings yet
A Comprehensive Survey on Face Recognition and Image Retrieval for Event-Based Applications
5 pages
UMDFaces: An Annotated Face Dataset For Training Deep Networks
No ratings yet
UMDFaces: An Annotated Face Dataset For Training Deep Networks
10 pages
Mini Project
No ratings yet
Mini Project
10 pages
FACE FILE
No ratings yet
FACE FILE
20 pages
Teoh 2021 J. Phys. Conf. Ser. 1755 012006
No ratings yet
Teoh 2021 J. Phys. Conf. Ser. 1755 012006
10 pages
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-02 Reference-Material-I
No ratings yet
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-02 Reference-Material-I
69 pages
How Facial Identification Works
No ratings yet
How Facial Identification Works
4 pages
Openface: A General-Purpose Face Recognition Library With Mobile Applications
No ratings yet
Openface: A General-Purpose Face Recognition Library With Mobile Applications
20 pages
Implementation of FaceNet and Support Vector Machine in A Real-Time Web-Based Timekeeping Application
No ratings yet
Implementation of FaceNet and Support Vector Machine in A Real-Time Web-Based Timekeeping Application
9 pages
Face Recognition
No ratings yet
Face Recognition
23 pages
1 Face Recognition Intro and Overview
No ratings yet
1 Face Recognition Intro and Overview
4 pages
Attendance Syste,
No ratings yet
Attendance Syste,
14 pages
33face Recognition Using Neural Networks Sona College
No ratings yet
33face Recognition Using Neural Networks Sona College
4 pages
DEEP FACE RECOGNITION USING IMPERFECT FACIAL DATA MAIN (1)
No ratings yet
DEEP FACE RECOGNITION USING IMPERFECT FACIAL DATA MAIN (1)
17 pages
Thesis On Face Detection
100% (3)
Thesis On Face Detection
7 pages
edit(1)
No ratings yet
edit(1)
16 pages
Face Detection and Face Recognition System
No ratings yet
Face Detection and Face Recognition System
7 pages
EP494 PowerpointFa
No ratings yet
EP494 PowerpointFa
30 pages
Facial Recognition Using Deep Learning
No ratings yet
Facial Recognition Using Deep Learning
8 pages
Face Recognition Chapter
No ratings yet
Face Recognition Chapter
30 pages
Theoretical and Practical Analysis On CNN, MTCNN and Caps-Net Base Face Recognition and Detection PDF
No ratings yet
Theoretical and Practical Analysis On CNN, MTCNN and Caps-Net Base Face Recognition and Detection PDF
35 pages
Face Detection Algorithm Report
No ratings yet
Face Detection Algorithm Report
9 pages
code info
No ratings yet
code info
8 pages
Minor Project1
No ratings yet
Minor Project1
28 pages
125994526
No ratings yet
125994526
11 pages
Class10 FaceNet Comparison
No ratings yet
Class10 FaceNet Comparison
20 pages
Deep Learning-Based Face Analytics 1st Edition Nalini K Ratha 2024 scribd download
100% (2)
Deep Learning-Based Face Analytics 1st Edition Nalini K Ratha 2024 scribd download
65 pages
Introduction To Face Recognition and Detection
No ratings yet
Introduction To Face Recognition and Detection
54 pages
Eigen Faces CV
No ratings yet
Eigen Faces CV
16 pages
4e5f PDF
No ratings yet
4e5f PDF
98 pages
IEEE Base Paper
No ratings yet
IEEE Base Paper
3 pages
MATLAB for Machine Learning: Unlock the power of deep learning for swift and enhanced results
From Everand
MATLAB for Machine Learning: Unlock the power of deep learning for swift and enhanced results
Giuseppe Ciaburro
No ratings yet
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
From Everand
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
Fouad Sabry
No ratings yet
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
From Everand
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
Fouad Sabry
No ratings yet
Smarter Decisions – The Intersection of Internet of Things and Decision Science
From Everand
Smarter Decisions – The Intersection of Internet of Things and Decision Science
Jojo Moolayil
No ratings yet
Individual Assignment 2
No ratings yet
Individual Assignment 2
4 pages
Disease Prediction Using Machine Learning
No ratings yet
Disease Prediction Using Machine Learning
9 pages
Multi-Step-Ahead Prediction With Neural Networks
No ratings yet
Multi-Step-Ahead Prediction With Neural Networks
12 pages
PASSLEAF: A Pool-Based Semi-Supervised Learning Framework For Uncertain Knowledge Graph Embedding
No ratings yet
PASSLEAF: A Pool-Based Semi-Supervised Learning Framework For Uncertain Knowledge Graph Embedding
8 pages
ImageNet Large Scale Visual Recognition Challenge PDF
No ratings yet
ImageNet Large Scale Visual Recognition Challenge PDF
43 pages
How To Fine-Tune BERT For Text Classification?: Corresponding Author The Source Codes Are Available at
No ratings yet
How To Fine-Tune BERT For Text Classification?: Corresponding Author The Source Codes Are Available at
10 pages
Classification of Flower Species Final
No ratings yet
Classification of Flower Species Final
32 pages
Xgboost: Notebook
No ratings yet
Xgboost: Notebook
8 pages
1 s2.0 0017931095003320 Main
No ratings yet
1 s2.0 0017931095003320 Main
4 pages
Arxiv 1809.02077 PDF
No ratings yet
Arxiv 1809.02077 PDF
8 pages
Pandas: Reference Sheet
No ratings yet
Pandas: Reference Sheet
9 pages
1801 05365 PDF
No ratings yet
1801 05365 PDF
15 pages
Time Series Forecasting Principles With Amazon Forecast: Technical Guide
No ratings yet
Time Series Forecasting Principles With Amazon Forecast: Technical Guide
32 pages
Course Presentation AI 900 AzureAIFundamentals
No ratings yet
Course Presentation AI 900 AzureAIFundamentals
68 pages
Time-series-Forecasting Time Series Forecasting Jupyter Code - Ipynb at Main Chetandudhane Time-series-Forecasting GitHub
100% (2)
Time-series-Forecasting Time Series Forecasting Jupyter Code - Ipynb at Main Chetandudhane Time-series-Forecasting GitHub
162 pages
Data Science For Beginners
100% (3)
Data Science For Beginners
354 pages
Comparative Study of Naive Bayes, Gaussian Naive Bayes Classifier and Decision Tree Algorithms For Prediction of Heart Diseases
No ratings yet
Comparative Study of Naive Bayes, Gaussian Naive Bayes Classifier and Decision Tree Algorithms For Prediction of Heart Diseases
14 pages
Decision Trees: A Recent Overview: S. B. Kotsiantis
No ratings yet
Decision Trees: A Recent Overview: S. B. Kotsiantis
23 pages
N-Gram Language Models: Random Sentence Generated From A Jane Austen Trigram Model
No ratings yet
N-Gram Language Models: Random Sentence Generated From A Jane Austen Trigram Model
28 pages
Data Mining Techniques: Presentation On Neural Network
No ratings yet
Data Mining Techniques: Presentation On Neural Network
55 pages
AI Technology For NoC Performance Evaluation
No ratings yet
AI Technology For NoC Performance Evaluation
5 pages
Cyc Hner Ski 2017
No ratings yet
Cyc Hner Ski 2017
8 pages
Problem 1:: Readingcsv PD Read - Excel (Readingcsv) Readingcsv Head
No ratings yet
Problem 1:: Readingcsv PD Read - Excel (Readingcsv) Readingcsv Head
18 pages
Modeling Packing Density of Granular Mixtures: An Artificial Intelligence Approach
No ratings yet
Modeling Packing Density of Granular Mixtures: An Artificial Intelligence Approach
4 pages
MindSpore Basics
No ratings yet
MindSpore Basics
19 pages
Customer Lifetime Value in Video Games Using Deep Learning and Parametric Models
No ratings yet
Customer Lifetime Value in Video Games Using Deep Learning and Parametric Models
7 pages
Big Data Computing - Assignment 6
No ratings yet
Big Data Computing - Assignment 6
3 pages
Training Generative Adversarial Networks With Limited Data
No ratings yet
Training Generative Adversarial Networks With Limited Data
37 pages