0% found this document useful (0 votes)

51 views

VGG Object Category Detection

The document describes an object category detection practical that explores techniques for visual object detection using image-based models. It focuses on using histograms of oriented gradients (HOG) features to describe image regions and build a sliding-window detector to localize objects at multiple scales. The practical loads training and test data for detecting traffic signs, extracts HOG features from training image patches, and learns an initial detector model to apply to test images.

Uploaded by

DrinkSlurp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views

VGG Object Category Detection

Uploaded by

DrinkSlurp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

1/7/2019 VGG Practical

Object category detection practical

This is an Oxford Visual Geometry Group computer vision practical, authored by Andrea Vedaldi and Andrew
Zisserman (Release 2018a).

The goal of object category detection is to identify and localize objects of a given type in an image. Examples
applications include detecting pedestrian, cars, or traffic signs in street scenes, objects of interest such as tools
or animals in web images, or particular features in medical image. Given a target class, such as people, a
detector receives as input an image and produces as output zero, one, or more bounding boxes around each
occurrence of the object class in the image. The key challenge is that the detector needs to find objects
regardless of their location and scale in the image, as well as pose and other variation factors, such as clothing,
illumination, occlusions, etc.

This practical explores basic techniques in visual object detection, focusing on image based models. The
appearance of image patches containing objects is learned using statistical analysis. Then, in order to detect
objects in an image, the statistical model is applied to image windows extracted at all possible scales and
locations, in order to identify which ones, if any, contain the object.

In more detail, the practical explores the following topics: (i) using HOG features to describe image regions, (ii)
building a HOG-based sliding-window detector to localize objects in images; (iii) working with multiple scales and
multiple object occurrences; (iv) using a linear support vector machine to learn the appearance of objects; (v)
evaluating an object detector in term of average precision; (vi) learning an object detector using hard negative
mining.

Object category detection practical

Getting started
Part 1: Detection fundamentals
Step 1.0: Loading the training data
Step 1.1: Visualize the training images
Step 1.2: Extract HOG features from the training images
Step 1.3: Learn a simple HOG template model
Step 1.4: Apply the model to a test image
Step 1.5: Extract the top detection
Part 2: Multiple scales and learning with an SVM
Step 2.1: Multi-scale detection
Step 2.2: Collect positive and negative training data
Step 2.3: Learn a model with an SVM
Step 2.4: Evaluate the learned model
Part 3: Multiple objects and evaluation
Step 3.1: Multiple detections
Step 3.2: Detector evaluation
Step 3.3: Evaluation on multiple images
Part 4: Hard negative mining
Step 4.1: Train with hard negative mining
Step 4.2: Evaluate the model on the test data
Part 5: Train your own object detector
Step 5.1: Preparing the training data
Step 5.2: Learn the model
Step 5.3: Test the model

www.robots.ox.ac.uk/~vgg/practicals/category-detection/index.html 1/11
1/7/2019 VGG Practical

Step 5.4: Detecting symmetric objects with multiple aspects

History

Getting started
Read and understand the requirements and installation instructions. The download links for this practical are:

Code and data: practical-category-detection-2018a.tar.gz

Code only: practical-category-detection-2018a-code-only.tar.gz
Data only: practical-category-detection-2018a-data-only.tar.gz
Git repository (for lab setters and developers)

After the installation is complete, open and edit the script exercise1.m in the MATLAB editor. The script
contains commented code and a description for all steps of this exercise, relative to Part I of this document. You
can cut and paste this code into the MATLAB window to run it, and will need to modify it as you go through the
session. Other files exercise2.m , exercise3.m , and exercise4.m are given for Part II, III, and IV.

Each part contains several Questions and Tasks to be answered/completed before proceeding further in the
practical.

Part 1: Detection fundamentals

Part I--IV use as running example the problem of street sign detection, using the data from the German Traffic
Sign Detection Benchmark. This data consists of a number of example traffic images, as well as a number of
larger test images containing one or more traffic signs at different sizes and locations. It also comes with ground
truth annotation, i.e. with specified bounding boxes and sign labels for each sign occurrence, which is required to
evaluate the quality of the detector.

In this part we will build a basic sliding-window object detector based on HOG features. Follow the steps below:

Step 1.0: Loading the training data

The MATLAB m-file loadData.m loads the data for the practical into memory. The function
loadData(targetClass) takes a targetClass argument specifying the object class of interest. Open the
example1.m file, select the following part of the code, and execute it in MATLAB (right button >
Evaluate selection or Shift+F7 ).

% Load the training and testing data (trainImages, trainBoxes, ...)

% The functio takes the ID of the type of traffic sign we want to recognize
% 1 is the 30 km/h speed limit
loadData(1) ;

This loads into the current workspace the following variables:

trainImages : a list of train image names.

trainBoxes : a 4 × N array of object bounding boxes, in the form [xmin , ymin , xmax , ymax ] .
trainBoxImages : for each bounding box, the name of the image containing it.
trainBoxLabels : for each bounding box, the object label. It is one of the index in targetClass .
trainBoxPatches : a 64 × 64 × 3 × N array of image patches, one for each training object. Patches
are in RGB format.

An analogous set of variables testImages , testBoxes , and so on are provided for the test data. Familiarise
yourself with the contents of these variables.

Question: why is there a trainImages and a trainBoxImages variables?

Step 1.1: Visualize the training images

Select now the part of the code related to section 1.1 and execute it. This will create an image visualizing both
the complete list of object training examples and their average.

Question: what can you deduce about the object variability from the average image?

www.robots.ox.ac.uk/~vgg/practicals/category-detection/index.html 2/11
1/7/2019 VGG Practical

Question: most boxes extend slightly around the object extent. Why do you think this may be valuable in
learning a detector?

Step 1.2: Extract HOG features from the training images

Object detectors usually work on top of a layer of low-level features. In this case, we use HOG (Histogram of
Oriented Gradients) features. In order to learn a model of the object, we start by extracting features from the
image patches corresponding to the available training examples. This is done by the following for loop:

hogCellSize = 8 ;
trainHog = {} ;
for i = 1:size(trainBoxPatches,4)
trainHog{i} = vl_hog(trainBoxPatches(:,:,:,i), hogCellSize) ;
end
trainHog = cat(4, trainHog{:}) ;

HOG is computed by the VLFeat function vl_hog (doc). This function takes as parameter the size in pixels of
each HOG cell hogCellSize . It also takes a RGB image, represented in MATLAB as a w × h × 3 array
(extracted as a slice of trainBoxPatches ). The output is a w/hogCellSize × h/hogCellSize × 31
dimensional array. One such array is extracted for each example image end eventually these are concatenated
in a 4D array along the fourth dimension.

Step 1.3: Learn a simple HOG template model

A very basic object model can be obtained by averaging the features of the example objects. This is done by:

w = mean(trainHog, 4) ;

The model can be visualized by rendering w as if it was a HOG feature array. This can be done using the
render option of vl_hog :

figure(2) ; clf ;
imagesc(vl_hog('render', w)) ;

Spend some time to study this plot and make sure you understand what is visualized.

Question: Can you make sense of the resulting plot?

Step 1.4: Apply the model to a test image

The model is matched to a test image by: (i) extracting the HOG features of the image and (ii) convolving the
model over the resulting feature map:

im = imread('data/signs-sample-image.jpg') ;
im = im2single(im) ;
hog = vl_hog(im, hogCellSize) ;
scores = vl_nnconv(hog, w, []) ;

The first two lines read a sample image and conver it to single format. The third line computes the HOG features
of the image using the vl_hog seen above. The fourth line convolves the HOG map hog with the model w .
It uses the function vl_nnconv 1 and returns a scores map.

Task: Work out the dimension of the scores arrays. Then, check your result with the dimension of the
array computed by MATLAB.

Question: Visualize the image im and the scores array using the provided example code. Does the
result match your expectations?

www.robots.ox.ac.uk/~vgg/practicals/category-detection/index.html 3/11
1/7/2019 VGG Practical

Step 1.5: Extract the top detection

Now that the model has been applied to the image, we have a response map scores . To extract a detection
from this, we (i) find the maximum response and (ii) compute the bounding box of the image patch containing the
corresponding HOG features. The maximum is found by:

[best, bestIndex] = max(scores(:)) ;

Note that bestIndex is a linear index in the range [1, M ] where M is the number of possible filter locations.
We convert this into a subscript (hx , hy ) using MATLAB ind2sub function:

[hy, hx] = ind2sub(size(scores), bestIndex) ;

(hx , hy ) are in units of HOG cells. We convert this into pixel coordinates as follows:

x = (hx - 1) * hogCellSize + 1 ;
y = (hy - 1) * hogCellSize + 1 ;

Question: Why are we subtracting -1 and summing +1? Which pixel (x, y) of the HOG cell (hx , hy ) is
found?

The size of the model template in number of HOG cell can be computed in several way; one is simply:

modelWidth = size(trainHog, 2) ;
modelHeight = size(trainHog, 1) ;

Now we have enough information to compute the bounding box as follows:

detection = [
x - 0.5 ;
y - 0.5 ;
x + hogCellSize * modelWidth - 0.5 ;
y + hogCellSize * modelHeight - 0.5 ;] ;

Note: the bounding box encloses exactly all the pixel of the HOG template. In MATLAB, pixel centers have
integer coordinates and pixel borders are at a distance ±1/2.

Question: Use the example code to plot the image and overlay the bounding box of the detected object. Did
it work as expected?

Part 2: Multiple scales and learning with an SVM

In this second part, we will: (i) extend the detector to search objects at multiple scales and (ii) learn a better
model using a support vector machine. Let's start by loading the data as needed:

setup ;
targetClass = 'mandatory' ;
loadData(targetClass) ;

The mandatory target class is simply the union of all mandatory traffic signs.

Step 2.1: Multi-scale detection

Objects exist in images at sizes different from one of the learned template. In order to find objects of all sizes, we
scale the image up and down and search for the object over and over again.

The set of searched scales is defined as follows:

www.robots.ox.ac.uk/~vgg/practicals/category-detection/index.html 4/11
1/7/2019 VGG Practical

% Scale space configuraiton

minScale = -1 ;
maxScale = 3 ;
numOctaveSubdivisions = 3 ;
scales = 2.^linspace(...
minScale,...
maxScale,...
numOctaveSubdivisions*(maxScale-minScale+1)) ;

Given the model w , as determined in Part I, we use the function detectAtMultipleScales in order to
search for the object at multiple scales:

detection = detectAtMultipleScales(im, w, hogCellSize, scales) ;

Note that the function generates a figure as it runs, so prepare a new figure before running it using the figure
command if you do not want your current figure to be deleted.

Question: Open and study the detectAtMultipleScales function. Convince yourself that it is the same
code as before, but operated after rescaling the image a number of times.

Question: Visualize the resulting detection using the supplied example code. Did it work? If not, can you
make sense of the errors?

Question: Look at the array of scores maps generated by detectAtMultipleScales using the
example code. Do they make sense? Is there anything wrong?

Step 2.2: Collect positive and negative training data

The model learned so far is too weak to work well. It is now time to use an SVM to learn a better one. In order to
do so, we need to prepare suitable data. We already have positive examples (features extracted from object
patches):

% Collect positive training data

pos = trainHog ;

Ino order to collect negative examples (features extracted from non-object patches), we loop through a number
of training images and sample patches uniformly:

Task: Identify the code that extract these patches in example2.m and make sure you understand it.

Question: How many negative examples are we collecting?

Step 2.3: Learn a model with an SVM

Now that we have the data, we can learn an SVM model. To this end we will use the vl_svmtrain function.
This function requires the data to be in a D × N matrix, where D are the feature dimensions and N the
number of training points. This is done by:

% Pack the data into a matrix with one datum per column
x = cat(4, pos, neg) ;
x = reshape(x, [], numPos + numNeg) ;

We also need a vector of binary labels, +1 for positive points and -1 for negative ones:

% Create a vector of binary labels

y = [ones(1, size(pos,4)) -ones(1, size(neg,4))] ;

www.robots.ox.ac.uk/~vgg/practicals/category-detection/index.html 5/11
1/7/2019 VGG Practical

Finally, we need to set the parameter λ of the SVM solver. For reasons that will become clearer later, we use
instead the equivalent C parameter:

numPos = size(pos,4) ;
numNeg = size(neg,4) ;
C = 10 ;
lambda = 1 / (C * (numPos + numNeg)) ;

Learning the SVM is then a one-liner:

% Learn the SVM using an SVM solver

w = vl_svmtrain(x,y,lambda,'epsilon',0.01,'verbose') ;

Question: Visualize the learned model w using the supplied code. Does it differ from the naive model
learned before? How?

Step 2.4: Evaluate the learned model

Use the detectAtMultipleScales seen above to evaluate the new SVM-based model.

Question: Does the learned model perform better than the naive average?

Task: Try different images. Does this detector work all the times? If not, what types of mistakes do you see?
Are these mistakes reasonable?

Part 3: Multiple objects and evaluation

Step 3.1: Multiple detections
Detecting at multiple scales is insufficient: we must also allow for more than one object occurrence in the image.
In order to to so, the package include a suitalbe detect function. This function is similar to
detectAtMultipleScales , but it returns the top 1000 detector responses rather than just the top one:

% Compute detections
[detections, scores] = detect(im, w, hogCellSize, scales) ;

Task: Open and study detect.m . Make sure that you understand how it works.

Question: Why do we want to return so many responses? In practice, it is unlikely that more than a handful
of object occurrences may be contained in any given image...

A single object occurrence generates multiple detector responses at nearby image locations and scales. In order
to eliminate these redundant detections, we use a non-maximum suppression algorithm. This is implemented by
the boxsuppress.m MATLAB m-file. The algorithm is simple: start from the highest-scoring detection, then
remove any other detection whose overlap[^overlap] is greater than a threshold. The function returns a boolean
vector keep of detections to preserve:

% Non-maximum suppression
keep = boxsuppress(detections, scores, 0.25) ;

detections = detections(:, keep) ;

scores = scores(keep) ;

For efficiency, after non-maximum suppression we keep just ten responses (as we do not expect more than a
few objects in any image):

www.robots.ox.ac.uk/~vgg/practicals/category-detection/index.html 6/11
1/7/2019 VGG Practical

% Further keep only top detections

detections = detections(:, 1:10) ;
scores = scores(1:10) ;

Step 3.2: Detector evaluation

We are now going to look at properly evaluating our detector. We use the PASCAL VOC criterion, computing
Average Precision (AP). Consider a test image containing a number of ground truth object occurrences
(g1 , … , gm ) and a list (b1 , s1 ), … , (bn , sn ) of candidate detections bi with score si . The following algorithm
converts this data into a list of labels and scores (si , yi ) that can be used to compute a precision-recall curve,
for example using VLFeat vl_pr function. The algorithm, implemented by evalDetections.m , is as follows:

1. Assign each candidate detection (bi , si ) a true or false label yi ∈ +1, −1 . To do so:
1. The candidate detections (bi , si ) are sorted by decreasing score si .
2. For each candidate detection in order: a. If there is a matching ground truth detection gj (
overlap(bi , gj ) larger than 50%), the candidate detection is considered positive (yi = +1 ).
Furthermore, the ground truth detection is removed from the list and not considered further. b.
Otherwise, the candidate detection is negative (yi = −1 ).
2. Add each ground truth object gi that is still unassigned to the list of candidates as pair (gj , −∞) with label
yj = +1 .

The overlap metric used to compare a candidate detection to a ground truth bounding box is defined as the ratio
of the area of the intersection over the area of the union of the two bounding boxes:

|A ∩ B|
overlap(A, B) = .
|A ∪ B|

Questions:

Why are ground truth detections removed after being matched?

What happens if an object is detected twice?
Can you explain why unassigned ground-truth objects are added to the list of candidates with −∞
score?

In order to apply this algorithm, we first need to find the ground truth bounding boxes in the test image:

% Find all the objects in the target image

s = find(strcmp(testImages{1}, testBoxImages)) ;
gtBoxes = testBoxes(:, s) ;

Then evalDetections can be used:

% No example is considered difficult

gtDifficult = false(1, numel(s)) ;

% PASCAL-like evaluation
matches = evalDetections(...
gtBoxes, gtDifficult, ...
detections, scores) ;

The gtDifficult flags can be used to mark some ground truth object occurrence as difficult and hence
ignored in the evaluation. This is used in the PASCAL VOC challenge, but not here (i.e. no object occurrence is
considered difficult).

evalDetections returns a matches structure with several fields. We focus here on

matches.detBoxFlags : this contains a +1 for each detection that was found to be correct and -1 otherwise.
We use this to visualize the detection errors:

% Visualization
figure(1) ; clf ;
imagesc(im) ; axis equal ; hold on ;
vl plotbox(detections(: matches detBoxFlags +1) 'g' 'linewidth' 2) ;
www.robots.ox.ac.uk/~vgg/practicals/category-detection/index.html 7/11
1/7/2019 VGG Practical
vl_plotbox(detections(:, matches.detBoxFlags==+1), g , linewidth , 2) ;
vl_plotbox(detections(:, matches.detBoxFlags==-1), 'r', 'linewidth', 2) ;
vl_plotbox(gtBoxes, 'b', 'linewidth', 1) ;
axis off ;

Task: Use the supplied example code to evaluate the detector on one image. Look carefully at the output and
convince yourself that it makes sense.

Now Plot the PR curve:

figure(2) ; clf ;
vl_pr(matches.labels, matches.scores) ;

Question: There are a large number of errors in each image. Should you worry? In what manner is the PR
curve affected? How would you eliminate the vast majority of those in a practice?

Step 3.3: Evaluation on multiple images

Evaluation is typically done on multiple images rather than just one. This is implemented by the evalModel.m
m-file.

Task: Open evalModel.m and make sure you understand the main steps of the evaluation procedure.

Use the supplied example code to run the evaluation on the entiere test set:

matches = evaluateModel(testImages, testBoxes, testBoxImages, ...

w, hogCellSize, scales) ;

Note: The function processes an image per time, visualizing the results as it progresses. The PR curve is the
result of the accumulation of the detections obtained thus far.

Task: Open the evaluateModel.m file in MATLAB and add a breakpoint right at the end of the for loop.
Now run the evaluation code again and look at each image individually (use dbcont to go to the next
image). Check out the correct and incorrect matches in each image and their ranking and the effect of this in
the cumulative precision-recall curve.

Part 4: Hard negative mining

This part explores more advanced learning methods. So far, the SVM has been learned using a small and
randomly sampled number of negative examples. However, in principle, every single patch that does not contain
the object can be considered as a negative sample. These are of course too many to be used in practice;
unfortunately, random sampling is ineffective as the most interesting (confusing) negative samples are a very
small and special subset of all the possible ones.

Hard negative mining is a simple technique that allows finding a small set of key negative examples. The idea is
simple: we start by training a model without any negatives at all (in this case the solver learns a 1-class SVM),
and then we alternate between evaluating the model on the training data to find erroneous responses and adding
the corresponding examples to the training set.

Step 4.1: Train with hard negative mining

Use the supplied code in example4.m to run hard negative mining. The code repeats SVM training, as seen
above, a number of times, progressively increasing the size of the neg array containing the negative samples.
This is updated using the output of:

www.robots.ox.ac.uk/~vgg/practicals/category-detection/index.html 8/11
1/7/2019 VGG Practical

[matches, moreNeg] = ...

evaluateModel(...
vl_colsubset(trainImages', schedule(t), 'beginning'), ...
trainBoxes, trainBoxImages, ...
w, hogCellSize, scales) ;

Here moreNeg contains the HOG features of the top (highest scoring and hence most confusing) image
patches in the supplied training images.

Task: Examine evaluateModel.m again to understand how hard negatives are extracted.

Question: What is the purpose of the construct

vl_colsubset(trainImages', schedule(t), 'beginning') ? Why do you think we visit more negative
images in later iterations?

The next step is to fuse the new negative set with the old one:

% Add negatives
neg = cat(4, neg, moreNeg) ;

Note that hard negative mining could select the same negatives at different iterations; the following code
squashes these duplicates:

% Remove negative duplicates

z = reshape(neg, [], size(neg,4)) ;
[~,keep] = unique(z','stable','rows') ;
neg = neg(:,:,:,keep) ;

Step 4.2: Evaluate the model on the test data

Once hard negative mining and training are done, we are ready to evaluate the model on the test data (note that
the model is evaluated on the training data for mining). As before:

evaluateModel(...
testImages, testBoxes, testBoxImages, ...
w, hogCellSize, scales) ;

Part 5: Train your own object detector

Skip on fast track

In this last part, you will learn your own object detector. To this end, open and look at exercise5.m . You will
need to prepare the following data:

Step 5.1: Preparing the training data

A folder data/myPositives containing files image1.jpeg , image2.jpeg , ..., each containing a
single cropped occurence of the target object. These crops can be of any size, but should be roughly
square.
A folder data/myNegatives containing images image1.jpeg , image2.jpeg , ..., that do not contain
the target object at all.
A test image data/myTestImage.jpeg containing the target object. This should not be one of the
training images.

Run the code in example5.m to check that your training data looks right.

Task: Understand the limitations of this simple detector and choose a target object that has a good chance of
being learnable.

www.robots.ox.ac.uk/~vgg/practicals/category-detection/index.html 9/11
1/7/2019 VGG Practical

Hint: Note in particular that object instances must be similar and roughly aligned. If your object is not symmetric,
consider choosing instances that face a particular direction (e.g. left-facing horse head).

Step 5.2: Learn the model

Use the code supplied in example5.m to learn an SVM model for your object using hard negative mining as in
Stage 4.1.

Step 5.3: Test the model

Use the code supplied in example5.m to evaluate the SVM model on a test image and visualize the result as
in Stage 2.1.

Task: Make sure you get sensible results. Go back to step 5.1 if needed and adjust your data.

Hint: For debugging purposes, try using one of your training images as test. Does it work at least in this case?

Step 5.4: Detecting symmetric objects with multiple aspects

The basic detectors you have learned so far are not invariant to effects such as object deformations, out-of-plane
rotations, and partial occlusions that affect most natural objects. Handling these effects requires additional
sophistications, including using deformable templates, and a mixture of multiple templates.

In particular, many objects in nature are symmetric and, as such, their images appear flipped when the objects
are seen from the left or the right direction (consider for example a face). This can be handled by a pair of
symmetric HOG templates. In this part we will explore this option.

Task: Using the procedure above, train a HOG template w for a symmetric object facing in one specific
direction. For example, train a left-facing horse head detector.

Task: Collect test images containing the object facing in both directions. Run your detector and convince
yourself that it works well only for the direction it was trained for.

HOG features have a well defined structure that makes it possible to predict how the features transform when the
underlying image is flipped. The transformation is in fact a simple permutation of the HOG elements. For a given
spatial cell, HOG has 31 dimensions. The following code permutes the dimension to flip the cell around the
vertical axis:

perm = vl_hog('permutation') ;
hog_flipped = hog(perm) ;

Note that this permutation applies to a single HOG cell. However, the template is a H × W × 31 dimensional
array of HOG cells.

Task: Given a hog array of dimension H × W × 31 , write MATLAB code to obtain the flipped feature
array hog_flipped .

Hint: Recall that the first dimension spans the vertical axis, the second dimension the horizontal axis, and the
third dimension feature channels. perm should be applied to the last dimension. Do you need to permute
anything else?

Now let us apply flipping to the model trained earlier:

Task: Let w be the model you trained before. Use the procedure to flip HOG to generate w_flipped .
Then visualize both w and w_flipped as done in Sect. 1.3. Convince yourself that flipping was
successful.

We have now two models, w and w_flipped , one for each view of the object.

www.robots.ox.ac.uk/~vgg/practicals/category-detection/index.html 10/11
1/7/2019 VGG Practical

Task: Run both models in turn on the same image, obtaining two list of bounding boxes. Find a way to merge
the two lists and visualise the top detections. Convince yourself that you can now detect objects facing either
way.

Hint: Recall how redundant detections can be removed using non-maximum suppression.

Congratulations: This concludes the practical!

History
Used in the Oxford AIMS CDT, 2014-18

1. This is part of the MatConvNet toolbox for convolutional neural networks. Nevertheless, there is no neural
network discussed here. ↩

www.robots.ox.ac.uk/~vgg/practicals/category-detection/index.html 11/11

Vehicle Detection and Tracking
No ratings yet
Vehicle Detection and Tracking
11 pages
Tutorial 7 Developing A Simple Image Classifier
No ratings yet
Tutorial 7 Developing A Simple Image Classifier
11 pages
Achour Idoughi - Project03
No ratings yet
Achour Idoughi - Project03
7 pages
Tieng Anh E2 4 10 1274
No ratings yet
Tieng Anh E2 4 10 1274
7 pages
Traffic Sign Recognition Using Histograms of Oriented Gradients and Artificial Neural Networks
No ratings yet
Traffic Sign Recognition Using Histograms of Oriented Gradients and Artificial Neural Networks
11 pages
VGG Image Classification Practical
No ratings yet
VGG Image Classification Practical
11 pages
Code Generation For Image Classification
No ratings yet
Code Generation For Image Classification
15 pages
Color Exploitation in Hog-based Traffic Sign Detection
No ratings yet
Color Exploitation in Hog-based Traffic Sign Detection
4 pages
Lesson 07
No ratings yet
Lesson 07
59 pages
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
From Everand
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
Mark Magic
No ratings yet
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
From Everand
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
Fouad Sabry
No ratings yet
EXPERIMENTS WITH PATCH-BASED OBJECT CLASSIFICATION
No ratings yet
EXPERIMENTS WITH PATCH-BASED OBJECT CLASSIFICATION
6 pages
A Histogram-Refinement Histogram-Of-Oriented-Gradientobject and Pedestrian Detector
No ratings yet
A Histogram-Refinement Histogram-Of-Oriented-Gradientobject and Pedestrian Detector
5 pages
Color exploitation in HOG-based traffic sign detection
No ratings yet
Color exploitation in HOG-based traffic sign detection
5 pages
W12-HOG
No ratings yet
W12-HOG
21 pages
CH 8
No ratings yet
CH 8
21 pages
Machine Learning: Aigerim Bogyrbayeva
No ratings yet
Machine Learning: Aigerim Bogyrbayeva
85 pages
10.8 - Histogram of Oriented Gradients - PyImageSearch Gurus
No ratings yet
10.8 - Histogram of Oriented Gradients - PyImageSearch Gurus
20 pages
CIS 6213 Applied Machine Learning Coursework
No ratings yet
CIS 6213 Applied Machine Learning Coursework
5 pages
Shape Classification Using Histogram of Oriented Gradients
No ratings yet
Shape Classification Using Histogram of Oriented Gradients
6 pages
Pedestrian Detection at 100 Frames Per Second
No ratings yet
Pedestrian Detection at 100 Frames Per Second
8 pages
yolo_report (1)
No ratings yet
yolo_report (1)
23 pages
computer_vision_2_feature_extraction_1_students
No ratings yet
computer_vision_2_feature_extraction_1_students
59 pages
Maxbox - Starter75 Object Detection
No ratings yet
Maxbox - Starter75 Object Detection
7 pages
_PhD Visual Object Category Recognition
No ratings yet
_PhD Visual Object Category Recognition
193 pages
Pedestrian Detection Report
100% (1)
Pedestrian Detection Report
7 pages
Object Detection - Basics
No ratings yet
Object Detection - Basics
33 pages
1-Recent Advances in Object Detection in The Age of Deep Convolutional Neural Networks
No ratings yet
1-Recent Advances in Object Detection in The Age of Deep Convolutional Neural Networks
104 pages
4
No ratings yet
4
31 pages
HOG 2011 Stanford
No ratings yet
HOG 2011 Stanford
46 pages
Why Do Linear Svms Trained On Hog Features Perform So Well?
No ratings yet
Why Do Linear Svms Trained On Hog Features Perform So Well?
8 pages
CVlecture 4
No ratings yet
CVlecture 4
62 pages
HOG (Histogram of Oriented Gradients) : A HOG Is A
No ratings yet
HOG (Histogram of Oriented Gradients) : A HOG Is A
5 pages
Vehicle Detection Using Hog and SVM
No ratings yet
Vehicle Detection Using Hog and SVM
5 pages
Pedestrian Detection Using FPGA
No ratings yet
Pedestrian Detection Using FPGA
17 pages
Lab4 Classification 2024kdjclkan
No ratings yet
Lab4 Classification 2024kdjclkan
36 pages
Instant Download OpenCV Computer Vision Application Programming Cookbook 2nd Edition Robert Laganiere PDF All Chapters
No ratings yet
Instant Download OpenCV Computer Vision Application Programming Cookbook 2nd Edition Robert Laganiere PDF All Chapters
71 pages
GNR602-Lec14-15 Harris-HoG-SIFT
No ratings yet
GNR602-Lec14-15 Harris-HoG-SIFT
86 pages
Histograms of Oriented Gradients: Carlo Tomasi
No ratings yet
Histograms of Oriented Gradients: Carlo Tomasi
6 pages
Integral Hog
No ratings yet
Integral Hog
14 pages
Overfeat
No ratings yet
Overfeat
58 pages
Wepik Advancing Object Detection Unveiling The Potential For Precision and Efficiency 202401081226449LyU
No ratings yet
Wepik Advancing Object Detection Unveiling The Potential For Precision and Efficiency 202401081226449LyU
22 pages
Report 34
No ratings yet
Report 34
22 pages
Real Time Object Recognition and Classification
No ratings yet
Real Time Object Recognition and Classification
6 pages
Mobile Robot Lab 2
No ratings yet
Mobile Robot Lab 2
10 pages
Another Descriptor: Histograms of Oriented Gradients For Human Detection
No ratings yet
Another Descriptor: Histograms of Oriented Gradients For Human Detection
6 pages
Pedestrian Detection Using FPGA
No ratings yet
Pedestrian Detection Using FPGA
20 pages
EE368 Project: Visual Code Marker Detection Through Geometric Feature Recognition
No ratings yet
EE368 Project: Visual Code Marker Detection Through Geometric Feature Recognition
9 pages
ss report
No ratings yet
ss report
8 pages
OpenCV Computer Vision Application Programming Cookbook 2nd Edition Robert Laganiere - Quickly download the ebook to read anytime, anywhere
100% (1)
OpenCV Computer Vision Application Programming Cookbook 2nd Edition Robert Laganiere - Quickly download the ebook to read anytime, anywhere
52 pages
Zhang DetectDistractedDriver Report
No ratings yet
Zhang DetectDistractedDriver Report
6 pages
Download OpenCV Computer Vision Application Programming Cookbook 2nd Edition Robert Laganiere ebook file with all chapters
100% (2)
Download OpenCV Computer Vision Application Programming Cookbook 2nd Edition Robert Laganiere ebook file with all chapters
76 pages
OpenCV Computer Vision Application Programming Cookbook 2nd Edition Robert Laganiere - Get the ebook instantly with just one click
100% (1)
OpenCV Computer Vision Application Programming Cookbook 2nd Edition Robert Laganiere - Get the ebook instantly with just one click
56 pages
Artificial Vision User Guide Matlab
No ratings yet
Artificial Vision User Guide Matlab
1,852 pages
Image Classification Based On Transfer Learning of CNN
No ratings yet
Image Classification Based On Transfer Learning of CNN
5 pages
unit 3_1_1709014556934
No ratings yet
unit 3_1_1709014556934
49 pages
Deep Learning lab manual
No ratings yet
Deep Learning lab manual
69 pages
Some Practical Assignments in Computer Vision
No ratings yet
Some Practical Assignments in Computer Vision
5 pages
1 ObjectDetection
No ratings yet
1 ObjectDetection
46 pages
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
From Everand
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
Fouad Sabry
No ratings yet
Invoice 2001381
No ratings yet
Invoice 2001381
1 page
2019-21-Edge Linking and Boundary Detection
100% (1)
2019-21-Edge Linking and Boundary Detection
20 pages
TOSHIBA Global Internship 2018 - Application Guidelines & Position List
No ratings yet
TOSHIBA Global Internship 2018 - Application Guidelines & Position List
17 pages
With An Example Explain Each Type of Dithering and Its Advantage and Disadvantage There Are Three Types of Dithering Methods
No ratings yet
With An Example Explain Each Type of Dithering and Its Advantage and Disadvantage There Are Three Types of Dithering Methods
3 pages
Automatic Vacant Parking Management System: P. Udaya Sree (217Z1A05D8)
No ratings yet
Automatic Vacant Parking Management System: P. Udaya Sree (217Z1A05D8)
11 pages
Review Jurnal
No ratings yet
Review Jurnal
7 pages
Artificial Intelligence:, John Mccarthy
No ratings yet
Artificial Intelligence:, John Mccarthy
29 pages
1.5 Image Sampling and Quantization
No ratings yet
1.5 Image Sampling and Quantization
17 pages
Module-1 DIP
No ratings yet
Module-1 DIP
27 pages
Soal - 3D - Game - Art - LKS - JATENG - 2022
No ratings yet
Soal - 3D - Game - Art - LKS - JATENG - 2022
8 pages
Master Robotics
No ratings yet
Master Robotics
48 pages
Photoshop For B.SC
No ratings yet
Photoshop For B.SC
29 pages
Download Complete (Ebook) Learn Computer Vision Using OpenCV: With Deep Learning CNNs and RNNs by Sunila Gollapudi ISBN 9781484242612, 1484242610 PDF for All Chapters
100% (7)
Download Complete (Ebook) Learn Computer Vision Using OpenCV: With Deep Learning CNNs and RNNs by Sunila Gollapudi ISBN 9781484242612, 1484242610 PDF for All Chapters
81 pages
Basics With OpenCV
No ratings yet
Basics With OpenCV
18 pages
AI Book - IX - 19072024 - 152532
No ratings yet
AI Book - IX - 19072024 - 152532
25 pages
Algorithms of Digital Image Processing A
No ratings yet
Algorithms of Digital Image Processing A
9 pages
Deep Learning For Sign Language Recognition Current Techniques Benchmarks and Open Issues
No ratings yet
Deep Learning For Sign Language Recognition Current Techniques Benchmarks and Open Issues
35 pages
Unit1 Revisiting AI Project Cycle & Ethical
No ratings yet
Unit1 Revisiting AI Project Cycle & Ethical
36 pages
Signature of Bos Chairman 67
No ratings yet
Signature of Bos Chairman 67
2 pages
Log
No ratings yet
Log
7 pages
Agricultural Robots
No ratings yet
Agricultural Robots
45 pages
Digital Image and Video Processing - 2012
No ratings yet
Digital Image and Video Processing - 2012
7 pages
Êý¿ØÏ ÍâÎÄ Òë (Ó ) (Doc Docsou Com)
No ratings yet
Êý¿ØÏ ÍâÎÄ Òë (Ó ) (Doc Docsou Com)
11 pages
Final Project Report
No ratings yet
Final Project Report
76 pages
On Attendance Management System
100% (1)
On Attendance Management System
25 pages
Statistical Shape Analysis
No ratings yet
Statistical Shape Analysis
3 pages
Career Guidance IIT Patna
No ratings yet
Career Guidance IIT Patna
20 pages
Hand Gesture Recognition Using MATLAB Software: Group II
No ratings yet
Hand Gesture Recognition Using MATLAB Software: Group II
11 pages
EMMANUEL TEMBO FINAL YEAR PROJECT REPORT2
No ratings yet
EMMANUEL TEMBO FINAL YEAR PROJECT REPORT2
66 pages
Assignment 6
No ratings yet
Assignment 6
4 pages