Seeing the Unseen: Errors and Bias in Visual Datasets

Jin, Hongrui

Computer Science > Computer Vision and Pattern Recognition

arXiv:2211.01847 (cs)

[Submitted on 3 Nov 2022]

Title:Seeing the Unseen: Errors and Bias in Visual Datasets

Authors:Hongrui Jin

View PDF

Abstract:From face recognition in smartphones to automatic routing on self-driving cars, machine vision algorithms lie in the core of these features. These systems solve image based tasks by identifying and understanding objects, subsequently making decisions from these information. However, errors in datasets are usually induced or even magnified in algorithms, at times resulting in issues such as recognising black people as gorillas and misrepresenting ethnicities in search results. This paper tracks the errors in datasets and their impacts, revealing that a flawed dataset could be a result of limited categories, incomprehensive sourcing and poor classification.

Comments:	13 pages, 2 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
ACM classes:	I.2.10
Cite as:	arXiv:2211.01847 [cs.CV]
	(or arXiv:2211.01847v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2211.01847

Submission history

From: Hongrui Jin [view email]
[v1] Thu, 3 Nov 2022 14:34:28 UTC (1,039 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2022-11

Change to browse by:

cs
cs.AI

References & Citations

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Seeing the Unseen: Errors and Bias in Visual Datasets

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Seeing the Unseen: Errors and Bias in Visual Datasets

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators