Shape-Aware Monocular 3D Object Detection

Chen, Wei; Zhao, Jie; Zhao, Wan-Lei; Wu, Song-Yuan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2204.08717 (cs)

[Submitted on 19 Apr 2022 (v1), last revised 24 Apr 2022 (this version, v2)]

Title:Shape-Aware Monocular 3D Object Detection

Authors:Wei Chen, Jie Zhao, Wan-Lei Zhao, Song-Yuan Wu

View PDF

Abstract:The detection of 3D objects through a single perspective camera is a challenging issue. The anchor-free and keypoint-based models receive increasing attention recently due to their effectiveness and simplicity. However, most of these methods are vulnerable to occluded and truncated objects. In this paper, a single-stage monocular 3D object detection model is proposed. An instance-segmentation head is integrated into the model training, which allows the model to be aware of the visible shape of a target object. The detection largely avoids interference from irrelevant regions surrounding the target objects. In addition, we also reveal that the popular IoU-based evaluation metrics, which were originally designed for evaluating stereo or LiDAR-based detection methods, are insensitive to the improvement of monocular 3D object detection algorithms. A novel evaluation metric, namely average depth similarity (ADS) is proposed for the monocular 3D object detection models. Our method outperforms the baseline on both the popular and the proposed evaluation metrics while maintaining real-time efficiency.

Comments:	8 pages; 6 figures. typo fixed; reference changed
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2204.08717 [cs.CV]
	(or arXiv:2204.08717v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2204.08717

Submission history

From: Wan-Lei Zhao [view email]
[v1] Tue, 19 Apr 2022 07:43:56 UTC (20,988 KB)
[v2] Sun, 24 Apr 2022 07:29:21 UTC (25,556 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Shape-Aware Monocular 3D Object Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Shape-Aware Monocular 3D Object Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators