I am the Head of Engineering at SpreeAI, a high-tech virtual try-on
startup. Besides overseeing all products R&D, I also lead and grow a world-class team of passionate Machine Learning researchers and engineers to develop and productionize
our photorealistic avatar technology.
Previously, I was a Research Scientist at Meta Reality Labs
Research, where I tech-led a group of researchers to develop 3D perception and human sensing algorithms
for Meta Aria glasses.
Before that, I was a Ph.D. student at The Robotics Institute, Carnegie Mellon University where I worked with Prof. Srinivasa Narasimhan and Prof. Yaser Sheikh on novel methods to capture dense and accurate 3D
shape of human bodies.
I also worked with Prof. Zhaoyang Wang at the Catholic University of
America, where I got B.E. degree in Electrical Engineering, on camera calibration, structured light system, and
tracking algorithms.
Research
I am very interested in various aspects of 3D vision, physics-based vision, and generative models for
photorealistic digitial avatar creation and human scene understanding. The goal is to develop holistic and
end-to-end machine learning systems that understand and recreate virtual environments that are perceptually
indistinguishable from reality.
Jobs opportuninty: I am hiring full time CV&ML&Graphics researchers. I strike to balance between core
and applied research with patents, papers, and product as outputs. Send me an email if you are interested in
working with me.
Award
Patent
Publication
|
Efficient Human Vision Inspired Action Recognition using Adaptive Spatiotemporal Sampling
Khoi-Nguyen Mac, Minh Do, Minh Vo
IEEE Trans. Image Process. 2023
PDF
|
|
EgoHumans: An Egocentric 3D Multi-Human Benchmark
Rawal Khirodkar, Aayush Bansal, Lingni Ma, Richard Newcombe, Minh Vo, Kris Kitani
ICCV 2023 (Oral and distingished egocentric papers)
Acceptance ratio: 152/8260 = 1.8%
PDF Project Page
|
|
Snipper: A Spatiotemporal Transformer for Simultaneous Multi-Person 3D Pose Estimation Tracking and
Forecasting on a Video Snippet
Shihao Zou, Yuanlu Xu, Chao Li, Lingni Ma, Li Chen, Minh Vo
IEEE Trans. on Circuits and Systems for Video Technology, 2023
PDF Project
Page
|
|
IDEO: Large Scale Egocentric 3D Object Dataset and Benchmark Challenges
Tien Do, Lance Lemke, Jingfan Guo, Khiem Vuong, Minh Vo, Hyun Soo Park
arxiv 2022
PDF Project Page
|
|
TAVA: Template-free Animatable Volumetric Actors
Ruilong Li, Julian Tanke, Minh Vo, Michael Zollhoefer, Jurgen Gall, Angjoo Kanazawa,
Christoph Lassner
ECCV 2022
PDF Project Page
|
|
LISA: Learning Implicit Shape and Appearance of Hands
Enric Corona, Tomas Hodan, Minh Vo, Francesc Moreno-Noguer, Chris Sweeney, Richard Newcombe,
and Lingni Ma
CVPR 2022
PDF Project Page
|
|
BANMo: Building Animatable 3D Neural Models from Many Casual Videos
Gengshan Yang, Minh Vo Natalia Neverova, Deva Ramanan, Andrea Vedaldi, Hanbyul Joo
CVPR 2022 (Oral) Acceptance ratio: 344/8161 = 4.2%
PDF Project
Page
|
|
Ego4D: Around the World in 3,000 Hours of Egocentric Video
K. Grauman et al.
CVPR 2022 (Oral - Best paper finalist and distingished egocentric papers) Acceptance ratio: 344/8161 = 4.2%
PDF Project Page
|
|
ODAM: Object Detection, Association, and Mapping using Posed RGB Video
Kejie Li, Daniel DeTone, Steven Chen, Minh Vo, Ian Reid, Hamid Rezatofighi, Chris Sweeney, Julian
Straub, Richard Newcombe
ICCV 2021
(Oral)
Acceptance ratio: 210/6152 = 3.3%
PDF Project Page
|
|
ContactOpt: Optimizing Contact to Improve Grasps
Patrick Grady, Chengcheng Tang, Christopher D. Twigg, Minh Vo, Samarth Brahmbhatt, Charles C.
Kemp
CVPR 2021
(Oral)
Acceptance ratio: 210/6152 = 3.3%
PDF Project Page
|
|
ANR: Articulated Neural Rendering for Virtual Avatars
Amit Raj, Julian Tanke, James Hays, Minh Vo, Carsten Stoll, and Christoph Lassner
CVPR 2021
PDF Project Page
|
|
TexMesh: Reconstructing Detailed Human Texture and Geometry from Monocular Video
Tiancheng Zhi, Christoph Lassner, Tony Tung, Carsten Stoll, Srinivasa Narasimhan, and Minh Vo
ECCV 2020
PDF Project
Page
|
|
Long-term Human Motion Prediction with Scene Context
Zhe Cao, Hang Gao, Karttikeya Mangalam, Qi-Zhi Cai, Minh Vo, and Jitendra Malik
ECCV 2020
(Oral)
Acceptance ratio: 104/5025 = 2.0%
PDF Project Page
|
|
4D Visualization of Dynamic Events from Unconstrained Multi-View Videos
Aayush Bansal, Minh Vo, Yaser Sheikh, Deva Ramanan, and Srinivasa Narasimhan
CVPR 2020
PDF Project Page
Press Coverage:
CMU,
ACM,
TechXplore,
ScienceMag,
and many others.
|
|
Spatiotemporal Bundle Adjustment for Dynamic 3D Human Reconstruction in the Wild
Minh Vo, Srinivasa Narasimhan, and Yaser Sheikh
TPAMI 2020 and CVPR 2016
PDF Project Page
|
|
Self-supervised Multi-view Person Association and Its Applications
Minh Vo, Ersin Yumer, Kalyan Sunkavalli, Sunil Hadap, Yaser Sheikh, and Srinivasa
Narasimhan
TPAMI 2020
PDF Project Page
|
|
Occlusion-Net: 2D/3D Occluded Keypoint Localization Using Graph Networks
Dinesh Reddy, Minh Vo, and Srinivasa Narasimhan,
CVPR 2019
PDF Project
Page
|
|
CarFusion: Combining Part Detection and Point Tracking for Dynamic 3D Reconstruction of
Vehicles
Dinesh Reddy, Minh Vo, and Srinivasa Narasimhan,
CVPR 2018
PDF Project
Page
|
|
Texture Illumination Separation for Single-shot Structured Light Reconstruction
Minh Vo, Srinivasa Narasimhan, and Yaser Sheikh
CCD 2014 and TPAMI 2015
PDF
Project Page
|
|
Passive Tomography of Turbulance Strength
Marina Alterman, Yoav Schechner, Minh Vo, and Srinivasa Narasimhan
ECCV 2014
PDF
Project Page
|
|
Automated fast initial guess in digital image correlation
Zhaoyang Wang, Minh Vo, Hien Kieu, Tongyan Pan
Strain 2014
PDF
|
|
Hyper-accurate flexible calibration technique for fringe-projection-based
three-dimensional imaging
Minh Vo, Zhaoyang Wang, Bing Pan, and Tongyan Pan
Optics Express 2012
PDF
Supplementary videos
|
|
Three-dimensional phantoms for curvature correction in spatial frequency domain
imaging
Thu Nguyen, Hanh Le, Minh Vo, Zhaoyang Wang, Long Luu, and Jessica
Ramella-Roman
Biomedical Optics Express 2012
PDF
|
|
Advanced geometric camera calibration for machine vision
Minh Vo, Zhaoyang Wang, Long Luu, and Jun Ma
Optical Engineering 2011
PDF
Software
|
|
Accuracy enhancement of digital image correlation with B-spline interpolation
Long Luu, Zhaoyang Wang, Minh Vo, Thang Hoang, and Jun Ma
Optics Letters 2011
PDF
|
|
Phase extraction from optical interferograms in presence of intensity nonlinearity and
arbitrary phase shifts
Thang Hoang, Zhaoyang Wang, Minh Vo, Jun Ma, Long Luu, and Bing Pan
Applied Physics Letters 2011
PDF
|
|
Flexible calibration technique for fringe-projection-based three-dimensional
imaging
Minh Vo, Zhaoyang Wang, Thang Hoang, and Dung Nguyen
Optics Letters 2010
PDF
|
Others
|
Exploiting Point Motion, Shape Deformation, and Semantic Priors for Dynamic 3D
Reconstruction in the Wild
Minh Vo
Ph.D. Thesis
PDF
|
External Coverage
|