


default search action
Peihao Chen
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j4]Weidong Chen
, Xiaofen Xing
, Peihao Chen
, Xiangmin Xu
:
Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition. IEEE Trans. Affect. Comput. 15(3): 1711-1724 (2024) - [c16]Zeyuan Yang, Jiageng Lin, Peihao Chen, Anoop Cherian, Tim K. Marks, Jonathan Le Roux, Chuang Gan:
RILA: Reflective and Imaginative Language Agent for Zero-Shot Semantic Audio-Visual Navigation. CVPR 2024: 16251-16261 - [c15]Yining Hong, Zishuo Zheng, Peihao Chen, Yian Wang, Junyan Li, Chuang Gan:
MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World. CVPR 2024: 26396-26406 - [c14]Junyan Li
, Delin Chen
, Tianle Cai
, Peihao Chen
, Yining Hong
, Zhenfang Chen
, Yikang Shen
, Chuang Gan
:
FlexAttention for Efficient High-Resolution Vision-Language Models. ECCV (25) 2024: 286-302 - [c13]Junyan Li, Delin Chen, Yining Hong, Zhenfang Chen, Peihao Chen, Yikang Shen, Chuang Gan:
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding. ICLR 2024 - [c12]Haoyu Zhen, Xiaowen Qiu, Peihao Chen, Jincheng Yang, Xin Yan, Yilun Du, Yining Hong, Chuang Gan:
3D-VLA: A 3D Vision-Language-Action Generative World Model. ICML 2024 - [i25]Yining Hong, Zishuo Zheng, Peihao Chen, Yian Wang, Junyan Li, Chuang Gan:
MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World. CoRR abs/2401.08577 (2024) - [i24]Haoyu Zhen, Xiaowen Qiu, Peihao Chen, Jincheng Yang, Xin Yan, Yilun Du, Yining Hong, Chuang Gan:
3D-VLA: A 3D Vision-Language-Action Generative World Model. CoRR abs/2403.09631 (2024) - [i23]Diwei Huang, Kunyang Lin, Peihao Chen, Qing Du, Mingkui Tan:
MAGIC: Map-Guided Few-Shot Audio-Visual Acoustics Modeling. CoRR abs/2405.13860 (2024) - [i22]Changhao Li, Xinyu Sun, Peihao Chen, Jugang Fan, Zixu Wang, Yanxia Liu, Jin-Hui Zhu, Chuang Gan, Mingkui Tan:
CoNav: A Benchmark for Human-Centered Collaborative Navigation. CoRR abs/2406.02425 (2024) - [i21]Junyan Li, Delin Chen, Tianle Cai, Peihao Chen, Yining Hong, Zhenfang Chen, Yikang Shen, Chuang Gan:
FlexAttention for Efficient High-Resolution Vision-Language Models. CoRR abs/2407.20228 (2024) - [i20]Yuncong Yang, Han Yang, Jiachen Zhou, Peihao Chen, Hongxin Zhang, Yilun Du, Chuang Gan:
SnapMem: Snapshot-based 3D Scene Memory for Embodied Exploration and Reasoning. CoRR abs/2411.17735 (2024) - [i19]Hongyan Zhi, Peihao Chen, Junyan Li, Shuailei Ma, Xinyu Sun, Tianhang Xiang, Yinjie Lei, Mingkui Tan, Chuang Gan:
LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences. CoRR abs/2412.01292 (2024) - 2023
- [c11]Xinyu Sun, Peihao Chen, Liangwei Chen, Changhao Li, Thomas H. Li, Mingkui Tan, Chuang Gan:
Masked Motion Encoding for Self-Supervised Video Representation Learning. CVPR 2023: 2235-2245 - [c10]Kunyang Lin, Peihao Chen, Diwei Huang, Thomas H. Li, Mingkui Tan, Chuang Gan:
Learning Vision-and-Language Navigation from YouTube Videos. ICCV 2023: 8283-8292 - [c9]Yining Hong, Haoyu Zhen, Peihao Chen, Shuhong Zheng, Yilun Du, Zhenfang Chen, Chuang Gan:
3D-LLM: Injecting the 3D World into Large Language Models. NeurIPS 2023 - [c8]Xinyu Sun, Peihao Chen, Jugang Fan, Jian Chen, Thomas H. Li, Mingkui Tan:
FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation. NeurIPS 2023 - [i18]Shuailei Ma, Yuefeng Wang, Ying Wei, Peihao Chen, Zhixiang Ye, Jiaqi Fan, Enming Zhang, Thomas H. Li:
Detecting the open-world objects with the help of the Brain. CoRR abs/2303.11623 (2023) - [i17]Weidong Chen, Xiaofen Xing, Peihao Chen, Xiangmin Xu:
Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition. CoRR abs/2307.10757 (2023) - [i16]Kunyang Lin, Peihao Chen, Diwei Huang, Thomas H. Li, Mingkui Tan, Chuang Gan:
Learning Vision-and-Language Navigation from YouTube Videos. CoRR abs/2307.11984 (2023) - [i15]Yining Hong, Haoyu Zhen, Peihao Chen, Shuhong Zheng, Yilun Du, Zhenfang Chen, Chuang Gan:
3D-LLM: Injecting the 3D World into Large Language Models. CoRR abs/2307.12981 (2023) - [i14]Peihao Chen, Xinyu Sun, Hongyan Zhi, Runhao Zeng, Thomas H. Li, Gaowen Liu, Mingkui Tan, Chuang Gan:
A2Nav: Action-Aware Zero-Shot Robot Navigation by Exploiting Vision-and-Language Ability of Foundation Models. CoRR abs/2308.07997 (2023) - [i13]Xinyu Sun, Peihao Chen, Jugang Fan, Thomas H. Li, Jian Chen, Mingkui Tan:
FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation. CoRR abs/2310.07473 (2023) - [i12]Junyan Li, Delin Chen, Yining Hong, Zhenfang Chen, Peihao Chen, Yikang Shen, Chuang Gan:
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding. CoRR abs/2311.03354 (2023) - [i11]Kunyang Lin, Yufeng Wang, Peihao Chen, Runhao Zeng, Siyuan Zhou, Mingkui Tan, Chuang Gan:
DCIR: Dynamic Consistency Intrinsic Reward for Multi-Agent Reinforcement Learning. CoRR abs/2312.05783 (2023) - [i10]Shuailei Ma, Yuefeng Wang, Ying Wei, Jiaqi Fan, Xinyu Sun, Peihao Chen, Enming Zhang:
A Simple Knowledge Distillation Framework for Open-world Object Detection. CoRR abs/2312.08653 (2023) - 2022
- [c7]Peihao Chen, Dongyu Ji, Kunyang Lin, Weiwen Hu, Wenbing Huang, Thomas H. Li, Mingkui Tan, Chuang Gan:
Learning Active Camera for Multi-Object Navigation. NeurIPS 2022 - [c6]Peihao Chen, Dongyu Ji, Kunyang Lin, Runhao Zeng, Thomas H. Li, Mingkui Tan, Chuang Gan:
Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation. NeurIPS 2022 - [i9]Xinyu Sun, Peihao Chen, Liangwei Chen, Thomas H. Li, Mingkui Tan, Chuang Gan:
M3Video: Masked Motion Modeling for Self-Supervised Video Representation Learning. CoRR abs/2210.06096 (2022) - [i8]Peihao Chen, Dongyu Ji, Kunyang Lin, Weiwen Hu, Wenbing Huang, Thomas H. Li, Mingkui Tan, Chuang Gan:
Learning Active Camera for Multi-Object Navigation. CoRR abs/2210.07505 (2022) - [i7]Peihao Chen, Dongyu Ji, Kunyang Lin, Runhao Zeng, Thomas H. Li, Mingkui Tan, Chuang Gan:
Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation. CoRR abs/2210.07506 (2022) - 2021
- [c5]Peihao Chen, Deng Huang, Dongliang He, Xiang Long, Runhao Zeng, Shilei Wen, Mingkui Tan, Chuang Gan:
RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning. AAAI 2021: 1045-1053 - 2020
- [j3]Peihao Chen
, Yang Zhang, Mingkui Tan
, Hongdong Xiao, Deng Huang, Chuang Gan
:
Generating Visually Aligned Sound From Videos. IEEE Trans. Image Process. 29: 8292-8302 (2020) - [j2]Peihao Chen
, Chuang Gan
, Guangyao Shen
, Wenbing Huang
, Runhao Zeng
, Mingkui Tan
:
Relation Attention for Temporal Action Localization. IEEE Trans. Multim. 22(10): 2723-2733 (2020) - [c4]Deng Huang, Peihao Chen, Runhao Zeng, Qing Du, Mingkui Tan, Chuang Gan:
Location-Aware Graph Convolutional Networks for Video Question Answering. AAAI 2020: 11021-11028 - [c3]Runhao Zeng, Haoming Xu, Wenbing Huang, Peihao Chen, Mingkui Tan, Chuang Gan:
Dense Regression Network for Video Grounding. CVPR 2020: 10284-10293 - [c2]Chuang Gan, Deng Huang, Peihao Chen, Joshua B. Tenenbaum, Antonio Torralba:
Foley Music: Learning to Generate Music from Videos. ECCV (11) 2020: 758-775 - [i6]Runhao Zeng, Haoming Xu, Wenbing Huang, Peihao Chen, Mingkui Tan, Chuang Gan:
Dense Regression Network for Video Grounding. CoRR abs/2004.03545 (2020) - [i5]Chuang Gan, Deng Huang, Peihao Chen, Joshua B. Tenenbaum, Antonio Torralba:
Foley Music: Learning to Generate Music from Videos. CoRR abs/2007.10984 (2020) - [i4]Peihao Chen, Yang Zhang, Mingkui Tan, Hongdong Xiao, Deng Huang, Chuang Gan:
Generating Visually Aligned Sound from Videos. CoRR abs/2008.00820 (2020) - [i3]Deng Huang, Peihao Chen, Runhao Zeng, Qing Du, Mingkui Tan, Chuang Gan:
Location-aware Graph Convolutional Networks for Video Question Answering. CoRR abs/2008.09105 (2020) - [i2]Peihao Chen, Deng Huang, Dongliang He, Xiang Long, Runhao Zeng, Shilei Wen, Mingkui Tan, Chuang Gan:
RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning. CoRR abs/2011.07949 (2020)
2010 – 2019
- 2019
- [j1]Runhao Zeng
, Chuang Gan
, Peihao Chen
, Wenbing Huang
, Qingyao Wu
, Mingkui Tan
:
Breaking Winner-Takes-All: Iterative-Winners-Out Networks for Weakly Supervised Temporal Action Localization. IEEE Trans. Image Process. 28(12): 5797-5808 (2019) - [c1]Chuang Gan, Hang Zhao, Peihao Chen, David D. Cox, Antonio Torralba:
Self-Supervised Moving Vehicle Tracking With Stereo Sound. ICCV 2019: 7052-7061 - [i1]Chuang Gan, Hang Zhao, Peihao Chen, David D. Cox, Antonio Torralba:
Self-supervised Moving Vehicle Tracking with Stereo Sound. CoRR abs/1910.11760 (2019)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-15 20:42 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint