default search action

combined dblp search
author search
venue search
publication search

ask others

Peihao Chen

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taffco/ChenXCX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/ChenXCX24
Weidong Chen, Xiaofen Xing, Peihao Chen, Xiangmin Xu:
Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition. IEEE Trans. Affect. Comput. 15(3): 1711-1724 (2024)
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/YangLCCMRG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/YangLCCMRG24
Zeyuan Yang, Jiageng Lin, Peihao Chen, Anoop Cherian, Tim K. Marks, Jonathan Le Roux, Chuang Gan:
RILA: Reflective and Imaginative Language Agent for Zero-Shot Semantic Audio-Visual Navigation. CVPR 2024: 16251-16261
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/HongZCWLG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/HongZCWLG24
Yining Hong, Zishuo Zheng, Peihao Chen, Yian Wang, Junyan Li, Chuang Gan:
MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World. CVPR 2024: 26396-26406
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/LiCCCHCSG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/LiCCCHCSG24
Junyan Li, Delin Chen, Tianle Cai, Peihao Chen, Yining Hong, Zhenfang Chen, Yikang Shen, Chuang Gan:
FlexAttention for Efficient High-Resolution Vision-Language Models. ECCV (25) 2024: 286-302
[c13]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LiCHCCSG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LiCHCCSG24
Junyan Li, Delin Chen, Yining Hong, Zhenfang Chen, Peihao Chen, Yikang Shen, Chuang Gan:
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding. ICLR 2024
[c12]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZhenQCY0DHG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhenQCY0DHG24
Haoyu Zhen, Xiaowen Qiu, Peihao Chen, Jincheng Yang, Xin Yan, Yilun Du, Yining Hong, Chuang Gan:
3D-VLA: A 3D Vision-Language-Action Generative World Model. ICML 2024
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-08577
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-08577
Yining Hong, Zishuo Zheng, Peihao Chen, Yian Wang, Junyan Li, Chuang Gan:
MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World. CoRR abs/2401.08577 (2024)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-09631
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-09631
Haoyu Zhen, Xiaowen Qiu, Peihao Chen, Jincheng Yang, Xin Yan, Yilun Du, Yining Hong, Chuang Gan:
3D-VLA: A 3D Vision-Language-Action Generative World Model. CoRR abs/2403.09631 (2024)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-13860
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-13860
Diwei Huang, Kunyang Lin, Peihao Chen, Qing Du, Mingkui Tan:
MAGIC: Map-Guided Few-Shot Audio-Visual Acoustics Modeling. CoRR abs/2405.13860 (2024)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02425
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02425
Changhao Li, Xinyu Sun, Peihao Chen, Jugang Fan, Zixu Wang, Yanxia Liu, Jin-Hui Zhu, Chuang Gan, Mingkui Tan:
CoNav: A Benchmark for Human-Centered Collaborative Navigation. CoRR abs/2406.02425 (2024)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-20228
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-20228
Junyan Li, Delin Chen, Tianle Cai, Peihao Chen, Yining Hong, Zhenfang Chen, Yikang Shen, Chuang Gan:
FlexAttention for Efficient High-Resolution Vision-Language Models. CoRR abs/2407.20228 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-17735
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-17735
Yuncong Yang, Han Yang, Jiachen Zhou, Peihao Chen, Hongxin Zhang, Yilun Du, Chuang Gan:
SnapMem: Snapshot-based 3D Scene Memory for Embodied Exploration and Reasoning. CoRR abs/2411.17735 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-01292
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-01292
Hongyan Zhi, Peihao Chen, Junyan Li, Shuailei Ma, Xinyu Sun, Tianhang Xiang, Yinjie Lei, Mingkui Tan, Chuang Gan:
LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences. CoRR abs/2412.01292 (2024)
2023
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/SunCCLLTG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/SunCCLLTG23
Xinyu Sun, Peihao Chen, Liangwei Chen, Changhao Li, Thomas H. Li, Mingkui Tan, Chuang Gan:
Masked Motion Encoding for Self-Supervised Video Representation Learning. CVPR 2023: 2235-2245
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/LinCHLTG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/LinCHLTG23
Kunyang Lin, Peihao Chen, Diwei Huang, Thomas H. Li, Mingkui Tan, Chuang Gan:
Learning Vision-and-Language Navigation from YouTube Videos. ICCV 2023: 8283-8292
[c9]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/HongZCZDCG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HongZCZDCG23
Yining Hong, Haoyu Zhen, Peihao Chen, Shuhong Zheng, Yilun Du, Zhenfang Chen, Chuang Gan:
3D-LLM: Injecting the 3D World into Large Language Models. NeurIPS 2023
[c8]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/SunCFCLT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SunCFCLT23
Xinyu Sun, Peihao Chen, Jugang Fan, Jian Chen, Thomas H. Li, Mingkui Tan:
FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation. NeurIPS 2023
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-11623
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-11623
Shuailei Ma, Yuefeng Wang, Ying Wei, Peihao Chen, Zhixiang Ye, Jiaqi Fan, Enming Zhang, Thomas H. Li:
Detecting the open-world objects with the help of the Brain. CoRR abs/2303.11623 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-10757
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-10757
Weidong Chen, Xiaofen Xing, Peihao Chen, Xiangmin Xu:
Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition. CoRR abs/2307.10757 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-11984
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-11984
Kunyang Lin, Peihao Chen, Diwei Huang, Thomas H. Li, Mingkui Tan, Chuang Gan:
Learning Vision-and-Language Navigation from YouTube Videos. CoRR abs/2307.11984 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-12981
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-12981
Yining Hong, Haoyu Zhen, Peihao Chen, Shuhong Zheng, Yilun Du, Zhenfang Chen, Chuang Gan:
3D-LLM: Injecting the 3D World into Large Language Models. CoRR abs/2307.12981 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-07997
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-07997
Peihao Chen, Xinyu Sun, Hongyan Zhi, Runhao Zeng, Thomas H. Li, Gaowen Liu, Mingkui Tan, Chuang Gan:
A²Nav: Action-Aware Zero-Shot Robot Navigation by Exploiting Vision-and-Language Ability of Foundation Models. CoRR abs/2308.07997 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-07473
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-07473
Xinyu Sun, Peihao Chen, Jugang Fan, Thomas H. Li, Jian Chen, Mingkui Tan:
FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation. CoRR abs/2310.07473 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-03354
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-03354
Junyan Li, Delin Chen, Yining Hong, Zhenfang Chen, Peihao Chen, Yikang Shen, Chuang Gan:
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding. CoRR abs/2311.03354 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-05783
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-05783
Kunyang Lin, Yufeng Wang, Peihao Chen, Runhao Zeng, Siyuan Zhou, Mingkui Tan, Chuang Gan:
DCIR: Dynamic Consistency Intrinsic Reward for Multi-Agent Reinforcement Learning. CoRR abs/2312.05783 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-08653
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-08653
Shuailei Ma, Yuefeng Wang, Ying Wei, Jiaqi Fan, Xinyu Sun, Peihao Chen, Enming Zhang:
A Simple Knowledge Distillation Framework for Open-world Object Detection. CoRR abs/2312.08653 (2023)
2022
[c7]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChenJLH0LTG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChenJLH0LTG22
Peihao Chen, Dongyu Ji, Kunyang Lin, Weiwen Hu, Wenbing Huang, Thomas H. Li, Mingkui Tan, Chuang Gan:
Learning Active Camera for Multi-Object Navigation. NeurIPS 2022
[c6]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChenJLZLTG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChenJLZLTG22
Peihao Chen, Dongyu Ji, Kunyang Lin, Runhao Zeng, Thomas H. Li, Mingkui Tan, Chuang Gan:
Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation. NeurIPS 2022
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-06096
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-06096
Xinyu Sun, Peihao Chen, Liangwei Chen, Thomas H. Li, Mingkui Tan, Chuang Gan:
M³Video: Masked Motion Modeling for Self-Supervised Video Representation Learning. CoRR abs/2210.06096 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-07505
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-07505
Peihao Chen, Dongyu Ji, Kunyang Lin, Weiwen Hu, Wenbing Huang, Thomas H. Li, Mingkui Tan, Chuang Gan:
Learning Active Camera for Multi-Object Navigation. CoRR abs/2210.07505 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-07506
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-07506
Peihao Chen, Dongyu Ji, Kunyang Lin, Runhao Zeng, Thomas H. Li, Mingkui Tan, Chuang Gan:
Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation. CoRR abs/2210.07506 (2022)
2021
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ChenHHLZWTG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ChenHHLZWTG21
Peihao Chen, Deng Huang, Dongliang He, Xiang Long, Runhao Zeng, Shilei Wen, Mingkui Tan, Chuang Gan:
RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning. AAAI 2021: 1045-1053
2020
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/tip/ChenZTXHG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tip/ChenZTXHG20
Peihao Chen, Yang Zhang, Mingkui Tan, Hongdong Xiao, Deng Huang, Chuang Gan:
Generating Visually Aligned Sound From Videos. IEEE Trans. Image Process. 29: 8292-8302 (2020)
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/ChenGSHZT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/ChenGSHZT20
Peihao Chen, Chuang Gan, Guangyao Shen, Wenbing Huang, Runhao Zeng, Mingkui Tan:
Relation Attention for Temporal Action Localization. IEEE Trans. Multim. 22(10): 2723-2733 (2020)
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HuangCZDTG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HuangCZDTG20
Deng Huang, Peihao Chen, Runhao Zeng, Qing Du, Mingkui Tan, Chuang Gan:
Location-Aware Graph Convolutional Networks for Video Question Answering. AAAI 2020: 11021-11028
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ZengXHCTG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ZengXHCTG20
Runhao Zeng, Haoming Xu, Wenbing Huang, Peihao Chen, Mingkui Tan, Chuang Gan:
Dense Regression Network for Video Grounding. CVPR 2020: 10284-10293
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/GanHCTT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/GanHCTT20
Chuang Gan, Deng Huang, Peihao Chen, Joshua B. Tenenbaum, Antonio Torralba:
Foley Music: Learning to Generate Music from Videos. ECCV (11) 2020: 758-775
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-03545
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-03545
Runhao Zeng, Haoming Xu, Wenbing Huang, Peihao Chen, Mingkui Tan, Chuang Gan:
Dense Regression Network for Video Grounding. CoRR abs/2004.03545 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-10984
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-10984
Chuang Gan, Deng Huang, Peihao Chen, Joshua B. Tenenbaum, Antonio Torralba:
Foley Music: Learning to Generate Music from Videos. CoRR abs/2007.10984 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-00820
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-00820
Peihao Chen, Yang Zhang, Mingkui Tan, Hongdong Xiao, Deng Huang, Chuang Gan:
Generating Visually Aligned Sound from Videos. CoRR abs/2008.00820 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-09105
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-09105
Deng Huang, Peihao Chen, Runhao Zeng, Qing Du, Mingkui Tan, Chuang Gan:
Location-aware Graph Convolutional Networks for Video Question Answering. CoRR abs/2008.09105 (2020)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-07949
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-07949
Peihao Chen, Deng Huang, Dongliang He, Xiang Long, Runhao Zeng, Shilei Wen, Mingkui Tan, Chuang Gan:
RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning. CoRR abs/2011.07949 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tip/ZengGCHWT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tip/ZengGCHWT19
Runhao Zeng, Chuang Gan, Peihao Chen, Wenbing Huang, Qingyao Wu, Mingkui Tan:
Breaking Winner-Takes-All: Iterative-Winners-Out Networks for Weakly Supervised Temporal Action Localization. IEEE Trans. Image Process. 28(12): 5797-5808 (2019)
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/GanZCC019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/GanZCC019
Chuang Gan, Hang Zhao, Peihao Chen, David D. Cox, Antonio Torralba:
Self-Supervised Moving Vehicle Tracking With Stereo Sound. ICCV 2019: 7052-7061
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-11760
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-11760
Chuang Gan, Hang Zhao, Peihao Chen, David D. Cox, Antonio Torralba:
Self-supervised Moving Vehicle Tracking with Stereo Sound. CoRR abs/1910.11760 (2019)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.