


default search action
Gen Luo
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [i25]Zhaokai Wang, Xizhou Zhu, Xue Yang, Gen Luo, Hao Li, Changyao Tian, Wenhan Dou, Junqi Ge, Lewei Lu, Yu Qiao, Jifeng Dai:
Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding. CoRR abs/2501.07783 (2025) - 2024
- [j5]Gen Luo, Yiyi Zhou
, Xiaoshuai Sun, Yongjian Wu, Yue Gao, Rongrong Ji:
Towards Language-Guided Visual Recognition via Dynamic Convolutions. Int. J. Comput. Vis. 132(1): 1-19 (2024) - [j4]Gen Luo
, Yiyi Zhou
, Jiamu Sun
, Xiaoshuai Sun
, Rongrong Ji
:
A Survivor in the Era of Large-Scale Pretraining: An Empirical Study of One-Stage Referring Expression Comprehension. IEEE Trans. Multim. 26: 3689-3700 (2024) - [c20]Changli Wu, Yiwei Ma, Qi Chen, Haowei Wang, Gen Luo, Jiayi Ji, Xiaoshuai Sun:
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation. AAAI 2024: 5940-5948 - [c19]Yaxin Luo, Jiayi Ji, Xiaofu Chen, Yuxin Zhang, Tianhe Ren, Gen Luo:
APL: Anchor-Based Prompt Learning for One-Stage Weakly Supervised Referring Expression Comprehension. ECCV (13) 2024: 198-215 - [c18]Minglang Huang, Yiyi Zhou, Gen Luo, Guannan Jiang, Weilin Zhuang, Xiaoshuai Sun:
Towards Omni-supervised Referring Expression Segmentation. ICME 2024: 1-6 - [c17]Yuxin Zhang, Yuxuan Du, Gen Luo, Yunshan Zhong, Zhenyu Zhang, Shiwei Liu, Rongrong Ji:
CaM: Cache Merging for Memory-efficient LLMs Inference. ICML 2024 - [c16]Jinlu Zhang, Yiyi Zhou, Qiancheng Zheng, Xiaoxiong Du, Gen Luo, Jun Peng, Xiaoshuai Sun, Rongrong Ji:
Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization. ICML 2024 - [c15]Xiaorui Huang
, Gen Luo
, Chaoyang Zhu
, Bo Tong
, Yiyi Zhou
, Xiaoshuai Sun
, Rongrong Ji
:
Deep Instruction Tuning for Segment Anything Model. ACM Multimedia 2024: 905-914 - [c14]Shengxin Chen
, Gen Luo
, Yiyi Zhou
, Xiaoshuai Sun
, Guannan Jiang
, Rongrong Ji
:
QueryMatch: A Query-based Contrastive Learning Framework for Weakly Supervised Visual Grounding. ACM Multimedia 2024: 4177-4186 - [c13]Changli Wu
, Yihang Liu
, Jiayi Ji
, Yiwei Ma
, Haowei Wang
, Gen Luo
, Henghui Ding
, Xiaoshuai Sun
, Rongrong Ji
:
3D-GRES: Generalized 3D Referring Expression Segmentation. ACM Multimedia 2024: 7852-7861 - [c12]Changli Wu, Qi Chen, Jiayi Ji, Haowei Wang, Yiwei Ma, You Huang, Gen Luo, Hao Fei, Xiaoshuai Sun, Rongrong Ji:
RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation. NeurIPS 2024 - [c11]Mingrui Wu, Xinyue Cai, Jiayi Ji, Jiale Li, Oucheng Huang, Gen Luo, Hao Fei, Guannan Jiang, Xiaoshuai Sun, Rongrong Ji:
ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models. NeurIPS 2024 - [i24]Gen Luo, Yiyi Zhou, Yuxin Zhang, Xiawu Zheng, Xiaoshuai Sun, Rongrong Ji:
Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models. CoRR abs/2403.03003 (2024) - [i23]Jinlu Zhang, Yiyi Zhou, Qiancheng Zheng, Xiaoxiong Du, Gen Luo, Jun Peng, Xiaoshuai Sun, Rongrong Ji:
Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization. CoRR abs/2403.06702 (2024) - [i22]Xiaorui Huang, Gen Luo, Chaoyang Zhu, Bo Tong, Yiyi Zhou, Xiaoshuai Sun, Rongrong Ji:
Deep Instruction Tuning for Segment Anything Model. CoRR abs/2404.00650 (2024) - [i21]Qiong Wu, Zhaoxi Ke, Yiyi Zhou, Gen Luo, Xiaoshuai Sun, Rongrong Ji:
Routing Experts: Learning to Route Dynamic Experts in Multi-modal Large Language Models. CoRR abs/2407.14093 (2024) - [i20]Changli Wu, Yihang Liu, Jiayi Ji, Yiwei Ma, Haowei Wang, Gen Luo, Henghui Ding, Xiaoshuai Sun, Rongrong Ji:
3D-GRES: Generalized 3D Referring Expression Segmentation. CoRR abs/2407.20664 (2024) - [i19]Mingrui Wu, Xinyue Cai, Jiayi Ji, Jiale Li, Oucheng Huang, Gen Luo, Hao Fei, Xiaoshuai Sun, Rongrong Ji:
ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models. CoRR abs/2407.21534 (2024) - [i18]Gen Luo, Xue Yang, Wenhan Dou, Zhaokai Wang, Jifeng Dai, Yu Qiao, Xizhou Zhu:
Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training. CoRR abs/2410.08202 (2024) - [i17]Yaxin Luo, Gen Luo, Jiayi Ji, Yiyi Zhou, Xiaoshuai Sun, Zhiqiang Shen, Rongrong Ji:
γ-MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models. CoRR abs/2410.13859 (2024) - [i16]Qing Jiang, Gen Luo, Yuqin Yang, Yuda Xiong, Yihao Chen, Zhaoyang Zeng, Tianhe Ren, Lei Zhang:
ChatRex: Taming Multimodal LLM for Joint Perception and Understanding. CoRR abs/2411.18363 (2024) - [i15]Changli Wu, Qi Chen, Jiayi Ji, Haowei Wang, Yiwei Ma, You Huang, Gen Luo, Hao Fei, Xiaoshuai Sun, Rongrong Ji:
RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation. CoRR abs/2412.02402 (2024) - [i14]Bo Tong, Bokai Lai, Yiyi Zhou, Gen Luo, Yunhang Shen, Ke Li, Xiaoshuai Sun, Rongrong Ji:
FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual Compression. CoRR abs/2412.04317 (2024) - 2023
- [j3]Jiayi Ji
, Xiaoyang Huang
, Xiaoshuai Sun
, Yiyi Zhou
, Gen Luo
, Liujuan Cao
, Jianzhuang Liu
, Ling Shao
, Rongrong Ji
:
Multi-Branch Distance-Sensitive Self-Attention Network for Image Captioning. IEEE Trans. Multim. 25: 3962-3974 (2023) - [j2]Yiyi Zhou
, Rongrong Ji
, Gen Luo
, Xiaoshuai Sun
, Jinsong Su
, Xinghao Ding
, Chia-Wen Lin
, Qi Tian
:
A Real-Time Global Inference Network for One-Stage Referring Expression Comprehension. IEEE Trans. Neural Networks Learn. Syst. 34(1): 134-143 (2023) - [c10]Lei Jin, Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Guannan Jiang, Annan Shu, Rongrong Ji:
RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension. CVPR 2023: 1-10 - [c9]Jiamu Sun, Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Guannan Jiang, Zhiyu Wang, Rongrong Ji:
RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension. CVPR 2023: 19144-19154 - [c8]Gen Luo, Yiyi Zhou, Tianhe Ren, Shengxin Chen, Xiaoshuai Sun, Rongrong Ji:
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models. NeurIPS 2023 - [i13]Gen Luo, Minglang Huang, Yiyi Zhou, Xiaoshuai Sun, Guannan Jiang, Zhiyu Wang, Rongrong Ji:
Towards Efficient Visual Adaption via Structural Re-parameterization. CoRR abs/2302.08106 (2023) - [i12]Gen Luo, Yiyi Zhou, Lei Jin, Xiaoshuai Sun, Rongrong Ji:
Towards End-to-end Semi-supervised Learning for One-stage Object Detection. CoRR abs/2302.11299 (2023) - [i11]Peng Mi, Jianghang Lin, Yiyi Zhou, Yunhang Shen, Gen Luo, Xiaoshuai Sun, Liujuan Cao, Rongrong Fu, Qiang Xu, Rongrong Ji:
Active Teacher for Semi-Supervised Object Detection. CoRR abs/2303.08348 (2023) - [i10]Gen Luo, Yiyi Zhou, Tianhe Ren, Shengxin Chen, Xiaoshuai Sun, Rongrong Ji:
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models. CoRR abs/2305.15023 (2023) - [i9]Changli Wu, Yiwei Ma, Qi Chen, Haowei Wang, Gen Luo, Jiayi Ji, Xiaoshuai Sun:
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation. CoRR abs/2308.16632 (2023) - [i8]Minglang Huang, Yiyi Zhou, Gen Luo, Guannan Jiang, Weilin Zhuang, Xiaoshuai Sun:
Towards Omni-supervised Referring Expression Segmentation. CoRR abs/2311.00397 (2023) - 2022
- [j1]Gen Luo
, Yiyi Zhou
, Xiaoshuai Sun
, Yan Wang, Liujuan Cao
, Yongjian Wu, Feiyue Huang, Rongrong Ji
:
Towards Lightweight Transformer Via Group-Wise Transformation for Vision-and-Language Tasks. IEEE Trans. Image Process. 31: 3386-3398 (2022) - [c7]Peng Mi, Jianghang Lin
, Yiyi Zhou, Yunhang Shen
, Gen Luo, Xiaoshuai Sun, Liujuan Cao, Rongrong Fu, Qiang Xu, Rongrong Ji:
Active Teacher for Semi-Supervised Object Detection. CVPR 2022: 14462-14471 - [c6]Chaoyang Zhu, Yiyi Zhou, Yunhang Shen
, Gen Luo, Xingjia Pan, Mingbao Lin, Chao Chen, Liujuan Cao, Xiaoshuai Sun, Rongrong Ji:
SeqTR: A Simple Yet Universal Network for Visual Grounding. ECCV (35) 2022: 598-615 - [i7]Chaoyang Zhu, Yiyi Zhou, Yunhang Shen, Gen Luo, Xingjia Pan, Mingbao Lin, Chao Chen, Liujuan Cao, Xiaoshuai Sun, Rongrong Ji:
SeqTR: A Simple yet Universal Network for Visual Grounding. CoRR abs/2203.16265 (2022) - [i6]Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Yan Wang, Liujuan Cao, Yongjian Wu, Feiyue Huang, Rongrong Ji:
Towards Lightweight Transformer via Group-wise Transformation for Vision-and-Language Tasks. CoRR abs/2204.07780 (2022) - [i5]Gen Luo, Yiyi Zhou, Jiamu Sun, Shubin Huang, Xiaoshuai Sun, Qixiang Ye, Yongjian Wu, Rongrong Ji:
What Goes beyond Multi-modal Fusion in One-stage Referring Expression Comprehension: An Empirical Study. CoRR abs/2204.07913 (2022) - 2021
- [c5]Jiayi Ji, Yunpeng Luo, Xiaoshuai Sun, Fuhai Chen, Gen Luo, Yongjian Wu, Yue Gao, Rongrong Ji:
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network. AAAI 2021: 1655-1663 - [i4]Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Xinghao Ding, Yongjian Wu, Feiyue Huang, Yue Gao, Rongrong Ji:
Towards Language-guided Visual Recognition via Dynamic Convolutions. CoRR abs/2110.08797 (2021) - 2020
- [c4]Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Liujuan Cao, Chenglin Wu, Cheng Deng
, Rongrong Ji:
Multi-Task Collaborative Network for Joint Referring Expression Comprehension and Segmentation. CVPR 2020: 10031-10040 - [c3]Yiyi Zhou, Rongrong Ji, Xiaoshuai Sun, Gen Luo, Xiaopeng Hong, Jinsong Su, Xinghao Ding, Ling Shao:
K-armed Bandit based Multi-Modal Network Architecture Search for Visual Question Answering. ACM Multimedia 2020: 1245-1254 - [c2]Gen Luo, Yiyi Zhou, Rongrong Ji, Xiaoshuai Sun, Jinsong Su, Chia-Wen Lin, Qi Tian:
Cascade Grouped Attention Network for Referring Expression Segmentation. ACM Multimedia 2020: 1274-1282 - [i3]Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Liujuan Cao, Chenglin Wu, Cheng Deng, Rongrong Ji:
Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation. CoRR abs/2003.08813 (2020) - [i2]Jiayi Ji, Yunpeng Luo, Xiaoshuai Sun, Fuhai Chen, Gen Luo, Yongjian Wu, Yue Gao, Rongrong Ji:
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network. CoRR abs/2012.07061 (2020)
2010 – 2019
- 2019
- [i1]Yiyi Zhou, Rongrong Ji, Gen Luo, Xiaoshuai Sun, Jinsong Su, Xinghao Ding, Chia-Wen Lin, Qi Tian:
A Real-time Global Inference Network for One-stage Referring Expression Comprehension. CoRR abs/1912.03478 (2019) - 2016
- [c1]Jun Ni, Gen Luo, Tao Yu, NingChuan Li:
No-reference image sharpness Algorithm based on gradient shape. CISP-BMEI 2016: 786-790
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-02-21 19:36 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint