Intelligent Science and Technology

Yousong Zhu

Ph.D, Associate Researcher

Personal profile
Talent Training
Research Projects
Representative Achievements
Awards and Honors

Yousong Zhu, Ph.D, Associate Researcher

Email: 202521@cumtb.edu.cn

Research Interests:Computer Vision, Object Detection and Recognition, Self-supervised Learning, Vision-Language Models

Education:1. 2014.09-2019.06: Ph.D. in Pattern Recognition and Intelligent Systems, National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences (CASIA), Beijing, China.

2. 2010.09-2014.06: B.Eng. in Automation, School of Information Science and Engineering, Central South University (CSU), Changsha, China.

Work Experience:

1. 2025.07-Present: Associate Researcher, School of Artificial Intelligence, China University of Mining and Technology-Beijing (CUMTB), Beijing, China.

2. 2022.07-2025.05: Associate Researcher, Foundation Model Research Center, Institute of Automation, Chinese Academy of Sciences (CASIA), Beijing, China.

3. 2019.07-2022.06: Assistant Researcher, National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences (CASIA), Beijing, China.




To Be Updated

1. National Natural Science Foundation (Youth Program),

Practical Object Detection Based on Neural Network Self-learning,

Principal Investigator, 2021.01-2023.12

2. Ministry of Science and Technology (Key Project),

Development of Visual Cognitive Model Porting and Algorithm Toolkit,

Principal Investigator, 2022.01-2024.12

3. National Key R&D Program (Sub-project),

Collaborative R&D and Demonstration of AI Key Technologies for Biodiversity Conservation,

Sub-project Leader, 2025.01-2027.12

4. Beijing Joint Fund (Frontier Project),

Multimodal Foundation Model-based Mobile Intelligent IVI UI Interaction,

Principal Investigator, 2024.10–2027.09

5. Guangxi Tobacco Company,

Cigarette Retail Storefront Data Collection and Smart Analysis Based on Image Recognition,

Principal Investigator, 2021.01–2023.10

6. SenseTime,

Self-supervised Visual Representation Learning and Structure Design for Industrial Scenarios,

Principal Investigator, 2021.11–2022.12

[1] Yufei Zhan, Shurong Zheng, Yousong Zhu, Hongyin Zhao, Fan Yang, Ming Tang, Jinqiao Wang. Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring. ICCV2025.

[2] Yufei Zhan, Yousong Zhu*, Zhiyang Chen, Fan Yang, Ming Tang, Jinqiao Wang. Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models. ECCV2024.

[3] Zhaowen Li, Yousong Zhu*, Zhiyang Chen, Wei Li, Rui Zhao, Chaoyang Zhao, Ming Tang, Jinqiao Wang. Efficient Masked Autoencoders with Self-Consistency. TPAMI2024.

[4] Zhaowen Li, Yousong Zhu*, Zhiyang Chen, Zongxin Gao, Rui Zhao, Chaoyang Zhao, Ming Tang, Jinqiao Wang. Self-supervised Representation Learning from Arbitrary Scenarios. CVPR2024.

[5] Zhiyang Chen, Yousong Zhu, Zhaowen Li, Fan Yang, Chaoyang Zhao, Liwei Wu, Jinqiao Wang, Ming Tang. The Devil is in Details: Delving into Lite FFN Design for Vision Transformers. ICASSP2024 (Oral).

[6] Zhiyang Chen, Yousong Zhu, Zhaowen Li, Fan Yang, Wei Li, Haixin Wang, Chaoyang Zhao, Liwei Wu, Rui Zhao, Jinqiao Wang, Ming Tang. Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks. NeurIPS2022 (Spotlight).

[7] Tong Wang, Yousong Zhu, Yingying Chen, Chaoyang Zhao, Bin Yu, Jinqiao Wang, Ming Tang. C2AM Loss: Chasing a Better Decision Boundary for Long-Tail Object Detection. CVPR2022.

[8] Zhaowen Li, Yousong Zhu, Fan Yang, Wei Li, Chaoyang Zhao, Yingying Chen, Zhiyang Chen, Jiahao Xie, Liwei Wu, Rui Zhao, Ming Tang, Jinqiao Wang. UniVIP: A Unified Framework for Self-Supervised Visual Pre-training. CVPR2022.

[9] Tong Wang, Yousong Zhu, Chaoyang Zhao, Wei Zeng, Jinqiao Wang, Ming Tang. Adaptive class suppression loss for long-tail object detection. CVPR2021.

[10] Zhiyang Chen, Yousong Zhu, Chaoyang Zhao, Guosheng Hu, Wei Zeng, Jinqiao Wang, Ming Tang. Dpt: Deformable patch-based transformer for visual recognition. ACM MM2021 (Oral).

[11] Li Wang, Dong Li, Yousong Zhu, Lu Tian, Yi Shan. Dual super-resolution learning for semantic segmentation. CVPR2020 (Oral).

[12] Tong Wang, Yousong Zhu, Chaoyang Zhao, Wei Zeng, Yaowei Wang, Jinqiao Wang, Ming Tang. Large Batch Optimization for Object Detection: Training COCO in 12 minutes. ECCV2020.

2024: Zuchongzhi Award for Frontier Innovation in Artificial Intelligence

2023: Second Prize, Beijing Natural Science Award

2023: Distinguished Research Backbone, Chinese Academy of Sciences

2023: Runner-up, Open World Object Detection Challenge, China Society of Image and Graphics

2022: SAIL Award (Supreme AI Leader), World Artificial Intelligence Conference

2022: Gold Medal, 8th China "Internet+" Innovation and Entrepreneurship Competition (Advisor)

2019: Outstanding Graduate, UCAS & Beijing City

2018: Champion, Global AI Challenge - Autonomous Driving Visual Perception Competition (Team Leader)