[1] Yufei Zhan, Shurong Zheng, Yousong Zhu, Hongyin Zhao, Fan Yang, Ming Tang, Jinqiao Wang. Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring. ICCV2025.
[2] Yufei Zhan, Yousong Zhu*, Zhiyang Chen, Fan Yang, Ming Tang, Jinqiao Wang. Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models. ECCV2024.
[3] Zhaowen Li, Yousong Zhu*, Zhiyang Chen, Wei Li, Rui Zhao, Chaoyang Zhao, Ming Tang, Jinqiao Wang. Efficient Masked Autoencoders with Self-Consistency. TPAMI2024.
[4] Zhaowen Li, Yousong Zhu*, Zhiyang Chen, Zongxin Gao, Rui Zhao, Chaoyang Zhao, Ming Tang, Jinqiao Wang. Self-supervised Representation Learning from Arbitrary Scenarios. CVPR2024.
[5] Zhiyang Chen, Yousong Zhu, Zhaowen Li, Fan Yang, Chaoyang Zhao, Liwei Wu, Jinqiao Wang, Ming Tang. The Devil is in Details: Delving into Lite FFN Design for Vision Transformers. ICASSP2024 (Oral).
[6] Zhiyang Chen, Yousong Zhu, Zhaowen Li, Fan Yang, Wei Li, Haixin Wang, Chaoyang Zhao, Liwei Wu, Rui Zhao, Jinqiao Wang, Ming Tang. Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks. NeurIPS2022 (Spotlight).
[7] Tong Wang, Yousong Zhu, Yingying Chen, Chaoyang Zhao, Bin Yu, Jinqiao Wang, Ming Tang. C2AM Loss: Chasing a Better Decision Boundary for Long-Tail Object Detection. CVPR2022.
[8] Zhaowen Li, Yousong Zhu, Fan Yang, Wei Li, Chaoyang Zhao, Yingying Chen, Zhiyang Chen, Jiahao Xie, Liwei Wu, Rui Zhao, Ming Tang, Jinqiao Wang. UniVIP: A Unified Framework for Self-Supervised Visual Pre-training. CVPR2022.
[9] Tong Wang, Yousong Zhu, Chaoyang Zhao, Wei Zeng, Jinqiao Wang, Ming Tang. Adaptive class suppression loss for long-tail object detection. CVPR2021.
[10] Zhiyang Chen, Yousong Zhu, Chaoyang Zhao, Guosheng Hu, Wei Zeng, Jinqiao Wang, Ming Tang. Dpt: Deformable patch-based transformer for visual recognition. ACM MM2021 (Oral).
[11] Li Wang, Dong Li, Yousong Zhu, Lu Tian, Yi Shan. Dual super-resolution learning for semantic segmentation. CVPR2020 (Oral).
[12] Tong Wang, Yousong Zhu, Chaoyang Zhao, Wei Zeng, Yaowei Wang, Jinqiao Wang, Ming Tang. Large Batch Optimization for Object Detection: Training COCO in 12 minutes. ECCV2020.