Qibin Hou received his Ph.D. degree from Nankai University in 06/2019 under the supervision of Prof. Ming-Ming Cheng. From 08/2019 to 08/2021, I spent two wonderful years working with Dr. Jiashi Feng and Dr. Shuicheng Yan as a research fellow at National University of Singapore. Now, I am an associate professor at School of Computer Science, Nankai University, Tianjin, China. I am also with Nankai International Advanced Research Institute (Shenzhen Futian).

Research Interests:

My research covers a range of Computer Vision and Deep Learning. To be specific, my research concentrates on building AI models to help agents better see and understand the complex world.

Specific directions: Vision Foundation Models Visual Scene Understanding Visual Content Generation

See my recent publications for more details.

For prospective students

I am always looking for self-motivated Ph.D./master students working with me on the above research fields. If you are interested in the research topics in my group, welcome to drop an email. Note that before contacting me, you are supposed to have read [the enrollment information] already.

News

  • Among the list of Highly Cited Chinese Researchers of 2024
  • Four papers accepted by CVPR’25
  • Two papers accepted by ICLR’25
  • Two papers accepted by TPAMI’2025
  • Will serve as an area chair for ICCV’2025
  • Three papers accepted by NeurIPS’2024
  • Five papers accepted by TPAMI’2024
  • Among the list of Highly Cited Chinese Researchers of 2023
  • Six papers accepted by CVPR’24 and ECCV’24
  • Top 2% of Scientists on Stanford List for multiple times

Group

Ph.D. Students

Master Students

  • Yuqi Yang (Co-supervise with Prof. Ming-Ming Cheng, 2022-)
  • Xinbin Yuan (2023-)
  • Yuhao Wan (2024-)
  • Guohong Mu (2024-)

Undergraduate Students

  • Ziheng Ouyang (Sophomore)
  • Jiadong Hou (Sophomore)
  • Modi Jin (Sophomore)

Preprint

“*” means authors contributed equally and “#” means corresponding author.

Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection

Xinbin Yuan, ZhaoHui Zheng, Yuxuan Li, Xialei Liu, Li Liu, Xiang Li, Qibin Hou#, Ming-Ming Cheng#

Arxiv, 2025

[Arxiv] [Code] [Zhihu] [PaperWithCode]

DenseVLM: A Retrieval and Decoupled Alignment Framework for Open-Vocabulary Dense Prediction

Yunheng Li, Yuxuan Li, Quansheng Zeng, Wenhai Wang, Qibin Hou#, Ming-Ming Cheng

Arxiv, 2024

[Arxiv] [Code]

Selected Journal Publications (Google Scholar)

Yolo-ms: rethinking multi-scale representation learning for real-time object detection

Yuming Chen, Xinbin Yuan, Ruiqi Wu, Jiabao Wang, Qibin Hou#, Ming-Ming Cheng

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025

[Arxiv] [Code]

Conv2former: A simple transformer-style convnet for visual recognition

Qibin Hou, Cheng-Ze Lu, Ming-Ming Cheng#, Jiashi Feng

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

[Arxiv] [Code]

Camoformer: Masked separable attention for camouflaged object detection

Bowen Yin*, Xuying Zhang*, Deng-Ping Fan, Shaohui Jiao, Ming-Ming Cheng, Luc Van Gool, Qibin Hou#

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

[Arxiv] [Code]

Vision permutator: A permutable mlp-like architecture for visual recognition

Qibin Hou, Zihang Jiang, Li Yuan, Ming-Ming Cheng, Shuicheng Yan, Jiashi Feng

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

[Arxiv] [Code]

Volo: Vision outlooker for visual recognition

Li Yuan*, Qibin Hou*, Zihang Jiang*, Jiashi Feng, Shuicheng Yan

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

[Arxiv] [Code]

Localization distillation for object detection

Zhaohui Zheng, Rongguang Ye, Qibin Hou#, Dongwei Ren, Ping Wang, Wangmeng Zuo, Ming-Ming Cheng

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

[Arxiv] [Code]

Deeply Supervised Salient Object Detection with Short Connections

Qibin Hou, Ming-Ming Cheng, Xiaowei Hu, Ali Borji, Zhuowen Tu, Philip Torr

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019

[Arxiv] [Code]

Selected Conference Publications (Google Scholar)

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Yupeng Zhou, Daquan Zhou#, Ming-Ming Cheng, Jiashi Feng, Qibin Hou#

Neural Information Processing Systems (NeurIPS), 2024

[Arxiv] [Project] [Code]

OPUS: Occupancy Prediction Using a Sparse Set

Jiabao Wang*, Zhaojiang Liu*, Qiang Meng, Liujiang Yan, Ke Wang, Jie Yang, Wei Liu, Qibin Hou#, Ming-Ming Cheng

Neural Information Processing Systems (NeurIPS), 2024

[Arxiv] [Code]

Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation

Yunheng Li, ZhongYu Li, Quansheng Zeng, Qibin Hou#, Ming-Ming Cheng

International Conference on Machine Learning (ICML), 2024

[Arxiv] [Code]

CrossKD: Cross-Head Knowledge Distillation for Dense Object Detection

Jiabao Wang*, Yuming Chen*, Zhaohui Zheng, Xiang Li, Ming-Ming Cheng, Qibin Hou#

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024

[Arxiv] [Code]

Dformer: Rethinking rgbd representation learning for semantic segmentation

Bowen Yin, Xuying Zhang, Zhongyu Li, Li Liu, Ming-Ming Cheng, Qibin Hou#

International Conference on Learning Representations (ICLR), 2024

[Arxiv] [Code]

SRFormer: Permuted Self-Attention for Single Image Super-Resolution

Yupeng Zhou, Zhen Li, Chun-Le Guo, Song Bai, Ming-Ming Cheng, Qibin Hou#

IEEE International Conference on Computer Vision (ICCV), 2023

[Arxiv] [Code]

Coordinate attention for efficient mobile network design

Qibin Hou, Daquan Zhou, Jiashi Feng

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021

[Arxiv] [Code]

Strip Pooling: Rethinking Spatial Pooling for Scene Parsing

Qibin Hou, Li Zhang, Ming-Ming Cheng, Jiashi Feng

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020

[Arxiv] [Code]

Academic Services

  • VALSE 2022 Expo Chair
  • VALSE 2023 Expo Chair
  • Reviewers for TPAMI/TIP/CVPR/ICCV/NeurIPS/ICML/ICLR etc.
  • Area chair for ICCV’25

Honors and Awards

  • First prize in natural science, Ministry of Education, 2022.
  • Second prize in nature science, CAAI, 2020.