I am an Assistant Professor and Master’s Supervisor at the Intelligent Media Research Center (智能媒体研究中心, iLearn, iLearn-Lab
), Shandong University. I received my B.S., M.S., and Ph.D. degrees from Harbin Institute of Technology under the supervision of Prof. Ping Fu. I have received honors including the ACM Jinan Chapter Rising Star Award, Huawei Outstanding Technical Collaboration Achievement Award, and the Shandong University Young Scholars Future Program.
My research focuses on multimodal visual content understanding, particularly multi-source visual salient object segmentation and cross-modal referring image segmentation, forming a systematic research framework spanning visual saliency segmentation, cross-modal semantic understanding, and referring image segmentation/grounding. I have led multiple competitive projects, including grants from the National Natural Science Foundation of China (General and Young Scientists Programs), the Shandong Provincial Natural Science Foundation, key R&D programs, and the Huawei MindSpore Academic Award Fund, and have participated in over 10 national and industry-funded projects. I have published more than 30 papers in top-tier IEEE/ACM Trans, CCF-A venues.
I also serves as a committee member of several CSIG/CAA technical committees and as reviewers or guest editors for leading international journals and conferences such as IEEE TPAMI and CVPR.
🔥 News
-
2026.04: 🎉🎉🎉 One paper on Salient Object Detection has been accepted by IEEE Transactions on Instrumentation and Measurement (IEEE TIM), and one paper on Egocentric Action Recognition has been accepted by ICMR’26.
-
2026.01: 🎉🎉🎉 One paper on Composed video retrieval has been accepted by ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMM).
-
2025.10: We are organizing a Special Issue titled “Advances in Deep Learning for Open-World Computer Vision and Pattern Recognition” in Electronics (SCI, IF = 2.6). Submissions are welcome link.
-
2025.06: Our lab has made progress in end-to-end superpixel image segmentation, with the related work accepted by IEEE Transactions on Circuits and Systems for Video Technology (IEEE TCSVT). Congratulations to all collaborators!
-
2025.05: Our lab has achieved research progress in salient object detection for remote sensing images and infrared small target detection. The corresponding works have been accepted by IEEE Transactions on Cybernetics (IEEE TCYB) and IEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), respectively. Congratulations to all collaborators!
-
2025.04: Our lab has made progress in data distillation, heterogeneous semantic segmentation model distillation, and few-shot compositional image retrieval. The related works have been accepted by CVPR 2025, ICMR 2025, and IJCNN 2025. Congratulations to all collaborators!
-
2024.11: Our lab achieved progress in cross-modal referring image segmentation. The paper “CMIRNet: Cross-Modal Interactive Reasoning Network for Referring Image Segmentation” has been accepted by IEEE Transactions on Circuits and Systems for Video Technology (IEEE TCSVT) (SCI, IF = 11.1, CAS Zone 1, Top journal). Congratulations to collaborators Tianxiang Xiao (junior undergraduate) and Yutong Liu (junior undergraduate), among others.
-
2024.08: Congratulations to our undergraduate research assistants Yutong Liu (junior undergraduate) and Tianxiang Xiao (junior undergraduate) for their progress in salient object detection for optical remote sensing images. Their work, “Heterogeneous Feature Collaboration Network for Salient Object Detection in Optical Remote Sensing Images,” has been accepted by the top journal IEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS) (SCI, IF = 8.6, CAS Zone 1).
-
2023.07: Congratulations to our undergraduate research assistants Xiangyu Zeng (junior undergraduate) and Yijun Hu (sophomore undergraduate) for their progress in salient object detection for optical remote sensing images. Their work, “Adaptive Edge-aware Semantic Interaction Network for Salient Object Detection in Optical Remote Sensing Images,” has been accepted by the top journal IEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS) (SCI, IF = 8.6, CAS Zone 1).
📝 Publications
🎙 Referring Expression Segmentation/Grounding (RES/REG)

CMIRNet: Cross-Modal Interactive Reasoning Network for Referring Image Segmentation
Mingzhu Xu, Tianxiang Xiao, Yutong Liu, Haoyu Tang, Yupeng Hu, and Liqiang Nie.
ICME 2024Two-Stage Information Bottleneck For Temporal Language Grounding, Haoyu Tang, Shuaike Zhang, Ming Yan, Ji Zhang, Mingzhu Xu, Yupeng Hu, Liqiang Nie.
🎙 Multimedia Information Retrieval (MIR)

A comprehensive survey on composed image retrieval
Xuemeng Song, Haoqiang Lin, Haokun Wen, Bohan Hou, Mingzhu Xu*, Liqiang Nie

Refine: Composed video retrieval via shared and differential semantics enhancement
Yupeng Hu, Zixu Li, Zhiwei Chen, Qinlei Huang, Zhiheng Fu, Mingzhu Xu*, Liqiang Nie
IJCNN 2025Pseudo Triplet Guided Few-shot Composed Image Retrieval, Bohan Hou, Haoqiang Lin, Haokun Wen, Meng Liu, Mingzhu Xu, Xuemeng SongESWA 2025Dual-space relation-aware entity representation learning for personalized compatibility modeling, Jinhuan Liu, Xu Cui, Xuemeng Song, Yanwei Yu, Mingzhu Xu, Junwei Du
🎙 Optical Remote Sensing Image Salient Object Detection (ORSI-SOD) & InfRared Small Target Detection (IRSTD)

Cross-Model Nested Fusion Network for Salient Object Detection in Optical Remote Sensing Images
Mingzhu Xu, Sen Wang, Yupeng Hu, Haoyu Tang, Runmin Cong, Liqiang Nie.

HDNet: A Hybrid Domain Network with Multi-Scale High-Frequency Information Enhancement for Infrared Small Target Detection
Mingzhu Xu, Chenglong Yu, Zexuan Li, Haoyu Tang, Yupeng Hu, Liqiang Nie.

Heterogeneous Feature Collaboration Network for Salient Object Detection in Optical Remote Sensing Images
Yutong Liu, Mingzhu Xu*, Tianxiang Xiao, Haoyu Tang, Yupeng Hu, and Liqiang Nie.

Adaptive Edge-aware Semantic Interaction Network for Salient Object Detection in Optical Remote Sensing Images
Xiangyu Zeng, Mingzhu Xu*, Yijun Hu, Haoyu Tang, Yupeng Hu, and Liqiang Nie.

Adaptive Spatial Tokenization Transformer for Salient Object Detection in Optical Remote Sensing Images
Lina Gao, Bing Liu, Ping Fu, Mingzhu Xu.
🎙 Superpixel Segmentation & Semantic Segmentation (SS)

Superpixel Segmentation With Edge Guided Local-Global Attention Network
Mingzhu Xu, Zhengyu Sun, Yijun Hu, Haoyu Tang, Yupeng Hu, Xuemeng Song, Liqiang Nie.

Heterogeneous Model Knowledge Distillation via Dual Alignment for Semantic Segmentation
Mingzhu Xu, Jing Wang, Mingcai Wang, Yiping Li, Yupeng Hu, Xuemeng Song, Weili Guan.
🎙 Natural Scenes – Single-Model / Multimodal Salient Object Detection (SOD)

UMINet: a unified multi-modality interaction network for RGB-D and RGB-T salient object detection
Lina Gao, Ping Fu, Mingzhu Xu, Tiantian Wang, Bing Liu.

Multi-Stream Attention-Aware Graph Convolution Network for Video Salient Object Detection
Mingzhu Xu, Ping Fu, Bing Liu, Junbao Li.
TCSVT 2020Video Salient Object Detection via Robust Seeds Extraction and Multi-graphs Manifold Propagation, Mingzhu Xu, Bing Liu, Ping Fu, Junbao Li, Yu Hen Hu, Shou Feng.TMM 2019Video Saliency Detection via Graph Clustering With Motion Energy and Spatiotemporal Objectness, Mingzhu Xu, Bing Liu, Ping Fu, Junbao Li, Yu Hen Hu.APIN 2022A Novel Dynamic Graph Evolution Network for Salient Object Detection, Mingzhu Xu, Ping Fu, Bing Liu, Hongtao Yin, Junbao Li.TIM 2025Self-Supervised Pre-training with Multi-modality Representation Enhancement for Salient Object Detection in RGB-D Images, Lina Gao, Bing Liu, Ping Fu, Mingzhu Xu, Yonggang Zhang, Yulong Huang.PR 2024TSVT: Token Sparsification Vision Transformer for robust RGB-D salient object detection, Lina Gao, Bing Liu, Ping Fu, Mingzhu Xu.NP 2023Depth-aware inverted refinement network for RGB-D salient object detection, Lina Gao, Bing Liu, Ping Fu, Mingzhu Xu.电子与信息学报 2023Salient Object Detection Based on Multiple Graph Neural Networks Collaborative Learning, Bing LIU, Tiantian Wang, Lina Gao, Mingzhu Xu*, Ping Fu.APIN 2022A novel spatiotemporal attention enhanced discriminative network for video salient object detection, Bing Liu, Kezhou Mu, Mingzhu Xu, Fangyuan Wang, Lei Feng.ICIP 2021Co-saliency detection via unified hierarchical graph neural network with geometric attention, Jiaqing Qiao, Shaowei Sun, Mingzhu Xu, Yongqiang Li, Bing Liu.
🎙 Others
CVPR 2025Towards stable and storage-efficient dataset distillation: Matching convexified trajectory, Wenliang Zhong, Haoyu Tang, Qinghai Zheng, Mingzhu Xu, Yupeng Hu, Weili Guan.MM 2024Revisiting unsupervised temporal action localization: The primacy of high-quality actionness and pseudolabels, Han Jiang, Haoyu Tang, Ming Yan, Ji Zhang, Mingzhu Xu, Yupeng Hu, Jihua Zhu, Liqiang Nie.AAAI 2024Exploiting the social-like prior in transformer for visual reasoning, Yudong Han, Yupeng Hu, Xuemeng Song, Haoyu Tang, Mingzhu Xu, Liqiang Nie.InFu 2024Listen as you wish: Fusion of audio and text for cross-modal event detection in smart cities, Haoyu Tang, Yupeng Hu, Yunxiao Wang, Shuaike Zhang, Mingzhu Xu, Jihua Zhu, Qinghai Zheng.IJCAI 2024Breaking Barriers of System Heterogeneity: Straggler-Tolerant Multimodal Federated Learning via Knowledge Distillation., Jinqian Chen, Haoyu Tang, Junhao Cheng, Ming Yan, Ji Zhang, Mingzhu Xu, Yupeng Hu, Liqiang Nie.TITS 2023Saliency-induced moving object detection for robust RGB-D vision navigation under complex dynamic environments, Chao Sun, Xing Wu, Jia Sun, Changyin Sun, Mingzhu Xu, Quanbo Ge.NCAA 2023CoGCN: co-occurring item-aware GCN for recommendation, Xinxiao Zhao, Fan Liu, Hao Liu, Mingzhu Xu, Haoyu Tang, Xueqing Li, Yupeng Hu.APIN 2022Visual tracking via dynamic saliency discriminative correlation filter, Lina Gao, Bing Liu, Ping Fu, Mingzhu Xu, Junbao Li.Electronics 2019Research and Implementation of ε-SVR Training Method Based on FPGA, Ruidong Wu, Bing Liu, Jiafeng Fu, Mingzhu Xu, Ping Fu, Junbao Li.
🎖 Honors and Awards
- 2025.09 ACM Jinan Chapter Rising Star Award
- 2024.12 7th National Undergraduate Embedded Chip and System Design Competition, Third Prize (National Level)
- 2024.09 Outstanding Technical Collaboration Award (Approximate Nearest Neighbor Search Research), Huawei
- 2024.07 “Outstanding Undergraduate Thesis”, Shandong University (Supervised two undergraduate students)
- 2024.06 Mathematical Contest in Modeling, Honorable Mention
- 2023.12 Young Faculty Teaching Competition, Shandong University, Second Prize (University Level)
- 2023.11 National Undergraduate Mathematical Contest in Modeling (Supervised Undergraduate Students), First Prize (Shandong Division)
- 2023.11 Postdoctoral Haihe Academic Exchange Activity on Information Technology Innovation and Digital Economy, Third Prize
📖 Educations
- 2015.09 - 2021.01, Harbin Institute of Technology, Ph.D. in Engineering.
- 2013.09 - 2015.07, Harbin Institute of Technology, M.S. in Engineering.
- 2009.09 - 2013.07, Harbin Institute of Technology, B.S. in Engineering.
🎓 Academic Service
- ⚖️ Guest Editors: Electronics, …
- 📝 Reviewer: IEEE TPAMI, TIP, TCYB, TMM, TKDE, TCSVT, TGRS, TITS, TIM, …
- ✍️ PC Member: CVPR, ICCV, ECCV, ICML, ICLR, ACM MM, AAAI, IJCAI, …
- 🏛️ Conference Service: Session Chair, Area Chair, …
