|  |   |   |   |   | Haiyang Mei梅海洋 Email: haiyang.mei@outlook.com Wechat: mhy845879017 |             | 
I am currently a Research Fellow at Show Lab @ NUS, National University of Singapore, under the supervision of Prof. Mike Zheng Shou. Before that, I received my Ph.D. from Dalian University of Technology, supervised by Prof. Xiaopeng Wei and Prof. Xin Yang, co-supervised by Rynson W.H. Lau and Bo Dong. I was a visting Ph.D. student from Jan 2022 to Jan 2023 in Sensors Group, Institute of Neuroinformatics, University of Zurich and ETH Zurich, Switzerland, supervised by Prof. Tobi Delbruck.
My primary research interest is in designing effective visual understanding models for the vision systems. This work can help AI agents in scene-level understanding, reasoning, and decision making.
		
		
		My recent endeavor is on scene confusing-discovery, which aims to mine the confusing/special yet meaningful object/region in the scene. In particular, the objects I focus on include glass, mirror, camouflaged object, and salient object, which are very common in daily life scenes but can confuse the vision systems due to their inherently special properties. Therefore, detecting and segmenting such objects from the scene plays an essential role in accurate scene understanding and can benefit a wide range of computer vision, graphics, and multimedia applications, including image classification, visual tracking, content-aware image editing, medical image diagnosis, and robotic navigation. However, this task has not been fully explored and remains an unsolved and challenging problem. Both glass and mirror do not have their own visual appearances but only transmit/reflect the appearances of their surroundings, making them fundamentally different from other common objects that have been addressed well by the state-of-the-art segmentation methods. The camouflaged/salient object is the object that is ``seamlessly'' embedded in their surroundings or most attention-grabbing and could be easily cluttered by the background in the complex scenes. I am working toward exploring useful cues and effective methods for accurate segmentation.
		
		
		Besides, my early work was on image super-resolution, which is to reconstruct the high-quality, visually satisfactory high-resolution image from the input low-resolution one and is the cornerstone of providing more detailed information for scene analysis and understanding.
	
| 2025 | ||
| 1. |  | SAM-I2V: Upgrading SAM to Support Promptable Video Segmentation with Less than 0.2% Training Cost Haiyang Mei, Pengyu Zhang, Mike Zheng Shou. The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2025 [arXiv|Open Access|Project Page] | 
| 2. |  | Can I Trust You? Advancing GUI Task Automation with Action Trust Score Haiyang Mei, Difei Gao, Xiaopeng Wei, Xin Yang, Mike Zheng Shou. The 33rd ACM International Conference on Multimedia (ACM MM) 2025 [Open Access|Project Page] | 
| 3. |  | You Only Communicate Once: One-shot Federated Low-Rank Adaptation of MLLM Binqian Xu, Haiyang Mei, Zechen Bai, Jinjin Gong, Rui Yan, Guo-Sen Xie, Yazhou Yao, Basura Fernando, Xiangbo Shu. The Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS) 2025 [PDF|Project Page] | 
| 4. |  | InterFeedback: Unveiling Interactive Intelligence of Large Multimodal Models via Human Feedback Henry Hengyuan Zhao, Wenqi Pei, Yifei Tao, Haiyang Mei, Mike Zheng Shou. The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP) 2025 [PDF|Project Page] | 
| 5. |  | FedMLLM: Federated Fine-tuning MLLM on Multimodal Heterogeneity Data Binqian Xu, Xiangbo Shu, Haiyang Mei, Guosen Xie, Basura Fernando, Mike Zheng Shou, Jinhui Tang. arXiv:2411.14717 [PDF|Project Page] | 
| 2024 | ||
| 1. |  | Skip \n: A Simple Method to Reduce Hallucination in Large Vision-Language Models Zongbo Han, Zechen Bai, Haiyang Mei, Qianli Xu, Changqing Zhang, Mike Zheng Shou. The Twelfth International Conference on Learning Representations Workshops (ICLRW) 2024 [PDF|Project Page] | 
| 2. |  | Apprenticeship-Inspired Elegance: Synergistic Knowledge Distillation Empowers Spiking Neural Networks for Efficient Single-Eye Emotion Recognition Yang Wang, Haiyang Mei, Qirui Bao, Ziqi Wei, Mike Zheng Shou, Haizhou Li, Bo Dong, Xin Yang. The 33rd International Joint Conference on Artificial Intelligence (IJCAI) 2024 [PDF|Project Page] | 
| 3. |  | One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos Zechen Bai, Tong He, Haiyang Mei, Pichao Wang, Ziteng Gao, Joya Chen, liulei, Zheng Zhang, Mike Zheng Shou. The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS) 2024 [PDF|Project Page] | 
| 4. |  | DoFIT: Domain-aware Federated Instruction Tuning with Alleviated Catastrophic Forgetting Binqian Xu, Xiangbo Shu, Haiyang Mei, Zechen Bai, Basura Fernando, Mike Zheng Shou, Jinhui Tang. The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS) 2024 [PDF|Project Page] | 
| 5. |  | Steel Sheet Counting from an Image with a Two-Stream Network Zhiling Cui, Haiyang Mei, Wen Dong, Ziqi Wei, Zheng Lv, Dongsheng Zhou, Xin Yang. IEEE Transactions on Instrumentation & Measurement (TIM) 2024 [PDF|Project Page] | 
| 2023 | ||
| 1. |  | Deep Polarization Reconstruction with PDAVIS Events Haiyang Mei, Zuowen Wang, Xin Yang, Xiaopeng Wei, Tobi Delbruck. The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023 [PDF|Project Page] | 
| 2. |  | Camouflaged Object Segmentation with Omni Perception Haiyang Mei, Ke Xu, Yunduo Zhou, Yang Wang, Haiyin Piao, Xiaopeng Wei, Xin Yang. International Journal of Computer Vision (IJCV) 2023 [PDF |Project Page] | 
| 3. |  | Distraction-Aware Camouflaged Object Segmentation Haiyang Mei, Xin Yang, Yunduo Zhou, Ge-Peng Ji, Xiaopeng Wei, Deng-Ping Fan. SCIENTIA SINICA Informationis (SSI) 2023 [PDF |Project Page] | 
| 4. |  | A Method for Generating Adversarial Patterns in Facial Recognition with Visual Camouflage Qirui Bao, Haiyang Mei, Huilin Wei, Zheng Lv, Yuxin Wang, Erwei Yin, Xin Yang. CADCG 2023 [PDF |Project Page] | 
| 5. |  | Event-Enhanced Multi-Modal Spiking Neural Network for Dynamic Obstacle Avoidance Yang Wang, Bo Dong, Yuji Zhang, Yunduo Zhou, Haiyang Mei, Ziqi Wei, Xin Yang. ACM International Conference on Multimedia (MM) 2023 [PDF |Project Page] | 
| 6. |  | Live Demo: E2P–Events to Polarization Reconstruction from PDAVIS Events Tobi Delbruck, Zuowen Wang, Haiyang Mei, Germain Haessig, Damien Joubert, Justin Haque, Yingkai Chen, Moritz B. Milde, Viktor Gruev. The IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2023 [PDF |Project Page] | 
| 7. |  | Exploiting Polarized Material Cues for Robust Car Detection Wen Dong, Haiyang Mei, Ziqi Wei, Ao Jin, Sen Qiu, Qiang Zhang, Xin Yang. Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI) 2024 [PDF |Project Page] | 
| 2022 | ||
| 1. |  | Glass Segmentation using Intensity and Spectral Polarization Cues Haiyang Mei, Bo Dong, Wen Dong, Jiaxi Yang, Seung-Hwan Baek, Felix Heide, Pieter Peers, Xiaopeng Wei, Xin Yang. The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022 [PDF|Project Page] | 
| 2. |  | Progressive Glass Segmentation Letian Yu*, Haiyang Mei*, Wen Dong, Ziqi Wei, Li Zhu, Yuxin Wang, Xin Yang. (* joint first authors) IEEE Transactions on Image Processing (TIP) 2022 [PDF|Project Page] | 
| 3. |  | Large-Field Contextual Feature Learning for Glass Detection Haiyang Mei, Xin Yang, Letian Yu, Qiang Zhang, Xiaopeng Wei, Rynson W.H. Lau. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2022 [PDF|Project Page] | 
| 4. |  | Mirror Segmentation via Semantic-Aware Contextual Contrasted Feature Learning Haiyang Mei, Letian Yu, Ke Xu, Yang Wang, Xin Yang, Xiaopeng Wei, Rynson W.H. Lau. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 2022 [PDF|Project Page] | 
| 2021 | ||
| 1. |  | Depth-Aware Mirror Segmentation Haiyang Mei, Bo Dong, Wen Dong, Pieter Peers, Xin Yang, Qiang Zhang, Xiaopeng Wei. The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021 (Oral) [PDF |Project Page] | 
| 2. |  | Camouflaged Object Segmentation with Distraction Mining Haiyang Mei, Ge-Peng Ji, Ziqi Wei, Xin Yang, Xiaopeng Wei, Deng-Ping Fan. The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021 [PDF|Project Page] | 
| 3. |  | Exploring Dense Context for Salient Object Detection Haiyang Mei, Yuanyuan Liu, Ziqi Wei, Dongsheng Zhou, Xiaopeng Wei, Qiang Zhang, Xin Yang. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT) 2021 [PDF|Project Page] | 
| 4. |  | A Two-Stage Attentive Network for Single Image Super-Resolution Jiqing Zhang, Chengjiang Long, Yuxin Wang, Haiyin Piao, Haiyang Mei, Xin Yang, Baocai Yin. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT) 2021 [PDF|Project Page] | 
| 2020 | ||
| 1. |  | Don't Hit Me! Glass Detection in Real-World Scenes Haiyang Mei, Xin Yang, Yang Wang, Yuanyuan Liu, Shengfeng He, Qiang Zhang, Xiaopeng Wei, Rynson W.H. Lau. The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020 [PDF|Project Page] | 
| 2. |  | Multi-Context And Enhanced Reconstruction Network For Single Image Super Resolution Jiqing Zhang, Chengjiang Long, Yuxin Wang, Xin Yang, Haiyang Mei, Baocai Yin. IEEE International Conference on Multimedia and Expo (ICME) 2020 [PDF|Project Page] | 
| 2019 | ||
| 1. |  | Where Is My Mirror? Xin Yang*, Haiyang Mei*, Ke Xu, Xiaopeng Wei, Baocai Yin, Rynson Lau. (* joint first authors) IEEE International Conference on Computer Vision (ICCV) 2019 [PDF|Project Page] | 
| 2018 | ||
| 1. |  | DRFN: Deep Recurrent Fusion Network for Single Image Super-Resolution with Large Factors Xin Yang, Haiyang Mei, Jiqing Zhang, Ke Xu, Baocai Yin, Qiang Zhang, Xiaopeng Wei. IEEE Transactions on Multimedia (TMM) 2018 [PDF| Project Page] | 
| 1.   一种利用回归树场的图像超分辨率放大方法.   201710859709.0 | 
| 2.   基于信息挖掘的伪装目标图像分割方法.   CN112750140B | 
| 3.   基于深度感知的镜子图像分割方法.   CN112767418B | 
| 4.   一种真实场景下玻璃检测的方法.   CN111339917B | 
| 5.   一种基于多任务协作的镜子检测方法.   CN111339919B | 
| 6.   Method for Glass Detection in Real Scenes.   US011361534B2 |