Haiwen Diao

About Me

PROFESSIONAL PATH

I am currently a Research Fellow at MMLab@NTU, Nanyang Technological University, under the supervision of Prof. Ziwei Liu. Before that, I obtained my Doctor Degree at IIAU-Lab of Dalian University of Technology, under the guidance of Prof. Huchuan Lu.

Apr. 2025 -- Present: Research fellow at MMLab@NTU with Prof. Ziwei Liu on Large Fundamental Models, Inference-Time Scaling and Reasoning.
Sep. 2023 -- Mar. 2025: Research intern at BAAI with Dr. Xinlong Wang on Large Multi-Modality Models for Understanding, Generation, and Unification.
Jan. 2023 -- Aug. 2023: Remote cooperation with Prof. Long Chen from HKUST and Dr. Bo Wan from KU Leuven on Parameter / Memory-Efficient Transfer Learning.
Jun. 2020 -- Mar. 2021: Research intern at Tencent AI Lab with Dr. Ying Zhang and Dr. Lin Ma on Image-Text Retrieval, Cross-Modal Boosting and Metric Learning.

Research Pursuits: Develop an efficient and reliable mechanism that can proficiently recognize visual-semantic perception, contextualize fine-grained interaction across modalities, and mimick human-like judgement and decision-making capabilities.
1. Vision-Language Retrieval:
SGRAF (AAAI'21), RCAR (TIP'23), DBL (TIP'24), GSSF (TIP'24)
2. Efficient Transfer Learning:
UniPT (CVPR'24), SHERL (ECCV'24), ReSoRA (ACMMM'25)
3. Multi-Modality Perception:
EVE (NeurIPS'24), EVEv2 (ICCV'25), NEO (2025)
DenseFusion (NeurIPS'24), Infinity-MM (2024), Visual Jigsaw (2025)
4. Multi-Modality Generation:
NOVA (ICLR'25), MoTrans (ACMMM'24)
5. Multi-Modality Unification:
ETT (NeurIPS'25)

Open Resources: [Awesome_Matching_Pretraining_Transfering]
[Awesome_Image_Text_Retrieval_Benchmark]