About Me
Currently, I am a Research Intern at StepFun, mentored by Dr. Quan Sun. Before that, I was a Research Intern at MSRA, mentored by Dr. Dong Chen. I also closely work with Jingwei Wu, Guopeng Li, Ruichuan An.
My research interests focus on unified understanding and generation in MLLM. Please email me if you want to collaborate on academic research or have any questions.
📝 Publications
GEBench: Benchmarking Image Generation Models as GUI Environments
Haodong Li, Juanxi Tian, Jingwei Wu, Quan Sun, Guopeng Li, Huanyu Zhang, Yanlin Lai, Ruichuan An, Hongbo Peng, Yuhong Dai, Chenxi Li, Chunmei Qing, Zheng Ge, Xiangyu Zhang, Daxin Jiang
ArXiv, 2026
[
ArXiv] [
Code]
UniBench: Benchmarking Unified Any-to-Any Interleaved Multimodal Learning
Yanlin Li, Minghui Guo, Kaiwen Zhang, Shize Zhang, Yiran Zhao,
Haodong Li, Congyue Zhou, Weijie Zheng, Yushen Yan, Shengqiong Wu, Wei Ji, Lei Cui, †Furu Wei, Hao Fei, Mong-Li Lee, Wynne Hsu
CVPR 2026
[
ArXiv] [
Code]
GENIUS: Generative Fluid Intelligence Evaluation Suite
Ruichuan An, Sihan Yang, Ziyu Guo, Wei Dai, Zijun Shen,
Haodong Li, Renrui Zhang, Xinyu Wei, Guopeng Li, Wenshan Wu, Wentao Zhang
ArXiv, 2026
[
ArXiv] [
Code]
DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation
Dongzhi Jiang, Renrui Zhang,
Haodong Li, Zhuofan Zong, Ziyu Guo, Jun He, Claire Guo, Junyan Ye, Rongyao Fang, Weijia Li, Rui Liu, †Hongsheng Li
CVPR 2026 Findings
[
ArXiv] [
Code]
VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing
Huanyu Zhang, Xuehai Bai, Chengzu Li, Chen Liang, Haochen Tian,
Haodong Li, Ruichuan An, Yifan Zhang, Anna Korhonen, Zhang Zhang, Liang Wang, Tieniu Tan
ArXiv, 2026
[
ArXiv] [
Code]
Chain of Mindset: Reasoning with Adaptive Cognitive Modes
Tianyi Jiang, Arctanx An, Hengyi Feng, Naixin Zhai,
Haodong Li, Xiaomin Yu, Jiahui Liu, Hanwen Du, Shuo Zhang, Zhi Yang, Jie Huang, Yuhua Li, Yongxin Ni, Huacan Wang, Ronghao Chen
ArXiv, 2026
[
ArXiv] [
Code]
M2A: Multimodal Memory Agent with Dual-Layer Hybrid Memory for Long-Term Personalized Interactions
Junyu Feng, Binxiao Xu, Jiayi Chen, Mengyu Dai, Cenyang Wu,
Haodong Li, Bohan Zeng, Yunliu Xie, Hao Liang, Ming Lu, Wentao Zhang
ArXiv, 2026
[
ArXiv] [
Code]
AD-MIR: Bridging the Gap from Perception to Persuasion in Advertising Video Understanding via Structured Reasoning
Binxiao Xu, Junyu Feng, Xiaopeng Lin,
Haodong Li, Zhiyuan Feng, Bohan Zeng, Shaolin Lu, Ming Lu, Qi She, Wentao Zhang
ArXiv, 2026
[
ArXiv] [
Code]
R-Align: Enhancing Generative Reward Models through Rationale-Centric Meta-Judging
Yanlin Lai, Mitt Huang, Hangyu Guo, Xiangfeng Wang,
Haodong Li, Shaoxiong Zhan, Liang Zhao, Chengyuan Yao, Yinmin Zhang, Qi Han, Chun Yuan, Zheng Ge, Xiangyu Zhang, Daxin Jiang
ArXiv, 2026
[
ArXiv] [
Code]
💻 Experiences
2025.10 - Present, Research Intern, StepFun, Foundation Model Group
2025.03 - 2025.09, Research Intern, Microsoft Research Asia (MSRA), Visual Computing Group
🖼️ Gallery
You can zoom images by clicking 🤫