About Me

I am an incoming Ph.D. student at The Chinese University of Hong Kong (CUHK), advised by Prof. Pheng Ann Heng. I am also a member of OTeam, working closely with Odin Zhang. Before this, I received my bachelor degree in Artificial Intelligence from South China University of Technology.

Currently, I am a Research Intern at StepFun, mentored by Dr. Quan Sun. Before that, I was a Research Intern at Microsoft Research Asia(MSRA), mentored by Dr. Dong Chen.

My research interests include visual generation and visual understanding. Feel free to contact me by email if you are interested in collaborating with me. You can also discuss co-advising opportunities with my bros: Ruichuan An (PKU), Huanyu Zhang (CASIA), Yuhong Dai (StepFun), and Juanxi Tian (NTU).

🔥 News

  • 2026.02: 🎉Two papers (UNIM and Draco) are accepted by CVPR 2026.

📝 Selected Publications Full Publications List

CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation

CoCo
Haodong Li, Chunmei Qing, Huanyu Zhang, Dongzhi Jiang, Yihang Zou, Hongbo Peng, Dingming Li, Yuhong Dai, ZePeng Lin, Juanxi Tian, Yi Zhou, Siqi Dai
ArXiv, 2026
[ArXiv] [Code]

GEBench: Benchmarking Image Generation Models as GUI Environments

GEBench
Haodong Li, Juanxi Tian, Jingwei Wu, Quan Sun, Guopeng Li, Huanyu Zhang, Yanlin Lai, Ruichuan An, Hongbo Peng, Yuhong Dai, Chenxi Li, Chunmei Qing, Zheng Ge, Xiangyu Zhang, Daxin Jiang
ArXiv, 2026
[ArXiv] [Code] [Page]

Draco: Draft as CoT for Text-to-Image Preview and Rare Concept Generation

Draco
Dongzhi Jiang, Renrui Zhang, Haodong Li, Zhuofan Zong, Ziyu Guo, Jun He, Claire Guo, Junyan Ye, Rongyao Fang, Weijia Li, Rui Liu, †Hongsheng Li
CVPR 2026 Findings
[ArXiv] [Code]

UNIM: A Unified Any-to-Any Interleaved Multimodal Benchmark

UNIM
Yanlin Li, Minghui Guo, Kaiwen Zhang, Shize Zhang, Yiran Zhao, Haodong Li, Congyue Zhou, Weijie Zheng, Yushen Yan, Shengqiong Wu, Wei Ji, Lei Cui, †Furu Wei, Hao Fei, Mong-Li Lee, Wynne Hsu
CVPR 2026
[Page]

💻 Experiences

StepFun
2025.10 - Present, Research Intern, StepFun, Foundation Model Group
MSRA
2025.03 - 2025.09, Research Intern, Microsoft Research Asia (MSRA), Visual Computing Group

You can zoom images by clicking 🤫