avatar

Xu YUAN(原 旭)

PhD Candidate in Computer Science, Hong Kong Polytechnic University

Biography

☀️ I’m currently a second-year PhD student at the Department of Computing, The Hong Kong Polytechnic University (PolyU), advised by Prof. Qing Li and Prof. Wenqi Fan. I got my Master’s degree from Harbin Institute of Technology, Shenzhen (HITSZ) in March 2024, under the supervision of Prof. Zheng Zhang. Before that, I received my Bachelor’s degree from Harbin Institute of Technology at Weihai in June 2021.

News

🔥 [Apr, 2026]   Our paper “mKG-RAG” has been accepted by SIGIR 2026! 🎉 🎉 🎉

🔥 [Feb, 2026]   Our paper “SUPERGLASSES” has been accepted by CVPR 2026 Findings! 🎉 🎉 🎉

🔥 [Dec, 2025]   Our paper “QA-Dragon” has been accepted by 2025 KDD Cup! 🎉 🎉 🎉

🔥 [May, 2025]   Our paper “HiBench” has been accepted by KDD 2025! 🎉 🎉 🎉

🔥 [Dec, 2024]   One paper has been accepted by AAAI 2025! 🎉 🎉 🎉

Research Interest

🚀 [Multimodal Large Language Models]: Retrieval-Augmented Generation, Instruction Tuning

🚀 [Multimedia Learning]: Deep Hashing Retrieval, Multimodal Retrieval

🚀 [Trustworthy Maching Learning]: Adversarial Examples, Backdoor Learning

Education

  • PhD Candidate in Computer Science, 2024.09-2027.09 (expected)
    The Hong Kong Polytechnic University, advised by Prof. Qing Li and Prof. Wenqi Fan

  • M.E. in Computer Science and Technology, 2021.09 - 2024.03
    Harbin Institute of Technology, Shenzhen, advised by Prof. Zheng Zhang

  • B.E. in Computer Science and Technology, 2017.09 - 2021.06
    Harbin Institute of Technology, Weihai

Experiences

  • March. 2024 - August. 2024: Research Intern
    • TAO Technology, Alibaba Group, Beijing
    • Working on vision and language understanding

Publications

(*) beside authors’ names indicates equal contributions.

mKG-RAG: Multimodal Knowledge Graph-Enhanced RAG for Visual Question Answering

Xu Yuan, Liangbo Ning, Wenqi Fan, and Qing Li

Proceedings of the 49th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2026. [CCF-A]

SUPERGLASSES: Benchmarking Vision Language Models as Intelligent Agents for AI Smart Glasses

Zhuohang Jiang*, Xu Yuan*, Haohao Qu, Shanru Lin, Kanglong Liu, Wenqi Fan, Qing Li

Findings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2026. [CCF-A]

HiBench: Benchmarking LLMs Capability on Hierarchical Structure Reasoning

Zhuohang Jiang*, Pangjing Wu*, Ziran Liang*, Peter Q. Chen*, Xu Yuan*, Ye Jia*, Tu Jiancheng*, Chen Li, Peter H. F. Ng, Li Qing

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2025. [CCF-A]

Instruction-guided Multi-Granularity Segmentation and Captioning with Large Multimodal Model

Xu Yuan*, Li Zhou*, Zenghui Sun, Zikun Zhou, and Jinsong Lan

Proceedings of the 39th Association for the Advancement of Artificial Intelligence (AAAI), 2025. [CCF-A]

This work is done during an internship at Alibaba Group.

BadCM: Invisible Backdoor against Cross-Modal Learning

Zheng Zhang*, Xu Yuan*, Lei Zhu, Jingkuan Song, and Liqiang Nie

IEEE Transactions on Image Processing (TIP), 33: 2558-2571, 2024. [CCF-A]

Semantic-Aware Adversarial Training for Reliable Deep Hashing

Xu Yuan, Zheng Zhang, Xunguang Wang, and Lin Wu

IEEE Transactions on Information Forensics and Security (TIFS), 18: 4681-4694, 2023. [CCF-A]

Academic Service

  • Serving as a conference reviewer of ICML, CVPR, AAAI, KDD, SIGIR, and ACM MM.
  • Serving as a journal reviewer of TMM, TIFS, TKDD, and Information Fusion.

Honors and Awards

  • 中国电子学会优秀硕士论文, 2025
  • PolyU Presidential PhD Fellowship, 2024
  • Outstanding Master Thesis Award, Harbin Institute of Technology, 2024
  • Binxing Fang Scholarship, 2023
  • Outstanding Graduate, Harbin Institute of Technology, 2021.
  • Outstanding Bachelor Thesis Award, Harbin Institute of Technology, 2021.
  • First price, the 19th National Undergraduate Robotics Contest (ROBOCON), China, 2020.