avatar

Xu YUAN(原 旭)

PhD Candidate in Computer Science, Hong Kong Polytechnic University

Biography

☀️ I’m currently a second-year PhD student at the Department of Computing, The Hong Kong Polytechnic University (PolyU), advised by Prof. Qing Li and Prof. Wenqi Fan. I got my Master’s degree from Harbin Institute of Technology, Shenzhen (HITSZ) in March 2024, under the supervision of Prof. Zheng Zhang. Before that, I received my Bachelor’s degree from Harbin Institute of Technology at Weihai in June 2021.

News

🔥 [Apr, 2026]   Our paper “mKG-RAG” has been accepted by SIGIR 2026! 🎉 🎉 🎉

🔥 [Feb, 2026]   Our paper “SUPERGLASSES” has been accepted by CVPR 2026 Findings! 🎉 🎉 🎉

🔥 [Dec, 2025]   Our paper “QA-Dragon” has been accepted by 2025 KDD Cup! 🎉 🎉 🎉

🔥 [May, 2025]   Our paper “HiBench” has been accepted by KDD 2025! 🎉 🎉 🎉

🔥 [Dec, 2024]   One paper has been accepted by AAAI 2025! 🎉 🎉 🎉

Research Interest

🚀 [Multimodal Large Language Models]: Retrieval-Augmented Generation, Instruction Tuning

🚀 [Multimedia Learning]: Deep Hashing Retrieval, Multimodal Retrieval

🚀 [Trustworthy Maching Learning]: Adversarial Examples, Backdoor Learning

Education

  • PhD Candidate in Computer Science, 2024.09-2027.09 (expected)
    The Hong Kong Polytechnic University, advised by Prof. Qing Li and Prof. Wenqi Fan

  • M.E. in Computer Science and Technology, 2021.09 - 2024.03
    Harbin Institute of Technology, Shenzhen, advised by Prof. Zheng Zhang

  • B.E. in Computer Science and Technology, 2017.09 - 2021.06
    Harbin Institute of Technology, Weihai

Experiences

  • March. 2024 - August. 2024: Research Intern
    • TAO Technology, Alibaba Group, Beijing
    • Working on vision and language understanding

Publications

(*) beside authors’ names indicates equal contributions.

mKG-RAG: Leveraging Multimodal Knowledge Graphs in Retrieval-Augmented Generation for Knowledge-intensive VQA

Xu Yuan, Liangbo Ning, Qingqing Ye, Wenqi Fan, and Qing Li

Proceedings of the 49th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2026. [CCF-A]

SUPERGLASSES: Benchmarking Vision Language Models as Intelligent Agents for AI Smart Glasses

Zhuohang Jiang*, Xu Yuan*, Haohao Qu, Shanru Lin, Kanglong Liu, Wenqi Fan, Qing Li

Findings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2026. [CCF-A]

HiBench: Benchmarking LLMs Capability on Hierarchical Structure Reasoning

Zhuohang Jiang*, Pangjing Wu*, Ziran Liang*, Peter Q. Chen*, Xu Yuan*, Ye Jia*, Tu Jiancheng*, Chen Li, Peter H. F. Ng, Qing Li

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2025. [CCF-A]

Instruction-guided Multi-Granularity Segmentation and Captioning with Large Multimodal Model

Xu Yuan*, Li Zhou*, Zenghui Sun, Zikun Zhou, and Jinsong Lan

Proceedings of the 39th Association for the Advancement of Artificial Intelligence (AAAI), 2025. [CCF-A]

This work is done during an internship at Alibaba Group.

BadCM: Invisible Backdoor against Cross-Modal Learning

Zheng Zhang*, Xu Yuan*, Lei Zhu, Jingkuan Song, and Liqiang Nie

IEEE Transactions on Image Processing (TIP), 33: 2558-2571, 2024. [CCF-A]

Semantic-Aware Adversarial Training for Reliable Deep Hashing

Xu Yuan, Zheng Zhang, Xunguang Wang, and Lin Wu

IEEE Transactions on Information Forensics and Security (TIFS), 18: 4681-4694, 2023. [CCF-A]

Academic Service

  • Serving as a conference reviewer of ICML, CVPR, AAAI, KDD, SIGIR, and ACM MM.
  • Serving as a journal reviewer of TMM, TIFS, TKDD, and Information Fusion.

Honors and Awards

  • 中国电子学会优秀硕士论文, 2025
  • PolyU Presidential PhD Fellowship, 2024
  • Outstanding Master Thesis Award, Harbin Institute of Technology, 2024
  • Binxing Fang Scholarship, 2023
  • Outstanding Graduate, Harbin Institute of Technology, 2021.
  • Outstanding Bachelor Thesis Award, Harbin Institute of Technology, 2021.
  • First price, the 19th National Undergraduate Robotics Contest (ROBOCON), China, 2020.