VisionXLab

@RethinkLab

Xue Yang

Assistant ProfessorPh.D. SupervisorShanghai Jiao Tong University

Links

Xue Yang

Assistant Professor, Ph.D. Supervisor

School of Automation and Intelligent Sensing, Shanghai Jiao Tong University

800 Dongchuan Road, Shanghai, 200240, China

📧 [email protected], [email protected], [email protected]

我正在寻找自驱力较强的攻读硕/博士 (2028年保研、拿到创智/中关村/河套等国家AI学院offer，提前进组实习是必须的，越早越好) 的学生、实习生（长期招收），与严骏驰教授共同指导，目标是在智能体、多模态大模型、空间智能、遥感影像解译等课题上做出有影响力的工作。请随时通过电子邮件与我联系。

Looking for self-motivated students (Master/Ph.D. 2028 spring & fall), interns to join us, co-supervised by Prof. Junchi Yan, with the goal of doing impactful work on the topic of Agentic AI, Multimodal Large Language Model, Spatial Intelligence, Remote Sensing Image Interpretation, etc. Please do not hesitate to contact me via email.

🔑 Research Interests

My research Citations: 13126 interests include Agentic AI, Multimodal Large Language Model, Spatial Intelligence, Remote Sensing Image Interpretation, etc.

🔥 Latest News

2026-06

I will serve as Senior Program Committee for AAAI 2027

2026-06

Four papers related to VLM (GRADE, EvoTok), Visual Grounding (RGBT-GroundBench) and Continual Learning (SCL-MGSM) are accepted by ECCV 2026. Congratulations to Mingxin Liu, Ziqian Fan (Sophomore), Zhaokai Wang, Leyao Gu (Sophomore), Zirun Zhu (Sophomore), Yan Li, Ning Liao, Ruilin Li. 🎉🎉🎉

2026-05

We have open-sourced SkillOpt jointly with Microsoft Research Asia 🔥🔥🔥

2026-05

Congratulations to doctoral student Ziyang Gong for receiving the first CCF doctoral student funding program. 🧨🧨🧨

2026-05

Serving as the Associate Editor (AE) for Visual Intelligence.

2026-05

One paper related to Spatial Intelligence (Holi-Spatial, Oral, 168/23918=0.7%) is accepted by ICML 2026. Congratulations to Prof. Zhihang Zhong. 🎉🎉🎉

2026-05

CitationClaw-v2 is released. Cheaper and more accurate.

2026-05

Serving as the Sponsor Chair for the 4th SCS-CV. 🧨🧨🧨

2026-05

One paper related to RS-VLM survey (GeoChef) is accepted by GRSM. Congratulations to Prof. Yue Zhou. 🧨🧨🧨

2026-05

GeoViS has been selected as a Candidate for the CVPR 2026 Best Paper Award. 🧨🧨🧨

2026-05

One paper related to AI4SCI (Molecular Detoxification, Oral, 6.1%) is accepted by KDD 2026, AI for Sciences Track. Congratulations to Fei Lin and Ziyang Gong. 🎉🎉🎉

2026-05

Two paper related to Streaming Video (PhoStream) and Spatial Intelligence (Holi-Spatial, Spotlight, 536/23918=2.2%) are accepted by ICML 2026. Congratulations to Xudong Lu and Prof. Zhihang Zhong. 🎉🎉🎉

2026-04

One paper related to object detection in remote sensing images is accepted by TCSVT.

2026-02

Five paper related to Safety of LLMs are accepted by ACL 2026 (two Main Conference, three Findings). Congratulations to Yu Tian. 🎉🎉🎉

2026-03

CitationClaw is released. Turning Every Citation into Explainable Impact.

🔥 Recent Works

Equal contribution

Corresponding author

Project Leader

PRCV

Context-Aware Aerial Object Detection Leveraging Inter-Object and Background Relationships (PRCV, 2026) Citation: 3

OBBDetectionRSConference

Botao Ren

Botian Xu

Xue Yang

Yifan Pu

Jingyi Wang

Zhidong Deng

PDF

arXiv

【IG-Bench】Ideas Have Genomes Benchmarking Scientific Lineage Reasoning and Lineage-Grounded Idea Generation (arXiv, 2026) Citation: 0

VLM & MLLM & LLMAI4SCIBenchmarkPreprint

Yifan Zhou

Qihao Yang

Yan Li

Donggang Li

Xiru Hu

Hokin Deng

Ziyang Gong

Xuanyi Zhou

Huacan Wang

Xiangchao Yan

Wanghan Xu

Wenlong Zhang

Shaofeng Zhang

Yue Zhou

Yifan Yang

Zhihang Zhong

Xue Yang

PDFHomepageCode

Tech. Report

【ACE-Brain-0.5】A Unified Embodied Foundational Model for Physical Agentic AI (Tech. Report, 2026) Citation: 0

VLM & MLLM & LLMSpatial IntelligencePreprintFirst/Correspondence

Ziyang Gong

Haoming Gu

Zehang Luo

Tianyi Zhang

Tao Tao

Yixiao Chi

Zhe Liu

Lingsi Zhu

Jingyuan Liu

Anke Tang

Songze Li

Yilun Kong

Ningjing Liu

Tianyu Zhu

Yunpeng Qing

Shuang Luo

Xiang Liu

Shi Fu

Dawei Nie

Sixiang Liu

Zhexi Wen

Feng Pan

Xiaofeng Wang

Zhi Hou

Chunxiao Liu

Xue Yang

Junchi Yan

Hengshuang Zhao

Dacheng Tao

Xiaogang Wang

PDFCodeHuggingFaceHomepage

arXiv

【DisciplineGen-1M】A Large-Scale Dataset for Multidisciplinary Visual Generation and Editing (arXiv, 2026) Citation: 0

VLM & MLLM & LLMAI4SCIBenchmarkPreprint

Zhaokai Wang

Mingxin Liu

Zirun Zhu

Ziqian Fan

Yiguo He

Mohan Zhang

Leyao Gu

Xiangyu Zhao

Ning Liao

Shaofeng Zhang

Xuanhe Zhou

Zhihang Zhong

Junchi Yan

Xue Yang

PDFHomepage

ECCV

【EvoTok】A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation (ECCV, 2026) Citation: 0

VLM & MLLM & LLMConferenceFirst/Correspondence

Yan Li

Ning Liao

Xiangyu Zhao

Shaofeng Zhang

Xiaoxing Wang

Yifan Yang

Junchi Yan

Xue Yang

PDFCode

ECCV

【GRADE】Benchmarking Discipline-Informed Reasoning in Image Editing (ECCV, 2026) Citation: 3

VLM & MLLM & LLMBenchmarkConferenceFirst/Correspondence

Mingxin Liu

Ziqian Fan

Zhaokai Wang

Leyao Gu

Zirun Zhu

Yiguo He

Yuchen Yang

Changyao Tian

Xiangyu Zhao

Ning Liao

Shaofeng Zhang

Qibing Ren

Zhihang Zhong

Xuanhe Zhou

Junchi Yan

Xue Yang

PDFHomepageCodeDataset

ECCV

【RGBT-Ground】Visual Grounding Beyond RGB in Complex Real-World Scenarios (ECCV, 2026) Citation: 2

MMF & RGBTPEFTDatasetConference

Tianyi Zhao

Jiawen Xi

Linhui Xiao

Junnan Li

Xue Yang

Maoxun Yuan

Xingxing Wei

PDF

ECCV

【SCL-MGSM】Enhancing Pretrained Model-based Continual Representation Learning via Guided Random Projection (ECCV, 2026) Citation: 3

Continual LearningConferenceFirst/Correspondence

Ruilin Li

Heming Zou

Xiufeng Yan

Zheming Liang

Jie Yang

Chenliang Li

Xue Yang

PDFHomepage

GRSM

Data-Driven Vision-Language Models for Remote Sensing A survey (GRSM, 2026) Citation: 0

VLM & MLLM & LLMSurveyRSJournalFirst/Correspondence

Yue Zhou

Shujun Zhao

Xue Yang

Ruigang Li

Tianwen Zhang

Mengcheng Lan

Chaofeng Chen

Lingfei Ma

Hongjie He

Jonathan Li

PDFHomepage

arXiv

【PhotoFlow】Agentic 3D Virtual Photography Missions (arXiv, 2026) Citation: 0

Spatial IntelligenceDatasetAI AgentPreprint

Jiarui Guo

Haojia Wei

Yiming Zhang

Yifei Liu

Yuning Gong

Hongjie Zhang

Xue Yang

Zhihang Zhong

PDFHomepageCodeDataset

arXiv

【SkillLens】From Raw Experience to Skill Consumption A Systematic Study of Model-Generated Agent Skills (arXiv, 2026) Citation: 5

AI AgentPreprint

Zisu Huang

Jingwen Xu

Yifan Yang

Ziyang Gong

Qihao Yang

Muzhao Tian

Xiaohua Wang

Changze Lv

Xuemei Gao

Qi Dai

Bei Liu

Kai Qiu

Xue Yang

Dongdong Chen

Xiaoqing Zheng

Chong Luo

PDFHomepageCode

arXiv

【SkillOpt】Executive Strategy for Self-Evolving Agent Skills (arXiv, 2026) Citation: 23

AI AgentPreprintFirst/Correspondence

Yifan Yang

Ziyang Gong

Weiquan Huang

Qihao Yang

Ziwei Zhou

Zisu Huang

Yan Li

Xuemei Gao

Qi Dai

Bei Liu

Kai Qiu

Yuqing Yang

Dongdong Chen

Xue Yang

Chong Luo

PDFHomepageCodeVideo

arXiv

【SpaceDG】Benchmarking Spatial Intelligence under Visual Degradation (arXiv, 2026) Citation: 0

Spatial IntelligenceBenchmarkLow levelPreprint

Xiaolong Zhou

Yifei Liu

Ziyang Gong

Jiarui Li

Qiyue Zhao

Muyao Niu

Yuanyuan Gao

Le Ma

Xue Yang

Hongjie Zhang

Zhihang Zhong

PDFCode

KDD

Oral

【ToxiMol】Breaking Bad Molecules Are MLLMs Ready for Structure-Level Molecular Detoxification? (KDD, 2026) Citation: 4

VLM & MLLM & LLMAI4SCIMedicalBenchmarkFirst/CorrespondenceConference

Fei Lin

Ziyang Gong

Cong Wang

Yonglin Tian

Tengchao Zhang

Yonglin Tian

Yining Jiang

Ji Dai

Chao Guo

Xiaotong Yu

Xue Yang

Gen Luo

Fei-Yue Wang

PDFCodeDataset

ACL

【SafeSteer】A Decoding-level Defense Mechanism for Multimodal Large Language Models (ACL, 2026) Citation: 1

VLM & MLLM & LLMSafetyConference

Xinyi Zeng

Xue Yang

Jingyuan Zhang

Huanqian Yan

Xiang Chen

Kaiwen Wei

Hankun Kang

Yu Tian

PDF