About
Education
Experience
Selected Paper [All]
* equal contribution
-
Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners
Xin Xu, Cliveb AI, Kai Yang, Tianhao Chen, Yang Wang, Saiyong Yang, Can Yang -
On Predictability of Reinforcement Learning Dynamics for Large Language Models
Yuchen Cai, Ding Cao, Xin Xu, Zijun Yao, Yuqing Huang, Zhenyu Tan, Benyi Zhang, Guiquan Liu, Junfeng Fang -
Advancing Multimodal Reasoning Capabilities of Multimodal Large Language Models via Visual Perception Reward
Tong Xiao, Xin Xu, Zhenya Huang, Hongyu Gao, Quan Liu, Qi Liu, Enhong Chen -
VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models
Yuchen Yan, Jin Jiang, Zhenbang Ren, Yijun Li, Xudong Cai, Yang Liu, Xin Xu, Mengdi Zhang, Jian Shao, Yongliang Shen, Jun Xiao, Yueting Zhuang -
MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task
Yuchen Yan, Yongliang Shen, Yang Liu, Jin Jiang, Xin Xu, Mengdi Zhang, Jian Shao, Yueting ZhuangICLR 2026 [Paper]
-
GPAS: Accelerating Convergence of LLM Pretraining via Gradient-Preserving Activation Scaling
Tianhao Chen*, Xin Xu*, Zijing Liu, Pengxiang Li, Xinyuan Song, Ajay Kumar Jaiswal, Fan Zhang, Jishan Hu, Yang Wang, Hao Chen, Shizhe Diao, Shiwei Liu, Yu Li, Lu Yin, Can Yang -
Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification
Chengwu Liu, Ye Yuan, Yichun Yin, Yan Xu, Xin Xu, Zaoyu Chen, Yasheng Wang, Lifeng Shang, Qun Liu, Ming Zhang -
UGPhysics: A Comprehensive Benchmark for Undergraduate Physics Reasoning with Large Language Models
Xin Xu*, Qiyun Xu*, Tong Xiao, Tianhao Chen, Yuchen Yan, Jiaxin Zhang, Shizhe Diao, Can Yang, Yang Wang -
UGMathBench: A Diverse and Dynamic Benchmark for Undergraduate-Level Mathematical Reasoning with Large Language Models
Xin Xu*, Jiaxin Zhang*, Tianhao Chen*, Zitong Chao, Jishan Hu, Can Yang -
Can LLMs Solve Longer Math Word Problems Better?
Xin Xu*, Tong Xiao*, Zitong Chao, Zhenya Huang, Can Yang, Yang Wang -
$S^3$cMath: Spontaneous Step-Level Self-Correction Makes Large Language Models Better Mathematical Reasoners
Yuchen Yan, Jin Jiang, Yang Liu, Yixin Cao, Xin Xu, Xunliang Cai, Jian ShaoAAAI 2025 [Paper]
-
UCS: A Unified Approach to Cell Segmentation for Subcellular Spatial Transcriptomics
Yuheng Chen, Xin Xu, Xiaomeng Wan, Jiashun Xiao, Can YangSmall Methods (Q1) [Paper]
-
Can We Verify Step by Step for Incorrect Answer Detection?
Xin Xu, Shizhe Diao, Can Yang, Yang Wang
Preprints
-
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models
Xin Xu, Clive Bai, Kai Yang, Tianhao Chen, Yangkun Chen, Weijie Liu, Hao Chen, Yang Wang, Saiyong Yang, Can Yang[Paper] [Code] [Datasets & Models]
-
ORBIT: On-policy Exploration-Exploitation for Controllable Multi-Budget Reasoning
Kun Liang, Clive Bai, Xin Xu, Chenming Tang, Sanwoo Lee, Weijie Liu, Saiyong Yang, Yunfang Wu[Paper]
-
EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control
Kai Yang, Xin Xu, Yangkun Chen, Weijie Liu, Jiafei Lyu, Zichuan Lin, Deheng Ye, Saiyong Yang -
Double-Checker: Enhancing Reasoning of Slow-Thinking LLMs via Self-Critical Fine-Tuning
Xin Xu*, Tianhao Chen*, Fan Zhang, Wanlong Liu, Pengxiang Li, Ajay Kumar Jaiswal, Yuchen Yan, Jishan Hu, Yang Wang, Hao Chen, Shiwei Liu, Shizhe Diao, Can Yang, Lu Yin
Award
Academic Services
Conference Reviewer: NeurIPS, ICLR, ACL, EMNLPJournal Reviewer: Annals of Applied Statistics
Teaching Assistant
Matrix Algebra and Applications at HKUST, 2025.2-2025.5Calculus at HKUST, 2024.9-2024.12
Sampling at HKUST, 2024.2-2024.5
Applied Statistics at HKUST, 2023.9-2023.12
Calculus at HKUST, 2023.3-2023.5
Probability and Mathematical Statistics at USTC, 2021.9-2022.1
Multivariable Calculus at USTC, 2021.3-2021.7
Mathematical Analysis I at USTC, 2020.9-2021.1
Miscellaneous
I love sports, board games, and Chinese poker.
|