Fan Yang

📬 yang DOT fan DOT acm DOT org

My name is Fan Yang (杨凡 in Chinese). I am a systems researcher and research manager of the Systems Research Group (SRG) at Microsoft Research Asia (MSR-Asia). I joined MSR-Asia after receiving my doctoral degree and bachelor’s degree in Computer Science at Nanjing University.

My research passion lies in Computer Systems. My recent focus is on exploring the fundamental principles of the systems for Artificial Intelligence (AI). I am among the first to discover and advocate the now well-known design principles for AI systems, including the hardware-aligned tile abstraction for AI compilers and the relaxed monotonicity for vector stores. Some techniques and solutions derived from these principles have been open-sourced and adopted by Microsoft products like Azure, M365, and Bing, and the corresponding research results have appeared in top systems conferences like OSDI/SOSP. Some open-source projects like OpenPAI or NNI even incubated new businesses. More recently, I have been passionate about the co-design of AI algorithms and systems, which I believe will define the next chapter of AI. In the past, I worked on large-scale systems, such as graph systems. I co-developed GraM, a high-performance graph engine that set a new speed record for trillion-scale graph analytics.

As a researcher, I also engage in public academic services, including serving on the program committees of ASPLOS (2022), ChinaSys (19th), EuroSys (2023, 2025, 2026), OSDI (2026).

We have a few FTE openings. Please send me your resume if interested.

We are recruiting interns the whole year, details here.

news

Dec 05, 2025	On a podcast discussing AI reasoning
Oct 01, 2025	USENIX ;login: published our article on WaferLLM (OSDI'25)
Sep 11, 2025	VentureBeat covered rStar2-Agent
Jul 16, 2025	rStar-Math and LIPS picked as "the most groundbreaking AI papers from the first half of 2025" by Turing Post
Jun 17, 2025	An MSR blog introduces our work on AI reasoning

selected publications

ArXiv

Vibe Reasoning: Eliciting Frontier AI Mathematical Capabilities – A Case Study on IMO 2025 Problem 6

Jiaao Wu, and 3 more authors

ArXiv, 2025

Bib HTML

@article{vibereasoning25,
  title = {Vibe Reasoning: Eliciting Frontier AI Mathematical Capabilities -- A Case Study on IMO 2025 Problem 6},
  author = {Wu, Jiaao and Zhang, Xian and Yang, Fan and Dong, Yinpeng},
  year = {2025},
  journal = {ArXiv},
}

SOSP

TrainVerify: Equivalence-Based Verification for Distributed LLM Training

Yunchi Lu, and 6 more authors

In SOSP. ArXiv version , 2025

Bib HTML Code

@inproceedings{trainverify25,
  title = {TrainVerify: Equivalence-Based Verification for Distributed LLM Training},
  author = {Lu, Yunchi and Miao, Youshan and Tan, Cheng and Huang, Peng and Zhu, Yi and Zhang, Xian and Yang, Fan},
  year = {2025},
  booktitle = {{SOSP}},
}

OSDI

WaferLLM: A Wafer-Scale LLM Inference System

Congjie He, and 7 more authors

In 19th USENIX Symposium on Operating Systems Design and Implementation, OSDI. An introductory article at ;login: , 2025

Bib HTML Slides

@inproceedings{waferllm25,
  title = {WaferLLM: A Wafer-Scale LLM Inference System},
  author = {He, Congjie and Huang, Yeqi and Mu, Pei and Miao, Ziming and Xue, Jilong and Ma, Lingxiao and Yang, Fan and Mai, Luo},
  year = {2025},
  booktitle = {19th {USENIX} Symposium on Operating Systems Design and Implementation, {OSDI}},
}

ArXiv

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Yaoqi Chen, and 17 more authors

ArXiv, 2025

Bib HTML

@article{chen2025retroinfervectorstorageapproachscalable,
  title = {RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference},
  author = {Chen, Yaoqi and Zhang, Jinkai and Lu, Baotong and Zhang, Qianxi and Zhang, Chengruidong and Luo, Jingjia and Liu, Di and Jiang, Huiqiang and Chen, Qi and Liu, Jing and Ding, Bailu and Yan, Xiao and Jiang, Jiawei and Chen, Chen and Zhang, Mingxing and Yang, Yuqing and Yang, Fan and Yang, Mao},
  year = {2025},
  journal = {ArXiv},
}