Fan Yang
Systems researcher, Research Manager of SRG@MSR-Asia
personal email:
yang DOT fan AT 163 DOT com
work email: fanyang AT microsoft DOT com
My name is Fan Yang (杨凡 in Chinese). I am a systems researcher and research manager of the Systems Research Group (SRG) at Microsoft Research Asia (MSR-Asia). I joined MSR-Asia after receiving my doctoral degree and bachelor’s degree in Computer Science at Nanjing University.
My research passion lies in Computer Systems. My recent focus is on exploring the fundamental principles of the systems for Artificial Intelligence (AI). I am among the first to discover and advocate the now well-known design principles for AI systems, including the tile abstraction for AI compilers and the relaxed monotonicity for vector stores. Some techniques and solutions derived from these principles have been open-sourced and adopted by Microsoft products like Azure, M365, and Bing, and the corresponding research results have appeared in top systems conferences like OSDI/SOSP. Some open-source projects like OpenPAI or NNI even incubated new businesses. More recently, I have been passionate about the co-design of AI algorithms and systems, which I believe will define the next chapter of AI. In the past, I worked on large-scale systems, such as graph systems. I co-developed GraM, a high-performance graph engine that set a new speed record for trillion-scale graph analytics.
As a researcher, I also engage in public academic services, including serving on the program committee of ASPLOS (2022), EuroSys (2023, 2025), ChinaSys (19th).
news
Aug 26, 2024 | Internship Opportunities at SRG |
---|---|
Aug 20, 2024 | MSR-Asia StarTrack Scholars Program |
latest posts
Aug 21, 2024 | 微软亚洲研究院多项创新技术,弥合大模型低比特量化与终端部署间鸿沟 |
---|---|
Aug 16, 2024 | 两个小模型互相验证,直接比肩大模型?微软的rStar甚至没用CoT和微调 | 机器之心 |
Aug 01, 2024 | To prospective interns |
selected publications
- RetrievalAttention: Accelerating Long-Context LLM Inference via Vector RetrievalArXiv, 2024