Brief Introduction

Hi, welcome to my homepage! I am Yan Li/李炎. I received my PhD degree in Computer Software and Theory from Software Engineering Institue, School of Computer Science, Peking University. Currently, I am a Research Engineer at Huawei. My research mainly focuses on large-scale deep learning systems, performance optimization of LLM inference engine, system-model co-design, etc.

Education

PhD in Computer Software and Theory

2018 - 2023
Peking University (co-advised by Prof. Hong Mei & Prof. Donggang Cao)
  • Scholarship of Schlumberger, Dec. 2019.
  • Scholarship of Schlumberger, Dec. 2020.

BSc in Computer Science

2014 - 2018
Peking University
  • Award of Scientific Research, Dec. 2017.
  • Meritorious Winner of Mathematical Contest in Modeling, Apr. 2017.
  • Scholarship of Freshman, Sep. 2014.
  • Courses: Lab. on Operating Systems(94)/Computer Network(94)/Advanced Algebra(92)/Lab. on Compiler Design(91)/Mathematical Logic(91)/Mathematical Analysis(90)/etc.

Experiences

Research Engineer

2023.07 - Now
Huawei, Beijing

Research Intern

2022.02 - 2022.07
Bytedance, Beijing

Intern of Recommendation Algorithm Engineer

2018.01 - 2018.08
Bytedance, Beijing

Intern of Operating System Development Engineer

2017.07 - 2017.08
Huawei, Beijing

Publications

Here are some of my publications.

Semantic Parallelism: Redefining Efficient MoE Inference via Model-Data Co-Scheduling
Yan Li*, Zhengyu Zhang*, Zhengang Wang, Pengfei Chen, Pengfei Zheng
ICLR 2026
Centrum: Escape from the Gaussian Process World! Enhancing Database Auto-tuning with Tree-Ensemble Bayesian Optimization
Yuanhao Lai, Pengfei Zheng, Chenpeng Ji, Yan Li, Songhang Zhang, Rutao Zhang, Zhengang Wang, Yunfei Du
SIGMOD 2025
Performance Modeling for Cloud-hosted Deep Learning Services with Hybrid Representation
Yan Li, Donggang Cao, Hong Mei
arxiv
SamProf: Top-down Performance Analysis for Neural Networks via Instruction Sampling
Yan Li, Pengcheng Li, Donggang Cao, Hong Mei
axriv
Sectum: Accurate Latency Prediction for TEE-hosted Deep Learning Inference
Yan Li, Junming Ma, Donggang Cao, Hong Mei
ICDCS 2022
SpotTune: Leveraging Transient Resources for Cost-efficient Hyper-parameter Tuning in the Public Cloud
Yan Li, Bo An, Junming Ma, Donggang Cao, Yasha Wang, Hong Mei
ICDCS 2020
PrTaurus: An Availability-Enhanced EMR Service on Preemptible Cloud Instances
Junming Ma, Yan Li, Xiangqun Chen, Donggang Cao
ICWS 2020
Cross-domain Workloads Performance Prediction via Runtime Metrics Transferring
Yan Li, Junming Ma, Donggang Cao
The 11th International Workshop on Joint Cloud Computing (JCC2020), Oxford, UK, Aug 3-6, 2020
CloudMeter: A Tool To Select the Best Cloud Service
Dian Jin, Yan Li, Donggang Cao
The 11th International Workshop on Joint Cloud Computing (JCC2020), Oxford, UK, Aug 3-6, 2020
DCStore: A Deduplication-Based Cloud-of-Clouds Storage Service
Bo An, Yan Li, Junming Ma, Donggang Cao, Gang Huang
ICWS 2019
Comparison between Chunk-based and Layer-based Container Image Storage Approaches: an Empirical Study
Yan Li, Bo An, Junming Ma, Donggang Cao
The 10th International Workshop on Joint Cloud Computing (JCC2019), San Francisco, USA, Apr 4-9, 2019
GPU Scheduling for Short Tasks in Private Cloud
Jialun Shao, Junming Ma, Yan Li, Bo An, Donggang Cao
The 10th International Workshop on Joint Cloud Computing (JCC2019), San Francisco, USA, Apr 4-9, 2019
BDViewer - A Web-Based Big Data Processing and Visualization Tool
Yan Li, Bo An, Junming Ma, Donggang Cao
COMPSAC 2018