Brief Introduction
Hi, welcome to my homepage! I am Yan Li/李炎. I received my PhD degree in Computer Software and Theory from Software Engineering Institue, School of Computer Science, Peking University. Currently, I am a Research Engineer at Huawei. My research mainly focuses on large-scale deep learning systems, performance optimization of LLM inference engine, system-model co-design, etc.
Education
- Scholarship of Schlumberger, Dec. 2019.
- Scholarship of Schlumberger, Dec. 2020.
- Award of Scientific Research, Dec. 2017.
- Meritorious Winner of Mathematical Contest in Modeling, Apr. 2017.
- Scholarship of Freshman, Sep. 2014.
- Courses: Lab. on Operating Systems(94)/Computer Network(94)/Advanced Algebra(92)/Lab. on Compiler Design(91)/Mathematical Logic(91)/Mathematical Analysis(90)/etc.
Experiences
Publications
Here are some of my publications.
Semantic Parallelism: Redefining Efficient MoE Inference via Model-Data Co-Scheduling
ICLR 2026
Centrum: Escape from the Gaussian Process World! Enhancing Database Auto-tuning with Tree-Ensemble Bayesian Optimization
SIGMOD 2025
Performance Modeling for Cloud-hosted Deep Learning Services with Hybrid Representation
arxiv
SamProf: Top-down Performance Analysis for Neural Networks via Instruction Sampling
axriv
Sectum: Accurate Latency Prediction for TEE-hosted Deep Learning Inference
ICDCS 2022
SpotTune: Leveraging Transient Resources for Cost-efficient Hyper-parameter Tuning in the Public Cloud
ICDCS 2020
PrTaurus: An Availability-Enhanced EMR Service on Preemptible Cloud Instances
ICWS 2020
Cross-domain Workloads Performance Prediction via Runtime Metrics Transferring
The 11th International Workshop on Joint Cloud Computing (JCC2020), Oxford, UK, Aug 3-6, 2020
CloudMeter: A Tool To Select the Best Cloud Service
The 11th International Workshop on Joint Cloud Computing (JCC2020), Oxford, UK, Aug 3-6, 2020
DCStore: A Deduplication-Based Cloud-of-Clouds Storage Service
ICWS 2019
Comparison between Chunk-based and Layer-based Container Image Storage Approaches: an Empirical Study
The 10th International Workshop on Joint Cloud Computing (JCC2019), San Francisco, USA, Apr 4-9, 2019
GPU Scheduling for Short Tasks in Private Cloud
The 10th International Workshop on Joint Cloud Computing (JCC2019), San Francisco, USA, Apr 4-9, 2019
BDViewer - A Web-Based Big Data Processing and Visualization Tool
COMPSAC 2018
