Education

  • University of Virginia,        Ph.D. in Computer Science,   GPA 4.0,   Aug 2022 - Present
  • George Mason University, Ph.D. in Computer Science,   GPA 4.0,   Aug 2021 - Jul 2022

Experience

  • University of Virginia,                  Research/Teaching Assistant,      Aug 2022 - Present
  • Snowflake AI Research,                Research Intern,                             Sep 2025 - Dec 2025
  • Samsung Semiconductor, Inc.,   Research Intern,                             May 2024 - Aug 2024
  • Argonne National Laboratory,    Research Intern,                             May 2022 - Aug 2022
  • George Mason University,           Research/Teaching Assistant,      Aug 2021 - May 2022

Publication

[MLSys ’26]
MorphServe: Efficient and Workload-Aware LLM Serving via Runtime Quantized Layer Swapping and KV Cache Resizing.
(Ninth Annual Conference on Machine Learning and Systems (MLSys’26).)
Zhaoyuan Su, Zeyu Zhang, Tingfeng Lan, Zirui Wang, Haiying Shen, Juncheng Yang, Yue Cheng .
[MLSys ’26]
λScale: Enabling Fast Scaling for Serverless Large Language Model Inference.
(Tenth Annual Conference on Machine Learning and Systems (MLSys’26).)
Minchen Yu, Rui Yang, Chaobo Jia, Zhaoyuan Su, Sheng Yao, Tingfeng Lan, Yuchen Yang, Zirui Wang, Yue Cheng, Wei Wang, Ao Wang, Ruichuan Chen .
[arXiv Preprint]
ZenFlow: Enabling Stall-Free Offloading Training via Asynchronous Updates.
Tingfeng Lan, Yusen Wu, Bin Ma, Zhaoyuan Su, Rui Yang, Tekin Bicer, Masahiro Tanaka, Olatunji Ruwase, Dong Li, Yue Cheng .
[NSDI ’26]
Towards Efficient LLM Storage Reduction via Tensor Deduplication and Delta Compression.
(The 23rd USENIX Symposium on Networked Systems Design and Implementation (NSDI’26).)
Zirui Wang, Tingfeng Lan, Zhaoyuan Su, Juncheng Yang, Yue Cheng .
[VLDB ’24]
Everything You Always Wanted to Know About Storage Compressibility of Pre-Trained ML Models but Were Afraid to Ask.
(50th International Conference on Very Large Data Bases (VLDB’24).)
Zhaoyuan Su, Ammar Ahmed, Zirui Wang, Ali Anwar, Yue Cheng .
[DRBSD ’22]
Understanding Impact of Lossy Compression on Derivative-related Metrics in Scientific Datasets.
(The 8th International Workshop on Data Analysis and Reduction for Big Scientific Data (DRBSD’22), affiliated with SC’22.)
Zhaoyuan Su, Sheng Di, Ali Murat Gok, Yue Cheng, Franck Cappello .

Professional Service

Conference Organizer and Community Services

2024 HotStorage, Web Chair.

External Journal Reviews

2025 Nature Machine Intelligence.
2024 TOS, ACM Transactions on Storage.
2023 Neural Processing Letters.

External Conference and Workshop Reviews

2026 FAST, USENIX Conference on File and Storage Technologies.
2026 EuroSys, European Conference on Computer Systems.
2025 SoCC, ACM Symposium on Cloud Computing.
2025 PPoPP, Symposium on Principles and Practice of Parallel Programming.
2025 NSDI, USENIX Symposium on Networked Systems Design and Implementation.
2025 NeurIPS, Conference on Neural Information Processing Systems.
2025 HPDC, ACM International Symposium on High-Performance Parallel and Distributed Computing.
2025 FAST, USENIX Conference on File and Storage Technologies.
2025 ATC, USENIX Annual Technical Conference.
2024 IPDPS, IEEE International Parallel & Distributed Processing Symposium.
2024 CLOUD, IEEE International Conference on Cloud Computing.
2024 HPDC, ACM International Symposium on High-Performance Parallel and Distributed Computing.
2024 HotStorage, USENIX Workshop on Hot Topics in Storage and File Systems.
2023 IPDPS, IEEE International Parallel & Distributed Processing Symposium.
2023 CLOUD, IEEE International Conference on Cloud Computing.
2023 HotStorage, USENIX Workshop on Hot Topics in Storage and File Systems.
2023 Cluster, IEEE Cluster Conference.