Education

  • University of Virginia,        Ph.D. in Computer Science,   GPA 4.0,   Aug 2022 - Present
  • George Mason University, Ph.D. in Computer Science,   GPA 4.0,   Aug 2021 - Jul 2022

Experience

  • University of Virginia,              Research/Teaching Assistant,       Aug 2022 - Present
  • Samsung Semiconductor,          Research Intern,                               May 2024 - Aug 2024
  • Argonne National Laboratory,   Research Intern,                              May 2022 - Aug 2022
  • George Mason University,         Research/Teaching Assistant,       Aug 2021 - May 2022

Publication

[arXiv Preprint]
MorphServe: Efficient and Workload-Aware LLM Serving via Runtime Layer Swapping and KV Cache Resizing.
Zhaoyuan Su, Tingfeng Lan, Zirui Wang, Juncheng Yang, Yue Cheng .
[arXiv Preprint]
λScale: Enabling Fast Scaling for Serverless Large Language Model Inference.
Minchen Yu, Rui Yang, Chaobo Jia, Zhaoyuan Su, Sheng Yao, Tingfeng Lan, Yuchen Yang, Zirui Wang, Yue Cheng, Wei Wang, Ao Wang, Ruichuan Chen .
[arXiv Preprint]
ZenFlow: Enabling Stall-Free Offloading Training via Asynchronous Updates.
Tingfeng Lan, Yusen Wu, Bin Ma, Zhaoyuan Su, Rui Yang, Tekin Bicer, Dong Li, Yue Cheng .
[NSDI ’26]
Towards Efficient LLM Storage Reduction via Tensor Deduplication and Delta Compression.
(The 23rd USENIX Symposium on Networked Systems Design and Implementation (NSDI’26).)
Zirui Wang, Tingfeng Lan, Zhaoyuan Su, Juncheng Yang, Yue Cheng .
[VLDB ’24]
Everything You Always Wanted to Know About Storage Compressibility of Pre-Trained ML Models but Were Afraid to Ask.
(50th International Conference on Very Large Data Bases (VLDB’24).)
Zhaoyuan Su, Ammar Ahmed, Zirui Wang, Ali Anwar, Yue Cheng .
[DRBSD ’22]
Understanding Impact of Lossy Compression on Derivative-related Metrics in Scientific Datasets.
(The 8th International Workshop on Data Analysis and Reduction for Big Scientific Data (DRBSD’22), affiliated with SC’22.)
Zhaoyuan Su, Sheng Di, Ali Murat Gok, Yue Cheng, Franck Cappello .