† denotes equal contribution.
2026
-
Harnessing Idle Compute at the Edge for Foundation Model Training
Leyang Xue, Meghana Madhyastha, Myungjin Lee, Amos Storkey, Randal Burns, and
Mahesh Marina In EuroMLSys, co-located with EuroSys, 2026
-
Towards Automated RAN Configuration Tuning in Cellular Networks with Causal Learning
Leyang Xue, Bolun Zhang,
Mahesh Marina , He Yan, Yu Zhou, Cheuk Yiu Ip, and James Klosowski
In HotMobile, 2026
-
CausalTune: Causal Learning based Automated Cellular RAN Configuration Tuning Framework
Leyang Xue†, Bolun Zhang
†, Yibo Ma,
Mahesh Marina , He Yan, Yu Zhou, Cheuk Yiu Ip, Senthil Dhandapani, and
1 more author In SIGCOMM, 2026
-
BatchGen: An Architecture for Scalable and Efficient Batch Inference
Tairan Xu†, Leyang Xue†, Zhan Lu†, Jinfu Deng, Hongyang Xiao, Yinsicheng Jiang, Congjie He, Matej Sandor, and 2 more authors
In OSDI, 2026
-
Morphling: Emulator for Distributed Machine Learning at the Edge
Leyang Xue, Yufeng Xia, Eren Mendi, Ismaeel Bashir, Jiaxun Yang, Myungjin Lee, and
Mahesh K. Marina In EdgeSys, 2026
Best Paper Award at EdgeSys 2026.
2025
-
Towards Decentralized and Sustainable Foundation Model Training with the Edge
Leyang Xue, Meghana Madhyastha, Randal Burns, Myungjin Lee, and
Mahesh K. Marina SIGENERGY Energy Inform. Rev., 2025
-
On Harnessing Idle Compute at the Edge for Foundation Model Training
Leyang Xue, Meghana Madhyastha, Myungjin Lee, Amos Storkey, Randal Burns, and
Mahesh K. Marina 2025
-
Poster: On Harnessing Idle Compute at the Edge for Foundation Model Training
Leyang Xue, Meghana Madhyastha, Myungjin Lee, Amos Storkey, Randal Burns, and
Mahesh Marina In MobiCom, 2025
-
HybridServe: Efficient Serving of Large AI Models with Confidence-Based Cascade Routing
In The 45th IEEE International Conference on Distributed Computing Systems (ICDCS), 2025
-
TUBO: A Tailored ML Framework for Reliable Network Traffic Forecasting
In The 45th IEEE International Conference on Distributed Computing Systems (ICDCS), 2025
-
MoE-Gen: High-Throughput MoE Inference on a Single GPU with Module-Based Batching
Tairan Xu,
Leyang Xue, Zhan Lu, Adrian Jackson, and
Luo Mai 2025
-
MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems
Yao Fu, Yinsicheng Jiang,
Yeqi Huang, Ping Nie, Zhan Lu,
Leyang Xue, Congjie He, Man-Kit Sit, and
6 more authors In NeurIPS Datasets & Benchmarks Track, 2025
-
Towards Energy Efficient 5G vRAN Servers
In NSDI, 2025
2024
-
MoE-Infinity: Efficient MoE Inference on Personal Machines with Sparsity-Aware Expert Cache
2024
-
ServerlessLLM: Low-Latency Serverless Inference for Large Language Models
In OSDI, 2024
2022
-
PAINT: Path Aware Iterative Network Tomography for Link Metric Inference
In ICNP, 2022