Publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2026

  1. Towards Automated RAN Configuration Tuning in Cellular Networks with Causal Learning
    Leyang Xue, Bolun Zhang, Mahesh Marina , He Yan, Yu Zhou, Cheuk Yiu Ip, and James Klosowski
    In HotMobile, 2026

2025

  1. Towards Decentralized and Sustainable Foundation Model Training with the Edge
    Leyang Xue, Meghana Madhyastha, Randal Burns, Myungjin Lee, and Mahesh K. Marina
    SIGENERGY Energy Inform. Rev., 2025
  2. On Harnessing Idle Compute at the Edge for Foundation Model Training
    Leyang Xue, Meghana Madhyastha, Myungjin Lee, Amos Storkey, Randal Burns, and Mahesh Marina
    2025
  3. HybridServe: Efficient Serving of Large AI Models with Confidence-Based Cascade Routings
    Leyang Xue, Yao FuLuo Mai, and Mahesh Marina
    In ICDCS (In Conjunction Events), 2025
  4. TUBO: A Tailored ML Framework for Reliable Network Traffic Forecasting
    Zhihang Yuan, Leyang Xue, Waleed Ahsan, and Mahesh Marina
    In ICDCS (In Conjunction Events), 2025
  5. MoE-Gen: High-Throughput MoE Inference on a Single GPU with Module-Based Batching
    Tairan Xu, Leyang Xue, Zhan Lu, and Luo Mai
    2025
  6. MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems
    Yao Fu, Yinsicheng Jiang, Yeqi Huang, Ping Nie, Zhan Lu, Leyang Xue, Congjie He, Man-Kit Sit, and 5 more authors
    In NeurIPS Datasets & Benchmarks Track, 2025
  7. Towards Energy Efficient 5G vRAN Servers
    In NSDI, 2025

2024

  1. MoE-Infinity: Offloading-Efficient MoE Model Serving
    Leyang Xue, Yao Fu, Zhan Lu, Luo Mai, and Mahesh K. Marina
    2024
  2. ServerlessLLM: Low-Latency Serverless Inference for Large Language Models
    Yao Fu, Leyang Xue, Yeqi Huang, Andrei-Octavian Brabete, Dmitrii UstiugovYuvraj Patel, and Luo Mai
    In OSDI, 2024

2022

  1. PAINT: Path Aware Iterative Network Tomography for Link Metric Inference
    Leyang Xue, Mahesh K. Marina, Geng Li, and Kai Zheng
    In ICNP, 2022