Leyang Xue

School of Informatics, The University of Edinburgh

prof_pic.jpg

Informatics Forum

Edinburgh, Scotland

I am a final-year PhD student at The University of Edinburgh, supervised by Prof. Mahesh Marina and Prof. Luo Mai. I received my B.Eng. degree in Electronic and Computer Engineering from Shanghai Jiao Tong University in Aug. 2018.

My research interest lies in the intersection of machine learning and distributed systems. My goal is to build efficient systems for the large-scale machine learning jobs. My current research focuses on the cost and energy efficiency of inference and training large language models (LLMs) in both cloud and edge environment.

Selected Publications

  1. MoE-Infinity: Offloading-Efficient MoE Model Serving
    Leyang Xue, Yao Fu, Zhan Lu, Luo Mai, and Mahesh K. Marina
    2024
  2. ServerlessLLM: Low-Latency Serverless Inference for Large Language Models
    Yao Fu, Leyang Xue, Yeqi Huang, Andrei-Octavian Brabete, Dmitrii UstiugovYuvraj Patel, and Luo Mai
    In OSDI, 2024