Marcel Wagenländer
PhD candidate
Open to Industry Opportunities
Expected Graduation: August 2026
I am actively exploring industry opportunities for Research Scientist or Research Engineer roles in ML Systems. I want to apply my expertise in distributed training and inference to build the systems behind frontier LLMs at real-world scale.
Location: Europe.
About

I’m an AI systems researcher and a PhD candidate in the Large-Scale Data & Systems (LSDS) group at Imperial College London, where I have been working under the supervision of Professor Peter Pietzuch since 2021. My research focuses on Machine Learning Systems (MLSys) for scaling large language models, spanning distributed training and inference, and I enjoy working across the stack, from Triton kernels through PyTorch to cluster orchestration.
My early work, Tenplex (SOSP'24), introduced dynamic parallelism that lets distributed deep learning jobs change their resources at runtime. My current research, Scepsy, targets high-performance serving of agentic LLM workflows.
In summer 2025, I was a Research Scientist Intern at Meta, where I modelled the serving of block-diffusion language models inside Meta’s internal performance simulator, showing that they add no extra latency over autoregressive models because attention is memory-bound.
I also collaborate with my second advisor, Mark van der Wilk, on Gaussian Processes (GP), where our research aims to improve benchmarking of GP approximations for comparability and enhance the performance of GP inference.
Prior to joining the LSDS group, I completed both my Bachelor of Science and Master of Science in Informatics at the Technical University of Munich.
Publications
Conferences
Tenplex: Changing Resources of Deep Learning Jobs using Parallelizable Tensor Collections
Marcel Wagenländer, Guo Li, Bo Zhao, Luo Mai, and Peter Pietzuch
Symposium on Operating Systems Principles (SOSP), 2024
KungFu: Making Training in Distributed Machine Learning Adaptive
Luo Mai, Guo Li, Marcel Wagenländer, Konstantinos Fertakis, Andrei-Octavian Brabete, and Peter Pietzuch
USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2020
Preprints
Tangram: Hiding GPU Heterogeneity for Efficient LLM Parallelization
Yanda Tao, Pedro F. Silvestre, Marcel Wagenländer, and Peter Pietzuch
arXiv
Scepsy: Serving Agentic Workflows Using Aggregate LLM Pipelines
Marcel Wagenländer*, Otto White*, Britannio Jarrett, Pedro F. Silvestre, Yanda Tao, Guo Li, Huanzhou Zhu, Llúis Vilanova, Peter Pietzuch
arXiv
Recommendations for Baselines and Benchmarking Approximate Gaussian Processes
Sebastian W. Ober*, Artem Artemev*, Marcel Wagenländer*, Rudolfs Grobins, Mark van der Wilk
arXiv
Workshops
Spotnik: Designing Distributed Machine Learning for Transient Cloud Resources
Marcel Wagenländer, Luo Mai, Guo Li, and Peter Pietzuch
USENIX Workshop on Hot Topics in Cloud Computing (HotCloud), 2020
Curriculum vitae
Curriculum vitae
hello ɑ marcel.systems