@bariskasikci: Super stoked that UW SyFI (https://syfi.cs.washington.edu) members won a number of prizes at the MLSys'26 competition, …
Summary
UW SyFI Lab members won multiple prizes at the MLSys'26 competition (NVIDIA Track), including 1st place in GDN Track Full-Agent Approach, 2nd in GDN Track Agent-Assisted, and 3rd in DSA Track Full-Agent Approach.
View Cached Full Text
Cached at: 05/24/26, 12:14 AM
Super stoked that UW SyFI (https://syfi.cs.washington.edu) members won a number of prizes at the MLSys’26 competition, NVIDIA Track. Hugre congrats to @KeisukeKamahori , @sudopowr , Yile Gu, Wei Shen, Steven Gao! Thanks to @nvidia , @modal , and the Flashinfer team for the support.
1st place in the GDN Track — Full-Agent Approach 2nd place in the GDN Track — Agent-Assisted Approach 3rd place in the DSA Track — Full-Agent Approach
SyFI Lab | SyFI Lab
Source: https://syfi.cs.washington.edu/ HomePeoplePublicationsTalksNewsBlogThe SyFI Lab at the University of Washington builds efficient and resilient infrastructure for the future of AI. As applications grow more complex, we bridge the gap between next-gen models and heterogeneous hardware through cross-stack innovation, delivering scalable, open-source systems validated by industrial partners.
Our research targets three key areas:
- Efficient AI: Optimizing algorithms and systems to maximize performance for training and inference.
- Flexible AI: Architecting systems that seamlessly adapt to diverse tasks, strategies, and model structures.
- Resilient AI: Ensuring AI system reliability at scale while leveraging AI to improve infrastructure robustness.
Blog Posts
Let AI Agents Write Your Serving Stack with VibeServe
May 12, 2026
We present VibeServe, a multi-agent system that synthesizes a complete LLM serving runtime end-to-end, specialized to a user-specified model, hardware, and workload.
SyFI in January 2026: A Big Month for Systems-Driven AI Research
January 31, 2026
January 2026 was a milestone month for the SyFI Lab, with six papers published across MLSys and ICLR—spanning inference, training, scheduling, retrieval, and model architecture.
Meet LLMc: Beating All Compression with LLMs
October 03, 2025
We present LLMc, an open-source tool to compress natural language using LLMs as the world’s most reference-packed dictionary.
Talks
The Wrong Contract at Every Layer: Redesigning OS and Hardware for AI Datacenters
May 15, 2026Dimitrios Skarlatos — Carnegie Mellon University
Abstract
The AI datacenter stack is built on hardware-software contracts and abstractions that were never designed for the demands of the workloads they now serve. Memory systems strain under terabyte-scale capacity, heterogeneous AI accelerators have been forcefully pressed into deployment, and AI’s appetite for data increasingly collides with the privacy realities that infrastructure provides today. With datacenters projected to consume over 1,000 TWh annually, renegotiating the system and hardware stack is no longer optional. In this talk, I will share my research journey redesigning these contracts and where they lead. I will start with memory management, presenting Contiguitas and Learned Virtual Memory (LVM), a progression of OS and hardware redesigns that tackle virtual memory from fragmentation in production datacenters to near-ideal address translation. I will then discuss LithOS, the first operating system for efficient ML on GPUs, giving the OS the control over heterogeneous accelerators it has so far lacked. Finally, I will briefly describe how Cinnamon and Cerium bring these threads together to make encrypted AI practical at scale. I will conclude with open questions on how AI itself can help renegotiate the hardware-software contract, and where we go from here.
Speaker Bio
Dimitrios Skarlatos is an assistant professor in the Computer Science Department at Carnegie Mellon University. His research bridges computer architecture and operating systems with a focus on AI datacenter efficiency, privacy, and scalability. His work has been deployed in production datacenters and upstreamed into the Linux kernel. He has received the IEEE CS TCCA Young Computer Architect Award, the NSF CAREER Award, the Intel Rising Star Award, a Linux Foundation Faculty Award, an ISCA Best Paper Award, two ASPLOS Best Paper Awards, a CACM Research Highlight, four IEEE MICRO Top Picks, the joint ACM SIGARCH & IEEE CS TCCA Outstanding Dissertation Award, and over a dozen industry faculty awards from Meta, Intel, Amazon, Oracle, AMD, and VMware. His recent work led to the founding of LithosAI, a startup where he serves as the CEO.
Similar Articles
@KeisukeKamahori: Very excited to share that our team at @UWSyFi won multiple prizes at the FlashInfer AI Kernel Generation Contest in #M…
University of Washington SyFI team won multiple prizes at the FlashInfer AI Kernel Generation Contest held during MLSys2026, with support from NVIDIA and Modal.
@TheTuringPost: AutoScientists – a research lab made of agents @Harvard researchers connected agents into a self-organizing scientific …
Harvard researchers present AutoScientists, a multi-agent system that forms self-organizing scientific teams without a central coordinator, achieving strong results on BioML-Bench and optimization tasks.
@ChengleiSi: Excited to share these preliminary results on our internal autoresearch system @Recursive_SI, where we achieve SOTA on …
Recursive's automated AI research system achieves state-of-the-art results on NanoChat, NanoGPT Speedrun, and GPU kernel benchmarks by automating the research loop without task-specific adaptations, and open-sourcing artifacts for further inspection.
@marksaroufim: It was an honor to give the keynote at MLSys Covered how AI systems have evolved, why AI is needed to improve them, why…
Mark Saroufim gave a keynote at the MLSys conference covering the evolution of AI systems, why AI is needed to improve them, and promising future directions. The recording will be released soon.
@pdhsu: Beautiful work - the Weissman lab at MIT strikes again!
The article highlights research from the Weissman lab at MIT, praising their recent contributions.