@RisingSayak: I realized that what I cannot profile, I cannot optimize. This is why I embarked on a little project in Diffusers, to t…

X AI KOLs Following 05/22/26, 03:18 PM Tools

Summary

Sayak Paul describes a project to profile and optimize Diffusers pipelines using torch.compile, and announces a tutorial series by Ari G. on the topic.

I realized that what I cannot profile, I cannot optimize. This is why I embarked on a little project in Diffusers, to try to profile important pipelines, identify bottlenecks for torch.compile, and fix them. Got decent results. I documented the process and invited the community to apply the same. @ariG23498 decided to take it a notch further by formulating an entire series of tutorials around the topic, starting from compiling simple torch ops and how to make sense of their profile traces. Follow his space to stay updated. It's an incredibly helpful skill to have, especially if you're in the optimization business. Even if you're not, it gives a good mental model of what's going on in those SMs.

Original Article

View Cached Full Text

Cached at: 05/23/26, 03:58 AM

I realized that what I cannot profile, I cannot optimize.

This is why I embarked on a little project in Diffusers, to try to profile important pipelines, identify bottlenecks for torch.compile, and fix them. Got decent results.

I documented the process and invited the community to apply the same.

@ariG23498 decided to take it a notch further by formulating an entire series of tutorials around the topic, starting from compiling simple torch ops and how to make sense of their profile traces.

Follow his space to stay updated.

It’s an incredibly helpful skill to have, especially if you’re in the optimization business. Even if you’re not, it gives a good mental model of what’s going on in those SMs.

@RisingSayak: I realized that what I cannot profile, I cannot optimize. This is why I embarked on a little project in Diffusers, to t…

Similar Articles

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

Journey in optimising Elixir application

@MaximeRivest: https://x.com/MaximeRivest/status/2055293570119065875

@leloykun: [WIP] Blog post on Lean4-to-TileLang Tensor Program Superoptimizer here:

Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines

Submit Feedback

Similar Articles

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

Journey in optimising Elixir application

@MaximeRivest: https://x.com/MaximeRivest/status/2055293570119065875

@leloykun: [WIP] Blog post on Lean4-to-TileLang Tensor Program Superoptimizer here:

Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines