coreml

Tag

Cards List
#coreml

Systematic Optimization of Real-Time Diffusion Model Inference on Apple M3 Ultra

arXiv cs.LG · 2026-05-19 Cached

This paper presents a systematic optimization study of real-time diffusion model inference on the Apple M3 Ultra, achieving 22.7 FPS at 512x512 resolution using CoreML conversion and a distillation model, revealing that CUDA-optimized techniques do not directly transfer to Apple's unified memory architecture.

0 favorites 0 likes
← Back to home

Submit Feedback