SP^3: Spherical Priors for Plug-and-Play Restoration
Summary
This paper introduces SP³, a method using Spherical Encoder priors for Plug-and-Play image restoration, achieving perceptual quality comparable to zero-shot diffusion priors while being 3–630× faster across tasks.
View Cached Full Text
Cached at: 06/16/26, 11:31 AM
Paper page - SP^3: Spherical Priors for Plug-and-Play Restoration
Source: https://huggingface.co/papers/2606.16396 Hi everyone, excited to share our new paper on Plug-and-Play restoration using Spherical Priors.
The main idea is to replace the usual denoiser/diffusion prior in PnP restoration with a Spherical Encoder prior. SP³ returns a natural-looking image after each iteration, and subsequent iterations improve the quality and consistency. This “anytime” restoration behavior is a key advantage over diffusion-based methods that require choosing the number of denoising steps in advance, and running all the way to the end.
In our experiments, SP³ reaches perceptual quality comparable to zero-shot diffusion and flow priors while being 3–630× faster across restoration tasks.
Happy to answer questions about the algorithm, results, or the sphere-prior design.
Similar Articles
VidSplat: Gaussian Splatting Reconstruction with Geometry-Guided Video Diffusion Priors
VidSplat is a training-free generative reconstruction framework that uses video diffusion priors to recover complete 3D scenes from sparse inputs by synthesizing novel views.
PRISM: Prior Rectification and Uncertainty-Aware Structure Modeling for Diffusion-Based Text Image Super-Resolution
PRISM is a diffusion-based framework for text image super-resolution that uses flow-matching prior rectification and uncertainty-aware residual encoding to improve accuracy under severe degradation, achieving state-of-the-art performance with millisecond-level inference.
Beyond 3D VQAs: Injecting 3D Spatial Priors into Vision-Language Models for Enhanced Geometric Reasoning
This paper proposes GASP, a framework that injects geometric priors into vision-language models via deep supervision with contrastive and depth consistency losses, achieving significant improvements on 3D spatial reasoning benchmarks without using 3D VQA data.
Efficient Image Synthesis with Sphere Latent Encoder
This paper proposes Sphere Latent Encoder, an efficient few-step image generation framework that performs denoising entirely in a spherical latent space, achieving high-quality 256×256 images with significantly reduced computational cost and improved FID scores on ImageNet-1K.
Geometry Matters: 3D Foundation Priors for Learning Semantic Correspondence
This paper introduces a post-training framework that leverages 3D priors from SAM3D to improve semantic correspondence in 2D foundation features, addressing issues like left-right confusion and repeated parts. The method uses instance-specific 3D reconstruction without pose annotations or spherical geometry shortcuts.