U-TTT: Towards Generalizable PET Image Denoising via Test-Time Training
Summary
This paper introduces U-TTT, a U-shaped deep learning model with test-time training layers and dual-domain adaptation for robust PET image denoising under distribution shifts, achieving state-of-the-art performance across different dose levels and scanner types.
View Cached Full Text
Cached at: 06/10/26, 09:44 AM
Paper page - U-TTT: Towards Generalizable PET Image Denoising via Test-Time Training
Source: https://huggingface.co/papers/2606.11032
Abstract
A novel U-shaped deep learning model with test-time training layers and dual-domain adaptation mechanisms achieves robust PET image denoising under distribution shifts.
Existing deep learning models forPositron Emission Tomography(PET) imagedenoisingoften suffer from severe performance degradation underdistribution shifts, fundamentally restricting their robust clinical deployment. This lack of generalization stems from the conventional paradigm offixed-parameter modelsthat cannot adapt to variations in test data (e.g., dose levels or scanner types) after training. To overcome this limitation and achieve robust generalization, we introduce U-TTT, a novelU-shaped modelthat integratesTest-Time Training(TTT) layers to dynamically adjust model parameters during inference throughself-supervision, thereby adapting to the specific characteristics of each test instance. Furthermore, to comprehensively capture the complex degradations of 3D PET data, U-TTT features adual-domain adaptationmechanism comprising aSpatial Test-Time Training(S-TTT) layer and aFrequency Test-Time Training(F-TTT) layer. The S-TTT layer captures and correctsspatial structural degradations, while the F-TTT layer suppressesglobal noise spectraand restores delicatehigh-frequency details. Extensive experiments demonstrate that U-TTT achieves state-of-the-art PETdenoisingperformance and exhibits superior generalization under challengingdistribution shifts, including both unseen dose levels and unseen scanners. Our code will be available at https://github.com/Yaziwel/U-TTT.
View arXiv pageView PDFGitHub2Add to collection
Get this paper in your agent:
hf papers read 2606\.11032
Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash
Models citing this paper0
No model linking this paper
Cite arxiv.org/abs/2606.11032 in a model README.md to link it from this page.
Datasets citing this paper0
No dataset linking this paper
Cite arxiv.org/abs/2606.11032 in a dataset README.md to link it from this page.
Spaces citing this paper0
No Space linking this paper
Cite arxiv.org/abs/2606.11032 in a Space README.md to link it from this page.
Collections including this paper0
No Collection including this paper
Add this paper to acollectionto link it from this page.
Similar Articles
UniPET: a universal network for high-quality PET image denoising across varied dose reduction factors
UniPET is a universal network for PET image denoising that handles varying dose reduction factors using domain generalization and region-aware learning, achieving state-of-the-art performance.
Linearizing Vision Transformer with Test-Time Training
This paper proposes a method to convert pretrained Softmax attention models into linear-complexity Test-Time Training (TTT) architectures, achieving comparable text-to-image quality to fine-tuned Softmax models while significantly accelerating inference. The approach is validated by linearizing Stable Diffusion 3.5, resulting in SD3.5-T^5 with 1.32x speedup at 1K resolution.
Back to Repair: A Minimal Denoising Network\ for Time Series Anomaly Detection
This paper introduces JuRe (Just Repair), a minimal denoising network for time series anomaly detection that matches or exceeds complex neural baselines on the TSB-AD and UCR benchmarks, demonstrating that a proper manifold-projection training objective is more important than architectural complexity.
Decoupled Residual Denoising Diffusion Models for Unified and Data Efficient Image-to-Image Translation
This paper proposes Decoupled Residual Denoising Diffusion Models (DRDD) for unified and data-efficient image-to-image translation, decoupling noise diffusion for domain harmonization from residual diffusion for semantic mapping.
Parallel-in-Time Training of Recurrent Neural Networks for Dynamical Systems Reconstruction
This paper investigates parallel-in-time algorithms for training recurrent neural networks in dynamical systems reconstruction, proposing GTF-DEER that enables stable learning over long sequences and improves reconstruction accuracy.