αDepth: Learning Single-Pass Soft Boundary Decomposition for Stereo Conversion

Hugging Face Daily Papers 05/29/26, 12:00 AM Papers

Summary

αDepth introduces a layered representation with Circular Alpha Representation (CAR) to address soft boundary challenges in stereo conversion, achieving state-of-the-art performance without manual guidance.

Accurately modeling soft boundaries, e.g., hair and defocus blur, is a fundamental challenge in stereo conversion due to the ambiguous blending of foreground and background. Existing depth models primarily predict single-layer depth, leading to ambiguity in depth correspondence at soft boundaries. While matting techniques can capture opacity for layered modeling, they often struggle in complex scenes with multiple targets and usually require user intervention. This paper introduces αDepth, a layered representation that decomposes soft boundaries for high-fidelity stereo conversion. Specifically, we first resolve mixed color and depth ambiguity by estimating layered color and depth values at soft boundaries. Considering complex multi-target scenes, we design a Circular Alpha Representation (CAR) that shifts the paradigm from global target extraction to local boundary decomposition. Unlike prior matting methods restricted to a single foreground/background, CAR enables efficient scene-level inference without manual guidance. Extensive evaluations demonstrate that αDepth achieves state-of-the-art performance in stereo conversion, eliminating background bleeding and structural distortions at soft boundaries.

Original Article

View Cached Full Text

Cached at: 06/03/26, 11:37 AM

Paper page - αDepth: Learning Single-Pass Soft Boundary Decomposition for Stereo Conversion

Source: https://huggingface.co/papers/2606.00386

Abstract

αDepth introduces a layered representation with Circular Alpha Representation (CAR) to address soft boundary challenges in stereo conversion through local boundary decomposition and efficient scene-level inference.

Accurately modelingsoft boundaries, e.g., hair and defocus blur, is a fundamental challenge instereo conversiondue to the ambiguous blending of foreground and background. Existing depth models primarily predict single-layer depth, leading to ambiguity in depth correspondence atsoft boundaries. Whilematting techniquescan capture opacity for layered modeling, they often struggle in complex scenes with multiple targets and usually require user intervention. This paper introduces αDepth, alayered representationthat decomposessoft boundariesfor high-fidelitystereo conversion. Specifically, we first resolve mixed color and depth ambiguity by estimating layered color and depth values atsoft boundaries. Considering complex multi-target scenes, we design aCircular Alpha Representation(CAR) that shifts the paradigm from global target extraction to local boundary decomposition. Unlike prior matting methods restricted to a single foreground/background, CAR enables efficientscene-level inferencewithout manual guidance. Extensive evaluations demonstrate that αDepth achieves state-of-the-art performance instereo conversion, eliminatingbackground bleedingandstructural distortionsatsoft boundaries.

View arXiv page View PDF Add to collection

Get this paper in your agent:

hf papers read 2606\.00386

Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash

Models citing this paper0

No model linking this paper

Cite arxiv.org/abs/2606.00386 in a model README.md to link it from this page.

Datasets citing this paper0

No dataset linking this paper

Cite arxiv.org/abs/2606.00386 in a dataset README.md to link it from this page.

Spaces citing this paper0

No Space linking this paper

Cite arxiv.org/abs/2606.00386 in a Space README.md to link it from this page.

Collections including this paper0

No Collection including this paper

Add this paper to acollectionto link it from this page.

αDepth: Learning Single-Pass Soft Boundary Decomposition for Stereo Conversion

Paper page - αDepth: Learning Single-Pass Soft Boundary Decomposition for Stereo Conversion

Abstract

Models citing this paper0

Datasets citing this paper0

Spaces citing this paper0

Collections including this paper0

Similar Articles

@RuohanZhang76: Excited to introduce StereoPolicy, led by @EvansXuHan. StereoPolicy is an effective way to add geometric cues to modern…

Unified Panoramic Geometry Estimation via Multi-View Foundation Models

Direct 3D-Aware Object Insertion via Decomposed Visual Proxies

Geometry-Aware Representation Denoising for Robust Multi-view 3D Reconstruction

TrackCraft3R: Repurposing Video Diffusion Transformers for Dense 3D Tracking

Submit Feedback

Similar Articles

@RuohanZhang76: Excited to introduce StereoPolicy, led by @EvansXuHan. StereoPolicy is an effective way to add geometric cues to modern…

Unified Panoramic Geometry Estimation via Multi-View Foundation Models

Direct 3D-Aware Object Insertion via Decomposed Visual Proxies

Geometry-Aware Representation Denoising for Robust Multi-view 3D Reconstruction

TrackCraft3R: Repurposing Video Diffusion Transformers for Dense 3D Tracking