Function2Scene: 3D Indoor Scene Layout from Functional Specifications
Summary
Function2Scene generates 3D indoor layouts from functional descriptions by parsing user needs and applying design constraints through an iterative refinement loop combining geometric analysis, LLM reasoning, and VLM assessment, outperforming baselines in satisfying functional requirements.
View Cached Full Text
Cached at: 06/01/26, 03:18 AM
Paper page - Function2Scene: 3D Indoor Scene Layout from Functional Specifications
Source: https://huggingface.co/papers/2605.30819
Abstract
Function2Scene generates 3D indoor layouts from functional descriptions by parsing user needs and applying design constraints through an iterative refinement process combining geometric analysis, language modeling, and visual assessment.
Mosttext-driven 3D indoor scene synthesismethods generate rooms from object-centric prompts, asking what furniture should be placed rather than how the space is used. Yet in real interior design, a layout is judged by how well it supports its occupants, e.g., theiractivitiesand physical needs. We introduce Function2Scene, a framework for generating 3D indoor layouts fromfunctional specifications, i.e.,natural-language design briefsdescribing who will use a room and what they need to do there. Given such a specification, our system parsesoccupant personasandactivities, derives a customized set offunctional design constraintsfrom ataxonomy of 17 criteriaspanning spatial, ergonomic, activity, and environmental considerations, and uses these constraints to guide layout generation. Rather than relying on an LLM to directly produce a final scene, Function2Scene performs iterative evaluation and refinement through a tool-augmentedcheck-and-repair loop, combininggeometric measurements,LLM-based contextual reasoning, andVLM-based visual assessment. Experiments on 30 professionally written interior-design cases show that Function2Scene produces layouts that better satisfy functional requirements than recent LLM-based scene synthesis baselines, with our results preferred in 94.3% of pairwise comparisons. Our work reframes text-driven indoor scene synthesis from placing plausible objects to designing spaces that support human use.
View arXiv pageView PDFProject pageAdd to collection
Get this paper in your agent:
hf papers read 2605\.30819
Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash
Models citing this paper0
No model linking this paper
Cite arxiv.org/abs/2605.30819 in a model README.md to link it from this page.
Datasets citing this paper0
No dataset linking this paper
Cite arxiv.org/abs/2605.30819 in a dataset README.md to link it from this page.
Spaces citing this paper0
No Space linking this paper
Cite arxiv.org/abs/2605.30819 in a Space README.md to link it from this page.
Collections including this paper0
No Collection including this paper
Add this paper to acollectionto link it from this page.
Similar Articles
@Av1dlive: Andrej Karpathy quietly shipped the best second brain idea in years not an app. a pattern. let an llm maintain a wiki o…
Andrej Karpathy popularized a pattern where an LLM maintains a wiki of notes from personal sources, and a free Claude Code plugin for Obsidian implements it with simple commands, enabling queryable and compounding knowledge.
@RealYDT: https://x.com/RealYDT/status/2065641164016120201
Microsoft has launched the MarkItDown tool, which converts various document formats (such as PDF, Word, Excel) into Markdown, making it easier for large models and RAG systems to process enterprise data.
@tferriss: NEW blog post is up! "Has AI Already Killed How-To Nonfiction? Sales Trends, My Personal Data, and What It Might Mean f…
Tim Ferriss shares a blog post analyzing how AI and LLMs have disrupted how-to nonfiction book sales, using his own sales data as evidence.
I Am Not a Reverse Centaur
Miguel Grinberg explains his refusal to accept unsolicited pull requests generated by LLMs, insisting on human involvement to avoid becoming a 'reverse centaur'.
SecureLens - A self-hosted AppSec agent and CLI scanner
SecureLens is an open-source, self-hosted security auditing tool that uses an LLM-powered async pipeline to triage codebases and probe web infrastructure, with an interactive REPL for follow-up questions and patch generation.