Semantic distance as routing layer: an on-device, serverless alternative to the central-index model
Summary
Proposes a decentralized information discovery system using on-device embedding models and peer-to-peer gossip, eliminating the need for central indexes like search engines.
Similar Articles
Decoupling Search from Reasoning: A Vendor-Agnostic Grounding Architecture for LLM Agents
This paper presents Decoupled Search Grounding (DSG), a vendor-agnostic architecture that separates search retrieval from LLM reasoning, enabling explicit control over provider routing, caching, and output contracts. Experiments show DSG nearly matches native search accuracy at 91% lower cost and 68% lower latency.
The Data Center Moves to Your Machine (4 minute read)
Perplexity unveiled a hybrid local-cloud inference system at Computex 2026 that intelligently routes queries between on-device and cloud models, building on its earlier Personal Computer agent.
On-Device Neural Architecture Search
Proposes a lightweight neural architecture search performed directly on the deployment device for near-sensor computing, validated on sEMG sign language and fault diagnosis datasets, achieving improved accuracy and reduced RAM occupancy.
Self-Describing Structured Data with Dual-Layer Guidance: A Lightweight Alternative to RAG for Precision Retrieval in Large-Scale LLM Knowledge Navigation
SDSR proposes lightweight self-describing structured data with dual-layer guidance to exploit LLM primacy bias, achieving 100% routing accuracy without vector DBs.
Rethinking Cross-Layer Information Routing in Diffusion Transformers
This paper proposes Diffusion-Adaptive Routing (DAR), a learnable, timestep-adaptive residual replacement that improves cross-layer information flow in Diffusion Transformers, leading to significant training acceleration and quality improvements.