inference-routing

Tag

Cards List
#inference-routing

@Hevalon: this tuesday, i'm publishing a guide on how to build a complete Agentic system with a harness to support sandboxing, pa…

X AI KOLs Timeline · 4d ago Cached

A guide on building a secure agentic system with sandboxing, parallel sub-agents, tool calling with control policies, inference routing, and protection against injection and role escalation attacks, to be published by Evangelos Pappas.

0 favorites 0 likes
#inference-routing

IR3DE: A Linear Router for Large Language Models

Hugging Face Daily Papers · 2026-06-04 Cached

IR3DE is a ridge regression-based router that selects domain-expert LLMs for different tasks, achieving competitive performance while enabling dynamic addition or removal of experts without retraining.

0 favorites 0 likes
#inference-routing

LoRe: Adaptive Interaction-Evaluation Routing with Per-Step Interaction Budgets for Iterative Graph Solvers

arXiv cs.LG · 2026-05-29 Cached

Introduces LoRe, a training-free wrapper that enforces per-step interaction budgets for iterative graph solvers, achieving substantial speedups and memory reductions on combinatorial optimization problems like MIS and TSP.

0 favorites 0 likes
#inference-routing

INAR-VL: Input-Aware Routing for Edge-Cloud Vision-Language Inference

arXiv cs.LG · 2026-05-20

INAR-VL proposes a lightweight routing system for edge-cloud vision-language inference that dynamically selects between edge and cloud models based on query complexity, achieving significant latency and energy reductions while preserving near-cloud accuracy.

0 favorites 0 likes
← Back to home

Submit Feedback