Tag
A guide on building a secure agentic system with sandboxing, parallel sub-agents, tool calling with control policies, inference routing, and protection against injection and role escalation attacks, to be published by Evangelos Pappas.
IR3DE is a ridge regression-based router that selects domain-expert LLMs for different tasks, achieving competitive performance while enabling dynamic addition or removal of experts without retraining.
Introduces LoRe, a training-free wrapper that enforces per-step interaction budgets for iterative graph solvers, achieving substantial speedups and memory reductions on combinatorial optimization problems like MIS and TSP.
INAR-VL proposes a lightweight routing system for edge-cloud vision-language inference that dynamically selects between edge and cloud models based on query complexity, achieving significant latency and energy reductions while preserving near-cloud accuracy.