resource-management

#resource-management

The Shadow Price of Reasoning: Economic Perspective on Optimal Budget Allocation for LLMs

arXiv cs.AI ↗ · 6d ago Cached

This paper formulates LLM inference budget allocation as a constrained optimization problem, proposing CLEAR to reallocate resources from low-utility queries to those near emergence thresholds, achieving up to 3× accuracy improvement under tight budgets.

0 favorites 0 likes

#resource-management

HADT: A Heterogeneous Multi-Agent Differential Transformer for Autonomous Earth Observation Satellite Cluster

arXiv cs.AI ↗ · 2026-06-01 Cached

This paper proposes HADT, a transformer-based architecture for autonomous resource management in heterogeneous satellite clusters for Earth observation, using differential attention and relational tokenization. Experiments show significant improvements over baselines and strong adaptability to varying cluster sizes.

0 favorites 0 likes

#resource-management

Online Allocation with Unknown Shared Supply

arXiv cs.AI ↗ · 2026-05-11 Cached

This paper introduces the Online Shared Supply Allocation problem and proposes a deterministic threshold-proportional policy (GPA) that achieves a 4/3-approximation to the offline optimum. It also includes a learning-augmented extension to handle imperfect forecasts and demonstrates superior performance in synthetic and real-world experiments.

0 favorites 0 likes

#resource-management

Retrieval-Conditioned Topology Selection with Provable Budget Conservation for Multi-Agent Code Generation

arXiv cs.AI ↗ · 2026-05-08 Cached

This paper introduces RGAO, a retrieval-guided adaptive orchestration framework for multi-agent code generation that dynamically selects topology based on code complexity. It provides a formal budget algebra ensuring provable resource conservation while significantly reducing routing errors compared to baseline methods.

0 favorites 0 likes

#resource-management

2b or not 2b ? Custom LLM Scheduling Competition [P]

Reddit r/MachineLearning ↗ · 2026-04-23

A Kaggle competition challenges participants to build a scheduler that decides whether to run a 2B-parameter model on MMLU questions to minimize a weighted cost based on compute and accuracy trade-offs.

0 favorites 0 likes

resource-management

The Shadow Price of Reasoning: Economic Perspective on Optimal Budget Allocation for LLMs

HADT: A Heterogeneous Multi-Agent Differential Transformer for Autonomous Earth Observation Satellite Cluster

Online Allocation with Unknown Shared Supply

Retrieval-Conditioned Topology Selection with Provable Budget Conservation for Multi-Agent Code Generation

2b or not 2b ? Custom LLM Scheduling Competition [P]

Submit Feedback