model-selection

Tag

Cards List
#model-selection

When Offline Selectors Cannot Beat the Best Single Model: A Diagnostic Study on edX Dropout Prediction

arXiv cs.LG · 22h ago Cached

This paper proposes a three-stage diagnostic framework to identify why offline model selectors fail to beat the best single model, applying it to dropout prediction on edX clickstream data. The study finds that the bottleneck is local representational ambiguity rather than learner choice or distribution shift, recommending state redesign or new data collection over further algorithm tuning.

0 favorites 0 likes
#model-selection

MOSAIC: Modular Orchestration for Structured Agentic Intelligence and Composition

arXiv cs.AI · 2d ago Cached

MOSAIC introduces a structured agentic framework for automated data science that uses memory-grounded model selection and workflow construction, validated on financial time-series tasks. It outperforms AutoML and agentic baselines.

0 favorites 0 likes
#model-selection

From Rashomon Theory to PRAXIS: Efficient Decision Tree Rashomon Sets

arXiv cs.LG · 2d ago Cached

PRAXIS is a new algorithm that efficiently approximates the Rashomon set of near-optimal decision trees, achieving orders of magnitude improvement in runtime and memory while maintaining near-perfect recall.

0 favorites 0 likes
#model-selection

@svpino: How to match the complexity of the problem you want to solve with the proper model. You want an inference router. In th…

X AI KOLs Following · 2026-05-26 Cached

Svpino demonstrates how to use an inference router to match problem complexity with the appropriate AI model, suggesting users should stop interacting with models directly.

0 favorites 0 likes
#model-selection

HyDRA: Hybrid Dynamic Routing Architecture for Heterogeneous LLM Pools

arXiv cs.CL · 2026-05-19 Cached

HyDRA is a hybrid dynamic routing architecture for heterogeneous LLM pools that predicts fine-grained capability requirements per query and selects the cheapest capable model via shortfall matching, achieving up to 72.5% cost savings with quality maintained. It is deployed in GitHub Copilot's VS Code Chat auto-mode and decouples routing from model catalog, requiring no retraining when models change.

0 favorites 0 likes
#model-selection

Forecasting Downstream Performance of LLMs With Proxy Metrics

Hugging Face Daily Papers · 2026-05-18 Cached

This paper introduces proxy metrics based on token-level statistics from expert-written solutions to forecast downstream LLM performance, significantly outperforming loss-based methods in model selection, pretraining data selection, and training-time forecasting.

0 favorites 0 likes
#model-selection

The Frontier-Only Narrative Is a Financing Story, Not an Architecture Story

Reddit r/artificial · 2026-05-15

This article argues that the narrative that only frontier AI models are necessary for production is driven by financing needs, not architectural reality. It highlights that smaller, efficient models like Phi-4, Claude Haiku, and routing solutions like RouteLLM offer cost-effective alternatives, and most enterprises waste tokens by defaulting to large models.

0 favorites 0 likes
#model-selection

Layer-wise Representation Dynamics: An Empirical Investigation Across Embedders and Base LLMs

arXiv cs.LG · 2026-05-14 Cached

This paper introduces Layer-wise Representation Dynamics (LRD), a framework with three measurement families to analyze how hidden states change across layers in language models. Applied to 31 models on 30 MTEB tasks, LRD reveals architectural differences and enables label-free model selection and inference-time layer pruning.

0 favorites 0 likes
#model-selection

Same model, different harness: 30-50 point performance swing. But teams still pick agents by model name.

Reddit r/AI_Agents · 2026-05-09

The article highlights that agent harnesses cause a 30-50 point performance swing compared to model selection, arguing that teams should focus on instance-level verification rather than just model names.

0 favorites 0 likes
#model-selection

@amitiitbhu: New article: LLM Routing Read here: https://outcomeschool.com/blog/llm-routing…

X AI KOLs Timeline · 2026-05-09 Cached

A tutorial blog post explaining LLM Routing — the practice of directing user queries to the most appropriate LLM based on cost, latency, and quality. Covers routing strategies, anatomy of an LLM router, and comparisons with Mixture of Experts.

0 favorites 0 likes
#model-selection

Toto

Product Hunt · 2026-05-08

Toto is a tool that routes context-rich tasks to the best AI model for the job.

0 favorites 0 likes
← Back to home

Submit Feedback