alpacaeval

#alpacaeval

more models more better. one expensive model is losing to three cheap ones, and there's a paper on it

Reddit r/artificial ↗ · 2d ago

A mixture-of-agents paper (arxiv 2406.04692) shows that a committee of cheap open models can outperform GPT-4o on AlpacaEval 2.0 by leveraging decorrelated errors, and the author shares similar real-world findings where multiple cheap models catch more bugs than a single expensive model.

0 favorites 0 likes

alpacaeval

more models more better. one expensive model is losing to three cheap ones, and there's a paper on it

Submit Feedback