Tag
This paper presents a complementary evaluation of PlanGPT, a large language model for automated planning, using plan cost and plan generation time metrics, and finds that PlanGPT performs no better than a greedy search strategy.
The author introduces the site plan for effectiveTPS, a tool designed to compare local AI models using a new 'effective TPS' metric alongside raw speed and latency. It aims to provide a simple leaderboard that highlights useful output quality over raw marketing numbers.