rubric

#rubric

Open-source procurement rubric for agentic AI vendors, I scored 5 of them and want feedback on the methodology

Reddit r/AI_Agents ↗ · 2026-06-11

The author created an open-source rubric tool to evaluate agentic AI vendor documentation on tool-call correctness, loop termination, and multi-step state coherence, scored five vendors (Anthropic, OpenAI, LangGraph, Sierra, Salesforce), and requests feedback on methodology and potential bias toward public documentation depth.

0 favorites 0 likes

#rubric

Using Large Language Models to Support High Volume Application Review for an Undergraduate Research Program

arXiv cs.CL ↗ · 2026-06-05 Cached

This paper describes the development of an LLM-based tool using OpenAI's GPT models to evaluate approximately 1,200 Statements of Purpose for Purdue's SURF program, processing them in 4.6 hours and accelerating the review process compared to traditional human grading.

0 favorites 0 likes

#rubric

Can Vision Language Models Be Adaptive in Mathematics Education? A Learner Model-based Rubric Study

arXiv cs.CL ↗ · 2026-05-18 Cached

This paper proposes a learner model-based rubric to evaluate the adaptivity of Vision Language Models (VLMs) in mathematics education. Experiments show measurable differences in adaptivity across models and reveal that current VLMs struggle to produce consistent learner-adaptive instructional responses.

0 favorites 0 likes

rubric

Open-source procurement rubric for agentic AI vendors, I scored 5 of them and want feedback on the methodology

Using Large Language Models to Support High Volume Application Review for an Undergraduate Research Program

Can Vision Language Models Be Adaptive in Mathematics Education? A Learner Model-based Rubric Study

Submit Feedback