educational-ai

#educational-ai

Evaluating Nonuniform Dependability Across Response Conditions: A Conditional Generalizability Framework Illustrated in Automated Essay Scoring

arXiv cs.CL ↗ · 2026-07-15 Cached

This paper proposes a conditional generalizability framework to evaluate nonuniform dependability across response conditions in automated essay scoring.

0 favorites 0 likes

#educational-ai

VectorizationLLM: Smart Vectorization Based AI Assistant

arXiv cs.AI ↗ · 2026-07-10 Cached

This paper presents VectorizationLLM, a specialized LLM built on Google open-weight models and a RAG knowledge base, designed to help students learn smart vectorization, Fourier analysis, and differential equations in MATLAB without providing direct answers.

0 favorites 0 likes

#educational-ai

Cross-Dataset Bloom Question Classification: Supervised Models and Prompted LLMs

arXiv cs.CL ↗ · 2026-06-15 Cached

This paper evaluates cross-dataset generalization of supervised ML/DL models and prompted LLMs for automatic Bloom's taxonomy classification of assessment questions, finding that LLMs are more robust across diverse educational contexts.

0 favorites 0 likes

#educational-ai

IntElicit: Eliciting and Assessing Contextualized Creativity via Dialogue Policy Optimization

arXiv cs.AI ↗ · 2026-06-11 Cached

IntElicit is a framework that uses dialogue policy optimization with a decomposed process reward mechanism to elicit and assess contextualized creativity through adaptive AI interviewing, reducing confounders like domain knowledge and engagement. Experiments show it improves creative outcomes over static assessment methods.

0 favorites 0 likes

#educational-ai

Measuring the impact of learning with AI in Sierra Leone and beyond

Google DeepMind Blog ↗ · 2026-06-08 Cached

A pre-registered trial in Sierra Leone found that AI-powered Guided Learning significantly improved math scores, achieving 1.2 to 1.7 years of progress in eight weeks, while teachers reported enhanced professional growth and a shift toward facilitation roles.

0 favorites 0 likes

#educational-ai

Elmes*: Automated Construction of Fine-Grained Evaluation Rubrics for Large Language Models in Long-Tail Educational Scenarios

arXiv cs.LG ↗ · 2026-06-08 Cached

This paper introduces Elmes+, an automated framework for constructing fine-grained evaluation rubrics for LLMs in long-tail educational scenarios, and presents the Edu-330 benchmark covering 330 scenarios across 11 subjects. The framework uses a multi-agent engine and self-evolving module to co-optimize evaluation criteria and test data, revealing multidimensional educational capability differences among top LLMs.

0 favorites 0 likes

#educational-ai

TeachObs: A Human-Validated Benchmark for Multimodal Teaching Observation and Model Evaluation

arXiv cs.CL ↗ · 2026-06-01 Cached

TeachObs introduces a human-validated benchmark for multimodal teaching observation, consisting of 30 classroom videos annotated with segment-level binary codes and lesson-level expert ratings, and evaluates five frontier LLMs across three tracks, finding no single model consistently outperforms and that model evaluations overrate procedurally clear lessons.

0 favorites 0 likes

#educational-ai

Towards Just-in-Time Adaptive Feedback: Enhancing Student Learning via Knowledge-Grounded LLM

arXiv cs.CL ↗ · 2026-05-27 Cached

This paper presents a framework that uses domain-specific expert knowledge to ground large language models for providing Just-in-Time adaptive feedback to students based on their written reasoning, achieving over 80% improvement in student performance in a large university course.

0 favorites 0 likes

#educational-ai

Teaching Through Analogies: A Modular Pipeline for Educational Analogy Generation

arXiv cs.CL ↗ · 2026-05-26 Cached

This paper presents a modular pipeline for educational analogy generation, decomposing the task into four stages and evaluating 12 LLMs and 7 embedding models. Results show that sub-concept grounding improves explanation quality and retrieval precision, with a novel LLM-as-a-judge evaluation validated against human annotations.

0 favorites 0 likes

#educational-ai

Agentic AI Ecosystems in Higher Education: A Perspective on AI Agents to Emerging Inclusive, Agentic Multi-Agent AI Framework for Learning, Teaching and Institutional Intelligence

arXiv cs.AI ↗ · 2026-05-15 Cached

This paper presents a forward-looking perspective on agentic multi-agent AI platforms in higher education, addressing the need for integrated, inclusive systems that support learning, teaching, and institutional operations. It identifies gaps in current fragmented AI tools and proposes directions for scalable, human-aligned multi-agent ecosystems.

0 favorites 0 likes

#educational-ai

RETUYT-INCO at BEA 2026 Shared Task 2: Meta-prompting in Rubric-based Scoring for German

arXiv cs.CL ↗ · 2026-05-13 Cached

This paper details the RETUYT-INCO team's participation in the BEA 2026 Shared Task 2, introducing a meta-prompting approach for rubric-based scoring of German short answers.

0 favorites 0 likes

#educational-ai

MBP-KT: Learning Global Collaborative Information from Meta-Behavioral Pattern for Enhanced Knowledge Tracing

arXiv cs.AI ↗ · 2026-05-12 Cached

This paper introduces MBP-KT, a framework for enhanced knowledge tracing that leverages meta-behavioral patterns to extract global collaborative information from learner interactions, improving performance across various downstream models.

0 favorites 0 likes

#educational-ai

Improving Lexical Difficulty Prediction with Context-Aligned Contrastive Learning and Ridge Ensembling

arXiv cs.CL ↗ · 2026-05-12 Cached

This paper introduces Context-Aligned Contrastive Regression to improve lexical difficulty prediction by addressing cross-lingual alignment and ordinal structure challenges in language learning datasets.

0 favorites 0 likes

#educational-ai

NSMQ Riddles: A Benchmark of Scientific and Mathematical Riddles for Quizzing Large Language Models

arXiv cs.CL ↗ · 2026-05-11 Cached

This paper introduces NSMQ Riddles, a novel benchmark using scientific and mathematical riddles from Ghana's National Science and Maths Quiz to evaluate Large Language Models, addressing the underrepresentation of Global South datasets in AI research.

0 favorites 0 likes

#educational-ai

Evaluating Adaptive Personalization of Educational Readings with Simulated Learners

arXiv cs.CL ↗ · 2026-04-21 Cached

Researchers from Arizona State University present a framework for evaluating adaptive personalization of educational reading materials using theory-grounded simulated learners, incorporating memory models, misconception revision, and Bayesian Knowledge Tracing. Experiments across three subjects show adaptive reading significantly improved outcomes in computer science but had mixed results in chemistry and biology.

0 favorites 0 likes

educational-ai

Submit Feedback