Tag
This paper evaluates cross-dataset generalization of supervised ML/DL models and prompted LLMs for automatic Bloom's taxonomy classification of assessment questions, finding that LLMs are more robust across diverse educational contexts.