Tag
This paper introduces Grammatical Error Representation (GER), a novel method for retrieving in-context demonstrations based on error patterns rather than semantic similarity, significantly improving multilingual grammatical error correction performance in LLMs with in-context learning.
Proposes CoCoGEC, a counterfactual generation framework that alters error-irrelevant contexts in GEC training data to improve model robustness, achieving significant F0.5 gains on perturbed benchmarks.
Introduces ArabiGEE, the first comprehensive Arabic grammatical error explanation taxonomy with a hierarchical structure spanning orthographic, morphological, syntactic, and lexical dimensions, comprising 27 error types, 140 correction types, and 324 explanations.
This paper refines word-based grammatical error annotation for L2 Korean by addressing problems in existing resources, including surface target realization and single-reference evaluation, and demonstrates improvements using KoBART-based correction.