Towards a Linguistic Evaluation of Narratives: A Quantitative Stylistic Framework

arXiv cs.CL 04/22/26, 04:00 AM Papers

Summary

A preprint proposes a 33-feature quantitative linguistic framework that distinguishes professionally edited from self-published books and outperforms existing story-level evaluation metrics.

arXiv:2604.19261v1 Announce Type: new Abstract: The evaluation of narrative quality remains a complex challenge, as it involves subjective factors such as plot, character development, and emotional impact. This work proposes a quantitative approach to narrative assessment by focusing on the linguistic dimension as a primary indicator of quality. The paper presents a methodology for the automatic evaluation of narrative based on the extraction of a comprehensive set of 33 quantitative linguistic features categorized into lexical, syntactic, and semantic groups. To test the model, an experiment was conducted on a specialized corpus of 23 books, including canonical masterpieces and self-published works. Through a similarity matrix, the system successfully clustered the narratives, distinguishing almost perfectly between professionally edited and self-published texts. Furthermore, the methodology was validated against a human-annotated dataset; it significantly outperforms traditional story-level evaluation metrics, demonstrating the effectiveness of quantitative linguistic features in assessing narrative quality.

Original Article Export to Word Export to PDF

View Cached Full Text

Cached at: 04/22/26, 08:30 AM

# Towards a Linguistic Evaluation of Narratives: A Quantitative Stylistic Framework
Source: [https://arxiv.org/abs/2604.19261](https://arxiv.org/abs/2604.19261)
[View PDF](https://arxiv.org/pdf/2604.19261)

> Abstract:The evaluation of narrative quality remains a complex challenge, as it involves subjective factors such as plot, character development, and emotional impact\. This work proposes a quantitative approach to narrative assessment by focusing on the linguistic dimension as a primary indicator of quality\. The paper presents a methodology for the automatic evaluation of narrative based on the extraction of a comprehensive set of 33 quantitative linguistic features categorized into lexical, syntactic, and semantic groups\. To test the model, an experiment was conducted on a specialized corpus of 23 books, including canonical masterpieces and self\-published works\. Through a similarity matrix, the system successfully clustered the narratives, distinguishing almost perfectly between professionally edited and self\-published texts\. Furthermore, the methodology was validated against a human\-annotated dataset; it significantly outperforms traditional story\-level evaluation metrics, demonstrating the effectiveness of quantitative linguistic features in assessing narrative quality\.

## Submission history

From: Alessandro Maisto \[[view email](https://arxiv.org/show-email/2b9a5b8d/2604.19261)\] **\[v1\]**Tue, 21 Apr 2026 09:21:40 UTC \(827 KB\)

Towards a Linguistic Evaluation of Narratives: A Quantitative Stylistic Framework

Similar Articles

BIASEDTALES-ML: A Multilingual Dataset for Analyzing Narrative Attribute Distributions in LLM-Generated Stories

Reward Modeling for Scientific Writing Evaluation

Saying More Than They Know: A Framework for Quantifying Epistemic-Rhetorical Miscalibration in Large Language Models

SwanNLP at SemEval-2026 Task 5: An LLM-based Framework for Plausibility Scoring in Narrative Word Sense Disambiguation

From Benchmarking to Reasoning: A Dual-Aspect, Large-Scale Evaluation of LLMs on Vietnamese Legal Text

Submit Feedback

Similar Articles

BIASEDTALES-ML: A Multilingual Dataset for Analyzing Narrative Attribute Distributions in LLM-Generated Stories

Reward Modeling for Scientific Writing Evaluation

Saying More Than They Know: A Framework for Quantifying Epistemic-Rhetorical Miscalibration in Large Language Models

SwanNLP at SemEval-2026 Task 5: An LLM-based Framework for Plausibility Scoring in Narrative Word Sense Disambiguation

From Benchmarking to Reasoning: A Dual-Aspect, Large-Scale Evaluation of LLMs on Vietnamese Legal Text