LLM-Based Financial Sentiment Analysis in Arabic: Evidence from Saudi Markets

arXiv cs.CL Papers

Summary

This paper presents a framework for Arabic financial sentiment analysis using LLMs, tailored for the Saudi market, integrating news and social media data to capture investor sentiment.

arXiv:2605.19714v1 Announce Type: new Abstract: Investor sentiment shapes financial markets, yet modeling sentiment in Arabic financial contexts remains challenging due to linguistic complexity and limited resources. We present an Arabic NLP framework for large-scale financial sentiment analysis tailored to the Saudi market, integrating official financial news and social media to capture institutional and public investor sentiment. The framework constructs a large Arabic financial corpus through a multi-stage pipeline encompassing data collection, cleaning, deduplication, entity linking, and sentiment annotation. Transformer-based NER combined with a curated company lexicon links textual mentions to canonical company identifiers, with sentiment labels assigned using a five-class scheme. The resulting dataset of 84K samples supports company-level sentiment aggregation and analysis of sentiment dynamics relative to stock market behavior on the Saudi Exchange. Experimental results demonstrate reliable and scalable Arabic financial sentiment analysis.
Original Article
View Cached Full Text

Cached at: 05/20/26, 08:26 AM

# LLM-Based Financial Sentiment Analysis in Arabic: Evidence from Saudi Markets
Source: [https://arxiv.org/abs/2605.19714](https://arxiv.org/abs/2605.19714)
[View PDF](https://arxiv.org/pdf/2605.19714)

> Abstract:Investor sentiment shapes financial markets, yet modeling sentiment in Arabic financial contexts remains challenging due to linguistic complexity and limited resources\. We present an Arabic NLP framework for large\-scale financial sentiment analysis tailored to the Saudi market, integrating official financial news and social media to capture institutional and public investor sentiment\. The framework constructs a large Arabic financial corpus through a multi\-stage pipeline encompassing data collection, cleaning, deduplication, entity linking, and sentiment annotation\. Transformer\-based NER combined with a curated company lexicon links textual mentions to canonical company identifiers, with sentiment labels assigned using a five\-class scheme\. The resulting dataset of 84K samples supports company\-level sentiment aggregation and analysis of sentiment dynamics relative to stock market behavior on the Saudi Exchange\. Experimental results demonstrate reliable and scalable Arabic financial sentiment analysis\.

## Submission history

From: Enrico Lopedoto \[[view email](https://arxiv.org/show-email/fa62f7bd/2605.19714)\] **\[v1\]**Tue, 19 May 2026 11:50:33 UTC \(563 KB\)

Similar Articles

Agentic Trading: When LLM Agents Meet Financial Markets

arXiv cs.AI

This paper presents a systematic survey and evidence map of 77 studies on LLM-based trading agents, finding that architectural experimentation is expanding rapidly but evaluation protocols, execution semantics, and reproducibility remain critical bottlenecks.

Consistency Analysis of Sentiment Predictions using Syntactic & Semantic Context Assessment Summarization (SSAS)

arXiv cs.CL

This paper presents SSAS (Syntactic & Semantic Context Assessment Summarization), a framework designed to improve consistency in LLM-based sentiment prediction by reducing noise and variance through hierarchical classification and iterative summarization. Empirical evaluation on three industry-standard datasets shows up to 30% improvement in data quality and reliability for enterprise decision-making.

Representation Signatures and Risk-Feedback Alignment in LLM Trading Agents

arXiv cs.LG

This paper investigates the behavioral alignment and representation dynamics of LLM agents in financial trading, introducing the TradeArena testbed and finding measurable pre-failure signatures in planning embeddings that can predict drawdowns with high accuracy across multiple frontier models and stress conditions.