LLM-Based Financial Sentiment Analysis in Arabic: Evidence from Saudi Markets

arXiv cs.CL 05/20/26, 04:00 AM Papers

arabic-nlp financial-sentiment llm arabic-language sentiment-analysis saudi-market nlp-framework

Summary

This paper presents a framework for Arabic financial sentiment analysis using LLMs, tailored for the Saudi market, integrating news and social media data to capture investor sentiment.

arXiv:2605.19714v1 Announce Type: new Abstract: Investor sentiment shapes financial markets, yet modeling sentiment in Arabic financial contexts remains challenging due to linguistic complexity and limited resources. We present an Arabic NLP framework for large-scale financial sentiment analysis tailored to the Saudi market, integrating official financial news and social media to capture institutional and public investor sentiment. The framework constructs a large Arabic financial corpus through a multi-stage pipeline encompassing data collection, cleaning, deduplication, entity linking, and sentiment annotation. Transformer-based NER combined with a curated company lexicon links textual mentions to canonical company identifiers, with sentiment labels assigned using a five-class scheme. The resulting dataset of 84K samples supports company-level sentiment aggregation and analysis of sentiment dynamics relative to stock market behavior on the Saudi Exchange. Experimental results demonstrate reliable and scalable Arabic financial sentiment analysis.

Original Article

View Cached Full Text

Cached at: 05/20/26, 08:26 AM

# LLM-Based Financial Sentiment Analysis in Arabic: Evidence from Saudi Markets
Source: [https://arxiv.org/abs/2605.19714](https://arxiv.org/abs/2605.19714)
[View PDF](https://arxiv.org/pdf/2605.19714)

> Abstract:Investor sentiment shapes financial markets, yet modeling sentiment in Arabic financial contexts remains challenging due to linguistic complexity and limited resources\. We present an Arabic NLP framework for large\-scale financial sentiment analysis tailored to the Saudi market, integrating official financial news and social media to capture institutional and public investor sentiment\. The framework constructs a large Arabic financial corpus through a multi\-stage pipeline encompassing data collection, cleaning, deduplication, entity linking, and sentiment annotation\. Transformer\-based NER combined with a curated company lexicon links textual mentions to canonical company identifiers, with sentiment labels assigned using a five\-class scheme\. The resulting dataset of 84K samples supports company\-level sentiment aggregation and analysis of sentiment dynamics relative to stock market behavior on the Saudi Exchange\. Experimental results demonstrate reliable and scalable Arabic financial sentiment analysis\.

## Submission history

From: Enrico Lopedoto \[[view email](https://arxiv.org/show-email/fa62f7bd/2605.19714)\] **\[v1\]**Tue, 19 May 2026 11:50:33 UTC \(563 KB\)

LLM-Based Financial Sentiment Analysis in Arabic: Evidence from Saudi Markets

Similar Articles

Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language Models

SAHM: A Benchmark for Arabic Financial and Shari'ah-Compliant Reasoning

Spam and Sentiment Detection in Arabic Tweets Using MARBERT Model

Automated Scoring of Arabic Text Using Large Language Models: A Literature Review

Benchmarking Frontier LLMs on Arabic Cultural and Sociolinguistic Knowledge: A Cross-Evaluation Framework with Human SME Ground Truth

Submit Feedback

Similar Articles

Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language Models

SAHM: A Benchmark for Arabic Financial and Shari'ah-Compliant Reasoning

Spam and Sentiment Detection in Arabic Tweets Using MARBERT Model

Automated Scoring of Arabic Text Using Large Language Models: A Literature Review

Benchmarking Frontier LLMs on Arabic Cultural and Sociolinguistic Knowledge: A Cross-Evaluation Framework with Human SME Ground Truth