Spam and Sentiment Detection in Arabic Tweets Using MARBERT Model

arXiv cs.CL 06/25/26, 04:00 AM Papers

arabic-nlp sentiment-analysis spam-detection marbert customer-satisfaction twitter deep-learning

Summary

This paper presents a sentiment analysis and spam detection system for Arabic tweets using the MARBERT model, trained on a dataset of 24,513 tweets to improve customer service for Saudi Telecom Company.

arXiv:2606.25495v1 Announce Type: new Abstract: Saudi Telecom Company (STC) is among the most popular companies in Saudi Arabia, with many customers. Yet, there is still a big room for improvement in users' satisfaction. Social media is the most robust platform to gauge users' satisfaction and determine their sentiments and critics. Twitter is among the most popular social media platform in this regard. STC customers prefer to use Twitter to write their feedback because it's a fast way to get responses due to the STC customer services account. One way to achieve customer demands and improve customer service is using the Sentiment Analysis tool. Sentiment Analysis on Twitter is highly used because of the significant number of tweets and the different opinions. Likewise, Deep learning is the best existing Sentiment Analysis method, and it has diverse models. Bidirectional Encoder Representations from Transformers (BERT) model is one of the deep learning models which have achieved excellent results in Sentiment Analysis for Natural Language Processing (NLP). NLP is mainly investigated in the English language. However, for Arabic, there is a significant gap to be filled. This study trained the proposed model using MARBERT and measured the performance using f1-score, precision, and recall metrics. We trained the model with an Arabic dataset of 24,513 tweets, including 1,437 positive, 13,828 negative, 5,694 neutral, 1,221 sarcasm, and 2,297 indeterminate tweets. The main goal is to analyze the tweets and get the sentiment to improve STC customer service. The proposed scheme is promising in terms of accuracy in contrast to existing techniques in the literature.

Original Article

View Cached Full Text

Cached at: 06/25/26, 05:12 AM

# Spam and Sentiment Detection in Arabic Tweets Using MARBERT Model
Source: [https://arxiv.org/abs/2606.25495](https://arxiv.org/abs/2606.25495)
[View PDF](https://arxiv.org/pdf/2606.25495)

> Abstract:Saudi Telecom Company \(STC\) is among the most popular companies in Saudi Arabia, with many customers\. Yet, there is still a big room for improvement in users' satisfaction\. Social media is the most robust platform to gauge users' satisfaction and determine their sentiments and critics\. Twitter is among the most popular social media platform in this regard\. STC customers prefer to use Twitter to write their feedback because it's a fast way to get responses due to the STC customer services account\. One way to achieve customer demands and improve customer service is using the Sentiment Analysis tool\. Sentiment Analysis on Twitter is highly used because of the significant number of tweets and the different opinions\. Likewise, Deep learning is the best existing Sentiment Analysis method, and it has diverse models\. Bidirectional Encoder Representations from Transformers \(BERT\) model is one of the deep learning models which have achieved excellent results in Sentiment Analysis for Natural Language Processing \(NLP\)\. NLP is mainly investigated in the English language\. However, for Arabic, there is a significant gap to be filled\. This study trained the proposed model using MARBERT and measured the performance using f1\-score, precision, and recall metrics\. We trained the model with an Arabic dataset of 24,513 tweets, including 1,437 positive, 13,828 negative, 5,694 neutral, 1,221 sarcasm, and 2,297 indeterminate tweets\. The main goal is to analyze the tweets and get the sentiment to improve STC customer service\. The proposed scheme is promising in terms of accuracy in contrast to existing techniques in the literature\.

## Submission history

From: Abrar Alotaibi \[[view email](https://arxiv.org/show-email/23359762/2606.25495)\] **\[v1\]**Wed, 24 Jun 2026 07:22:39 UTC \(1,058 KB\)

Spam and Sentiment Detection in Arabic Tweets Using MARBERT Model

Similar Articles

MentalMARBERT: Domain-Adaptive Pre-training and Two-Stage Fine-Tuning for Arabic Mental Health Disorders Detection

LLM-Based Financial Sentiment Analysis in Arabic: Evidence from Saudi Markets

Automated Scoring of Arabic Text Using Large Language Models: A Literature Review

Linear Semantic Segmentation for Low-Resource Spoken Dialects

An End-to-End Hybrid Framework for Rumour Detection in Low-Resources Algerian Dialect

Submit Feedback

Similar Articles

MentalMARBERT: Domain-Adaptive Pre-training and Two-Stage Fine-Tuning for Arabic Mental Health Disorders Detection

LLM-Based Financial Sentiment Analysis in Arabic: Evidence from Saudi Markets

Automated Scoring of Arabic Text Using Large Language Models: A Literature Review

Linear Semantic Segmentation for Low-Resource Spoken Dialects

An End-to-End Hybrid Framework for Rumour Detection in Low-Resources Algerian Dialect