arabic-nlp

#arabic-nlp

Spam and Sentiment Detection in Arabic Tweets Using MARBERT Model

arXiv cs.CL ↗ · yesterday Cached

This paper presents a sentiment analysis and spam detection system for Arabic tweets using the MARBERT model, trained on a dataset of 24,513 tweets to improve customer service for Saudi Telecom Company.

0 favorites 0 likes

#arabic-nlp

Analyzing and Encoding the Al-Mawrid Arabic-English Dictionary with the ISO Language Markup Framework and TEI Lex-0

arXiv cs.CL ↗ · 2026-06-17 Cached

This paper presents a methodology for digitizing the Al-Mawrid Arabic-English dictionary using ISO LMF and TEI Lex-0 standards, achieving high parsing accuracy and precision, and addressing gaps in Arabic lexical infrastructure.

0 favorites 0 likes

#arabic-nlp

QIAS 2026: Overview of the Shared Task on Islamic Inheritance Reasoning

arXiv cs.CL ↗ · 2026-06-15 Cached

This paper presents an overview of the QIAS 2026 shared task on Islamic inheritance reasoning, evaluating LLMs on multi-step legal and numerical reasoning using the MAWARITH benchmark.

0 favorites 0 likes

#arabic-nlp

MentalMARBERT: Domain-Adaptive Pre-training and Two-Stage Fine-Tuning for Arabic Mental Health Disorders Detection

arXiv cs.CL ↗ · 2026-06-12 Cached

This paper presents MentalMARBERT, a domain-adapted Arabic language model for detecting mental health disorders from social media text. The framework uses domain-adaptive pre-training and a two-stage fine-tuning approach, achieving 0.877 accuracy and 0.861 macro-F1 on a newly constructed Arabic mental health dataset of 50,670 tweets.

0 favorites 0 likes

#arabic-nlp

Cohesion-6K: An Arabic Dataset for Analyzing Social Cohesion and Conflict in Online Discourse

arXiv cs.CL ↗ · 2026-05-22 Cached

Introduces Cohesion-6K, a manually and ChatGPT-assisted annotated dataset of 6,000 Arabic Facebook posts about the Israeli Occupation of Palestine, spanning conflict to cohesion categories. Analysis shows conflict-oriented posts receive 2-4x more engagement than resolution-oriented ones.

0 favorites 0 likes

#arabic-nlp

Audience Engagement with Arabic Women's Social Empowerment and Wellbeing: A Decadal Corpus

arXiv cs.CL ↗ · 2026-05-22 Cached

This paper presents the Arabic Women and Society Corpus, a ten-year collection of over 250,000 Arabic Facebook posts related to women's empowerment and social wellbeing, with engagement metrics for analyzing gender discourse and sentiment.

0 favorites 0 likes

#arabic-nlp

Building Arabic NLP from the Ground Up: Twenty Years of Lessons, Failures, and Open Problems

arXiv cs.CL ↗ · 2026-05-21 Cached

A comprehensive overview of twenty years of Arabic NLP research, discussing lessons, failures, and open problems in the field.

0 favorites 0 likes

#arabic-nlp

LLM-Based Financial Sentiment Analysis in Arabic: Evidence from Saudi Markets

arXiv cs.CL ↗ · 2026-05-20 Cached

This paper presents a framework for Arabic financial sentiment analysis using LLMs, tailored for the Saudi market, integrating news and social media data to capture investor sentiment.

0 favorites 0 likes

#arabic-nlp

SAHM: A Benchmark for Arabic Financial and Shari'ah-Compliant Reasoning

arXiv cs.CL ↗ · 2026-04-22 Cached

Researchers release SAHM, the first Arabic financial benchmark with 14,380 expert-verified instances covering Shari’ah-compliant reasoning, showing large performance gaps for 20 evaluated LLMs.

0 favorites 0 likes

#arabic-nlp

QIMMA قِمّة ⛰: A Quality-First Arabic LLM Leaderboard

Hugging Face Blog ↗ · 2026-04-21 Cached

QIMMA is a new quality-first Arabic LLM leaderboard introduced by TII UAE that validates benchmarks before evaluation to ensure accurate performance measurement. It addresses systematic quality issues in existing Arabic NLP benchmarks through a rigorous multi-stage validation pipeline.

0 favorites 0 likes

#arabic-nlp

QU-NLP at QIAS 2026: Multi-Stage QLoRA Fine-Tuning for Arabic Islamic Inheritance Reasoning

arXiv cs.CL ↗ · 2026-04-21 Cached

This paper presents Qatar University's multi-stage QLoRA fine-tuning approach on Qwen3-4B for Arabic Islamic inheritance reasoning, achieving 90% MIR-E score through domain adaptation on Islamic fatwa records followed by task-specific training on 12,000 structured inheritance cases, matching commercial systems like Gemini-2.5-flash with minimal computational resources.

0 favorites 0 likes

#arabic-nlp

Beyond MCQ: An Open-Ended Arabic Cultural QA Benchmark with Dialect Variants

arXiv cs.CL ↗ · 2026-04-20 Cached

This paper introduces the first parallel Arabic cultural QA benchmark spanning Modern Standard Arabic and multiple dialects, converting multiple-choice questions to open-ended formats and evaluating LLMs with chain-of-thought reasoning to address gaps in culturally grounded and dialect-specific knowledge.

0 favorites 0 likes

arabic-nlp

Submit Feedback