natural-language-processing

#natural-language-processing

SARA: Unlocking Multilingual Knowledge in Mixture-of-Experts via Semantically Anchored Routing Alignment

arXiv cs.CL ↗ · 2026-06-25 Cached

This paper proposes SARA, a framework that aligns routing distributions of multilingual inputs using Jensen-Shannon divergence to improve expert sharing for low-resource languages in sparse Mixture-of-Experts models. Experiments on Qwen3-30B-A3B and Phi-3.5-MoE-instruct show improvements on multilingual benchmarks.

0 favorites 0 likes

#natural-language-processing

Automatic Generation of Highlights for Academic Paper Via Prompt-based Learning

arXiv cs.CL ↗ · 2026-06-25 Cached

This paper investigates prompt-based learning for automatically generating highlights of academic papers, using models like GPT-2, T5, and ChatGPT, and shows that ChatGPT with few-shot prompts achieves performance comparable to or better than supervised methods without requiring task-specific training data.

0 favorites 0 likes

#natural-language-processing

The cognitive, affective, and behavioral expression of self-stigma among people who use drugs in online substance use communities

arXiv cs.CL ↗ · 2026-06-25 Cached

This paper develops a codebook for self-stigma among people who use drugs and analyzes 72,115 Reddit posts to examine prevalence, co-occurrence, and temporal patterns of cognitive, affective, and behavioral stigma indicators, finding that self-stigma is expressed as an integrated phenomenon with behavioral indicators often preceding core indicators.

0 favorites 0 likes

#natural-language-processing

Automatic Part-of-Speech Tagging of Arabic-English Dictionary Senses through WordNet

arXiv cs.CL ↗ · 2026-06-24 Cached

This paper proposes a resource-light algorithm to automatically assign part-of-speech tags to senses in the Al-Mawrid Arabic-English bilingual dictionary by transferring tags from English WordNet after disambiguation, achieving high accuracy with minimal cost.

0 favorites 0 likes

#natural-language-processing

T2D-Bench: Evidence-Gated Evaluation of LLM Outputs for Type 2 Diabetes Using a Multi-Layer Clinical-Lifestyle Knowledge Graph

arXiv cs.AI ↗ · 2026-06-24 Cached

T2D-Bench is a benchmark for evaluating LLM outputs for Type 2 Diabetes using a multi-layer clinical-lifestyle knowledge graph. It reveals that current LLMs fail evidence-path checks in about a third of cases.

0 favorites 0 likes

#natural-language-processing

Exploring Academic Influence of Algorithms by Co-occurrence Network Based on Full-text of Academic Papers

arXiv cs.AI ↗ · 2026-06-24 Cached

This paper constructs large-scale algorithm co-occurrence networks from the full text of academic papers to study the collective influence of algorithms in NLP, finding that classic, high-performing, and intersectional algorithms hold central network positions.

0 favorites 0 likes

#natural-language-processing

RASC+: Retrieval-Constrained LLM Adjudication for Clinical Value Set Authoring

arXiv cs.CL ↗ · 2026-06-24 Cached

This paper introduces RASC+, a retrieval-constrained LLM adjudication method for clinical value set authoring that improves candidate-pool recall and selection precision over prior RASC baselines, demonstrating that blinded LLM adjudication with Qwen3-based retrieval significantly outperforms direct generation.

0 favorites 0 likes

#natural-language-processing

Evaluating LLM Usage for Efficient and Explainable Numerical and Classified Implicit Sentiment Analysis of Product Desirability

arXiv cs.CL ↗ · 2026-06-24 Cached

This paper presents a scalable framework using LLMs for implicit sentiment analysis of product desirability from qualitative feedback, achieving up to 0.97 Pearson correlation and 94% accuracy while providing explanations, with GPT-4o-mini offering similar performance at 94% lower cost.

0 favorites 0 likes

#natural-language-processing

Diffusion Language Models: An Experimental Analysis

arXiv cs.AI ↗ · 2026-06-20 Cached

A systematic experimental analysis evaluating eight state-of-the-art Diffusion Language Models across multiple benchmarks, analyzing trade-offs between generation quality and computational efficiency.

0 favorites 0 likes

#natural-language-processing

Why do AI systems still struggle to interpret uncertainty in human conversation?

Reddit r/artificial ↗ · 2026-06-19

The article discusses why AI systems have difficulty interpreting uncertainty and ambiguity in human conversation, highlighting ongoing challenges in natural language understanding.

0 favorites 0 likes

#natural-language-processing

@jreuben1: Speech and Language Processing (3rd ed. draft) Dan Jurafsky and James H. Martin https://web.stanford.edu/~jurafsky/slp3…

X AI KOLs Following ↗ · 2026-06-19 Cached

The Jan 6, 2026 draft of the 3rd edition of 'Speech and Language Processing' by Dan Jurafsky and James H. Martin is released, featuring a revised structure with a focus on large language models and updated chapters.

0 favorites 0 likes

#natural-language-processing

Approximate Structured Diffusion for Sequence Labelling

arXiv cs.CL ↗ · 2026-06-18 Cached

This paper introduces Approximate Structured Diffusion, a method that combines conditional random fields (CRFs) with discrete diffusion for sequence labelling. It uses a CRF conditioned on noisy label sequences and approximate mean-field inference, achieving a 16.5% error reduction on POS tagging.

0 favorites 0 likes

#natural-language-processing

PEC-Home: Interpretation of Progressively Elliptical Commands in Smart Homes

arXiv cs.CL ↗ · 2026-06-18 Cached

This paper introduces PEC-Home, a simulated home dataset for interpreting progressively elliptical commands in smart homes, and finds that current LLM-based assistants struggle with such commands due to referential and intention ambiguity.

0 favorites 0 likes

#natural-language-processing

Possible or Definite? A Benchmark for Evaluating Diagnostic Uncertainty Preservation in Clinical Text

arXiv cs.CL ↗ · 2026-06-18 Cached

This paper introduces a benchmark of 1,200 clinical documents with 9,184 uncertainty annotations to evaluate whether LLMs preserve diagnostic uncertainty in clinical text, finding that LLMs often fail to preserve original uncertainty cues and struggle with nuanced distinctions.

0 favorites 0 likes

#natural-language-processing

Speaking in Self-Assessing Tongues: On the Verbalized Confidence of LLMs in Machine Translation

arXiv cs.CL ↗ · 2026-06-17 Cached

This paper investigates verbalized methods for extracting LLM confidence in machine translation outputs, comparing them with internal token probabilities. The study finds that while both approaches perform similarly in error detection and calibration, there is little correlation between internal and verbalized confidence measures.

0 favorites 0 likes

#natural-language-processing

Characterizing Narrative Content in Web-scale LLM Pretraining Data

Hugging Face Daily Papers ↗ · 2026-06-17 Cached

A fine-grained study of narrative features in web-scale LLM pretraining data, introducing NarraBERT and NarraDolma to measure narrative patterns and their distribution across sources.

0 favorites 0 likes

#natural-language-processing

Sumi: Open Uniform Diffusion Language Model from Scratch

Hugging Face Daily Papers ↗ · 2026-06-17 Cached

Sumi is a 7B uniform diffusion language model pretrained from scratch on 1.5T tokens, achieving competitive performance on knowledge and reasoning tasks while being fully open-source with released weights and training recipe.

0 favorites 0 likes

#natural-language-processing

Distilling Examples into Task Instructions: Enhanced In-Context Learning for Real-World B2B Conversations

arXiv cs.CL ↗ · 2026-06-16 Cached

This paper introduces the Call Playbook dataset for classifying real-world B2B conversations and proposes methods to distill examples into compact, interpretable task instructions, achieving 99% token reduction and up to 7% AUC improvement over traditional in-context learning.

0 favorites 0 likes

#natural-language-processing

Do LLMs Reliably Identify Correct Information Units in Aphasic Discourse?

arXiv cs.AI ↗ · 2026-06-16 Cached

This study investigates whether instruction-tuned LLMs (Llama-3.1-8B, Qwen2.5-7B, Mistral-7B, Phi-3-mini) can reliably classify Correct Information Units in aphasic discourse transcripts. Few-shot prompting yields competitive F1 scores (0.776–0.817) for three models, but performance varies by severity and human agreement remains insufficient for fully autonomous use.

0 favorites 0 likes

#natural-language-processing

CoCoGEC: Counterfactual Generation for Robust Grammatical Error Correction

arXiv cs.CL ↗ · 2026-06-16 Cached

Proposes CoCoGEC, a counterfactual generation framework that alters error-irrelevant contexts in GEC training data to improve model robustness, achieving significant F0.5 gains on perturbed benchmarks.

0 favorites 0 likes

natural-language-processing

Submit Feedback