nlp

#nlp

@DimitrisPapail: The co-inventor of Looped Transformers defended her PhD thesis yesterday and is heading to an incredible new role soon …

X AI KOLs Timeline ↗ · 19h ago Cached

Angeliki Giannou, co-inventor of Looped Transformers, has successfully defended her PhD thesis and is set to begin a new role. Congratulations were shared by Dimitris Papailiopoulos on social media.

0 favorites 0 likes

#nlp

Who and What? Using Linguistic Features and Annotator Characteristics to Analyze Annotation Variation

arXiv cs.CL ↗ · yesterday Cached

This paper presents a large-scale analysis of four harmful language detection datasets, examining how annotator characteristics and linguistic features interact to influence annotation variation. It highlights intersectional effects and warns against generalizing findings across different datasets.

0 favorites 0 likes

#nlp

YEZE at SemEval-2026 Task 9: Detecting Multilingual, Multicultural and Multievent Online Polarization via Heterogeneous Ensembling

arXiv cs.CL ↗ · yesterday Cached

This paper details the YEZE system for SemEval-2026 Task 9, which detects online polarization in 22 languages using a heterogeneous ensemble of XLM-RoBERTa and mDeBERTa models.

0 favorites 0 likes

#nlp

IRC-Bench: Recognizing Entities from Contextual Cues in First-Person Reminiscences

arXiv cs.CL ↗ · yesterday Cached

This paper introduces IRC-Bench, a benchmark for recognizing implicit entities in first-person reminiscences using contextual cues rather than explicit mentions. It evaluates various LLM and retrieval configurations, finding QLoRA-adapted Llama 3.1 8B to be the top performer in open-world settings.

0 favorites 0 likes

#nlp

Generating Query-Focused Summarization Datasets from Query-Free Summarization Datasets

arXiv cs.CL ↗ · yesterday Cached

This paper proposes an evidence-based model to automatically generate query keywords from query-free summarization datasets, enabling the creation of query-focused summarization datasets. Experimental results show that summaries generated using evidence-based queries achieve competitive ROUGE scores compared to original queries.

0 favorites 0 likes

#nlp

@HongcanGuo: A brand-new approach to text modeling

X AI KOLs Timeline ↗ · yesterday Cached

A researcher named HongcanGuo teases a brand-new approach to text modeling, but the tweet provides no technical details.

0 favorites 0 likes

#nlp

huggingface/transformers Release 5.8.0

GitHub Releases Watchlist ↗ · 3d ago Cached

Hugging Face has released version 5.8.0 of the Transformers library, a widely used open-source framework for natural language processing and deep learning.

0 favorites 0 likes

#nlp

Towards High-Quality Machine Translation for Kokborok: A Low-Resource Tibeto-Burman Language of Northeast India

arXiv cs.CL ↗ · 2026-04-23 Cached

Researchers develop KokborokMT, a neural MT system for the low-resource Kokborok language, achieving BLEU scores of 17.30 en→trp and 38.56 trp→en by fine-tuning NLLB-200 on a 36k-sentence parallel corpus.

0 favorites 0 likes

#nlp

huggingface/transformers Release v5.6.0

GitHub Releases Watchlist ↗ · 2026-04-22 Cached

Hugging Face released version 5.6.0 of its popular transformers library.

0 favorites 0 likes

#nlp

Product-of-Experts Training Reduces Dataset Artifacts in Natural Language Inference

arXiv cs.CL ↗ · 2026-04-22 Cached

This paper proposes Product-of-Experts (PoE) training to reduce dataset artifacts in Natural Language Inference, downweighting examples where biased models are overconfident. PoE nearly preserves accuracy on SNLI (89.10% vs. 89.30%) while reducing bias reliance by ~4.85 percentage points.

0 favorites 0 likes

#nlp

Model-Agnostic Meta Learning for Class Imbalance Adaptation

arXiv cs.CL ↗ · 2026-04-22 Cached

University of Memphis researchers propose HAMR, a model-agnostic meta-learning framework that uses bi-level optimization and neighborhood-aware resampling to adaptively reweight hard examples and minority classes across six imbalanced NLP datasets.

0 favorites 0 likes

#nlp