named-entity-recognition

#named-entity-recognition

LC-ICL: Label-Guided Contrastive In-Context Learning for Robust Information Extraction

arXiv cs.CL ↗ · 2d ago Cached

This paper proposes LC-ICL, a novel few-shot technique that uses both correct and incorrect examples with error-cause labels to improve large language models' performance on information extraction tasks like named entity recognition and relation extraction.

0 favorites 0 likes

#named-entity-recognition

A Study of Temporal Fusion Strategies for Named Entity Recognition in Historical Texts

arXiv cs.CL ↗ · 3d ago Cached

This paper systematically studies how temporal metadata can be structurally embedded into named entity recognition (NER) models for historical texts. Experiments with absolute and relative temporal representations injected via early or late fusion mechanisms show that late fusion strategies yield more robust performance on French and German historical datasets.

0 favorites 0 likes

#named-entity-recognition

Overview of HIPE-2026: Person-Place Relation Extraction from Multilingual Historical Texts

arXiv cs.CL ↗ · 2026-06-25 Cached

This paper presents the results of HIPE-2026, the third edition of the HIPE evaluation series, which focuses on temporally grounded person-place relation extraction from multilingual historical documents in French, German, and English. Seventeen participating teams were evaluated on predictive accuracy, computational efficiency, and cross-domain generalization.

0 favorites 0 likes

#named-entity-recognition

AAbAAC: An Annotated Corpus for Autoimmunity Information Extraction

arXiv cs.AI ↗ · 2026-06-12 Cached

AAbAAC is a manually annotated corpus of 115 PubMed abstracts for autoimmunity information extraction, focusing on entities like autoimmune diseases and autoantibodies. The study demonstrates improved NER performance after fine-tuning on this corpus.

0 favorites 0 likes

#named-entity-recognition

Instruction Finetuning DeepSeek-R1-8B Model Using LoRA and NEFTune

arXiv cs.AI ↗ · 2026-06-10 Cached

This paper investigates instruction finetuning of DeepSeek-R1-8B using LoRA and NEFTune for financial named-entity recognition, achieving a micro-F1 of 0.912 and outperforming several baseline models.

0 favorites 0 likes

#named-entity-recognition

SMADE-IE: Sparse Multi-Agent Framework with Evidence-Driven Debate for Zero-Shot Information Extraction

arXiv cs.CL ↗ · 2026-06-04 Cached

SMADE-IE is a sparse multi-agent framework for zero-shot information extraction that uses an Adaptive Mode Selector and Evidence-Driven Debate mechanism with Toulmin-style argumentation and Bayesian updates to outperform existing baselines on 9 benchmarks across NER, RE, and JERE tasks while improving token efficiency.

0 favorites 0 likes

#named-entity-recognition

The Word and the Way: Strategies for Domain-Specific BERT Pre-Training in German Medical NLP

arXiv cs.CL ↗ · 2026-06-03 Cached

This paper introduces ChristBERT, a family of domain-specific RoBERTa-based language models for German clinical NLP, and evaluates three domain adaptation strategies (continued pre-training, pre-training from scratch, and vocabulary adaptation) on medical named entity recognition and text classification tasks, achieving state-of-the-art results.

0 favorites 0 likes

#named-entity-recognition

Beyond Agreement: Scoring Panel-Surfaced Biomedical Entity Candidates for Curator Triage

arXiv cs.CL ↗ · 2026-06-01 Cached

This paper introduces BioConCal, a supervised scorer that uses inference-time panel and candidate features to rank biomedical entity candidates surfaced by LLM panels, significantly improving over raw agreement for curator triage.

0 favorites 0 likes

#named-entity-recognition

Specialty-Specific Medical Language Model for Immune-Mediated Diseases

arXiv cs.CL ↗ · 2026-05-29 Cached

This paper presents a specialty-specific medical language model for extracting information from clinical narratives about immune-mediated and infectious diseases, using a BiLSTM-CNN-Char architecture trained on a curated corpus of 371 case reports, achieving an F1 score of 0.89.

0 favorites 0 likes

#named-entity-recognition

LELA: An End-to-end LLM-based Entity Linking Framework with Zero-shot Domain Adaptation

arXiv cs.AI ↗ · 2026-05-27 Cached

LELA is an LLM-based entity linking framework that combines zero-shot NER and entity disambiguation into an end-to-end Python library, validated across diverse settings.

0 favorites 0 likes

#named-entity-recognition

Curation and Extraction of Drug-Related Entities from Reddit Platform

arXiv cs.CL ↗ · 2026-05-27 Cached

Introduces ReDose, a dataset of 6,435 Reddit posts annotated for drug, dose, and effect entities, and benchmarks various models including BiomedBERT, Llama-3 70B, and GPT-4 for extraction.

0 favorites 0 likes

#named-entity-recognition

Shakespeare’s World – I thought this would be simple but

Hacker News Top ↗ · 2026-05-26 Cached

The author used spaCy NER and Claude to extract place names from Shakespeare's works, then built an interactive map with MapLibre, OpenCage, and Stadia Maps, filtering places by play and displaying quotes.

0 favorites 0 likes

#named-entity-recognition

UCCI: Calibrated Uncertainty for Cost-Optimal LLM Cascade Routing

arXiv cs.LG ↗ · 2026-05-20 Cached

UCCI proposes a calibration-first router for LLM cascades that uses isotonic regression to map token-level margin uncertainty to error probability, achieving a 31% cost reduction on a production NER workload while maintaining micro-F1=0.91 and reducing expected calibration error from 0.12 to 0.03.

0 favorites 0 likes

#named-entity-recognition

Concordance Comparison as a Means of Assembling Local Grammars

arXiv cs.CL ↗ · 2026-05-13 Cached

This paper presents a method for comparing concordances of local grammars to optimize Named Entity Recognition for person names in Portuguese, achieving improved F-measure scores on the HAREM dataset.

0 favorites 0 likes

#named-entity-recognition

I trained a NER model on 33,000 Indian Supreme Court judgments (1950–2024) CASE_CITATION hits 97.76% F1, +17 points over the only prior baseline [P]

Reddit r/MachineLearning ↗ · 2026-05-07

Released en_legal_ner_ind_trf v0.1, an InLegalBERT model fine-tuned on 33,000 Indian Supreme Court judgments, achieving a 97.76% F1 score on case citations and significantly outperforming previous baselines.

0 favorites 0 likes

#named-entity-recognition

A Mechanism and Optimization Study on the Impact of Information Density on User-Generated Content Named Entity Recognition

arXiv cs.CL ↗ · 2026-04-22 Cached

ArXiv preprint identifies low information density as the root cause of NER performance collapse on noisy user-generated content and introduces the Window-Aware Optimization Module (WOM) that boosts F1 by up to 4.5% on WNUT2017.

0 favorites 0 likes

#named-entity-recognition

DiZiNER: Disagreement-guided Instruction Refinement via Pilot Annotation Simulation for Zero-shot Named Entity Recognition

arXiv cs.CL ↗ · 2026-04-20 Cached

DiZiNER is a framework that uses disagreement between multiple LLMs to refine task instructions for zero-shot named entity recognition, achieving state-of-the-art results on 14 out of 18 benchmarks and significantly reducing the performance gap between zero-shot and supervised systems.

0 favorites 0 likes

#named-entity-recognition

@mayhewsw: New paper: I know the hotness is all in 10xing compute scale, and telling things to think step by step with tool use, b…

X AI KOLs Following ↗ · 2026-04-19

Authors release Universal NER v2, a named-entity recognition paper presented at LREC 2026 that deliberately eschews modern scaling and tool-use trends.

0 favorites 0 likes

named-entity-recognition

Submit Feedback