bert

#bert

The Word and the Way: Strategies for Domain-Specific BERT Pre-Training in German Medical NLP

arXiv cs.CL ↗ · yesterday Cached

This paper introduces ChristBERT, a family of domain-specific RoBERTa-based language models for German clinical NLP, and evaluates three domain adaptation strategies (continued pre-training, pre-training from scratch, and vocabulary adaptation) on medical named entity recognition and text classification tasks, achieving state-of-the-art results.

0 favorites 0 likes

#bert

A Fine-Tuned BERT Classifier for Personal-Letter Titles in Late-Ming and Early-Qing Collected Works

arXiv cs.CL ↗ · 2026-05-25 Cached

This paper presents Lepton, a fine-tuned BERT classifier that predicts whether a title in Classical Chinese wenji table-of-contents is a personal letter or a preface, leveraging 5,438 hand-labeled titles from late-Ming and early-Qing literati.

0 favorites 0 likes

#bert

Leveraging Large Language Models for Sentiment Analysis: Multi-Modal Analysis of Decentraland's MANA Token

arXiv cs.CL ↗ · 2026-05-21 Cached

This paper uses a BERT-based large language model for sentiment analysis of Decentraland's Discord community to enhance MANA token price prediction, demonstrating that a multi-modal LSTM incorporating sentiment, trading volume, and market capitalization outperforms a price-only baseline.

0 favorites 0 likes

#bert

Shortcut Solutions Learned by Transformers Impair Continual Compositional Reasoning

arXiv cs.LG ↗ · 2026-05-08 Cached

This research paper investigates how shortcut solutions learned by Transformer models, specifically BERT, impair their ability to perform continual compositional reasoning. It contrasts BERT with ALBERT, finding that ALBERT's recurrent nature offers better inductive bias for continual learning tasks.

0 favorites 0 likes

#bert

I trained a NER model on 33,000 Indian Supreme Court judgments (1950–2024) CASE_CITATION hits 97.76% F1, +17 points over the only prior baseline [P]

Reddit r/MachineLearning ↗ · 2026-05-07

Released en_legal_ner_ind_trf v0.1, an InLegalBERT model fine-tuned on 33,000 Indian Supreme Court judgments, achieving a 97.76% F1 score on case citations and significantly outperforming previous baselines.

0 favorites 0 likes

#bert

Foundational Study on Authorship Attribution of Japanese Web Reviews for Actor Analysis

arXiv cs.CL ↗ · 2026-04-21

A foundational study on applying stylometric authorship attribution to threat intelligence, using Japanese Rakuten reviews to compare TF-IDF+LR, BERT embedding, BERT fine-tuning, and metric learning methods. BERT-FT performed best overall, but TF-IDF+LR proved more stable and efficient when scaling to hundreds of authors.

0 favorites 0 likes

#bert

The Prose of Proteins - A Lesson in Taste and Vision through the Work of Brian Hie

ML at Berkeley ↗ · 2024-04-11

This article profiles researcher Brian Hie, highlighting how his unique background in literature and computer science informed the development of ESM, a BERT-like model for protein sequences.

0 favorites 0 likes

bert

The Word and the Way: Strategies for Domain-Specific BERT Pre-Training in German Medical NLP

A Fine-Tuned BERT Classifier for Personal-Letter Titles in Late-Ming and Early-Qing Collected Works

Leveraging Large Language Models for Sentiment Analysis: Multi-Modal Analysis of Decentraland's MANA Token

Shortcut Solutions Learned by Transformers Impair Continual Compositional Reasoning

I trained a NER model on 33,000 Indian Supreme Court judgments (1950–2024) CASE_CITATION hits 97.76% F1, +17 points over the only prior baseline [P]

Foundational Study on Authorship Attribution of Japanese Web Reviews for Actor Analysis

The Prose of Proteins - A Lesson in Taste and Vision through the Work of Brian Hie

Submit Feedback