bert

Tag

Cards List
#bert

Shortcut Solutions Learned by Transformers Impair Continual Compositional Reasoning

arXiv cs.LG · 2026-05-08 Cached

This research paper investigates how shortcut solutions learned by Transformer models, specifically BERT, impair their ability to perform continual compositional reasoning. It contrasts BERT with ALBERT, finding that ALBERT's recurrent nature offers better inductive bias for continual learning tasks.

0 favorites 0 likes
#bert

I trained a NER model on 33,000 Indian Supreme Court judgments (1950–2024) CASE_CITATION hits 97.76% F1, +17 points over the only prior baseline [P]

Reddit r/MachineLearning · 2026-05-07

Released en_legal_ner_ind_trf v0.1, an InLegalBERT model fine-tuned on 33,000 Indian Supreme Court judgments, achieving a 97.76% F1 score on case citations and significantly outperforming previous baselines.

0 favorites 0 likes
#bert

Foundational Study on Authorship Attribution of Japanese Web Reviews for Actor Analysis

arXiv cs.CL · 2026-04-21

A foundational study on applying stylometric authorship attribution to threat intelligence, using Japanese Rakuten reviews to compare TF-IDF+LR, BERT embedding, BERT fine-tuning, and metric learning methods. BERT-FT performed best overall, but TF-IDF+LR proved more stable and efficient when scaling to hundreds of authors.

0 favorites 0 likes
#bert

The Prose of Proteins - A Lesson in Taste and Vision through the Work of Brian Hie

ML at Berkeley · 2024-04-11

This article profiles researcher Brian Hie, highlighting how his unique background in literature and computer science informed the development of ESM, a BERT-like model for protein sequences.

0 favorites 0 likes
← Back to home

Submit Feedback