Computational conceptual history of scientific concepts: From early digital methods to LLMs

arXiv cs.CL Papers

Summary

This paper situates large language models within the broader history of computational approaches to concept analysis in the history, philosophy, and sociology of science (HPSS), reviewing methodological challenges and LLM-based case studies for lexical semantic change detection. It covers corpus construction, operationalization, and evaluation across both pre-LLM and LLM-era workflows.

arXiv:2606.04118v1 Announce Type: new Abstract: This article situates large language models (LLMs) within the longer history of computational approaches to concept analysis in the history, philosophy, and sociology of science (HPSS). We examine what LLMs add to existing methods, how they inherit longstanding problems, and review recent case studies that employ them. In the first part, we reconstruct computational conceptual history before LLMs by bringing together three strands of work: early digital methods in HPSS, distributional approaches from digital history and related research, and lexical semantic change detection. We provide an overview of the main challenges and opportunities, focusing on corpus construction, operationalization and modelling choices, and evaluation and interpretation. In the second part, we turn to the era of LLMs, starting with a short introduction to LLMs before reviewing LLM-based work on lexical semantic change detection and relevant case studies in HPSS. We then revisit the earlier methodological questions, showing how issues of corpus construction, model choice and training data, operationalization trade-offs, and evaluation and interpretation play out in LLM-based workflows.
Original Article
View Cached Full Text

Cached at: 06/05/26, 02:12 AM

# Computational conceptual history of scientific concepts: From early digital methods to LLMs
Source: [https://arxiv.org/abs/2606.04118](https://arxiv.org/abs/2606.04118)
[View PDF](https://arxiv.org/pdf/2606.04118)

> Abstract:This article situates large language models \(LLMs\) within the longer history of computational approaches to concept analysis in the history, philosophy, and sociology of science \(HPSS\)\. We examine what LLMs add to existing methods, how they inherit longstanding problems, and review recent case studies that employ them\. In the first part, we reconstruct computational conceptual history before LLMs by bringing together three strands of work: early digital methods in HPSS, distributional approaches from digital history and related research, and lexical semantic change detection\. We provide an overview of the main challenges and opportunities, focusing on corpus construction, operationalization and modelling choices, and evaluation and interpretation\. In the second part, we turn to the era of LLMs, starting with a short introduction to LLMs before reviewing LLM\-based work on lexical semantic change detection and relevant case studies in HPSS\. We then revisit the earlier methodological questions, showing how issues of corpus construction, model choice and training data, operationalization trade\-offs, and evaluation and interpretation play out in LLM\-based workflows\.

## Submission history

From: Michael Zichert \[[view email](https://arxiv.org/show-email/a4714919/2606.04118)\] **\[v1\]**Tue, 2 Jun 2026 18:28:29 UTC \(275 KB\)

Similar Articles

LLM Wiki v2 (16 minute read)

TLDR AI

This post presents a pattern for building personal knowledge bases using LLMs, offering a structured approach for leveraging large language models in knowledge management.

Learning to reason with LLMs

OpenAI Blog

OpenAI publishes an article exploring reasoning techniques with LLMs through cipher-decoding examples, demonstrating step-by-step problem-solving approaches and pattern recognition in language models.