Computational conceptual history of scientific concepts: From early digital methods to LLMs
Summary
This paper situates large language models within the broader history of computational approaches to concept analysis in the history, philosophy, and sociology of science (HPSS), reviewing methodological challenges and LLM-based case studies for lexical semantic change detection. It covers corpus construction, operationalization, and evaluation across both pre-LLM and LLM-era workflows.
View Cached Full Text
Cached at: 06/05/26, 02:12 AM
# Computational conceptual history of scientific concepts: From early digital methods to LLMs Source: [https://arxiv.org/abs/2606.04118](https://arxiv.org/abs/2606.04118) [View PDF](https://arxiv.org/pdf/2606.04118) > Abstract:This article situates large language models \(LLMs\) within the longer history of computational approaches to concept analysis in the history, philosophy, and sociology of science \(HPSS\)\. We examine what LLMs add to existing methods, how they inherit longstanding problems, and review recent case studies that employ them\. In the first part, we reconstruct computational conceptual history before LLMs by bringing together three strands of work: early digital methods in HPSS, distributional approaches from digital history and related research, and lexical semantic change detection\. We provide an overview of the main challenges and opportunities, focusing on corpus construction, operationalization and modelling choices, and evaluation and interpretation\. In the second part, we turn to the era of LLMs, starting with a short introduction to LLMs before reviewing LLM\-based work on lexical semantic change detection and relevant case studies in HPSS\. We then revisit the earlier methodological questions, showing how issues of corpus construction, model choice and training data, operationalization trade\-offs, and evaluation and interpretation play out in LLM\-based workflows\. ## Submission history From: Michael Zichert \[[view email](https://arxiv.org/show-email/a4714919/2606.04118)\] **\[v1\]**Tue, 2 Jun 2026 18:28:29 UTC \(275 KB\)
Similar Articles
LLMs with in-context learning for Algorithmic Theoretical Physics
This paper investigates using Large Language Models, specifically Claude, interfaced with a Computer Algebra System (Maple) to perform algorithmic computations in theoretical physics, such as analyzing cosmological perturbations.
What are They Thinking? Delineation, Probing and Tracking of Concepts in LLMs
This paper presents a methodology for delineating concepts and training linear probes to detect them in LLM embeddings, using four example concepts across three models. The work aims to enable scalable monitoring of LLM internal representations.
Disentangling Mathematical Reasoning in LLMs: A Methodological Investigation of Internal Mechanisms
This paper investigates how large language models perform arithmetic operations by analyzing internal mechanisms through early decoding, revealing that proficient models exhibit a clear division of labor between attention and MLP modules in reasoning tasks.
LLM Wiki v2 (16 minute read)
This post presents a pattern for building personal knowledge bases using LLMs, offering a structured approach for leveraging large language models in knowledge management.
Learning to reason with LLMs
OpenAI publishes an article exploring reasoning techniques with LLMs through cipher-decoding examples, demonstrating step-by-step problem-solving approaches and pattern recognition in language models.