Position: Hippocampal Explicit Memory Is the Cornerstone for AGI

arXiv cs.AI Papers

Summary

This position paper argues that integrating explicit memory, analogous to human hippocampal memory, is essential for advancing LLMs toward AGI. It draws on neuroscience to propose that higher-order cognitive functions require explicit memory beyond implicit statistical learning.

arXiv:2606.11245v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities across various tasks, raising expectations for Artificial General Intelligence (AGI). This position paper argues that integrating explicit memory is the cornerstone for advancing LLMs toward AGI. The key reason is that the underlying learning mechanism of LLMs is highly analogous to human implicit memory. However, higher-order cognitive functions necessary for AGI, such as long-term strategic planning, metacognition, and symbolic reasoning, heavily rely on hippocampal explicit memory and cannot arise solely from implicit statistical learning. Drawing on findings from neuroscience, I advance this perspective and complement it with computational requirements for artificial explicit memory systems, hoping to foster further research and lay the groundwork for explicit memory integration.
Original Article
View Cached Full Text

Cached at: 06/11/26, 01:46 PM

# Position: Hippocampal Explicit Memory Is the Cornerstone for AGI
Source: [https://arxiv.org/html/2606.11245](https://arxiv.org/html/2606.11245)
###### Abstract

Large Language Models \(LLMs\) have demonstrated remarkable capabilities across various tasks, raising expectations for Artificial General Intelligence \(AGI\)\. This position paper argues that integrating explicit memory is the cornerstone for advancing LLMs toward AGI\. The key reason is that the underlying learning mechanism of LLMs is highly analogous to human implicit memory\. However, higher\-order cognitive functions necessary for AGI, such as long\-term strategic planning, metacognition, and symbolic reasoning, heavily rely on hippocampal explicit memory and cannot arise solely from implicit statistical learning\. Drawing on findings from neuroscience, I advance this perspective and complement it with computational requirements for artificial explicit memory systems, hoping to foster further research and lay the groundwork for explicit memory integration\.

Machine Learning, ICML, Hippocampal Memory, Explicit Memory, Human\-Level AI, AGI, LLMs, NeuroAI

## 1Introduction

Large Language Models \(LLMs\) have recently achieved remarkable success in the field of natural language processing\(Comaniciet al\.,[2025](https://arxiv.org/html/2606.11245#bib.bib134); Singhet al\.,[2026](https://arxiv.org/html/2606.11245#bib.bib133)\), opening new possibilities in artificial intelligence\(Grattafioriet al\.,[2024](https://arxiv.org/html/2606.11245#bib.bib3); Yanget al\.,[2025](https://arxiv.org/html/2606.11245#bib.bib7); DeepSeek\-AIet al\.,[2025](https://arxiv.org/html/2606.11245#bib.bib25)\)\. Trained on vast amounts of text data, these models comprehend context and demonstrate near\-human performance in tasks such as writing, question\-answering, code generation, and conversational assistance\(Weiet al\.,[2022](https://arxiv.org/html/2606.11245#bib.bib48); Srivastavaet al\.,[2023](https://arxiv.org/html/2606.11245#bib.bib2); Lianget al\.,[2023](https://arxiv.org/html/2606.11245#bib.bib80)\)\. Notably, advanced models like GPT\-5, utilizing hundreds of billions of parameters, can solve complex problems and have practical applications across various domains\(Heet al\.,[2025](https://arxiv.org/html/2606.11245#bib.bib53); Chakrabartyet al\.,[2022](https://arxiv.org/html/2606.11245#bib.bib18); Colomboet al\.,[2024](https://arxiv.org/html/2606.11245#bib.bib81)\)\. Some researchers even argue that LLMs can be considered the initial stages of Artificial General Intelligence \(AGI\), due to their ability to understand and address intricate issues\(Bubecket al\.,[2023](https://arxiv.org/html/2606.11245#bib.bib16)\)\.

However, despite their impressive performance, LLMs still face significant challenges such as hallucination, difficulties in planning, and limitations in logical reasoning\(Gendronet al\.,[2024](https://arxiv.org/html/2606.11245#bib.bib37); Yaoet al\.,[2023](https://arxiv.org/html/2606.11245#bib.bib124); Tianet al\.,[2024](https://arxiv.org/html/2606.11245#bib.bib106); Songet al\.,[2025](https://arxiv.org/html/2606.11245#bib.bib96); Valmeekamet al\.,[2022](https://arxiv.org/html/2606.11245#bib.bib114)\)\. Since LLMs cannot store and use information dynamically over long periods, memory has always been considered a notable weakness\(Goertzel,[2023](https://arxiv.org/html/2606.11245#bib.bib12); Fenget al\.,[2024](https://arxiv.org/html/2606.11245#bib.bib103); Zhonget al\.,[2024](https://arxiv.org/html/2606.11245#bib.bib132); Morriset al\.,[2024](https://arxiv.org/html/2606.11245#bib.bib71)\)\. I regard memory as the key point for addressing these issues and also for developing higher\-order capabilities essential for AGI\(Collinset al\.,[2024](https://arxiv.org/html/2606.11245#bib.bib21)\), such as dynamic learning, reflection, and metacognition\. While there are various discussions regarding the definition of AGI, I consider Human\-Level AI to be the defining criterion for AGI: an AI system with general, human\-level ability to learn, reason, and apply knowledge across all cognitive tasks and domains\. Although memory is commonly perceived as merely information storage, in reality, it is an integral component of all learning processes in humans\(Atkinson and Shiffrin,[1968](https://arxiv.org/html/2606.11245#bib.bib82); Anderson,[1982](https://arxiv.org/html/2606.11245#bib.bib9)\)\. Constraints on memory are directly linked to limitations in cognitive ability, highlighting its pivotal role in the evolution of artificial intelligence\.

In this paper, I argue that the fundamental learning mechanism of LLMs can be compared to learning based on implicit memory in biological systems\. I also suggest that the cognitive functions of LLMs are restricted to implicit learning\. Based on this perspective,I claim that realizing AGI should involve the integration of explicit memory\.

![Refer to caption](https://arxiv.org/html/2606.11245v1/x1.png)

Figure 1:Comparison of three memory and learning paradigms\.\(1\)Human implicit memory forms through repeated practice and produces automatic, spontaneous responses \(e\.g\., predicting the next note in a rehearsed melody\)\.\(2\)Human explicit memory supports conscious, knowledge\-based problem solving:17×617\\times 6is decomposed via number concepts, the definition of multiplication, and arithmetic facts to explicitly compute102102\.\(3\)An LLM, given the same prompt, emits a reasoning\-like trace ending in102102\. Although the output looks like explicit reasoning, the underlying process is closer to operating an abacus: the correct answer can be produced automatically by just following learned manipulation rules \(rule indices are illustrative\), with no semantic understanding of the numbers or the procedure\.To support this claim, the remainder of this paper is organized as follows\.[Section2](https://arxiv.org/html/2606.11245#S2)reviews the fundamental concepts and mechanisms of human explicit and implicit memory systems\.[Section3](https://arxiv.org/html/2606.11245#S3)analyzes the underlying learning processes of current LLMs, illustrating their structural and functional resemblance to implicit memory\.[Section4](https://arxiv.org/html/2606.11245#S4)explores the distinct cognitive functions supported by each memory system, to clarify the operational boundaries of each memory\. Building upon these insights,[Section5](https://arxiv.org/html/2606.11245#S5)details the specific limitations of current LLMs and highlights why explicit memory is essential for higher\-order cognition\.[Section6](https://arxiv.org/html/2606.11245#S6)then formally defines the computational requirements for developing an artificial explicit memory system\.[Section7](https://arxiv.org/html/2606.11245#S7)examines alternative perspectives,[Section8](https://arxiv.org/html/2606.11245#S8)discusses broader implications, and[Section9](https://arxiv.org/html/2606.11245#S9)concludes the paper with directions for future research\. In the Appendices,[AppendixA](https://arxiv.org/html/2606.11245#A1)reviews recent advancements in neural memory\.[AppendixB](https://arxiv.org/html/2606.11245#A2)further discusses theoretical and practical considerations such as substrate independence and causality\. Finally,[AppendixC](https://arxiv.org/html/2606.11245#A3)presents empirical examples demonstrating the absence of explicit memory in current LLMs\.

## 2Explicit and Implicit Memories

LLMs naturally possess memory, and understanding the memorization process in LLMs can be greatly enriched by drawing parallels with biological systems\. Human memory is structured to store and utilize multiple types of information\(Tulving,[1972](https://arxiv.org/html/2606.11245#bib.bib108); Baddeley and Hitch,[1974](https://arxiv.org/html/2606.11245#bib.bib5)\), enabling various cognitive functions\. This section provides the basic concepts and operation mechanisms in each memory system\. Please refer to[Figure1](https://arxiv.org/html/2606.11245#S1.F1)for illustrative examples of explicit and implicit memory systems in humans and LLMs\.

### 2\.1Basic Concepts

Explicit MemoryExplicit memory, also known as declarative memory, handles information related to facts, events, or experiences\(Tulving,[1985](https://arxiv.org/html/2606.11245#bib.bib109)\)\. Explicit memory is typically divided into two main categories: episodic memory and semantic memory\. Episodic memory preserves personal experiences tied to specific times and places\(Tulving,[2002](https://arxiv.org/html/2606.11245#bib.bib110)\), so it is closely related to autobiographical memory and is instrumental in reconstructing past experiences\(Addiset al\.,[2007](https://arxiv.org/html/2606.11245#bib.bib4)\)\. In contrast, semantic memory refers to the memory of general knowledge, concepts, language, and facts, which can be accessed without relying on specific experiential contexts\(Pattersonet al\.,[2007](https://arxiv.org/html/2606.11245#bib.bib78)\)\. Explicit memory is primarily associated with higher\-level cognitive functions, such as active learning, problem\-solving, and language comprehension\. Episodic memory, in particular, is linked to executive functions like self\-reflective thinking and future planning\(Schacter and Addis,[2007](https://arxiv.org/html/2606.11245#bib.bib90); Schacteret al\.,[2017](https://arxiv.org/html/2606.11245#bib.bib91)\), while semantic memory is essential for knowledge\-based reasoning, such as language processing and categorization\(Binder and Desai,[2011](https://arxiv.org/html/2606.11245#bib.bib14)\)\. The hippocampus and medial temporal lobe are responsible for these processes\(Squire,[2004](https://arxiv.org/html/2606.11245#bib.bib58)\)\.

Implicit MemoryImplicit memory operates mainly by influencing performance and behavior through unconscious learning\(Schacter,[1987](https://arxiv.org/html/2606.11245#bib.bib89)\)\. It is typically divided into procedural memory, classical conditioning, and priming\. Procedural memory mainly involves motor skills and habits, such as riding a bicycle or playing a musical instrument, which are performed automatically without needing explicit recall\(Doyon and Benali,[2005](https://arxiv.org/html/2606.11245#bib.bib28)\)\. These skills are developed through repeated practice and learning, allowing them to be executed effortlessly\(Cohen and Squire,[1980](https://arxiv.org/html/2606.11245#bib.bib74)\)\. Classical conditioning forms stimulus\-stimulus associations, while habit learning forms stimulus\-response associations through extensive repetition\(Clark and Squire,[1998](https://arxiv.org/html/2606.11245#bib.bib20); Graybiel,[2008](https://arxiv.org/html/2606.11245#bib.bib42)\)\. Implicit memory is a key factor in cognitive and behavioral abilities such as habituation, automated skill performance, and emotional learning\. This type of memory is closely connected to brain structures like the amygdala, cerebellum, and basal ganglia\(Squire and Dede,[2015](https://arxiv.org/html/2606.11245#bib.bib99)\)\.

### 2\.2Formation Mechanism

Explicit MemoryThe formation of explicit memory occurs through strong synaptic strengthening in the main neural circuits of the hippocampus and medial temporal lobe when new events are encountered\(Scoville and Milner,[1957](https://arxiv.org/html/2606.11245#bib.bib94); Squire and Zola\-Morgan,[1991](https://arxiv.org/html/2606.11245#bib.bib98)\)\. The hippocampus encodes events through sparse representations that index distributed cortical patterns of experience\(Teyler and Rudy,[2007](https://arxiv.org/html/2606.11245#bib.bib104); Goodeet al\.,[2020](https://arxiv.org/html/2606.11245#bib.bib39)\)\. Specifically, the entorhinal cortex \(EC\) serves as the gateway between the neocortex and hippocampus, relaying experience into the hippocampal trisynaptic pathway \(EC→\\rightarrowdentate gyrus→\\rightarrowCA3→\\rightarrowCA1\)\. This pathway facilitates memory encoding, where each stage exhibits distinct synaptic plasticity\(Andersenet al\.,[1971](https://arxiv.org/html/2606.11245#bib.bib8)\)\.

First, in the dentate gyrus \(DG\), neural signals are sparsely encoded\(Jung and McNaughton,[1993](https://arxiv.org/html/2606.11245#bib.bib52)\), selectively activating a small subset of neurons and thereby strongly inducing long\-term potentiation \(LTP\)\. If the neurons were densely coded, a large number of neurons would be redundantly activated, leading to overlap and interference between distinct concepts and episodes\. However, sparse coding ensures that even similar information is clearly separated \(pattern separation\), allowing distinct and non\-overlapping memory representations\(Yassa and Stark,[2011](https://arxiv.org/html/2606.11245#bib.bib125); Leutgebet al\.,[2007](https://arxiv.org/html/2606.11245#bib.bib50)\)\.

Next, the CA3 region binds these sparsely active neurons through its recurrent collateral circuit, forming an autoassociative network\. Because NMDA receptor\-dependent LTP can be induced by only a few stimulations, CA3 rapidly stores the co\-active elements of an episode as a single bound pattern that can later be recovered from a partial cue\(Nakazawaet al\.,[2002](https://arxiv.org/html/2606.11245#bib.bib73); Rebolaet al\.,[2017](https://arxiv.org/html/2606.11245#bib.bib85)\)\.

Finally, the CA1 region serves as the output stage of the hippocampus\(Nakazawaet al\.,[2004](https://arxiv.org/html/2606.11245#bib.bib72)\), integrating information received from CA3 and transmitting it to the cerebral cortex and other brain areas\. By regulating the hippocampal output pathways, CA1 contributes to the comprehensive management of memory, facilitating the transition of temporary hippocampal memories into long\-lasting neocortical representations\(Malenka and Bear,[2004](https://arxiv.org/html/2606.11245#bib.bib86)\)\.

Implicit MemoryImplicit memory is primarily formed in subcortical structures such as the basal ganglia, cerebellum, and amygdala\(Alexanderet al\.,[1986](https://arxiv.org/html/2606.11245#bib.bib6); Conway,[2020](https://arxiv.org/html/2606.11245#bib.bib22)\)\. Among these, the basal ganglia are crucial in decision\-making for behavior through the cortico\-basal ganglia\-thalamocortical loop\(Packard and Knowlton,[2002](https://arxiv.org/html/2606.11245#bib.bib76)\), which repeatedly receives dense neuronal inputs from the cerebral cortex\. For example, Medium Spiny Neurons \(MSNs\) in the striatum of the basal ganglia express D1 \(direct pathway\) and D2 \(indirect pathway\) dopamine receptors and respond sensitively to reward prediction and dopamine signals\(Hikosaka and Isoda,[2010](https://arxiv.org/html/2606.11245#bib.bib46); Gerfen and Surmeier,[2011](https://arxiv.org/html/2606.11245#bib.bib38)\)\.

The main stages of learning occur as follows:

1. 1\.Prediction: Anticipating a reward based on performing a specific behavior in the current context\.
2. 2\.Action: When the behavior is performed, signals from the cortex are densely transmitted to the striatal MSNs\.
3. 3\.Outcome Evaluation: Assessing whether the behavior resulted in a reward, failure, or punishment\.
4. 4\.Synaptic Changes: Strengthening the direct pathway if the reward is greater than expected, strengthening the indirect pathway otherwise\.

Since dopamine\-driven synaptic strengthening or weakening happens after behavior, a one\-time result typically does not cause significant synaptic change\(Schultzet al\.,[1997](https://arxiv.org/html/2606.11245#bib.bib121)\)\. Repeated trials are necessary to solidify the connection between a specific situation and the corresponding behavior, creating a stimulus\-action mapping\. Once a behavior becomes part of procedural memory or habit, it is performed or inhibited automatically when the same situation occurs, demonstrating implicit learning\. Overall, implicit memory develops gradually over time, requiring numerous repetitions and trial\-and\-error\(Montagueet al\.,[2004](https://arxiv.org/html/2606.11245#bib.bib70)\)\.

ComparisonExplicit memory activates a selective set of neurons \(sparse coding\) to create clear memories\. In contrast, implicit memory relies on behavior and outcomes, gradually recruiting many neurons \(dense coding\) that operate simultaneously\. Explicit memory forms rapidly and powerfully, whereas implicit memory develops through long\-term repetition, marking a key difference between the two types of memory\(Squire and Zola,[1996](https://arxiv.org/html/2606.11245#bib.bib59); Reber,[2013](https://arxiv.org/html/2606.11245#bib.bib84)\)\.

### 2\.3Retrieval Mechanism

Explicit MemoryExplicit memory retrieval centers on the hippocampal\-cortical loop, where contextual cues are essential\(McClellandet al\.,[1995](https://arxiv.org/html/2606.11245#bib.bib66); Wiltgenet al\.,[2010](https://arxiv.org/html/2606.11245#bib.bib120); Kumaranet al\.,[2016](https://arxiv.org/html/2606.11245#bib.bib27)\)\. For example, when presented with the word “summer”, this cue is transmitted through the entorhinal cortex \(EC\) to activate the CA3 region\. The CA3 region performs pattern completion through its recurrent circuitry, which can reconstruct an entire stored memory pattern even when only a few neurons are activated\(Nakazawaet al\.,[2002](https://arxiv.org/html/2606.11245#bib.bib73); Rolls,[2013](https://arxiv.org/html/2606.11245#bib.bib87)\)\. For instance, the word “summer” might activate neurons associated with sensory elements like the beach, sand, the sound of waves, and the smell of the breeze\.

This reconstructed memory pattern is then relayed back to the broader cortex\(Eichenbaum,[2000](https://arxiv.org/html/2606.11245#bib.bib29); Takashimaet al\.,[2006](https://arxiv.org/html/2606.11245#bib.bib1)\)\. Passing through the CA1 region, the pattern interacts bidirectionally with various areas of the cerebral cortex, such as the prefrontal cortex, sensory cortex, and association cortex, synchronizing different elements of the memory\(Frankland and Bontempi,[2005](https://arxiv.org/html/2606.11245#bib.bib34); Mattar and Daw,[2018](https://arxiv.org/html/2606.11245#bib.bib64)\)\.

In summary, the process of retrieving explicit memory can be outlined as follows:

1. 1\.A contextual cue or intention initiates the hippocampal\-cortical loop\.
2. 2\.Signals from the EC activate specific neurons in the CA3 region\.
3. 3\.The CA3 region’s recurrent circuitry reconstructs the entire memory pattern based on the activated neurons\.
4. 4\.The reconstructed pattern is relayed through the CA1 region to the cerebral cortex, synchronizing the elements of the memory\.
5. 5\.Top\-down attention from the prefrontal cortex modulates the retrieval process, enabling coherent and integrated memory to emerge consciously\.

This process allows rich and multi\-layered memories to be replayed from a single cue or piece of information\. Through this process, the hippocampus reactivates the distributed neural patterns of a past episode, effectively recreating the entire context of the original experience\.

Implicit MemoryImplicit memory retrieval happens automatically\(Graybiel,[2008](https://arxiv.org/html/2606.11245#bib.bib42); Hikosaka and Isoda,[2010](https://arxiv.org/html/2606.11245#bib.bib46)\)\. When a familiar stimulus appears, stored stimulus\-response patterns in the striatum are activated via MSNs\(Yin and Knowlton,[2006](https://arxiv.org/html/2606.11245#bib.bib126)\)\. For example, if you suddenly need to stop while driving, your foot instinctively presses the brake\. This occurs because cortical signals reach the striatum, exciting MSNs previously associated with that action\(Kreitzer and Malenka,[2008](https://arxiv.org/html/2606.11245#bib.bib56)\)\. The MSNs then facilitate action selection through either the direct \(D1\) or indirect \(D2\) pathway\. The direct pathway enables the action, while the indirect pathway suppresses certain actions\(Tritsch and Sabatini,[2012](https://arxiv.org/html/2606.11245#bib.bib107)\)\. In short, implicit memory retrieval reactivates learned behaviors automatically in response to familiar cues\.

ComparisonExplicit and implicit memory retrieval operate through distinct mechanisms tailored to their unique functions\. In explicit memory retrieval, sparse coding plays a key role in pattern completion\. A small number of neurons, activated by a cue, trigger the reconstruction of the entire memory through highly selective connections\. This process allows the memory system to retrieve detailed and multi\-faceted experiences by filling in missing details and linking related sensory and contextual elements into a coherent whole\. Implicit memory, on the other hand, operates via learned stimulus\-response patterns\. When a familiar stimulus is encountered, the system activates the corresponding response pattern automatically, allowing previously learned behaviors or routines to be executed efficiently\. Unlike explicit memory, implicit retrieval does not reconstruct a broader memory context but instead focuses on directly accessing and applying specific learned associations\.

## 3Nature of Learning in LLMs

This section examines the learning and reasoning processes of LLMs, focusing on how their low\-level mechanisms relate to explicit and implicit memory systems\. In doing so, it reveals key distinctions from hippocampal explicit memory and indicates that the core characteristics of LLMs more closely resemble implicit learning and habituation\.

Gradual LearningExplicit memory can encode new episodes vividly with just one exposure when strong stimuli are present or attention is engaged\. In contrast, implicit memory relies on gradually accumulating associations between stimuli, actions, and rewards \(or punishments\)\. For instance, repeated input projection strengthens or weakens synapses incrementally, guided by dopamine signals reflecting reward prediction errors\. Similarly, LLM training follows a comparable pattern\. Model parameters do not change drastically with one or two examples\. Instead, the model adjusts its weights cumulatively by repeatedly observing numerous examples within a vast corpus, aiming to reduce errors \(loss\)\. This gradual and repetitive learning mirrors the characteristics of implicit memory formation\.

Dense and Distributed CodingExplicit memory is characterized by sparse coding to ensure pattern separation\. In contrast, subcortical circuits receive dense signals from the cortex\. LLMs exhibit a similar structure, with their network broadly interconnected\. Their activations for specific contexts are not confined to isolated nodes but occur across a wide range of parameters\. These features of distributed coding and large\-scale parallel activations are closely aligned with the implicit processing of the basal ganglia\.

Error\-Driven LearningImplicit learning involves dopamine neurons strengthening or weakening synapses based on reward prediction errors\. In contrast, learning in explicit memory relies on the associations of events or pieces of information, rather than on an error\-based mechanism\. The hippocampus rapidly encodes new events, combining elements presented together in time and space into a single episode\. For LLMs, the loss function works in a similar way to dopamine signals, adjusting model parameters through backpropagation based on the difference between the actual output and the ground truth across large datasets\. Thus, the error minimization mechanism based on prediction errors is a key shared feature of implicit memory and LLMs, distinguishing them from hippocampal learning\. This distinction suggests that the underlying learning mechanism of LLMs is closely aligned with implicit learning\.

Automatic ActionExplicit memory retrieval through the hippocampus can reconstruct past episodes richly from just one contextual cue\. In contrast, implicit memory retrieval in the striatum triggers automatic procedural responses without reconstructing any past episodes\. In the case of LLMs, when specific prompts \(contexts\) are provided, the process does not involve reconstructing an entire episode like the hippocampus\. Instead, LLMs follow a procedure where their learned parameters automatically determine the optimal next word\. This resembles how humans unconsciously execute habits or conditioned responses based on implicit memory, suggesting that LLM retrieval mechanisms are based on the stimulus\-response mapping of the basal ganglia\.

Fundamental SimilaritiesIn summary, the low\-level learning mechanism of LLMs is closer to implicit memory systems, specifically habituation and procedural learning, than to explicit memory processes\. This perspective provides valuable insights into understanding LLMs as systems that require gradual and repetitive training, and exhibit the automatic retrieval characteristics of human implicit learning\.

## 4Functional Specificity of Memory

Explicit and implicit memory serve distinct cognitive roles and are not easily interchangeable\. This section explores the unique cognitive abilities supported by each memory system\. By examining these differences, I indirectly reveal the cognitive functions that LLMs, which parallel implicit memory, are capable of possessing and those they cannot\.

Pattern RecognitionPattern recognition is the ability of humans to identify specific rules or consistencies in a complex environment\. For instance, people can quickly analyze facial features like eyes, nose, and mouth to identify a specific person\. In language comprehension, even when spelling is incorrect or sentences are incomplete, humans can capture the meaning by recognizing patterns in words and context\.

Research on statistical learning demonstrates that human pattern recognition and perception are deeply rooted in the implicit memory system\.Fiser and Aslin \([2002a](https://arxiv.org/html/2606.11245#bib.bib31),[b](https://arxiv.org/html/2606.11245#bib.bib32)\)showed that infants have the ability to automatically learn statistical patterns of co\-occurrences and conditional probabilities between visual objects\.Turk\-Browneet al\.\([2005](https://arxiv.org/html/2606.11245#bib.bib111)\)revealed that visual statistical learning is automatic and implicit, despite being gated by selective attention\. Additionally,Batterink and Paller \([2017](https://arxiv.org/html/2606.11245#bib.bib60)\)used EEG measurements to track brain activity related to implicit learning while participants detected patterns in word sequences\. Ultimately, pattern recognition is an automated information processing function based on implicit memory, which is consistently supported by historical theory and modern experimental evidence\.

Language AbilityThe interaction between explicit memory and implicit memory in language processing has been demonstrated through various studies, emphasizing that fluent communication heavily relies on implicit memory\(Graf and Schacter,[1985](https://arxiv.org/html/2606.11245#bib.bib41)\)\. Ullman’s Declarative/Procedural model\(Ullman,[2004](https://arxiv.org/html/2606.11245#bib.bib113)\)posits that the mental lexicon, which stores word\-specific knowledge, depends on explicit memory, while implicit memory plays a key role in the automatic application of grammatical rules\. Research on patients with hippocampal damage\(Vargha\-Khademet al\.,[1997](https://arxiv.org/html/2606.11245#bib.bib115); Schmolcket al\.,[2002](https://arxiv.org/html/2606.11245#bib.bib93)\)showed that even when episodic memory is impaired, semantic memory remains intact, enabling normal language abilities\. This suggests that language fluency operates independently of explicit memory\.

In contrast, studies revealed that dysfunctions in the basal ganglia disrupt grammatical processing and automated language use, highlighting the critical contribution of implicit memory in procedural language functions\(Liebermanet al\.,[1992](https://arxiv.org/html/2606.11245#bib.bib61); Ullmanet al\.,[1997](https://arxiv.org/html/2606.11245#bib.bib112)\)\. Finally,Boothet al\.\([2007](https://arxiv.org/html/2606.11245#bib.bib15)\)showed that the basal ganglia and cerebellum interact with brain regions responsible for phonological processing, supporting the refinement and automation of linguistic processes\. This underscores the essential role of implicit memory in natural and flexible communication\. Therefore, language ability can be understood as relying on a dual memory system, where explicit memory processes new information during the initial stages of learning, and repeated use transitions it into implicit memory for automation\.

Logical ReasoningMathematical and logical reasoning rely heavily on semantic memory, a subset of explicit memory, as evidenced by various neuroscience studies\.Friedrich and Friederici \([2013](https://arxiv.org/html/2606.11245#bib.bib35)\)found that the interaction between the prefrontal cortex and hippocampal systems is essential for processing the semantics of mathematical logic\.Menon \([2016](https://arxiv.org/html/2606.11245#bib.bib67)\)further explained that declarative memory is an integral part of forming associative memories, which allows for generalization beyond superficial problem attributes\. Additionally, research byEvanset al\.\([2024](https://arxiv.org/html/2606.11245#bib.bib30)\)showed that declarative memory is a key predictor of elementary school students’ mathematical skills, whereas procedural memory has little influence\. AlthoughFayol and Thevenot \([2012](https://arxiv.org/html/2606.11245#bib.bib68)\)observed that simple operations like addition and subtraction can be performed through procedural memory, the converging evidence suggests that complex mathematical and logical reasoning critically depends on explicit memory\.

Executive FunctionExecutive functions are key components of higher\-order cognitive abilities in humans\. They encompass various subdomains that help achieve goals, solve problems, and adapt to changing environments\. In particular, cognitive flexibility, planning, decision\-making, and task\-switching are critical skills that enable individuals to adapt to changes, achieve goals systematically, make optimal decisions, and efficiently transition between tasks\. Explicit memory is a crucial factor in executive functions through the retrieval of past experiences, which supports complex cognitive processes such as planning and decision\-making\.

Kleinet al\.\([2002](https://arxiv.org/html/2606.11245#bib.bib54)\)explained that explicit memory provides contextually relevant information to decision\-making systems, enabling more accurate judgments\. This role of memory is also demonstrated in multitasking studies byBurgesset al\.\([2000](https://arxiv.org/html/2606.11245#bib.bib17)\), who found that retrospective memory supports prospective memory and planning abilities, while damage to related brain regions can impair these functions\.Hassabis and Maguire \([2007](https://arxiv.org/html/2606.11245#bib.bib45)\)emphasized that the scene construction process of episodic memory is crucial for future\-oriented thinking and goal\-directed actions\.Whittingtonet al\.\([2020](https://arxiv.org/html/2606.11245#bib.bib119)\)computationally demonstrated that the hippocampal formation organizes knowledge into structured relational maps\.

On the other hand, deficits in explicit memory are closely linked to impaired executive functions\. For example,Johnset al\.\([2012](https://arxiv.org/html/2606.11245#bib.bib51)\)reported that individuals with Down syndrome who performed poorly on explicit memory tasks showed weaknesses in executive functions such as working memory and cognitive flexibility\. Lastly,Pedrazaet al\.\([2024](https://arxiv.org/html/2606.11245#bib.bib79)\)demonstrated that implicit statistical learning can compete with executive functions and even negatively impact declarative learning processes, such as goal\-directed behaviors\. In summary, explicit memory provides the foundation for executive functions by enabling the retrieval and manipulation of information\. It is essential for higher\-order processes like planning, judgment, and future\-oriented thinking\.

MetacognitionExplicit processes are central to metacognition and reflection, enabling individuals to monitor and regulate their cognitive behaviors\. The feeling of knowing \(FOK\) exemplifies this, as explored byIraket al\.\([2019](https://arxiv.org/html/2606.11245#bib.bib47)\), who linked FOK judgments to specific neural components, emphasizing the role of explicit mechanisms in predicting memory retrieval\.Nelson \([1990](https://arxiv.org/html/2606.11245#bib.bib105)\)similarly highlighted that FOK relies on explicit control and monitoring processes to resolve the paradox of predicting recognition of inaccessible items\.Fleming and Dolan \([2012](https://arxiv.org/html/2606.11245#bib.bib33)\)further tied metacognitive accuracy to the prefrontal cortex, showing that reflective judgments depend on explicit representations\. This aligns withKlein and Gangi \([2010](https://arxiv.org/html/2606.11245#bib.bib55)\)’s view of the self as a system of interrelated explicit processes, crucial for integrating episodic memory and semantic knowledge\. In nonhuman primates,Hamptonet al\.\([2020](https://arxiv.org/html/2606.11245#bib.bib43)\)provided evidence of explicit memory systems supporting metacognition, reinforcing their evolutionary significance\. These studies illustrate that explicit mechanisms are essential for reflective thought, enabling self\-awareness and adaptive decision\-making\.

Mental SimulationEpisodic memory, as a constructive and flexible system, plays a critical role in mental simulation by enabling the recombination of past experiences to envision future scenarios\(Schacter and Addis,[2007](https://arxiv.org/html/2606.11245#bib.bib90)\)\. This capacity for “mental time travel” is uniquely human and vital for adaptive planning\(Suddendorf and Corballis,[2007](https://arxiv.org/html/2606.11245#bib.bib101)\)\. Episodic memory and simulation share the default mode network, which supports scene construction and autobiographical thought, dividing into distinct subsystems handling memory retrieval and reflective social cognition\(Hassabis and Maguire,[2007](https://arxiv.org/html/2606.11245#bib.bib45); Andrews\-Hannaet al\.,[2014](https://arxiv.org/html/2606.11245#bib.bib10)\)\. Moreover, while episodic and semantic details interact dynamically, episodic simulation often demands greater cognitive control to construct vivid hypothetical scenarios\(Benoit and Schacter,[2015](https://arxiv.org/html/2606.11245#bib.bib13); Devittet al\.,[2017](https://arxiv.org/html/2606.11245#bib.bib26)\)\. Together, these findings highlight how episodic memory underpins the mental simulations that enable humans to anticipate and navigate the future\.

In conclusion, cognitive functions are closely linked to specific memory systems, and understanding these dynamics allows us to better identify the strengths and weaknesses of LLMs\. Due to their exceptional performance in implicit memory\-based processes, LLMs excel at recognizing patterns, generating fluent language, and leveraging statistical associations within large datasets\. However, their various limitations become apparent in tasks requiring explicit memory, which will be discussed in detail in the next section\.

## 5Need for Explicit Memory

The previous section examined how explicit and implicit memory systems influence distinct cognitive functions\. This section addresses the key limitations that LLMs must overcome to evolve into AGI\. It is notable that these limitations closely align with the inherent constraints of implicit memory, which lacks mechanisms for rapid and context\-rich learning\. Explicit memory, in contrast, offers a potential pathway to overcoming these challenges by enabling higher\-order cognitive abilities of humans\. Refer to[AppendixC](https://arxiv.org/html/2606.11245#A3)for actual examples showing these weaknesses\.

### 5\.1Higher\-Order Learning

Humans utilize a wider range of higher\-order learning mechanisms based on explicit memory\. First, humans are capable ofdynamic learning, continuously acquiring new knowledge over time\. In contrast, the learning process in LLMs is strictly confined to the training stage, which limits their potential to function as real\-time learners\.One\-shot learningis especially crucial for immediate adaptation of agents in settings such as narrative memory, personalized contexts, or unpredictable situations\. This instant learning is also enabled by explicit memory, through substantial synaptic weight adjustments after just a single experience\.

Furthermore,episodic learning, which combines knowledge with its source and spatiotemporal context, forms the foundation of the “feeling of knowing” and metacognition\. This type of learning grants individuals their unique self\-narratives\. Humans can also learn through semantic memory, which allows the understanding of abstract relationships found in books without requiring direct experience or links to rewards\. This capacity forabstract learningis a distinctive ability that sets humans apart from other primates\.

Finally, human explicit memory systems are built on associations between knowledge structures\. When a new concept is introduced, it is not treated as an isolated sample\. Instead, it integrates into existing semantics by forming associations with them, enablingcontinual learningby updating or expanding knowledge in a natural way\. This higher\-order learning is essential for bridging the gap between LLMs and AGI and strongly supports the need for explicit memory\.

### 5\.2Metacognition

Hallucinationhas been a well\-known and persistent issue in LLMs since their early stages\. In essence, hallucination can be attributed to a lack ofmetacognition\. Metacognition entails the ability to distinguish between what one knows and does not know, as well as to keep track of the source and context of one’s knowledge\. Since LLMs lack episodic metadata about their knowledge and learn by forming probabilistic distributions based on data, the occurrence of hallucinations is an expected byproduct\.

The problem ofconsistency, where LLMs provide different answers to the same question under subtle noise, is also inevitable due to the absence of metacognition and any grounding for the knowledge\. In contrast, humans can store their actions as memories in specific situations, reflect on the outcomes later, and engage inmental simulationsorreflections\. This ability relies on episodic memory, which is essential for cognitive abilities related to metacognition\.

### 5\.3Logical Reasoning

As mentioned earlier, simple arithmetic is often automatically mapped into human procedural memory through repeated practice\. However, whenarithmetic problemsbecome more complex and involve longer numbers, humans can solve them given enough time, while LLMs tend to struggle\. Logical reasoning, such asmathematical reasoning, causal reasoning, or deductive reasoning, heavily relies on concepts stored in semantic memory and the associations between them\. The difficulty of logical reasoning lies in the fact that the core logical concepts and their relationships are so abstract that the possible variations of corresponding low\-level representations can be practically infinite\. For example, the principle of multiplication is simple, but the specific expressions representing multiplication equations are infinite\. Such representational diversity makes statistical learning difficult\. Solving such high\-level logical problems using only statistical habituation, without relying on semantic memory, would be nearly impossible\.

### 5\.4Executive Function

Executive functions encompass a variety of sub\-cognitive functions\. Among these, regulatory abilities likeinhibitionandtask\-switchingare not prominently displayed in current LLMs, as they are not necessary without episodic memory\.Planning, while long considered a weakness of LLMs, has seen significant improvement with repeated efforts and the development of models specialized in reasoning\. Executive functions heavily rely on short\-term storage, often referred to as working memory\. Within the context window that corresponds to working memory, LLMs demonstrate strong reasoning capabilities, allowing for planning within a limited scope\. However, to reach human\-level capabilities, such as writing a 1,000\-page book or completing a year\-long project broken down into daily tasks, LLMs must maintain coherence beyond their context limits, requiring episodic memory\. If such advancements are realized, the need for regulation and task\-switching will become evident, as they will allow systematic goal\-setting and the ability to filter out unnecessary tasks while carrying out the work\.

## 6Computational Requirements for Artificial Explicit Memory System

In this section, I define an explicit memory system that can artificially replicate the aforementioned functions and specify the computational properties required for such a memory system\. The memory system is treated as a module that receives a dense embedding from upstream representational processes\. The system is defined as a functionfmemoryf\_\{\\mathrm\{memory\}\}:

fmemory\\displaystyle f\_\{\\mathrm\{memory\}\}:I→O\\displaystyle:I\\rightarrow O\(1\)I\\displaystyle I=\(E,M\)\\displaystyle=\(E,M\)O\\displaystyle O=\(Δ​M,Y\)\.\\displaystyle=\(\\Delta M,Y\)\.The input consists of the current dense embeddingE∈ℝdE\\in\\mathbb\{R\}^\{d\}and the current memory stateMM\. The output comprises the memory updateΔ​M\\Delta Mand the retrieved dense embeddingY∈ℝdY\\in\\mathbb\{R\}^\{d\}\. The internal structure ofMMis specified incrementally through the following requirements\.

Sparse IndexingThe memory system generates a sparse index from the dense input, where most elements are zero:

S\\displaystyle S=sparsify​\(E\)∈ℝn,\\displaystyle=\\mathrm\{sparsify\}\(E\)\\in\\mathbb\{R\}^\{n\},\(2\)‖S‖0\\displaystyle\\\|S\\\|\_\{0\}=∑i=1n𝟏​\[Si≠0\]≪n\.\\displaystyle=\\sum\_\{i=1\}^\{n\}\\mathbf\{1\}\[S\_\{i\}\\neq 0\]\\ll n\.LetFS=\{i∣Si≠0\}F\_\{S\}=\\\{i\\mid S\_\{i\}\\neq 0\\\}denote the set of activated dimensions\. The memory stateMMincludes a key\-value mappingP∈ℝn×dP\\in\\mathbb\{R\}^\{n\\times d\}, where each rowPi∈ℝdP\_\{i\}\\in\\mathbb\{R\}^\{d\}represents the dense embedding referenced by sparse index dimensionii\. The mappingPPbinds the sparse index to its dense source:

P⊤​S≈E\.P^\{\\top\}S\\approx E\.\(3\)This binding ensures that each active sparse dimension functions as a pointer to its corresponding dense representation\.

Error\-Independent UpdateMemory updates should not be driven by error minimization; instead,Δ​M\\Delta Mshould mainly be computed as a function ofEEandMM, with no direct dependence on predictionppor prediction erroree:

∇p,eΔ​M=𝟎\.\\nabla\_\{p,e\}\\Delta M=\\mathbf\{0\}\.\(4\)This does not mean memory must be entirely decoupled from prediction\. Predictions or perceived errors may still influence memory\. What the constraint excludes is the case where prediction error dominantly drives the update via error\-driven learning systems such as gradient descent\.

Associative ConstructionThe memory stateMMfurther includes an associative matrixA∈ℝn×nA\\in\\mathbb\{R\}^\{n\\times n\}that captures relationships among sparse indices\. Each piece of information is not stored independently but is interconnected with the existing memory\. For a given sparse indexSS, the updateΔ​A\\Delta Astrengthens the connections among all activated dimensions, following the principle of “fire together, wire together”:

Δ​Ai,j\>0,∀i,j∈FS\.\\Delta A\_\{i,j\}\>0,\\quad\\forall i,j\\in F\_\{S\}\.\(5\)
Pattern SeparationSparse indexing alone does not guarantee separation\.sparsify\\mathrm\{sparsify\}must additionally satisfy a non\-expansion property such that sparse codes are less similar than their dense inputs\. Formally, for an appropriately normalized semantic similarity measuresim\\mathrm\{sim\}:

sim​\(S1,S2\)<sim​\(E1,E2\)\.\\mathrm\{sim\}\(S\_\{1\},S\_\{2\}\)<\\mathrm\{sim\}\(E\_\{1\},E\_\{2\}\)\.\(6\)Without this property, two distinct inputsE1E\_\{1\}andE2E\_\{2\}could collapse to overlapping sparse codes, becoming indistinguishable to the associative matrixAAand undermining both pattern completion and key\-value lookup\.

Pattern CompletionAn explicit memory system can restore information from partial input through pattern completion, which proceeds in two stages\. Given a partial inputEpartialE^\{\\mathrm\{partial\}\}, the system first generates the corresponding sparse index, then performs index completion viaAAand dense reconstruction viaPP:

Spartial\\displaystyle S^\{\\mathrm\{partial\}\}=sparsify​\(Epartial\),\\displaystyle=\\mathrm\{sparsify\}\(E^\{\\mathrm\{partial\}\}\),\(7\)Sretrieved\\displaystyle S^\{\\mathrm\{retrieved\}\}=σ​\(A​Spartial\),FSpartial⊂FSretrieved,\\displaystyle=\\sigma\(A\\,S^\{\\mathrm\{partial\}\}\),\\quad F\_\{S^\{\\mathrm\{partial\}\}\}\\subset F\_\{S^\{\\mathrm\{retrieved\}\}\},Y\\displaystyle Y=P⊤​Sretrieved∈ℝd\.\\displaystyle=P^\{\\top\}S^\{\\mathrm\{retrieved\}\}\\in\\mathbb\{R\}^\{d\}\.Here,σ\\sigmadenotes any selective readout that preserves sparsity \(e\.g\., top\-kk, thresholding, or winner\-take\-all\)\. IfElearnedE^\{\\mathrm\{learned\}\}was previously stored and the partial sparse code is a subset of its full code,FSpartial⊂FSlearnedF\_\{S^\{\\mathrm\{partial\}\}\}\\subset F\_\{S^\{\\mathrm\{learned\}\}\}, thenY≈ElearnedY\\approx E^\{\\mathrm\{learned\}\}\.

DynamicityThe memory state must evolve over time, with changesΔ​Mt\\Delta M\_\{t\}at timettdepending on the inputEtE\_\{t\}and the current memory stateMt=\(At,Pt\)M\_\{t\}=\(A\_\{t\},P\_\{t\}\):

It\\displaystyle I\_\{t\}=\(Et,Mt\),\\displaystyle=\(E\_\{t\},M\_\{t\}\),\(8\)Ot\\displaystyle O\_\{t\}=\(Δ​Mt,Yt\),\\displaystyle=\(\\Delta M\_\{t\},Y\_\{t\}\),Mt\+1\\displaystyle M\_\{t\+1\}=Mt\+Δ​Mt=\(At\+Δ​At,Pt\+Δ​Pt\)\.\\displaystyle=M\_\{t\}\+\\Delta M\_\{t\}=\(A\_\{t\}\+\\Delta A\_\{t\},\\;P\_\{t\}\+\\Delta P\_\{t\}\)\.
High and Instant PlasticityExplicit memory must enable immediate storage of information after a single experience\. OnceEtE\_\{t\}is learned at timett, the system must have the capacity to retrieve it via pattern completion at the immediate following timestept\+1t\+1using only partial inputEtpartialE\_\{t\}^\{\\mathrm\{partial\}\}:

fmemory​\(Etpartial,Mt\+1\)\\displaystyle f\_\{\\mathrm\{memory\}\}\(E\_\{t\}^\{\\mathrm\{partial\}\},M\_\{t\+1\}\)=\(Δ​Mt\+1,Yt\+1\),\\displaystyle=\(\\Delta M\_\{t\+1\},Y\_\{t\+1\}\),\(9\)Yt\+1\\displaystyle Y\_\{t\+1\}≈Et\.\\displaystyle\\approx E\_\{t\}\.
Adaptive ForgettingMemory capacity is necessarily bounded, and the encoding of new patterns could come at the cost of existing connections\. When a new sparse patternSSis encoded, connections between active and inactive dimensions undergo heterosynaptic weakening:

i∈FS,j∉FS⟹Δ​Ai,j≤0\.i\\in F\_\{S\},\\;j\\notin F\_\{S\}\\implies\\Delta A\_\{i,j\}\\leq 0\.\(10\)Only connections involving currently active dimensions undergo plasticity, while connections between two inactive dimensions remain undisturbed\.

This section outlines the eight computational requirements necessary for an explicit memory system to enable artificial neural networks to acquire higher\-order cognitive functions\. These requirements identify the most essential conditions rather than provide a complete biological replica; thus, simplifications such as the use of a unified bidirectionalPPor the omission of cortical consolidation remain areas for future refinement\. I review the findings of recent studies through the lens of these computational requirements in[AppendixA](https://arxiv.org/html/2606.11245#A1)\.

## 7Alternative Views

As highlighted throughout this paper, there is a fundamental gap between current LLMs and AGI\. Existing approaches confined to implicit learning mechanisms alone cannot bridge this divide\. I argue that integrating an explicit memory system is essential to overcome this limitation, though this position naturally invites alternative perspectives\.

The most direct counterargument is that current models have already crossed the threshold of AGI\. Microsoft researchers presented their view of GPT\-4 as an early, incomplete AGI\(Bubecket al\.,[2023](https://arxiv.org/html/2606.11245#bib.bib16)\), while a researcher at OpenAI claimed AGI has been achieved with models like o1\. However, these are minority views, and the dominant opinion is that AGI or Human\-Level AI has not been realized\.

Beyond the definition of AGI, another viewpoint holds that explicit memory is not necessarily required to achieve AGI\. However, since the discussion on the necessity of explicit memory is still in its early stages, counterarguments against it are not yet widely articulated\. Indeed, memory is generally considered an essential component of AGI\(Kudithipudiet al\.,[2022](https://arxiv.org/html/2606.11245#bib.bib57); Goertzel,[2023](https://arxiv.org/html/2606.11245#bib.bib12); Fenget al\.,[2024](https://arxiv.org/html/2606.11245#bib.bib103)\)\. The debate is more likely to focus on alternative ways to integrate memory into AI rather than questioning its necessity\.

A closely related view, Complementary Learning Systems theory\(McClellandet al\.,[1995](https://arxiv.org/html/2606.11245#bib.bib66); Kumaranet al\.,[2016](https://arxiv.org/html/2606.11245#bib.bib27)\), shares my premise that a hippocampus\-like module is needed but treats LLMs as analogues of neocortical semantic \(explicit\) memory\. Yet, as[AppendixC](https://arxiv.org/html/2606.11245#A3)shows, LLMs fail to produce the rule\-respecting, flexibly retrievable knowledge that defines semantic memory, reverting to rigid stimulus–response behavior\. Thus, I maintain that LLMs operate fundamentally as implicit systems rather than semantic ones\.

Shanget al\.\([2024](https://arxiv.org/html/2606.11245#bib.bib95)\)proposed a solution using LLMs as processors while integrating AI\-native long\-term memory in a systematic way to achieve AGI\. This approach first builds a natural language memory, which is then compressed into a neural network\-based personal model to enable continuous learning and personalized reasoning\. Through this process, AI is expected to develop personalized intelligence and adaptive capabilities, bringing it closer to AGI\.

## 8Discussion

Current SuccessIf explicit memory is as essential as I argue, one might expect current LLMs to be far more limited than they are, given that they already handle tasks well beyond the reach of human implicit memory\. I attribute this to two primary factors\. The first is scale\. A key constraint on human implicit memory is the experience and time it requires, yet LLMs are exposed during pre\-training alone to more text than a person reads in a lifetime, pushing implicit\-style learning to a biologically unattainable degree\. The second factor is optimization\. Although implicit learning is error\-driven, the brain cannot optimize via per\-weight gradients as artificial networks do\. Storing a gradient for every weight is an overhead no organism could sustain, and exploiting it yields clear gains in certain domains\. In short, the success of LLMs does not contradict their alignment with implicit memory; humans are simply bound by biological limits far tighter than the theoretical ceiling of the implicit system, and have never fully realized its potential\.

TestabilityThe central claim of this paper rests on the premise that current LLMs are fundamentally implicit memory systems and therefore lack explicit memory\. This hypothesis must ultimately be confirmed or refuted empirically\. Showing that a model does not operate explicitly is relatively straightforward\. As shown in[AppendixC](https://arxiv.org/html/2606.11245#A3), when a model fails on problems requiring the exact same knowledge it successfully applies in closely related cases, it indicates a lack of explicit processing\. Proving the converse, however, is significantly more challenging\. Because LLMs serve an enormous user base, their failures are rapidly collected and integrated back into training data by providers\. Consequently, even when such problems are eventually resolved, the improvement cannot prove that the task is now solved explicitly rather than through expanded statistical coverage\.

The most reliable behavioral evidence for explicit learning therefore requires deliberate omission from the training data together with a test of adaptation to the omitted task\. For instance, if a model trained without any multiplication problem sets could perform compound multiplication after learning only addition and the definition of multiplication, this would be strong evidence of explicit memory\. An indirect signal comes from tasks that require tracking a rapidly changing state over a long horizon, such as complex entity tracking, which lies beyond the reach of implicit memory\. It is essential that the state not be written explicitly into the tokens; otherwise, the task reduces to a chain of automatic rule applications and no longer probes explicit processing\.

Further RequirementsReaching AGI necessitates not only the development of the explicit memory system itself but also broader theoretical considerations such as determining the precise nature of the input information \(e\.g\., contextual episodes, self\-actions, long\-term temporal dependencies\) and how explicit memory can interact with other forms of memory\. Furthermore, because the proposed memory system functions by indexing representations produced by upstream computation, its effectiveness fundamentally depends on whether the LLM’s internal reasoning itself operates in an explicit manner\. This remains an open question that warrants dedicated investigation\.

## 9Conclusion

The path to AGI requires addressing significant limitations in current LLMs\. While LLMs excel in tasks that involve implicit memory, such as pattern recognition and statistical learning, they lack explicit memory capabilities\. This limits their ability to perform dynamic learning, long\-term reasoning, and metacognitive tasks essential for AGI\.

Explicit memory is critical for storing and retrieving episodic and semantic information, enabling models to update knowledge over time, reason logically, and simulate hypothetical scenarios\. These capabilities allow LLMs to overcome challenges like hallucination, inconsistency, and deficiencies in long\-term planning\. Key features include sparse coding, pattern completion, and dynamic updating mechanisms, which mirror human memory processes\.

To achieve AGI, research can significantly benefit from prioritizing the integration of explicit memory into LLMs and exploring its interaction with other memory systems\. This shift will empower LLMs to perform complex, coherent tasks and adapt in real\-time, moving closer to the versatility and depth of human cognition\. I hope that the theoretical framework will guide the evolution of robust explicit memory systems and contribute to the broader NeuroAI agenda of leveraging validated biological principles to advance artificial intelligence\(Hassabiset al\.,[2017](https://arxiv.org/html/2606.11245#bib.bib44); Zadoret al\.,[2023](https://arxiv.org/html/2606.11245#bib.bib129)\)\.

## References

- D\. R\. Addis, A\. T\. Wong, and D\. L\. Schacter \(2007\)Remembering the past and imagining the future: common and distinct neural substrates during event construction and elaboration\.Neuropsychologia45\(7\),pp\. 1363–1377\.Cited by:[§2\.1](https://arxiv.org/html/2606.11245#S2.SS1.p1.1)\.
- G\. E\. Alexander, M\. R\. DeLong, and P\. L\. Strick \(1986\)Parallel organization of functionally segregated circuits linking basal ganglia and cortex\.Annu\. Rev\. Neurosci\.9\(1\),pp\. 357–381\(en\)\.Cited by:[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p5.1)\.
- P\. Andersen, T\. V\. P\. Bliss, and K\. K\. Skrede \(1971\)Lamellar organization of hippocampal excitatory pathways\.Experimental Brain Research13\(2\),pp\. 222–238\.External Links:[Document](https://dx.doi.org/10.1007/BF00234087),ISBN 1432\-1106,[Link](https://doi.org/10.1007/BF00234087)Cited by:[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p1.3)\.
- J\. Anderson \(1982\)Acquisition of cognitive skill\.Psychological Review89,pp\. 369–406\.External Links:[Document](https://dx.doi.org/10.1037/0033-295X.89.4.369),ISBN 9781483214467Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p2.1)\.
- J\. R\. Andrews\-Hanna, R\. Saxe, and T\. Yarkoni \(2014\)Contributions of episodic retrieval and mentalizing to autobiographical thought: evidence from functional neuroimaging, resting\-state connectivity, and fMRI meta\-analyses\.Neuroimage91,pp\. 324–335\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p11.1)\.
- R\.C\. Atkinson and R\.M\. Shiffrin \(1968\)Human memory: a proposed system and its control processes\.K\. W\. Spence and J\. T\. Spence \(Eds\.\),Psychology of Learning and Motivation, Vol\.2,pp\. 89–195\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1016/S0079-7421%2808%2960422-3),ISSN 0079\-7421,[Link](https://www.sciencedirect.com/science/article/pii/S0079742108604223)Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p2.1)\.
- A\. D\. Baddeley and G\. Hitch \(1974\)Working memory\.G\. H\. Bower \(Ed\.\),Psychology of Learning and Motivation, Vol\.8,pp\. 47–89\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1016/S0079-7421%2808%2960452-1),ISSN 0079\-7421,[Link](https://www.sciencedirect.com/science/article/pii/S0079742108604521)Cited by:[§2](https://arxiv.org/html/2606.11245#S2.p1.1)\.
- L\. J\. Batterink and K\. A\. Paller \(2017\)Online neural monitoring of statistical learning\.Cortex90,pp\. 31–45\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1016/j.cortex.2017.02.004),ISSN 0010\-9452,[Link](https://www.sciencedirect.com/science/article/pii/S0010945217300540)Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p3.1)\.
- A\. Behrouz, P\. Zhong, and V\. Mirrokni \(2024\)Titans: learning to memorize at test time\.arXiv \[cs\.LG\]\.External Links:[Link](http://arxiv.org/abs/2501.00663),2501\.00663Cited by:[Appendix A](https://arxiv.org/html/2606.11245#A1.p4.1)\.
- R\. G\. Benoit and D\. L\. Schacter \(2015\)Specifying the core network supporting episodic simulation and episodic memory by activation likelihood estimation\.Neuropsychologia75,pp\. 450–457\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p11.1)\.
- J\. R\. Binder and R\. H\. Desai \(2011\)The neurobiology of semantic memory\.Trends Cogn\. Sci\.15\(11\),pp\. 527–536\(en\)\.Cited by:[§2\.1](https://arxiv.org/html/2606.11245#S2.SS1.p1.1)\.
- J\. R\. Booth, L\. Wood, D\. Lu, J\. C\. Houk, and T\. Bitan \(2007\)The role of the basal ganglia and cerebellum in language processing\.Brain Res\.1133\(1\),pp\. 136–144\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p5.1)\.
- S\. Bubeck, V\. Chandrasekaran, R\. Eldan, J\. Gehrke, E\. Horvitz, E\. Kamar, P\. Lee, Y\. T\. Lee, Y\. Li, S\. Lundberg, H\. Nori, H\. Palangi, M\. T\. Ribeiro, and Y\. Zhang \(2023\)Sparks of artificial general intelligence: early experiments with gpt\-4\.External Links:[Link](https://www.microsoft.com/en-us/research/publication/sparks-of-artificial-general-intelligence-early-experiments-with-gpt-4/)Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p1.1),[§7](https://arxiv.org/html/2606.11245#S7.p2.1)\.
- P\. W\. Burgess, E\. Veitch, A\. de Lacy Costello, and T\. Shallice \(2000\)The cognitive and neuroanatomical correlates of multitasking\.Neuropsychologia38\(6\),pp\. 848–863\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p8.1)\.
- T\. Chakrabarty, V\. Padmakumar, and H\. He \(2022\)Help me write a poem \- instruction tuning as a vehicle for collaborative poetry writing\.InProceedings of the 2022 Conference on Empirical Methods in Natural Language Processing,Y\. Goldberg, Z\. Kozareva, and Y\. Zhang \(Eds\.\),Abu Dhabi, United Arab Emirates,pp\. 6848–6863\.External Links:[Document](https://dx.doi.org/10.18653/v1/2022.emnlp-main.460),[Link](https://aclanthology.org/2022.emnlp-main.460/)Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p1.1)\.
- R\. E\. Clark and L\. R\. Squire \(1998\)Classical conditioning and brain systems: the role of awareness\.Science280\(5360\),pp\. 77–81\(en\)\.Cited by:[§2\.1](https://arxiv.org/html/2606.11245#S2.SS1.p2.1)\.
- N\. J\. Cohen and L\. R\. Squire \(1980\)Preserved learning and retention of pattern\-analyzing skill in amnesia: dissociation of knowing how and knowing that\.Science210\(4466\),pp\. 207–210\.External Links:[Document](https://dx.doi.org/10.1126/science.7414331),[Link](https://www.science.org/doi/abs/10.1126/science.7414331),https://www\.science\.org/doi/pdf/10\.1126/science\.7414331Cited by:[§2\.1](https://arxiv.org/html/2606.11245#S2.SS1.p2.1)\.
- K\. M\. Collins, I\. Sucholutsky, U\. Bhatt, K\. Chandra, L\. Wong, M\. Lee, C\. E\. Zhang, T\. Zhi\-Xuan, M\. Ho, V\. Mansinghka, A\. Weller, J\. B\. Tenenbaum, and T\. L\. Griffiths \(2024\)Building machines that learn and think with people\.Nature Human Behaviour8\(10\),pp\. 1851–1863\.External Links:[Document](https://dx.doi.org/10.1038/s41562-024-01991-9),ISBN 2397\-3374,[Link](https://doi.org/10.1038/s41562-024-01991-9)Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p2.1)\.
- P\. Colombo, T\. P\. Pires, M\. Boudiaf, D\. Culver, R\. Melo, C\. Corro, A\. F\. T\. Martins, F\. Esposito, V\. L\. Raposo, S\. Morgado, and M\. Desa \(2024\)SaulLM\-7b: a pioneering large language model for law\.External Links:[Link](https://arxiv.org/abs/2403.03883),2403\.03883Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p1.1)\.
- G\. Comanici, E\. Bieber, M\. Schaekermann, I\. Pasupat, N\. Sachdeva, I\. Dhillon, M\. Blistein, O\. Ram, D\. Zhang, E\. Rosen, L\. Marris, S\. Petulla, C\. Gaffney, A\. Aharoni, N\. Lintz, T\. C\. Pais, H\. Jacobsson, I\. Szpektor, N\. Jiang,et al\.\(2025\)Gemini 2\.5: pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities\.External Links:2507\.06261,[Link](https://arxiv.org/abs/2507.06261)Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p1.1)\.
- C\. M\. Conway \(2020\)How does the brain learn environmental structure? ten core principles for understanding the neurocognitive mechanisms of statistical learning\.Neurosci\. Biobehav\. Rev\.112,pp\. 279–299\(en\)\.Cited by:[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p5.1)\.
- P\. Das, S\. Chaudhury, E\. Nelson, I\. Melnyk, S\. Swaminathan, S\. Dai, A\. Lozano, G\. Kollias, V\. Chenthamarakshan, J\. Navratil, S\. Dan, and P\. Chen \(2024\)Larimar: large language models with episodic memory control\.InInternational Conference on Machine Learning,pp\. 10109–10126\(en\)\.External Links:ISSN 2640\-3498,[Link](https://proceedings.mlr.press/v235/das24a.html)Cited by:[Appendix A](https://arxiv.org/html/2606.11245#A1.p2.1)\.
- DeepSeek\-AI, D\. Guo, D\. Yang, H\. Zhang, J\. Song, R\. Zhang, R\. Xu, Q\. Zhu, S\. Ma, P\. Wang, X\. Bi, X\. Zhang, X\. Yu, Y\. Wu, Z\. F\. Wu, Z\. Gou, Z\. Shao, Z\. Li, Z\. Gao,et al\.\(2025\)DeepSeek\-r1: incentivizing reasoning capability in llms via reinforcement learning\.External Links:[Link](https://arxiv.org/abs/2501.12948),2501\.12948Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p1.1)\.
- A\. L\. Devitt, D\. R\. Addis, and D\. L\. Schacter \(2017\)Episodic and semantic content of memory and imagination: a multilevel analysis\.Mem\. Cognit\.45\(7\),pp\. 1078–1094\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p11.1)\.
- J\. Doyon and H\. Benali \(2005\)Reorganization and plasticity in the adult brain during learning of motor skills\.Curr\. Opin\. Neurobiol\.15\(2\),pp\. 161–167\(en\)\.Cited by:[§2\.1](https://arxiv.org/html/2606.11245#S2.SS1.p2.1)\.
- H\. Eichenbaum \(2000\)A cortical–hippocampal system for declarative memory\.Nature Reviews Neuroscience1\(1\),pp\. 41–50\.External Links:[Document](https://dx.doi.org/10.1038/35036213),ISBN 1471\-0048,[Link](https://doi.org/10.1038/35036213)Cited by:[§2\.3](https://arxiv.org/html/2606.11245#S2.SS3.p2.1)\.
- T\. M\. Evans, D\. W\. Lipscomb, F\. S\. Earle, S\. N\. Del Tufo, J\. A\. G\. Lum, L\. E\. Cutting, and M\. T\. Ullman \(2024\)Declarative memory supports children’s math skills: a longitudinal study\.PLoS One19\(7\),pp\. e0304211\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p6.1)\.
- M\. Fayol and C\. Thevenot \(2012\)The use of procedural knowledge in simple addition and subtraction problems\.Cognition123\(3\),pp\. 392–403\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1016/j.cognition.2012.02.008),ISSN 0010\-0277,[Link](https://www.sciencedirect.com/science/article/pii/S0010027712000340)Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p6.1)\.
- T\. Feng, C\. Jin, J\. Liu, K\. Zhu, H\. Tu, Z\. Cheng, G\. Lin, and J\. You \(2024\)How far are we from AGI: are LLMs all we need?\.Transactions on Machine Learning Research\.Note:Survey CertificationExternal Links:ISSN 2835\-8856,[Link](https://openreview.net/forum?id=H2ZKqfNd0U)Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p2.1),[§7](https://arxiv.org/html/2606.11245#S7.p3.1)\.
- J\. Fiser and R\. N\. Aslin \(2002a\)Statistical learning of higher\-order temporal structure from visual shape sequences\.J\. Exp\. Psychol\. Learn\. Mem\. Cogn\.28\(3\),pp\. 458–467\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p3.1)\.
- J\. Fiser and R\. N\. Aslin \(2002b\)Statistical learning of new visual feature combinations by infants\.Proc\. Natl\. Acad\. Sci\. U\. S\. A\.99\(24\),pp\. 15822–15826\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p3.1)\.
- S\. M\. Fleming and R\. J\. Dolan \(2012\)The neural basis of metacognitive ability\.Philos\. Trans\. R\. Soc\. Lond\. B Biol\. Sci\.367\(1594\),pp\. 1338–1349\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p10.1)\.
- P\. W\. Frankland and B\. Bontempi \(2005\)The organization of recent and remote memories\.Nature Reviews Neuroscience6\(2\),pp\. 119–130\.External Links:[Document](https://dx.doi.org/10.1038/nrn1607),ISBN 1471\-0048,[Link](https://doi.org/10.1038/nrn1607)Cited by:[§2\.3](https://arxiv.org/html/2606.11245#S2.SS3.p2.1)\.
- R\. M\. Friedrich and A\. D\. Friederici \(2013\)Mathematical logic in the human brain: semantics\.PLoS One8\(1\),pp\. e53699\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p6.1)\.
- G\. Gendron, Q\. Bao, M\. Witbrock, and G\. Dobbie \(2024\)Large language models are not strong abstract reasoners\.InProceedings of the Thirty\-Third International Joint Conference on Artificial Intelligence, IJCAI\-24,K\. Larson \(Ed\.\),pp\. 6270–6278\.Note:Main TrackExternal Links:[Document](https://dx.doi.org/10.24963/ijcai.2024/693),[Link](https://doi.org/10.24963/ijcai.2024/693)Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p2.1)\.
- C\. R\. Gerfen and D\. J\. Surmeier \(2011\)Modulation of striatal projection systems by dopamine\.Annu\. Rev\. Neurosci\.34\(1\),pp\. 441–466\(en\)\.Cited by:[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p5.1)\.
- B\. Goertzel \(2023\)Generative ai vs\. agi: the cognitive strengths and weaknesses of modern llms\.External Links:[Link](https://arxiv.org/abs/2309.10371),2309\.10371Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p2.1),[§7](https://arxiv.org/html/2606.11245#S7.p3.1)\.
- T\. D\. Goode, K\. Z\. Tanaka, A\. Sahay, and T\. J\. McHugh \(2020\)An integrated index: engrams, place cells, and hippocampal memory\.Neuron107\(5\),pp\. 805–820\.Note:doi: 10\.1016/j\.neuron\.2020\.07\.011External Links:[Document](https://dx.doi.org/10.1016/j.neuron.2020.07.011),ISBN 0896\-6273,[Link](https://doi.org/10.1016/j.neuron.2020.07.011)Cited by:[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p1.3)\.
- I\. Goshen, M\. Brodsky, R\. Prakash, J\. Wallace, V\. Gradinaru, C\. Ramakrishnan, and K\. Deisseroth \(2011\)Dynamics of retrieval strategies for remote memories\.Cell147\(3\),pp\. 678–689\.Note:doi: 10\.1016/j\.cell\.2011\.09\.033External Links:[Document](https://dx.doi.org/10.1016/j.cell.2011.09.033),ISBN 0092\-8674,[Link](https://doi.org/10.1016/j.cell.2011.09.033)Cited by:[§B\.2](https://arxiv.org/html/2606.11245#A2.SS2.p3.1)\.
- P\. Graf and D\. L\. Schacter \(1985\)Implicit and explicit memory for new associations in normal and amnesic subjects\.Journal of Experimental Psychology: Learning, Memory, and Cognition11\(3\),pp\. 501–518\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p4.1)\.
- A\. Grattafiori, A\. Dubey, A\. Jauhri, A\. Pandey, A\. Kadian, A\. Al\-Dahle, A\. Letman, A\. Mathur, A\. Schelten, A\. Vaughan, A\. Yang, A\. Fan, A\. Goyal, A\. Hartshorn, A\. Yang, A\. Mitra, A\. Sravankumar, A\. Korenev, A\. Hinsvark,et al\.\(2024\)The llama 3 herd of models\.External Links:[Link](https://arxiv.org/abs/2407.21783),2407\.21783Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p1.1)\.
- A\. M\. Graybiel \(2008\)Habits, rituals, and the evaluative brain\.Annu\. Rev\. Neurosci\.31\(1\),pp\. 359–387\(en\)\.Cited by:[§2\.1](https://arxiv.org/html/2606.11245#S2.SS1.p2.1),[§2\.3](https://arxiv.org/html/2606.11245#S2.SS3.p6.1)\.
- R\. R\. Hampton, J\. W\. M\. Engelberg, and R\. J\. Brady \(2020\)Explicit memory and cognition in monkeys\.Neuropsychologia138\(107326\),pp\. 107326\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p10.1)\.
- D\. Hassabis, D\. Kumaran, C\. Summerfield, and M\. Botvinick \(2017\)Neuroscience\-Inspired artificial intelligence\.Neuron95\(2\),pp\. 245–258\(en\)\.Cited by:[§9](https://arxiv.org/html/2606.11245#S9.p3.1)\.
- D\. Hassabis and E\. A\. Maguire \(2007\)Deconstructing episodic memory with construction\.Trends Cogn\. Sci\.11\(7\),pp\. 299–306\(en\)\.Cited by:[§B\.2](https://arxiv.org/html/2606.11245#A2.SS2.p2.1),[§4](https://arxiv.org/html/2606.11245#S4.p11.1),[§4](https://arxiv.org/html/2606.11245#S4.p8.1)\.
- K\. He, R\. Mao, Q\. Lin, Y\. Ruan, X\. Lan, M\. Feng, and E\. Cambria \(2025\)A survey of large language models for healthcare: from data, technology, and applications to accountability and ethics\.External Links:[Link](https://arxiv.org/abs/2310.05694),2310\.05694Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p1.1)\.
- O\. Hikosaka and M\. Isoda \(2010\)Switching from automatic to controlled behavior: cortico\-basal ganglia mechanisms\.Trends Cogn\. Sci\.14\(4\),pp\. 154–161\(en\)\.Cited by:[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p5.1),[§2\.3](https://arxiv.org/html/2606.11245#S2.SS3.p6.1)\.
- J\. Huang, X\. Chen, S\. Mishra, H\. S\. Zheng, A\. W\. Yu, X\. Song, and D\. Zhou \(2024\)Large language models cannot self\-correct reasoning yet\.InThe Twelfth International Conference on Learning Representations,External Links:[Link](https://openreview.net/forum?id=IkmD3fKBPQ)Cited by:[§B\.3](https://arxiv.org/html/2606.11245#A2.SS3.SSS0.Px2.p4.1)\.
- M\. Irak, C\. Soylu, G\. Turan, and D\. Çapan \(2019\)Neurobiological basis of feeling of knowing in episodic memory\.Cogn\. Neurodyn\.13\(3\),pp\. 239–256\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p10.1)\.
- A\. Johns, J\. Homewood, R\. Stevenson, and A\. Taylor \(2012\)Implicit and explicit olfactory memory in people with and without down syndrome\.Res\. Dev\. Disabil\.33\(2\),pp\. 583–593\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p9.1)\.
- M\. W\. Jung and B\. L\. McNaughton \(1993\)Spatial selectivity of unit activity in the hippocampal granular layer\.Hippocampus3\(2\),pp\. 165–182\(en\)\.Cited by:[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p2.1)\.
- S\. Kadavath, T\. Conerly, A\. Askell, T\. Henighan, D\. Drain, E\. Perez, N\. Schiefer, Z\. Hatfield\-Dodds, N\. DasSarma, E\. Tran\-Johnson, S\. Johnston, S\. El\-Showk, A\. Jones, N\. Elhage, T\. Hume, A\. Chen, Y\. Bai, S\. Bowman, S\. Fort, D\. Ganguli, D\. Hernandez, J\. Jacobson, J\. Kernion, S\. Kravec, L\. Lovitt, K\. Ndousse, C\. Olsson, S\. Ringer, D\. Amodei, T\. Brown, J\. Clark, N\. Joseph, B\. Mann, S\. McCandlish, C\. Olah, and J\. Kaplan \(2022\)Language models \(mostly\) know what they know\.External Links:[Link](https://arxiv.org/abs/2207.05221),2207\.05221Cited by:[§B\.3](https://arxiv.org/html/2606.11245#A2.SS3.SSS0.Px2.p3.1)\.
- S\. B\. Klein, L\. Cosmides, J\. Tooby, and S\. Chance \(2002\)Decisions and the evolution of memory: multiple systems, multiple functions\.Psychol\. Rev\.109\(2\),pp\. 306–329\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p8.1)\.
- S\. B\. Klein and C\. E\. Gangi \(2010\)The multiplicity of self: neuropsychological evidence and its implications for the self as a construct in psychological research\.Ann\. N\. Y\. Acad\. Sci\.1191\(1\),pp\. 1–15\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p10.1)\.
- A\. C\. Kreitzer and R\. C\. Malenka \(2008\)Striatal plasticity and basal ganglia circuit function\.Neuron60\(4\),pp\. 543–554\.Note:doi: 10\.1016/j\.neuron\.2008\.11\.005External Links:[Document](https://dx.doi.org/10.1016/j.neuron.2008.11.005),ISBN 0896\-6273,[Link](https://doi.org/10.1016/j.neuron.2008.11.005)Cited by:[§2\.3](https://arxiv.org/html/2606.11245#S2.SS3.p6.1)\.
- D\. Kudithipudi, M\. Aguilar\-Simon, J\. Babb, M\. Bazhenov, D\. Blackiston, J\. Bongard, A\. P\. Brna, S\. Chakravarthi Raja, N\. Cheney, J\. Clune, A\. Daram, S\. Fusi, P\. Helfer, L\. Kay, N\. Ketz, Z\. Kira, S\. Kolouri, J\. L\. Krichmar, S\. Kriegman, M\. Levin, S\. Madireddy, S\. Manicka, A\. Marjaninejad, B\. McNaughton, R\. Miikkulainen, Z\. Navratilova, T\. Pandit, A\. Parker, P\. K\. Pilly, S\. Risi, T\. J\. Sejnowski, A\. Soltoggio, N\. Soures, A\. S\. Tolias, D\. Urbina\-Meléndez, F\. J\. Valero\-Cuevas, G\. M\. van de Ven, J\. T\. Vogelstein, F\. Wang, R\. Weiss, A\. Yanguas\-Gil, X\. Zou, and H\. Siegelmann \(2022\)Biological underpinnings for lifelong learning machines\.Nature machine intelligence4\(3\),pp\. 196–210\(en\)\.External Links:[Document](https://dx.doi.org/10.1038/s42256-022-00452-0),ISSN 2522\-5839,2522\-5839,[Link](https://www.nature.com/articles/s42256-022-00452-0)Cited by:[§7](https://arxiv.org/html/2606.11245#S7.p3.1)\.
- D\. Kumaran, D\. Hassabis, and J\. L\. McClelland \(2016\)What learning systems do intelligent agents need? complementary learning systems theory updated\.Trends in Cognitive Sciences20\(7\),pp\. 512–534\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1016/j.tics.2016.05.004),ISSN 1364\-6613,[Link](https://www.sciencedirect.com/science/article/pii/S1364661316300432)Cited by:[§2\.3](https://arxiv.org/html/2606.11245#S2.SS3.p1.1),[§7](https://arxiv.org/html/2606.11245#S7.p4.1)\.
- J\. K\. Leutgeb, S\. Leutgeb, M\. Moser, and E\. I\. Moser \(2007\)Pattern separation in the dentate gyrus and ca3 of the hippocampus\.Science315\(5814\),pp\. 961–966\.External Links:[Document](https://dx.doi.org/10.1126/science.1135801),[Link](https://www.science.org/doi/abs/10.1126/science.1135801),https://www\.science\.org/doi/pdf/10\.1126/science\.1135801Cited by:[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p2.1)\.
- P\. Liang, R\. Bommasani, T\. Lee, D\. Tsipras, D\. Soylu, M\. Yasunaga, Y\. Zhang, D\. Narayanan, Y\. Wu, A\. Kumar, B\. Newman, B\. Yuan, B\. Yan, C\. Zhang, C\. Cosgrove, C\. D\. Manning, C\. Ré, D\. Acosta\-Navas, D\. A\. Hudson,et al\.\(2023\)Holistic evaluation of language models\.External Links:[Link](https://arxiv.org/abs/2211.09110),2211\.09110Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p1.1)\.
- P\. Lieberman, E\. Kako, J\. Friedman, G\. Tajchman, L\. S\. Feldman, and E\. B\. Jiminez \(1992\)Speech production, syntax comprehension, and cognitive deficits in parkinson’s disease\.Brain Lang\.43\(2\),pp\. 169–189\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p5.1)\.
- R\. C\. Malenka and M\. F\. Bear \(2004\)LTP and ltd: an embarrassment of riches\.Neuron44\(1\),pp\. 5–21\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1016/j.neuron.2004.09.012),ISSN 0896\-6273,[Link](https://www.sciencedirect.com/science/article/pii/S0896627304006087)Cited by:[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p4.1)\.
- M\. G\. Mattar and N\. D\. Daw \(2018\)Prioritized memory access explains planning and hippocampal replay\.Nature Neuroscience21\(11\),pp\. 1609–1617\.External Links:[Document](https://dx.doi.org/10.1038/s41593-018-0232-z),ISBN 1546\-1726,[Link](https://doi.org/10.1038/s41593-018-0232-z)Cited by:[§2\.3](https://arxiv.org/html/2606.11245#S2.SS3.p2.1)\.
- J\. L\. McClelland, B\. L\. McNaughton, and R\. C\. O’Reilly \(1995\)Why there are complementary learning systems in the hippocampus and neocortex: insights from the successes and failures of connectionist models of learning and memory\.Psychol\. Rev\.102\(3\),pp\. 419–457\(en\)\.Cited by:[§2\.3](https://arxiv.org/html/2606.11245#S2.SS3.p1.1),[§7](https://arxiv.org/html/2606.11245#S7.p4.1)\.
- V\. Menon \(2016\)Memory and cognitive control circuits in mathematical cognition and learning\.InProgress in Brain Research,Progress in brain research,pp\. 159–186\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p6.1)\.
- P\. R\. Montague, S\. E\. Hyman, and J\. D\. Cohen \(2004\)Computational roles for dopamine in behavioural control\.Nature431\(7010\),pp\. 760–767\.External Links:[Document](https://dx.doi.org/10.1038/nature03015),ISBN 1476\-4687,[Link](https://doi.org/10.1038/nature03015)Cited by:[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p7.1)\.
- M\. R\. Morris, J\. Sohl\-Dickstein, N\. Fiedel, T\. Warkentin, A\. Dafoe, A\. Faust, C\. Farabet, and S\. Legg \(2024\)Position: levels of AGI for operationalizing progress on the path to AGI\.InProceedings of the 41st International Conference on Machine Learning,R\. Salakhutdinov, Z\. Kolter, K\. Heller, A\. Weller, N\. Oliver, J\. Scarlett, and F\. Berkenkamp \(Eds\.\),Proceedings of Machine Learning Research, Vol\.235,pp\. 36308–36321\.External Links:[Link](https://proceedings.mlr.press/v235/morris24b.html)Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p2.1)\.
- K\. Nakazawa, T\. J\. McHugh, M\. A\. Wilson, and S\. Tonegawa \(2004\)NMDA receptors, place cells and hippocampal spatial memory\.Nature Reviews Neuroscience5\(5\),pp\. 361–372\.External Links:[Document](https://dx.doi.org/10.1038/nrn1385),ISBN 1471\-0048,[Link](https://doi.org/10.1038/nrn1385)Cited by:[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p4.1)\.
- K\. Nakazawa, M\. C\. Quirk, R\. A\. Chitwood, M\. Watanabe, M\. F\. Yeckel, L\. D\. Sun, A\. Kato, C\. A\. Carr, D\. Johnston, M\. A\. Wilson, and S\. Tonegawa \(2002\)Requirement for hippocampal CA3 NMDA receptors in associative memory recall\.Science297\(5579\),pp\. 211–218\(en\)\.Cited by:[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p3.1),[§2\.3](https://arxiv.org/html/2606.11245#S2.SS3.p1.1)\.
- T\. O\. Nelson \(1990\)Metamemory: a theoretical framework and new findings\.G\. H\. Bower \(Ed\.\),Psychology of Learning and Motivation, Vol\.26,pp\. 125–173\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1016/S0079-7421%2808%2960053-5),ISSN 0079\-7421,[Link](https://www.sciencedirect.com/science/article/pii/S0079742108600535)Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p10.1)\.
- M\. G\. Packard and B\. J\. Knowlton \(2002\)Learning and memory functions of the basal ganglia\.Annu\. Rev\. Neurosci\.25\(1\),pp\. 563–593\(en\)\.Cited by:[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p5.1)\.
- S\. Park and J\. Bak \(2024\)Memoria: resolving fateful forgetting problem through human\-inspired memory architecture\.InProceedings of the 41st International Conference on Machine Learning,R\. Salakhutdinov, Z\. Kolter, K\. Heller, A\. Weller, N\. Oliver, J\. Scarlett, and F\. Berkenkamp \(Eds\.\),Proceedings of Machine Learning Research, Vol\.235,pp\. 39587–39615\.External Links:[Link](https://proceedings.mlr.press/v235/park24a.html)Cited by:[Appendix A](https://arxiv.org/html/2606.11245#A1.p3.1)\.
- K\. Patterson, P\. J\. Nestor, and T\. T\. Rogers \(2007\)Where do you know what you know? the representation of semantic knowledge in the human brain\.Nature Reviews Neuroscience8\(12\),pp\. 976–987\.External Links:[Document](https://dx.doi.org/10.1038/nrn2277),ISBN 1471\-0048,[Link](https://doi.org/10.1038/nrn2277)Cited by:[§2\.1](https://arxiv.org/html/2606.11245#S2.SS1.p1.1)\.
- F\. Pedraza, B\. C\. Farkas, T\. Vékony, F\. Haesebaert, R\. Phelipon, I\. Mihalecz, K\. Janacsek, R\. Anders, B\. Tillmann, G\. Plancher, and D\. Németh \(2024\)Evidence for a competitive relationship between executive functions and statistical learning\.npj Science of Learning9\(1\),pp\. 30\.External Links:[Document](https://dx.doi.org/10.1038/s41539-024-00243-9),ISBN 2056\-7936,[Link](https://doi.org/10.1038/s41539-024-00243-9)Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p9.1)\.
- S\. Ramirez, X\. Liu, P\. Lin, J\. Suh, M\. Pignatelli, R\. L\. Redondo, T\. J\. Ryan, and S\. Tonegawa \(2013\)Creating a false memory in the hippocampus\.Science341\(6144\),pp\. 387–391\.External Links:[Document](https://dx.doi.org/10.1126/science.1239073),[Link](https://www.science.org/doi/abs/10.1126/science.1239073),https://www\.science\.org/doi/pdf/10\.1126/science\.1239073Cited by:[§B\.2](https://arxiv.org/html/2606.11245#A2.SS2.p3.1)\.
- P\. J\. Reber \(2013\)The neural basis of implicit learning and memory: a review of neuropsychological and neuroimaging research\.Neuropsychologia51\(10\),pp\. 2026–2042\(en\)\.Cited by:[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p8.1)\.
- N\. Rebola, M\. Carta, and C\. Mulle \(2017\)Operation and plasticity of hippocampal CA3 circuits: implications for memory encoding\.Nat\. Rev\. Neurosci\.18\(4\),pp\. 208–220\(en\)\.Cited by:[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p3.1)\.
- E\. Rolls \(2013\)The mechanisms for pattern completion and pattern separation in the hippocampus\.Frontiers in Systems Neuroscience7\.External Links:[Document](https://dx.doi.org/10.3389/fnsys.2013.00074),ISSN 1662\-5137,[Link](https://www.frontiersin.org/journals/systems-neuroscience/articles/10.3389/fnsys.2013.00074)Cited by:[§2\.3](https://arxiv.org/html/2606.11245#S2.SS3.p1.1)\.
- D\. L\. Schacter and D\. R\. Addis \(2007\)The cognitive neuroscience of constructive memory: remembering the past and imagining the future\.Philos\. Trans\. R\. Soc\. Lond\. B Biol\. Sci\.362\(1481\),pp\. 773–786\(en\)\.Cited by:[§2\.1](https://arxiv.org/html/2606.11245#S2.SS1.p1.1),[§4](https://arxiv.org/html/2606.11245#S4.p11.1)\.
- D\. L\. Schacter, R\. G\. Benoit, and K\. K\. Szpunar \(2017\)Episodic future thinking: mechanisms and functions\.Curr\. Opin\. Behav\. Sci\.17,pp\. 41–50\.Cited by:[§2\.1](https://arxiv.org/html/2606.11245#S2.SS1.p1.1)\.
- D\. L\. Schacter \(1987\)Implicit memory: history and current status\.Journal of Experimental Psychology: Learning, Memory, and Cognition13\(3\),pp\. 501–518\.Cited by:[§2\.1](https://arxiv.org/html/2606.11245#S2.SS1.p2.1)\.
- P\. Schlegel, Y\. Yin, A\. S\. Bates, S\. Dorkenwald, K\. Eichler, P\. Brooks, D\. S\. Han, M\. Gkantia, M\. dos Santos, E\. J\. Munnelly, G\. Badalamente, L\. Serratosa Capdevila, V\. A\. Sane, A\. M\. C\. Fragniere, L\. Kiassat, M\. W\. Pleijzier, T\. Stürner, I\. F\. M\. Tamimi, C\. R\. Dunne,et al\.\(2024\)Whole\-brain annotation and multi\-connectome cell typing of drosophila\.Nature634\(8032\),pp\. 139–152\.External Links:[Document](https://dx.doi.org/10.1038/s41586-024-07686-5),ISBN 1476\-4687,[Link](https://doi.org/10.1038/s41586-024-07686-5)Cited by:[§B\.1](https://arxiv.org/html/2606.11245#A2.SS1.p2.1)\.
- H\. Schmolck, E\. A\. Kensinger, S\. Corkin, and L\. R\. Squire \(2002\)Semantic knowledge in patient H\.M\. and other patients with bilateral medial and lateral temporal lobe lesions\.Hippocampus12\(4\),pp\. 520–533\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p4.1)\.
- M\. Schrimpf, I\. A\. Blank, G\. Tuckute, C\. Kauf, E\. A\. Hosseini, N\. Kanwisher, J\. B\. Tenenbaum, and E\. Fedorenko \(2021\)The neural architecture of language: integrative modeling converges on predictive processing\.Proceedings of the National Academy of Sciences118\(45\),pp\. e2105646118\.External Links:[Document](https://dx.doi.org/10.1073/pnas.2105646118),[Link](https://www.pnas.org/doi/abs/10.1073/pnas.2105646118),https://www\.pnas\.org/doi/pdf/10\.1073/pnas\.2105646118Cited by:[§B\.1](https://arxiv.org/html/2606.11245#A2.SS1.p3.1)\.
- W\. Schultz, P\. Dayan, and P\. R\. Montague \(1997\)A neural substrate of prediction and reward\.Science275\(5306\),pp\. 1593–1599\.External Links:[Document](https://dx.doi.org/10.1126/science.275.5306.1593),[Link](https://www.science.org/doi/abs/10.1126/science.275.5306.1593),https://www\.science\.org/doi/pdf/10\.1126/science\.275\.5306\.1593Cited by:[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p7.1)\.
- W\. B\. Scoville and B\. Milner \(1957\)Loss of recent memory after bilateral hippocampal lesions\.J\. Neurol\. Neurosurg\. Psychiatry20\(1\),pp\. 11–21\(en\)\.Cited by:[§B\.2](https://arxiv.org/html/2606.11245#A2.SS2.p2.1),[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p1.3)\.
- J\. Shang, Z\. Zheng, J\. Wei, X\. Ying, F\. Tao, and Mindverse Team \(2024\)AI\-native memory: a pathway from LLMs towards AGI\.arXiv \[cs\.CL\]\.External Links:[Link](http://arxiv.org/abs/2406.18312),2406\.18312Cited by:[§7](https://arxiv.org/html/2606.11245#S7.p5.1)\.
- A\. Singh, A\. Fry, A\. Perelman, A\. Tart, A\. Ganesh, A\. El\-Kishky, A\. McLaughlin, A\. Low, A\. Ostrow, A\. Ananthram, A\. Nathan, A\. Luo, A\. Helyar, A\. Madry, A\. Efremov, A\. Spyra, A\. Baker\-Whitcomb, A\. Beutel, A\. Karpenko,et al\.\(2026\)OpenAI gpt\-5 system card\.External Links:2601\.03267,[Link](https://arxiv.org/abs/2601.03267)Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p1.1)\.
- P\. Song, P\. Han, and N\. Goodman \(2025\)A survey on large language model reasoning failures\.In2nd AI for Math Workshop @ ICML 2025,External Links:[Link](https://openreview.net/forum?id=hsgMn4KBFG)Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p2.1)\.
- L\. R\. Squire and S\. Zola\-Morgan \(1991\)The medial temporal lobe memory system\.Science253\(5026\),pp\. 1380–1386\(en\)\.Cited by:[§B\.2](https://arxiv.org/html/2606.11245#A2.SS2.p2.1),[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p1.3)\.
- L\. R\. Squire and A\. J\. O\. Dede \(2015\)Conscious and unconscious memory systems\.Cold Spring Harb\. Perspect\. Biol\.7\(3\),pp\. a021667\(en\)\.Cited by:[§2\.1](https://arxiv.org/html/2606.11245#S2.SS1.p2.1)\.
- L\. R\. Squire and S\. M\. Zola \(1996\)Structure and function of declarative and nondeclarative memory systems\.Proceedings of the National Academy of Sciences93\(24\),pp\. 13515–13522\.External Links:[Document](https://dx.doi.org/10.1073/pnas.93.24.13515),[Link](https://www.pnas.org/doi/abs/10.1073/pnas.93.24.13515),https://www\.pnas\.org/doi/pdf/10\.1073/pnas\.93\.24\.13515Cited by:[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p8.1)\.
- L\. R\. Squire \(2004\)Memory systems of the brain: a brief history and current perspective\.Neurobiology of Learning and Memory82\(3\),pp\. 171–177\.Note:Multiple Memory SystemsExternal Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1016/j.nlm.2004.06.005),ISSN 1074\-7427,[Link](https://www.sciencedirect.com/science/article/pii/S1074742704000735)Cited by:[§2\.1](https://arxiv.org/html/2606.11245#S2.SS1.p1.1)\.
- A\. Srivastava, A\. Rastogi, A\. Rao, A\. A\. M\. Shoeb, A\. Abid, A\. Fisch, A\. R\. Brown, A\. Santoro, A\. Gupta, A\. Garriga\-Alonso, A\. Kluska, A\. Lewkowycz, A\. Agarwal, A\. Power, A\. Ray, A\. Warstadt, A\. W\. Kocurek, A\. Safaya, A\. Tazarv,et al\.\(2023\)Beyond the imitation game: quantifying and extrapolating the capabilities of language models\.Transactions on Machine Learning Research\.Note:Featured CertificationExternal Links:ISSN 2835\-8856,[Link](https://openreview.net/forum?id=uyTL5Bvosj)Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p1.1)\.
- T\. Suddendorf and M\. C\. Corballis \(2007\)The evolution of foresight: what is mental time travel, and is it unique to humans?\.Behav\. Brain Sci\.30\(3\),pp\. 299–313; discussion 313–51\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p11.1)\.
- Y\. Sun, S\. Hu, G\. Zhou, K\. J\. Zheng, H\. Hajishirzi, N\. Dziri, and D\. Song \(2026\)OMEGA: can LLMs reason outside the box in math? evaluating exploratory, compositional, and transformative generalization\.InThe Thirty\-ninth Annual Conference on Neural Information Processing Systems Datasets and Benchmarks Track,External Links:[Link](https://openreview.net/forum?id=gBxpAKxg22)Cited by:[§B\.3](https://arxiv.org/html/2606.11245#A2.SS3.SSS0.Px1.p1.1)\.
- B\. Szigeti, P\. Gleeson, M\. Vella, S\. Khayrulin, A\. Palyanov, J\. Hokanson, M\. Currie, M\. Cantarelli, G\. Idili, and S\. Larson \(2014\)OpenWorm: an open\-science approach to modeling caenorhabditis elegans\.Front Comput Neurosci8,pp\. 137\(en\)\.Cited by:[§B\.1](https://arxiv.org/html/2606.11245#A2.SS1.p2.1)\.
- A\. Takashima, K\. M\. Petersson, F\. Rutters, I\. Tendolkar, O\. Jensen, M\. J\. Zwarts, B\. L\. McNaughton, and G\. Fernández \(2006\)Declarative memory consolidation in humans: a prospective functional magnetic resonance imaging study\.Proceedings of the National Academy of Sciences103\(3\),pp\. 756–761\.External Links:[Document](https://dx.doi.org/10.1073/pnas.0507774103),[Link](https://www.pnas.org/doi/abs/10.1073/pnas.0507774103),https://www\.pnas\.org/doi/pdf/10\.1073/pnas\.0507774103Cited by:[§2\.3](https://arxiv.org/html/2606.11245#S2.SS3.p2.1)\.
- T\. J\. Teyler and J\. W\. Rudy \(2007\)The hippocampal indexing theory and episodic memory: updating the index\.Hippocampus17\(12\),pp\. 1158–1169\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1002/hipo.20350),[Link](https://onlinelibrary.wiley.com/doi/abs/10.1002/hipo.20350),https://onlinelibrary\.wiley\.com/doi/pdf/10\.1002/hipo\.20350Cited by:[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p1.3)\.
- Y\. Tian, A\. Ravichander, L\. Qin, R\. Le Bras, R\. Marjieh, N\. Peng, Y\. Choi, T\. Griffiths, and F\. Brahman \(2024\)MacGyver: are large language models creative problem solvers?\.InProceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies \(Volume 1: Long Papers\),K\. Duh, H\. Gomez, and S\. Bethard \(Eds\.\),Mexico City, Mexico,pp\. 5303–5324\.External Links:[Document](https://dx.doi.org/10.18653/v1/2024.naacl-long.297),[Link](https://aclanthology.org/2024.naacl-long.297/)Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p2.1)\.
- N\. X\. Tritsch and B\. L\. Sabatini \(2012\)Dopaminergic modulation of synaptic transmission in cortex and striatum\.Neuron76\(1\),pp\. 33–50\(en\)\.Cited by:[§2\.3](https://arxiv.org/html/2606.11245#S2.SS3.p6.1)\.
- E\. Tulving \(1972\)Episodic and semantic memory\.Organization of memory\.,pp\. xiii, 423–xiii, 423\.Cited by:[§2](https://arxiv.org/html/2606.11245#S2.p1.1)\.
- E\. Tulving \(1985\)Memory and consciousness\.Canadian Psychology / Psychologie canadienne26\(1\),pp\. 1–12\.Cited by:[§2\.1](https://arxiv.org/html/2606.11245#S2.SS1.p1.1)\.
- E\. Tulving \(2002\)Episodic memory: from mind to brain\.Annual Review of Psychology53\(Volume 53, 2002\),pp\. 1–25\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1146/annurev.psych.53.100901.135114),ISSN 1545\-2085,[Link](https://www.annualreviews.org/content/journals/10.1146/annurev.psych.53.100901.135114)Cited by:[§2\.1](https://arxiv.org/html/2606.11245#S2.SS1.p1.1)\.
- N\. B\. Turk\-Browne, J\. Jungé, and B\. J\. Scholl \(2005\)The automaticity of visual statistical learning\.J\. Exp\. Psychol\. Gen\.134\(4\),pp\. 552–564\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p3.1)\.
- M\. Turpin, J\. Michael, E\. Perez, and S\. R\. Bowman \(2023\)Language models don’t always say what they think: unfaithful explanations in chain\-of\-thought prompting\.InThirty\-seventh Conference on Neural Information Processing Systems,External Links:[Link](https://openreview.net/forum?id=bzs4uPLXvi)Cited by:[§B\.3](https://arxiv.org/html/2606.11245#A2.SS3.SSS0.Px2.p2.1)\.
- M\. T\. Ullman, S\. Corkin, M\. Coppola, G\. Hickok, J\. H\. Growdon, W\. J\. Koroshetz, and S\. Pinker \(1997\)A neural dissociation within language: evidence that the mental dictionary is part of declarative memory, and that grammatical rules are processed by the procedural system\.J\. Cogn\. Neurosci\.9\(2\),pp\. 266–276\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p5.1)\.
- M\. T\. Ullman \(2004\)Contributions of memory circuits to language: the declarative/procedural model\.Cognition92\(1\-2\),pp\. 231–270\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p4.1)\.
- K\. Valmeekam, A\. Olmo, S\. Sreedharan, and S\. Kambhampati \(2022\)Large language models still can’t plan \(a benchmark for LLMs on planning and reasoning about change\)\.InNeurIPS 2022 Foundation Models for Decision Making Workshop,External Links:[Link](https://openreview.net/pdf?id=wUU-7XTL5XO)Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p2.1)\.
- F\. Vargha\-Khadem, D\. G\. Gadian, K\. E\. Watkins, A\. Connelly, W\. Van Paesschen, and M\. Mishkin \(1997\)Differential effects of early hippocampal pathology on episodic and semantic memory\.Science277\(5324\),pp\. 376–380\(en\)\.Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p4.1)\.
- W\. Wang, L\. Dong, H\. Cheng, X\. Liu, X\. Yan, J\. Gao, and F\. Wei \(2023\)Augmenting language models with long\-term memory\.InAdvances in Neural Information Processing Systems,A\. Oh, T\. Naumann, A\. Globerson, K\. Saenko, M\. Hardt, and S\. Levine \(Eds\.\),Vol\.36,pp\. 74530–74543\.External Links:[Link](https://proceedings.neurips.cc/paper_files/paper/2023/file/ebd82705f44793b6f9ade5a669d0f0bf-Paper-Conference.pdf)Cited by:[Appendix A](https://arxiv.org/html/2606.11245#A1.p1.1)\.
- Y\. Wang, Y\. Gao, X\. Chen, H\. Jiang, S\. Li, J\. Yang, Q\. Yin, Z\. Li, X\. Li, B\. Yin, J\. Shang, and J\. Mcauley \(2024\)MEMORYLLM: towards self\-updatable large language models\.InProceedings of the 41st International Conference on Machine Learning,R\. Salakhutdinov, Z\. Kolter, K\. Heller, A\. Weller, N\. Oliver, J\. Scarlett, and F\. Berkenkamp \(Eds\.\),Proceedings of Machine Learning Research, Vol\.235,pp\. 50453–50466\.External Links:[Link](https://proceedings.mlr.press/v235/wang24s.html)Cited by:[Appendix A](https://arxiv.org/html/2606.11245#A1.p2.1)\.
- J\. Wei, Y\. Tay, R\. Bommasani, C\. Raffel, B\. Zoph, S\. Borgeaud, D\. Yogatama, M\. Bosma, D\. Zhou, D\. Metzler, E\. H\. Chi, T\. Hashimoto, O\. Vinyals, P\. Liang, J\. Dean, and W\. Fedus \(2022\)Emergent abilities of large language models\.External Links:[Link](https://arxiv.org/abs/2206.07682),2206\.07682Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p1.1)\.
- J\. C\. R\. Whittington, T\. H\. Muller, S\. Mark, G\. Chen, C\. Barry, N\. Burgess, and T\. E\. J\. Behrens \(2020\)The tolman\-eichenbaum machine: unifying space and relational memory through generalization in the hippocampal formation\.Cell183\(5\),pp\. 1249–1263\.e23\.Note:doi: 10\.1016/j\.cell\.2020\.10\.024External Links:[Document](https://dx.doi.org/10.1016/j.cell.2020.10.024),ISBN 0092\-8674,[Link](https://doi.org/10.1016/j.cell.2020.10.024)Cited by:[§4](https://arxiv.org/html/2606.11245#S4.p8.1)\.
- B\. J\. Wiltgen, M\. Zhou, Y\. Cai, J\. Balaji, M\. G\. Karlsson, S\. N\. Parivash, W\. Li, and A\. J\. Silva \(2010\)The hippocampus plays a selective role in the retrieval of detailed contextual memories\.Curr\. Biol\.20\(15\),pp\. 1336–1344\(en\)\.Cited by:[§2\.3](https://arxiv.org/html/2606.11245#S2.SS3.p1.1)\.
- C\. Xiao, P\. Zhang, X\. Han, G\. Xiao, Y\. Lin, Z\. Zhang, Z\. Liu, and M\. Sun \(2024\)InfLLM: training\-free long\-context extrapolation for LLMs with an efficient context memory\.InThe Thirty\-eighth Annual Conference on Neural Information Processing Systems,External Links:[Link](https://openreview.net/pdf?id=bTHFrqhASY)Cited by:[Appendix A](https://arxiv.org/html/2606.11245#A1.p1.1)\.
- D\. L\. K\. Yamins, H\. Hong, C\. F\. Cadieu, E\. A\. Solomon, D\. Seibert, and J\. J\. DiCarlo \(2014\)Performance\-optimized hierarchical models predict neural responses in higher visual cortex\.Proceedings of the National Academy of Sciences111\(23\),pp\. 8619–8624\.External Links:[Document](https://dx.doi.org/10.1073/pnas.1403112111),[Link](https://www.pnas.org/doi/abs/10.1073/pnas.1403112111),https://www\.pnas\.org/doi/pdf/10\.1073/pnas\.1403112111Cited by:[§B\.1](https://arxiv.org/html/2606.11245#A2.SS1.p3.1)\.
- A\. Yang, B\. Yang, B\. Zhang, B\. Hui, B\. Zheng, B\. Yu, C\. Li, D\. Liu, F\. Huang, H\. Wei, H\. Lin, J\. Yang, J\. Tu, J\. Zhang, J\. Yang, J\. Yang, J\. Zhou, J\. Lin, K\. Dang,et al\.\(2025\)Qwen2\.5 technical report\.External Links:[Link](https://arxiv.org/abs/2412.15115),2412\.15115Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p1.1)\.
- H\. Yang, Z\. Lin, W\. Wang, H\. Wu, Z\. Li, B\. Tang, W\. Wei, J\. Wang, Z\. Tang, S\. Song, C\. Xi, Y\. Yu, K\. Chen, F\. Xiong, L\. Tang, and W\. E \(2024\)Memory3\\text\{Memory\}^\{3\}: Language modeling with explicit memory\.arXiv \[cs\.CL\]\.External Links:[Link](http://arxiv.org/abs/2407.01178),2407\.01178Cited by:[Appendix A](https://arxiv.org/html/2606.11245#A1.p1.1)\.
- S\. Yao, D\. Yu, J\. Zhao, I\. Shafran, T\. Griffiths, Y\. Cao, and K\. Narasimhan \(2023\)Tree of thoughts: deliberate problem solving with large language models\.InAdvances in Neural Information Processing Systems,A\. Oh, T\. Naumann, A\. Globerson, K\. Saenko, M\. Hardt, and S\. Levine \(Eds\.\),Vol\.36,pp\. 11809–11822\.External Links:[Link](https://proceedings.neurips.cc/paper_files/paper/2023/file/271db9922b8d1f4dd7aaef84ed5ac703-Paper-Conference.pdf)Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p2.1)\.
- M\. A\. Yassa and C\. E\. L\. Stark \(2011\)Pattern separation in the hippocampus\.Trends Neurosci\.34\(10\),pp\. 515–525\(en\)\.Cited by:[§2\.2](https://arxiv.org/html/2606.11245#S2.SS2.p2.1)\.
- H\. H\. Yin and B\. J\. Knowlton \(2006\)The role of the basal ganglia in habit formation\.Nature Reviews Neuroscience7\(6\),pp\. 464–476\.External Links:[Document](https://dx.doi.org/10.1038/nrn1919),ISBN 1471\-0048,[Link](https://doi.org/10.1038/nrn1919)Cited by:[§2\.3](https://arxiv.org/html/2606.11245#S2.SS3.p6.1)\.
- Z\. Yin, Q\. Sun, Q\. Guo, Z\. Zeng, Q\. Cheng, X\. Qiu, and X\. Huang \(2024\)Explicit memory learning with expectation maximization\.InProceedings of the 2024 Conference on Empirical Methods in Natural Language Processing,Stroudsburg, PA, USA,pp\. 16618–16635\.External Links:[Document](https://dx.doi.org/10.18653/v1/2024.emnlp-main.927),[Link](https://aclanthology.org/2024.emnlp-main.927.pdf)Cited by:[Appendix A](https://arxiv.org/html/2606.11245#A1.p2.1)\.
- A\. Zador, S\. Escola, B\. Richards, B\. Ölveczky, Y\. Bengio, K\. Boahen, M\. Botvinick, D\. Chklovskii, A\. Churchland, C\. Clopath, J\. DiCarlo, S\. Ganguli, J\. Hawkins, K\. Körding, A\. Koulakov, Y\. LeCun, T\. Lillicrap, A\. Marblestone, B\. Olshausen, A\. Pouget, C\. Savin, T\. Sejnowski, E\. Simoncelli, S\. Solla, D\. Sussillo, A\. S\. Tolias, and D\. Tsao \(2023\)Catalyzing next\-generation artificial intelligence through neuroai\.Nature Communications14\(1\),pp\. 1597\.External Links:[Document](https://dx.doi.org/10.1038/s41467-023-37180-x),ISBN 2041\-1723,[Link](https://doi.org/10.1038/s41467-023-37180-x)Cited by:[§9](https://arxiv.org/html/2606.11245#S9.p3.1)\.
- C\. Zhang, Y\. Jian, Z\. Ouyang, and S\. Vosoughi \(2024\)Working memory identifies reasoning limits in language models\.InProceedings of the 2024 Conference on Empirical Methods in Natural Language Processing,Y\. Al\-Onaizan, M\. Bansal, and Y\. Chen \(Eds\.\),Miami, Florida, USA,pp\. 16896–16922\.External Links:[Document](https://dx.doi.org/10.18653/v1/2024.emnlp-main.938),[Link](https://aclanthology.org/2024.emnlp-main.938/)Cited by:[§B\.3](https://arxiv.org/html/2606.11245#A2.SS3.SSS0.Px1.p1.1)\.
- W\. Zhong, L\. Guo, Q\. Gao, H\. Ye, and Y\. Wang \(2024\)MemoryBank: enhancing large language models with long\-term memory\.Proceedings of the AAAI Conference on Artificial Intelligence38\(17\),pp\. 19724–19731\.External Links:[Document](https://dx.doi.org/10.1609/aaai.v38i17.29946),[Link](https://ojs.aaai.org/index.php/AAAI/article/view/29946)Cited by:[§1](https://arxiv.org/html/2606.11245#S1.p2.1)\.
- Z\. Zhou, R\. Tao, J\. Zhu, Y\. Luo, Z\. Wang, and B\. Han \(2024\)Can language models perform robust reasoning in chain\-of\-thought prompting with noisy rationales?\.InThe Thirty\-eighth Annual Conference on Neural Information Processing Systems,External Links:[Link](https://openreview.net/forum?id=FbuODM02ra)Cited by:[§B\.3](https://arxiv.org/html/2606.11245#A2.SS3.SSS0.Px1.p1.1)\.

## Appendix ARecent Progress

Fortunately, there is widespread agreement on the importance of memory, and efforts to integrate memory into language models are steadily continuing\(Wanget al\.,[2023](https://arxiv.org/html/2606.11245#bib.bib116); Xiaoet al\.,[2024](https://arxiv.org/html/2606.11245#bib.bib122); Yanget al\.,[2024](https://arxiv.org/html/2606.11245#bib.bib123)\)\. However, many approaches still treat memory as a simple extension of context length or a kind of external storage\. At the same time, the lack of an underlying theoretical foundation leads people to focus on more superficial issues, preventing consistent progress in a clear direction\. I review some of the latest research to understand recent advancements and propose directions for further development\.

Yinet al\.\([2024](https://arxiv.org/html/2606.11245#bib.bib127)\)highlighted the importance of explicit memory and introduced a memory system that dynamically updates using input\-output mappings\. Similarly, MemoryLLM\(Wanget al\.,[2024](https://arxiv.org/html/2606.11245#bib.bib117)\)separates dynamic parameters from static ones to introduce explicit memory\. Larimar\(Daset al\.,[2024](https://arxiv.org/html/2606.11245#bib.bib24)\)emphasized quick memory updates and rapid learning, inspired by the human hippocampus and episodic memory\.

Memoria\(Park and Bak,[2024](https://arxiv.org/html/2606.11245#bib.bib77)\)constructs memory independently without tying it to prediction or loss and retrieves information based on associations\. Additionally, it applies a depression mechanism for forgetting, based on individual utility\. However, its internal associations are not incorporated within the memory contents, which are dense hidden states, and they are only used for retrieval\. Sparse coding would enable a natural combination with associative mechanisms, making the system even more robust\.

Titans\(Behrouzet al\.,[2024](https://arxiv.org/html/2606.11245#bib.bib11)\)focuses specifically on memorization during test time, emphasizing associative memory\. The consideration of forgetting mechanisms is promising, and it is particularly insightful to use surprise\-based mechanisms to determine memorization levels\. However, its memory updates are coupled with loss gradients, making them error\-dependent\. Forgetting is implemented through a global decay process, but incorporating a preservation mechanism that reflects memory utility could make it even more effective\.

As discussed above, explicit neural memory has seen steady and meaningful progress\. However, engineering efforts have naturally tended to prioritize approaches that provide immediate practical utility\. As a result, more resources have been allocated to improving directly usable systems such as RAG, rather than exploring a new fundamental structure of explicit memory itself\.

I believe the situation is now changing\. With the rapid advancement of LLM capabilities, the field has reached a point where discussing AGI is no longer speculative\. Achieving such an ambitious goal will likely require overcoming multiple fundamental bottlenecks, which in turn calls for broader and more balanced exploration\. In this context, I expect that increased attention, including perspectives such as mine, combined with more balanced resource allocation, will accelerate progress toward making explicit neural memory practically viable\.

## Appendix BFurther Discussions

### B\.1Substrate Independence

To clarify, I do not assume that LLMs must replicate the exact physical architecture of the human brain to achieve AGI\. Rather, my argument is based on the fact that the human brain remains the only successful natural implementation of the higher\-order cognitive functions expected of AGI\. Because the hippocampus and explicit memory are universally recognized as core drivers of these capabilities, adapting this proven structural blueprint represents a promising pathway for realizing AGI\.

This perspective is grounded in the principle of substrate independence, which is the premise that higher cognitive functions emerge from computational operations rather than from any specific physical substrate\. This principle is strongly supported by biological modeling and neurorobotics; for instance, mapping the connectomes ofC\. elegansandDrosophila\(Szigetiet al\.,[2014](https://arxiv.org/html/2606.11245#bib.bib102); Schlegelet al\.,[2024](https://arxiv.org/html/2606.11245#bib.bib92)\)onto artificial agents or robotic systems demonstrates that if the structural foundations and computational mechanisms are replicated, they yield behavioral and functional outcomes closely resembling those of the original organism, regardless of the biological or synthetic substrate\.

Furthermore, when this is extended to artificial systems, several studies have shown that low\-level structural alignments between the human brain and artificial neural networks can successfully translate into high\-level functional similarities\(Yaminset al\.,[2014](https://arxiv.org/html/2606.11245#bib.bib23); Schrimpfet al\.,[2021](https://arxiv.org/html/2606.11245#bib.bib63)\)\. Therefore, while transformers and mammalian brains are fundamentally different substrates, abstracting and applying the computational mechanisms of explicit memory remains a viable strategy for advancing LLMs\.

### B\.2Causality of Explicit Memory

While the central argument of this paper is supported by the differential functions of explicit and implicit memory, one might argue that explicit memory and higher\-order cognitive functions are merely correlated, lacking causal evidence that the former is the primary driver of these abilities\. In the absence of such causal evidence, it could be argued that integrating explicit memory into LLMs would not necessarily lead to the emergence of these capabilities\. Indeed, distinguishing between causation and correlation is critically important in neuroscience\. I would like to emphasize that the central premise of this paper, that the hippocampus\-based explicit memory system is a necessary condition for higher cognitive functions such as future planning and reasoning, is not based on mere correlation but on well\-established causal evidence accumulated over decades in neuroscience\.

The relationship between explicit memory and executive function discussed in the manuscript has been cross\-validated through studies of patients with bilateral hippocampal lesions, such as Patient H\.M\., as well as through various animal deficit models\(Scoville and Milner,[1957](https://arxiv.org/html/2606.11245#bib.bib94); Squire and Zola\-Morgan,[1991](https://arxiv.org/html/2606.11245#bib.bib98)\)\. The fact that loss of hippocampal function leads to the absence of explicit memory, which in turn directly results in impairments in goal\-directed behavior and the ability to simulate future scenarios, provides clear evidence of a causal relationship\. In other words, the loss of a specific brain structure directly causes the breakdown of particular cognitive abilities\(Hassabis and Maguire,[2007](https://arxiv.org/html/2606.11245#bib.bib45)\)\.

Another important issue concerns causal evidence at the level of specific subregions and detailed mechanisms within the hippocampus\. In this regard, many studies have demonstrated causality at both molecular and circuit levels\. For example,Goshenet al\.\([2011](https://arxiv.org/html/2606.11245#bib.bib40)\)used optogenetics to precisely and temporarily inhibit the CA1 subregion of the hippocampus in real time, showing that this circuit plays an essential causal role not only in recent memory but also in the retrieval of long\-term \(remote\) memories\. In addition,Ramirezet al\.\([2013](https://arxiv.org/html/2606.11245#bib.bib100)\)demonstrated that optogenetic activation of a specific ensemble of engram cells in the dentate gyrus is sufficient to reconstruct a prior contextual memory and even generate false memories\. This provides direct evidence that activity in a specific subcircuit can serve as a sufficient condition for memory retrieval\.

### B\.3LLM Cognitive Adequacy

One might question whether explicit memory is strictly necessary for LLMs, suggesting that implicit memory alone may suffice\. This inquiry arises from the observation that LLMs have already achieved human\-level or superhuman performance in specific domains within the aforementioned explicit memory tasks\. I address this question as follows\.

#### Logical Reasoning

Regarding logical and mathematical reasoning, I acknowledge that LLMs achieve strong performance on many benchmark problems\. However, this performance should be interpreted with caution\. Humans are typically able to apply newly learned concepts to solve novel problems with minimal exposure, whereas LLMs generally require extensive training over large distributions of similar problems\. Recent studies continue to show that LLMs struggle with problems that involve unfamiliar formulations or require flexible abstraction beyond their training distribution \(e\.g\., counterfactual reasoning or out\-of\-distribution mathematical generalization\)\(Sunet al\.,[2026](https://arxiv.org/html/2606.11245#bib.bib128); Zhouet al\.,[2024](https://arxiv.org/html/2606.11245#bib.bib131); Zhanget al\.,[2024](https://arxiv.org/html/2606.11245#bib.bib130)\)\. This suggests that high performance on known problem types does not necessarily imply the acquisition of human\-level logical reasoning capabilities\.

#### Metacognition

It is true that recent LLMs show excellent performance in self\-correction and confidence calibration through chain\-of\-thought, and this appears to operate very similarly to human metacognition\. However, according to studies conducted so far, these capabilities differ from human metacognition\.

A study byTurpinet al\.\([2023](https://arxiv.org/html/2606.11245#bib.bib69)\)reported that the self\-correction or expression of uncertainty shown by LLMs during the chain\-of\-thought process is not a true reflection of their internal knowledge state, but rather a pattern learned through multitask learning and RLHF processes\. In other words, when a model outputs “I do not know this” or “I will think again,” it does not reflect metacognitive awareness, but rather triggers self\-validation in the middle of generation to receive higher rewards from human evaluators\.

AlthoughKadavathet al\.\([2022](https://arxiv.org/html/2606.11245#bib.bib88)\)reported the self\-evaluation capabilities of LLMs, they observed a limitation\. This is not domain\-general metacognition but rather a restricted prediction that only operates within a specific distribution\. This suggests that the capability of the model is another form of probabilistic prediction learning that depends on the prompt format and training data distribution\.

If true metacognition existed, the model would be able to recognize and correct its own reasoning errors without external feedback\. However, a study byHuanget al\.\([2024](https://arxiv.org/html/2606.11245#bib.bib49)\)demonstrated that there are fundamental limits to LLMs independently correcting logical errors in their responses using only their intrinsic capabilities, without the intervention of ground truth prompts or tools\.

In conclusion, as seen in these studies, although the uncertainty prediction and self\-correction abilities of LLMs have improved compared to the past, these abilities are more accurately interpreted as a facet of multi\-task learning where the model estimates the accuracy of generated text and utilizes that result as context\. This mechanism has limitations in clearly tracking the sources and boundaries of knowledge and in performing consistent self\-assessment even in unfamiliar environments\. In particular, considering reports that issues such as continuously occurring hallucinations and the tendency to provide contradictory answers depending on the context are still common in very long contexts or with uncommon inputs, it seems difficult to conclude that the model has acquired true metacognition yet\.

### B\.4RAG Insufficiency

Given the rapid advancements in Retrieval\-Augmented Generation \(RAG\) and agent systems, one might argue that the issues raised in this paper could be resolved using existing RAG\-based systems, rather than strictly relying on hippocampal explicit memory\. While industry techniques like RAG are excellent practical tools, their core functionalities differ fundamentally from the artificial explicit memory system proposed in this paper\.

RAG essentially functions as a static external hard drive that stores raw text or summaries\. Information is accumulated statically without any interpretation or reorganization among different pieces of knowledge\. Because retrieval mechanisms typically do not perform exhaustive searches, this static nature makes it incredibly difficult to intentionally locate outdated information in order to forget or modify it\.

The “explicitness” of RAG is fundamentally bound to the text modality\. In real\-world scenarios where AI is deployed on devices like mobile phones and exposed to a continuous, real\-time stream of visual and auditory information, applying RAG would require the system to constantly summarize all multimodal inputs into text, store them, and immediately retrieve them to make split\-second decisions\. This is computationally prohibitive\.

Most importantly, because RAG forces an “all\-or\-nothing” decision at the exact moment of exposure, the system must decide right then whether a piece of information is worth summarizing and storing\. In reality, the future utility of new information is rarely clear at the time it is first encountered\. A hippocampus\-inspired explicit memory system, by contrast, unconditionally stores vast amounts of episodic information first, and only later evaluates its utility to consolidate important memories\.

Furthermore, in a RAG framework, the model can only utilize information if it is explicitly injected into its immediate context window\. Consequently, the model never actually knows what it holds in its database until it is prompted with it\. This architectural limitation prevents the model from developing true metacognition, which is the active, self\-aware management of knowledge that my proposed explicit memory system aims to achieve\.

## Appendix CEmpirical Evidence for the Absence of Explicit Memory in LLMs

To investigate the boundaries of current LLMs, specifically regarding the absence of explicit memory and executive function, I present a series of illustrative failure cases in[Figures2](https://arxiv.org/html/2606.11245#A3.F2)and[3](https://arxiv.org/html/2606.11245#A3.F3)\. These examples demonstrate that despite the models’ advanced capabilities in complex reasoning, they struggle with tasks that require the rigorous stability and rule\-adherence characteristic of explicit memory systems\.

A crucial distinction must be drawn regarding the interpretation of these failures\. While the integration of explicit memory theoretically opens the door to solving such problems, merely overcoming these specific examples does not equate to the acquisition of explicit memory or its associated cognitive functions\. Practically, the errors could be mitigated by synthesizing similar datasets and fine\-tuning the model\. However, this approach remains fundamentally trapped within the paradigm of implicit memory\. It merely expands the model’s statistical coverage without altering the underlying processing mechanism\.

The essence of explicit memory lies not in the volume of memorized patterns, but in the principled application of rules\. For instance, humans do not require extensive training on hundreds of examples to sum the number ‘1’ a hundred times\. They simply apply the fundamental arithmetic principle of addition\. Therefore, true progress involves moving beyond increasing the implicit boundary of data coverage to achieving a system capable of explicit memorization and manipulation, mirroring the efficiency and stability of human explicit memory\.

![Refer to caption](https://arxiv.org/html/2606.11245v1/x2.png)

Figure 2:Illustrative examples demonstrating the absence of explicit memory in ChatGPT\-5, with bold text added for emphasis\.\(A\)The first case illustrates the model’s susceptibility to the priming effect, a characteristic of implicit memory\. While the model initially correctly answers that the sun rises in the east, the mere introduction of a context regarding an “Earth rotation reversal” hypothesis causes the model to override this fundamental fact and answer “West”\. This suggests that even basic facts are not treated as stable explicit memories but are implicit and malleable under context\.\(B\)The second case involves specific knowledge retrieval regarding StarCraft\. Although the model possesses the correct internal knowledge \(Templar Archives costs 200 gas\) as shown in the first turn, it fails to identify a factual error when the user asks a question based on a false premise \(150 gas\)\. Instead of correcting the user, the model fabricates a justification for the incorrect value, further indicating that the model’s knowledge formation relies on implicit weights rather than explicit fact\-checking\.![Refer to caption](https://arxiv.org/html/2606.11245v1/x3.png)

Figure 3:Examples highlighting the limitations of LLMs in tasks requiring semantic memory and executive function\.\(A\)This panel illustrates a failure in semantic memory involving basic logic\. The task requires summing a sequence of ‘1’s \(totaling 108\), a problem solvable by a human child in minutes\. However, the model incorrectly calculates the sum as 120\. This failure suggests that LLMs process basic logical operations differently from human semantic memory\. Notably, this error cannot be solely attributed to tokenization, as the model performs correctly when the number of terms is halved\.\(B\)This panel tests the model’s executive function, a capability closely linked to explicit memory\. The user instructs the model to swap ‘a’ and ‘e’ in its output\. The model fails to execute this rule immediately \(e\.g\., correctly writing “cen” instead of “can” but consistently failing elsewhere\)\. This highlights a critical divergence: while an LLM might require specific training to master such a task, human executive function allows individuals to successfully execute such novel, rule\-based tasks on the very first attempt without any prior practice\.\(C\)The Sudoku puzzle further demonstrates limitations in semantic memory and rule adherence\. Although ChatGPT\-5 demonstrates outstanding capability in solving complex Olympiad\-level mathematics, it fails to solve this Sudoku puzzle\. In contrast, even a human novice could logically deduce the solution within a few hours\. The model produces an incorrect grid while claiming success, indicating that it operates on implicit, probabilistic associations rather than an explicit application of logical rules\.

Similar Articles

Human-Inspired Memory Architecture for LLM Agents

arXiv cs.AI

Microsoft researchers propose a biologically-inspired memory architecture for LLM agents that incorporates mechanisms like sleep-phase consolidation and interference-based forgetting to manage persistent memory efficiently.

Can AGI be achieved with LLMs alone?

Reddit r/singularity

This post explores the debate among top AI figures regarding whether LLMs alone can achieve AGI or if additional breakthroughs like world models are required.