How do Humans Process AI-generated Hallucination Contents: a Neuroimaging Study

arXiv cs.AI 05/19/26, 04:00 AM Papers
Summary
This paper uses EEG recordings to study neural dynamics when humans process AI-generated hallucinated content, revealing distinct cognitive patterns and differences between misjudged and correctly judged hallucinations.
arXiv:2605.16953v1 Announce Type: new Abstract: While AI-generated hallucinations pose considerable risks, the underlying cognitive mechanisms by which humans can successfully recognize or be misled by these hallucinations remain unclear. To address this problem, this paper explores humans' neural dynamics to characterize how the brain processes hallucinated content. We record EEG signals from 27 participants while they are performing a verification task to judge the correctness of image descriptions generated by a multi-modal large language model (MLLM). Based on an averaged event-related potential (ERP) study, we reveal that multiple cognitive processes, e.g., semantic integration, inferential processing, memory retrieval, and cognitive load, exhibit distinct patterns when humans process hallucinated versus non-hallucinated content. Notably, neural responses to hallucinations that were misjudged versus correctly judged by human participants showed significant differences. This indicates that misjudged AI-generated hallucinations failed to trigger the standard neurocognitive fact verification pathway.
Original Article
View Cached Full Text
Cached at: 05/19/26, 06:37 AM
# How do Humans Process AI-generated Hallucination Contents: a Neuroimaging Study
Source: [https://arxiv.org/html/2605.16953](https://arxiv.org/html/2605.16953)
###### Abstract

While AI\-generated hallucinations pose considerable risks, the underlying cognitive mechanisms by which humans can successfully recognize or be misled by these hallucinations remain unclear\. To address this problem, this paper explores humans’ neural dynamics to characterize how the brain processes hallucinated content\. We record EEG signals from 27 participants while they are performing a verification task to judge the correctness of image descriptions generated by a multi\-modal large language model \(MLLM\)\. Based on an averaged event\-related potential \(ERP\) study, we reveal that multiple cognitive processes, e\.g\., semantic integration, inferential processing, memory retrieval, and cognitive load, exhibit distinct patterns when humans process hallucinated versus non\-hallucinated content\. Notably, neural responses to hallucinations that were misjudged versus correctly judged by human participants showed significant differences\. This indicates that misjudged AI\-generated hallucinations failed to trigger the standard neurocognitive fact verification pathway\. The detailed code can be accessed openly through the url[https://github\.com/Promise\-Z5Q2SQ/EEG\-Hallucination](https://github.com/Promise-Z5Q2SQ/EEG-Hallucination)\.

hallucinations,brain signals,neuroscience,multimodal large language model

## 1Introduction

Over the past few years, AI models have made impressive progress, scaling in size, architecture sophistication, and capability\(Minaeeet al\.,[2024](https://arxiv.org/html/2605.16953#bib.bib48); Zhaoet al\.,[2023](https://arxiv.org/html/2605.16953#bib.bib49); Wuet al\.,[2023](https://arxiv.org/html/2605.16953#bib.bib50)\)\. These advances have enabled them to perform a wide range of tasks, from image captioning and generation\(Borji,[2022](https://arxiv.org/html/2605.16953#bib.bib53)\)to open\-ended conversation\(Touvronet al\.,[2023](https://arxiv.org/html/2605.16953#bib.bib51)\)and multi\-modal understanding\(Team,[2025](https://arxiv.org/html/2605.16953#bib.bib5); Yanget al\.,[2023](https://arxiv.org/html/2605.16953#bib.bib52)\)\. However, one of their key drawbacks, hallucinations, i\.e\., the tendency to generate plausible yet factually incorrect content, has become a growing concern\. A series of studies has shown that such hallucinations can easily mislead humans to make erroneous decisions\(Sunet al\.,[2024](https://arxiv.org/html/2605.16953#bib.bib39)\)in critical domains such as healthcare, law, and finance\.

To address hallucinations in AI models, researchers have attempted to understand the origins of AI hallucinations, especially in multi\-modal large language models \(MLLMs\) and large language models LLMs\(Jiet al\.,[2023](https://arxiv.org/html/2605.16953#bib.bib29); Huanget al\.,[2025](https://arxiv.org/html/2605.16953#bib.bib30)\)\. Much of existing research has investigated hallucinations from the perspective of the model, i\.e\., exploring how aspects such as training data, prompt design, decoding or sampling strategies, and internal uncertainty or confidence measures contribute to the occurrence of hallucinations\(Maynezet al\.,[2020](https://arxiv.org/html/2605.16953#bib.bib31)\)\. Based on the investigations, researchers further work on detecting hallucinated content automatically and further deriving mechanisms to generate factually consistent content\. For example,\(Lewiset al\.,[2020](https://arxiv.org/html/2605.16953#bib.bib34)\)introduced retrieval\-augmented generation to augment the LLMs with factual knowledge\.Farquharet al\.\([2024](https://arxiv.org/html/2605.16953#bib.bib33)\)proposes using several verification steps during generation to check the consistency of the generated content\.

On the other hand, the effects of AI hallucinations on humans have been extensively studied\(Zhaiet al\.,[2024](https://arxiv.org/html/2605.16953#bib.bib37); Kimet al\.,[2025](https://arxiv.org/html/2605.16953#bib.bib38)\)\. Most of these studies are based on users’ explicit, post\-hoc judgments and behaviors during their interactions with AI hallucinations\(Barros,[2025](https://arxiv.org/html/2605.16953#bib.bib40)\)\. For example,Klingbeilet al\.\([2024](https://arxiv.org/html/2605.16953#bib.bib35)\)conducted a user study examining AI hallucinations with varying levels of fluency and presentation tone\. It reveals that humans are prone to being misled by content characterized by high fluency and professionalism\. This indicates that AI\-generated hallucinations vary significantly in their deceptiveness, resulting in different levels of risk to human cognition and decision\-making\.

However, few studies have examined, from a neuroscience perspective, how human brain activity patterns differ when viewing hallucinated versus non\-hallucinated content generated by AI models\. Existing theory has segmented human perception of information content into several stages based on humans’ EEG signals, progressing from early sensory encoding and attentional allocation to higher\-order semantic integration and memory retrieval\(Luck,[2014](https://arxiv.org/html/2605.16953#bib.bib2)\)\. Building on this theoretical framework, a critical question arises: at which processing stage does the detection of AI hallucinations succeed or fail? Investigating these temporal dynamics is essential to understanding why humans are susceptible to “plausible but incorrect” AI hallucinations\.

To fill this gap, we investigate the neurological mechanisms by which humans recognize hallucinations and gain insight into the development of MLLMs\. Specifically, we have the following research questions\.

- •RQ1\.Do neural signals exhibit significant differences across distinct temporal stages when participants view hallucinated versus non\-hallucinated contents?
- •RQ2\.If yes, is this difference modulated by whether participants correctly recognize the hallucination content?
- •RQ3\.Can we predict whether AI\-generated content contains hallucinations based on human neural signals?

To address these research questions, we collected EEG data from 27 participants\. Each participant viewed textual stimuli generated by an MLLM that included both hallucinated and non\-hallucinated content\. Participants were asked to judge whether the textual stimuli matched the image content \(i\.e\., whether they recognized any hallucinations\)\. On the basis of this paradigm, we conducted averaged event\-related potential \(ERP\) analyses \(detailed in Section[4\.2](https://arxiv.org/html/2605.16953#S4.SS2)\)\. Based on the EEG/ERP methods, we obtain a series of findings about the underlying mechanisms when humans are perceiving AI\-generated hallucinations\. The ERP analysis reveals a significant difference in brain signals when participants are processing hallucination words and non\-hallucination words\. And it suggests that multiple cognitive processes, such as semantic\-thematic integration, inferential processing, memory retrieval, and cognitive loading, are engaged in hallucination recognition\.

However, we also noticed a significant difference between the brain signals when the hallucination words are misjudged and correctly judged by participants\. In\-depth analysis across different temporal stages suggests that misjudged AI hallucinations exhibit different neurocognitive patterns, specifically affecting humans’ allocation of attention and inferential reasoning\. Inspired by those analyses, we further conduct a prediction experiment and show that EEG signals can be used to predict, at both the word\-level and sentence\-level, whether content contains hallucinations\. However, this prediction is not as effective for instances where human subjects fail to correctly identify the hallucinations\. This indicates that deceptive AI hallucinations may deceive humans at both neural and behavioral levels\.

## 2Related Work

### 2\.1Hallucination in LLMs

Hallucination in LLMs denotes fluent but ungrounded or factually incorrect outputs\(Jiet al\.,[2023](https://arxiv.org/html/2605.16953#bib.bib29)\)\. Prior work separates intrinsic drivers from extrinsic causes and advances two main strands: detection and mitigation\. For detection, post\-hoc verification with retrieval/KBs checks factuality in knowledge\-intensive tasks\(Lewiset al\.,[2020](https://arxiv.org/html/2605.16953#bib.bib34)\), while black\-box consistency methods flag unstable generations without instrumenting the model\(Manakulet al\.,[2023](https://arxiv.org/html/2605.16953#bib.bib43)\)\. AMBER offers an LLM\-free, type\-controlled benchmark, and object\-level studies document captioning\-specific hallucinations\(Wanget al\.,[2023](https://arxiv.org/html/2605.16953#bib.bib1); Rohrbachet al\.,[2018](https://arxiv.org/html/2605.16953#bib.bib44)\)\. Complementary strategies include self\-critique and verifier pipelines to reject dubious claims and improved cross\-modal alignment in MLLMs to curb object and attribute hallucinations\(Manakulet al\.,[2023](https://arxiv.org/html/2605.16953#bib.bib43); Rohrbachet al\.,[2018](https://arxiv.org/html/2605.16953#bib.bib44)\)\. Despite progress, most evaluations remain outcome\-based, leaving open when and how humans neurally register hallucinations\.

### 2\.2Neuroscience & AI

Event\-related potentials \(ERPs\) provide time\-resolved markers of cognitive processing\(Luck,[2014](https://arxiv.org/html/2605.16953#bib.bib2)\)\. N100 and P200 index perceptual and attentional allocation; P300 relates to task\-relevant salience; N400 tracks semantic integration and expectancy violations; and P600 reflects late reanalysis and monitoring, including “semantic P600” effects\(Bornkessel\-Schlesewsky and Schlesewsky,[2008](https://arxiv.org/html/2605.16953#bib.bib23)\)\. Computational–neuroscience links show partial convergence between language\-model representations and brain activity, and surprisal robustly correlates with N400 amplitude in sentence comprehension\(Schrimpfet al\.,[2021](https://arxiv.org/html/2605.16953#bib.bib45); Franket al\.,[2013](https://arxiv.org/html/2605.16953#bib.bib46); Yeet al\.,[2025](https://arxiv.org/html/2605.16953#bib.bib42)\)\. Beyond language, EEG studies of short\-video polarization demonstrate that cognitive impact may be invisible to surface behaviors yet measurable in neural signals, and that EEG features can predict exposure to polarized content\(Duet al\.,[2025a](https://arxiv.org/html/2605.16953#bib.bib36),[b](https://arxiv.org/html/2605.16953#bib.bib41)\)\. Building on these insights, we compare ERPs elicited by hallucinated versus non\-hallucinated words and condition effects on recognition\.

## 3Data Collection

![Refer to caption](https://arxiv.org/html/2605.16953v1/figure/procedure.png)Figure 1:The overall procedure of our data collection\. A\) The procedure of stimulus selection\. B\) The experimental trial flow consists of five stages: presenting an image \(S1\), showing a fixation cross \(S2\), displaying a sentence word\-by\-word \(S3\), the participant making a judgment about the sentence’s match to the image \(S4\), and finally proceeding to the next image \(S5\)\.In this section, we describe the collection of EEG and behavioral data from 27 participants while they completed the multi\-modal QA task designed for hallucination recognition\.

### 3\.1Participants

A total of 27 volunteers were recruited for this study, comprising 11 males and 16 females, aged between 19 and 30 years \(with an average of 24\)\. The sample comprised mostly college students, but also included several members of the general public, ensuring some diversity beyond the academic population\. The participants represented a range of disciplines \(e\.g\., computer science, mechanical engineering, chemistry, and environmental engineering\), spanning undergraduate to postgraduate levels\. Each individual completed the full experiment in approximately 1\.5 hours, including 30 minutes for equipment setup and task instructions\. Prior to participation, all individuals were informed that their time would be compensated at a rate equivalent to US$11\.8 per hour, contingent upon their completion of the study, to ensure the quality of the data collected for the study\.

### 3\.2Task preparation

To minimize bias arising from participants’ varying disciplinary backgrounds, we deliberately adopted a multimodal QA task that demands minimal prerequisite knowledge\. Our approach draws on the AMBER benchmark\(Wanget al\.,[2023](https://arxiv.org/html/2605.16953#bib.bib1)\)—a multi‑dimensional, LLM‑free evaluation dataset for hallucination in MLLMs—which comprises 1,004 images derived from the MSCOCO\(Linet al\.,[2014](https://arxiv.org/html/2605.16953#bib.bib3)\)and includes detailed annotations on three types of hallucinations: entity, attribute, and relation\. Based on this benchmark, we generated responses to generative\-style prompts for each image via an MLLM\. The MLLM we chose in our study is Qwen2\.5\-VL\-3B\-Instruct\(Team,[2025](https://arxiv.org/html/2605.16953#bib.bib5); Wanget al\.,[2024](https://arxiv.org/html/2605.16953#bib.bib6)\)\. Leveraging both the original hallucination annotations provided by AMBER and our own manual verification, we manually selected 60 image–response pairs where the Qwen2\.5\-VL\-3B\-Instruct generated hallucinatory content\. For each response, we carefully and manually screened the generated content, selecting one sentence that clearly contained hallucination and one sentence verified to be hallucination\-free, thereby constructing a set of strictly controlled and balanced stimuli for EEG testing\. To ensure that each sentence we selected is not an illusion in itself, we use GPT4\-HDM method\(Suet al\.,[2024](https://arxiv.org/html/2605.16953#bib.bib7)\)and input each sentence into GPT4 to let it judge whether it violates the common sense of the real world\. See Appendix[A\.3](https://arxiv.org/html/2605.16953#A1.SS3)for more details\. Following AMBER’s taxonomy of hallucination types, we categorized the stimuli as follows: 27 entity\-related, 13 relation\-related, and 26 attribute\-related \(attribute category further subdivided into action \(5\), count \(11\), and state \(10\)\)\. Representative cases and the full selection criteria are documented in our publicly accessible code repository\. The procedure of stimuli selection is shown in Fig[1](https://arxiv.org/html/2605.16953#S3.F1)A\.

### 3\.3Procedure

Before the main trials, participants first completed an entry questionnaire and signed informed consent regarding privacy and data security\. They then received detailed instructions explaining the primary tasks and the operational procedures, and were explicitly informed that they retained the right to withdraw from the study at any time without consequence\. Following orientation, participants carried out a series of training trials intended to help them become familiar with the formal experiment’s flow\. Each participant was also asked to select a random seed before the experiment began, which was used to randomize the order of stimulus categories in order to ensure that across participants, each hallucination type and image condition would be fairly and evenly presented\.

Once these preparatory steps were finished, each trial proceeded through stages S1 through S5 in sequence, as shown in Fig[1](https://arxiv.org/html/2605.16953#S3.F1)B\. In S1 \(Image Presentation\), an image is shown for 6000 milliseconds while participants are told to view it attentively, knowing there will be a later match judgment\. In S2 \(Fixation\), a central fixation cross is displayed for 1000 milliseconds to orient and stabilize visual attention\. In S3 \(Sentence Presentation\), a sentence description appears word by word: the first word \(e\.g\., “The”\) is shown for 750 milliseconds \(\(Yeet al\.,[2022](https://arxiv.org/html/2605.16953#bib.bib4)\)\), followed by each subsequent word for the same duration; the sentence may either contain a hallucination of a different type or be non\-hallucinated\. After the full description, in S4 \(Judgement\) participants are asked whether the sentence matches the image content via a binary choice \(Yes or No\), responding using key presses\. Finally, S5 introduces the next trial and cycles back to S1 when participants press the space key\. During the entire experiment, we continuously recorded EEG signals from each participant\. Using event triggers, we logged the onset times of all key stimulus events, so each segment of EEG could be aligned precisely to the relevant experimental stage\. In addition, for every single sentence shown, we recorded the participant’s judgment \(Yes/No\) about whether the description matched the image\.

## 4Result Analysis

In this section, we employ ERP analysis techniques to investigate how brain signal patterns differ when participants view hallucination versus non\-hallucination words, and synthesize these findings to outline the neural mechanisms by which humans recognize hallucinations\.

### 4\.1Statistic Analysis

Across the full set of 120 trials per participant, on average, participants answered 101\.14±6\.53 items correctly, yielding a mean overall accuracy of 84\.29%\. Considering hallucination categories, the mean recognition accuracy by type was relation: 90\.88%, entity: 89\.30%, and attribute: 86\.18%\. Statistical tests reveal that the accuracy across all hallucination categories does not differ significantly\. While the accuracies across these categories are fairly similar, the relation type had the highest performance and the attribute type the lowest, suggesting that relation\-based hallucinations may be easier for participants to detect, whereas attribute\-level hallucinations pose greater detection difficulty\.

### 4\.2ERP Analysis

ERP refers to brain voltages that are time\-locked to specific events and reflect neural responses elicited by those events\(Blackwood and Muir,[1990](https://arxiv.org/html/2605.16953#bib.bib8)\)\. One of its key advantages is the high temporal resolution it offers, and the sequence of ERP peaks provides precise insight into rapid neural processing stages \(\(Lucket al\.,[2000](https://arxiv.org/html/2605.16953#bib.bib9)\)\)\. ERP components are evoked amplitude in different post\-stimulus time windows, e\.g\., N100, N400 \(negative waves within 100ms, 400 ms\), and P200, and P600 \(positive waves within 200 ms, 600 ms\)\. These standard ERP markers index different cognitive operations\(Lucket al\.,[2000](https://arxiv.org/html/2605.16953#bib.bib9)\)\. In our analysis, we employ conventional ERP\-processing procedures including signal preprocessing, defining time windows of interest, and specifying regions of interest \(ROIs\) for comparing conditions\(Yeet al\.,[2022](https://arxiv.org/html/2605.16953#bib.bib4); Zhuet al\.,[2024](https://arxiv.org/html/2605.16953#bib.bib10); Yeet al\.,[2024](https://arxiv.org/html/2605.16953#bib.bib11)\)\. The method of preprocessing is detailed in Appendix[A\.4](https://arxiv.org/html/2605.16953#A1.SS4)\. To disentangle different ERP components, we partitioned the extracted time interval into several distinct time windows based on the approach introduced byLehmann and Skrandies \([1980](https://arxiv.org/html/2605.16953#bib.bib13)\)\. Their method identifies evoked scalp potential components by examining both their latency and their topographic pattern\. We computed the Global Field Power over the 50\-750 ms post\-stimulus interval, and delineated time windows around the peaks, as shown in Table[1](https://arxiv.org/html/2605.16953#S4.T1)\.

Table 1:The statistical significance test results for different ERP components across brain regions\. We use the repeated measures ANOVA test and adopt post\-hoc pair\-wise comparisons with FDR correction\. \* and \*\* indicate statistical significance at a level of p<0\.05, p<0\.001, respectively\.Time windowROIRM\-ANOVA testPost\-hoc test50–120 msr\-temporal, parietal\*HalluCorrect \>NoHallu \*occipital\*HalluCorrect \>HalluWrong \*120–280 mspre\-frontal, occipital\*HalluCorrect \>NoHallu \*frontal, central, l\-temporal\*HalluCorrect \>NoHallu \*\*central, parietal\*HalluCorrect \>HalluWrong \*280–550 msr\-temporal\*HalluCorrect \>NoHallu \*central\*HalluCorrect \>NoHallu \*\*550–750 mspre\-frontal, frontal, l\-temporal,r\-temporal, occipital\*HalluCorrect \>NoHallu \*parietal\*HalluCorrect \>HalluWrong \*central\*\*HalluCorrect \>NoHallu \*\*![Refer to caption](https://arxiv.org/html/2605.16953v1/figure/erp.png)Figure 2:A\) Comparison of ERP waveforms elicited by different stimulus word types in the central brain region, with shaded areas indicating the 95% confidence intervals\. B\) Time\-resolved topographic difference maps comparing HalluCorrect with NoHallu and HalluWrong words, respectively; highlighted electrodes denote brain regions showing significant effects in the post\-hoc analysis\.To facilitate subsequent analyses, we partitioned the EEG data according to both the stimulus word type and participants’ recognition performance\. We defined three categories:HalluCorrectfor hallucination words that the participant correctly recognized as hallucinated;NoHallufor non\-hallucination words correctly identified as non\-hallucinated; andHalluWrongfor hallucination words which participants failed to recognize \(i\.e\., words that were in fact hallucinations but were judged as non\-hallucinations\)\. We plot the grand\-average ERP waveforms for different stimulus word types in central brain region in Figure[2](https://arxiv.org/html/2605.16953#S4.F2)\. We segment the electrodes into seven brain regions according to their placement on the brain topography shown in Figure[2](https://arxiv.org/html/2605.16953#S4.F2)\. We applied a repeated\-measures ANOVA in a fixed time window for each brain region, followed by post\-hoc pair\-wise comparisons with FDR correction\. The statistical findings for the various time windows and regions of interest are presented in Table[1](https://arxiv.org/html/2605.16953#S4.T1)\. Below, we discuss the characteristic features of each component and its potential functional roles\.

N100 is an early component in the time window around 100 ms \(50–120 ms\)\. We employ the repeated measures ANOVA method and discover significant differences between the grand\-averaged N100 component in r\-temporal \(F\[1,26\]=4\.615, p<0\.05,ηp2\\eta\_\{p\}^\{2\}=0\.151\), parietal \(F\[1,26\]=4\.872, p<0\.05,ηp2\\eta\_\{p\}^\{2\}=0\.158\), and occipital \(F\[1,26\]=8\.271, p<0\.05,ηp2\\eta\_\{p\}^\{2\}=0\.233\)\. The N100 component is typically interpreted as reflecting very early visual perceptual processing, especially for low\-level visual features\(Yanget al\.,[2022](https://arxiv.org/html/2605.16953#bib.bib15)\)\. It often shows maximal expression in occipito\-parietal regions\. Recent work also indicates that the amplitude of N100 is closely linked with attentional allocation\. Larger N100 amplitudes have been observed when stimuli draw more attention, or when perceptual systems are required to allocate greater resources to processing salient or unexpected input\(Thorntonet al\.,[2007](https://arxiv.org/html/2605.16953#bib.bib14); Rutmanet al\.,[2010](https://arxiv.org/html/2605.16953#bib.bib16)\)\. The results of post\-hoc test indicate that during the recognition of HalluCorrect words, participants show enhanced N100 responses compared to NoHallu and HalluWrong words\. This suggests that the process of correctly identifying hallucination words recruits attention very early and imposes a higher cognitive load on the perceptual system, even before later semantic processing stages\.

P200 is the dominant component in the time window around 200 ms \(120–280 ms\)\. RM\-ANOVA reveals the significant differences between grand\-averaged P300 component in pre\-frontal \(F\[1,26\]=8\.575, p<0\.05,ηp2\\eta\_\{p\}^\{2\}=0\.248\), occipital \(F\[1,26\]=11\.246, p<0\.05,ηp2\\eta\_\{p\}^\{2\}=0\.302\), and frontal \(F\[1,26\]=18\.226, p<0\.001,ηp2\\eta\_\{p\}^\{2\}=0\.412\), central \(F\[1,26\]=15\.311, p<0\.001,ηp2\\eta\_\{p\}^\{2\}=0\.371\), l\-temporal \(F\[1,26\]=14\.037, p<0\.001,ηp2\\eta\_\{p\}^\{2\}=0\.351\), and central \(F\[1,26\]=11\.285, p<0\.05,ηp2\\eta\_\{p\}^\{2\}=0\.324\), parietal \(F\[1,26\]=12\.039, p<0\.05,ηp2\\eta\_\{p\}^\{2\}=0\.336\)\. The P200 component is generally understood to reflect early attentional engagement and decision\-related processing\. It has been associated with novelty detection, stimulus complexity, and perceptual salience, such that more complex or unexpected stimuli elicit larger P200 amplitudes\(Ghaniet al\.,[2020](https://arxiv.org/html/2605.16953#bib.bib17)\)\. Empirical work shows that P200 amplitude tends to increase with attentional load and with stimuli that violate perceptual or contextual expectations\(Kempet al\.,[2009](https://arxiv.org/html/2605.16953#bib.bib18); Polich,[2007](https://arxiv.org/html/2605.16953#bib.bib19)\)\. We observe enhanced P200 responses for HalluCorrect words compared to NoHallu and HalluWrong words\. This suggests that hallucination words impose greater demands on early stimulus selection\. The processing system flags such words as perceptually or lexically salient, because they diverge from semantic expectation or evoke conflict\.

N400 component is evoked around 400 ms after the stimulus \(280\-550 ms\)\. Significant differences are found in central \(F\[1,26\]=16\.442, p<0\.001,ηp2\\eta\_\{p\}^\{2\}=0\.387\), r\-temporal \(F\[1,26\]=7\.929, p<0\.05,ηp2\\eta\_\{p\}^\{2\}=0\.234\)\. The N400 component is widely understood to index the access and integration of semantic information\. It typically reaches its maximal amplitude at centro\-parietal electrode sites, reflecting the brain’s effort to reconcile a word’s meaning with its broader context\. Empirical findings show that less predictable or semantically incongruent words evoke larger N400 responses, consistent with the idea that the N400 is sensitive to violations of expectation and relates to retrieval from semantic memory\(Lauet al\.,[2008](https://arxiv.org/html/2605.16953#bib.bib20); Lindborget al\.,[2023](https://arxiv.org/html/2605.16953#bib.bib21); Michaelovet al\.,[2022](https://arxiv.org/html/2605.16953#bib.bib22)\)\. HalluCorrect words conflict with the visual context, hence they elicit greater N400 amplitudes than NoHallu words\. This suggests when the descriptive content generated by the model diverges from what is visually present or semantically expected, the cost of semantic integration increases\. We also note that the post\-hoc analysis does not reveal significant differences between HalluCorrect and HalluWrong words\. This may suggest that even when participants make incorrect judgments, they still subconsciously register semantic conflict\. However, due to limitations in perceptual processing \(e\.g\., N100, P200\) and higher\-level inferential or reanalysis processes \(e\.g\., P600\), they ultimately arrive at an incorrect decision\.

P600 waveform mainly appears in the time window around 600 ms \(550\-750 ms\)\. Through ANOVA, we find significant differences between grand\-averaged P600 component in pre\-frontal \(F\[1,26\]=5\.874, p<0\.05,ηp2\\eta\_\{p\}^\{2\}=0\.184\), frontal \(F\[1,26\]=7\.268, p<0\.05,ηp2\\eta\_\{p\}^\{2\}=0\.218\), l\-temporal \(F\[1,26\]=7\.517, p<0\.05,ηp2\\eta\_\{p\}^\{2\}=0\.224\), r\-temporal \(F\[1,26\]=8\.093, p<0\.05,ηp2\\eta\_\{p\}^\{2\}=0\.237\), occipitial \(F\[1,26\]=6\.561, p<0\.05,ηp2\\eta\_\{p\}^\{2\}=0\.202\), and central \(F\[1,26\]=31\.558, p<0\.001,ηp2\\eta\_\{p\}^\{2\}=0\.548\), and parietal \(F\[1,26\]=15\.142, p<0\.001,ηp2\\eta\_\{p\}^\{2\}=0\.0\.368\)\. The P600 \(or late positivity\) component is classically implicated in sentence processing tasks and shows its strongest responses at centro\-parietal electrode sites\. Originally, the P600 was discovered as an index of syntactic reanalysis and repair, reflecting efforts to restructure or repair comprehension\(Seyednozadiet al\.,[2021](https://arxiv.org/html/2605.16953#bib.bib25)\)\. More recently, research has shown that even in sentences that are grammatically correct, semantic conflict or non\-typicality can also provoke a P600, which is referred to as the “semantic P600”\(Bornkessel\-Schlesewsky and Schlesewsky,[2008](https://arxiv.org/html/2605.16953#bib.bib23); Brouweret al\.,[2012](https://arxiv.org/html/2605.16953#bib.bib24)\)\. In the context of hallucination recognition, once a word is identified as semantically hallucinatory, participants engage controlled reanalysis and decision/monitoring processes\. These processes recruit language\-time systems and posterior integration networks, consistent with the late positivity seen in P600\. Thus, correct detection of hallucination involves not only early sensory/attentional and semantic mismatch stages, but also later re\-evaluation and integration when the linguistic input conflicts with perceptual or expectation\-based models\.

Across all examined components and brain regions in the post\-hoc test, none of the comparisons between HalluWrong and NoHallu words reach statistical significance\. Full test statistics for each time window and ROI are provided in the Appendix[A\.5](https://arxiv.org/html/2605.16953#A1.SS5)\. Importantly, we conducted a post\-hoc sensitivity analysis to evaluate the detectability of these effects under our design\. It shows that our sample size provides approximately 80% power \(α\\alpha=0\.05\) to detect medium\-to\-large effects \(dz≥d\_\{z\}\\geq0\.462\)\. This indicates that there exists no medium\-to\-large ERP effects when comparing HalluWrong versus NoHallu words\. The ERP analysis across different hallucination types did not yield statistically significant results and is therefore not reported in this paper\.

### 4\.3Discussion

Overall, our findings advance understanding of the neural mechanisms by which humans recognize hallucinated content generated by MLLMs\. The ERP results clearly show that various cognitive processes are engaged at extremely fine temporal scales\. Specifically, differences in early perceptual attention and cognitive load \(P200/N100\), semantic\-thematic understanding \(N400\), inferential processing, and memory retrieval \(P600\) mechanisms underlie successful hallucination recognition \(addressingRQ1\)\. These observations are consistent with prior studies of comprehension mechanisms, which posit that unexpected or incongruent input requires more effortful retrieval and integration of memory and draws upon prediction error and expectancy effects\(Zhuet al\.,[2024](https://arxiv.org/html/2605.16953#bib.bib10)\)\.

It is worth noting that our results align with, and extend, findings from previous ERP studies\. For example,\(Yeet al\.,[2022](https://arxiv.org/html/2605.16953#bib.bib4)\)explored the neural mechanisms underlying reading comprehension, and\(Pinkosovaet al\.,[2022](https://arxiv.org/html/2605.16953#bib.bib26)\)investigated relevance judgments\. They both reported that answer words \(or words highly relevant to the task\) elicit larger ERP amplitudes compared to ordinary or low\-relevance words\. These patterns suggest that when a stimulus is more directly tied to achieving the experimental goal, participants tend to allocate more attentional and cognitive resources to those items\. By analogy, our results suggest that participants in our user study tended to devote more resources to items that were more directly aligned with achieving the experimental task goal, i\.e\., detection of hallucinated content\. While the precise mechanisms driving this attentional allocation remain beyond the scope of this paper, they represent an intriguing avenue for future research\.

On the other hand, when participants failed to detect the hallucinated content, we did not observe the typical neural activity associated with anomaly detection\. This may suggest that, due to the high linguistic fluency and contextual coherence, hallucinations produced by advanced models can not successfully trigger cognitive mechanisms related to fact verification\. This may lead to the formation of humans’ false beliefs\. In the task we designed, this process primarily operates through attention allocation and inferential processing\. This pattern implies that recognition \(or conscious awareness\) is a key trigger for abnormal neural responses\. Mere exposure to hallucinated content does not suffice to induce the enhanced ERP effects \(addressingRQ2\)\. This aspect distinguishes the present task from many prior word\-recognition or anomaly detection tasks\.

The findings from our study offer valuable insights:\(1\) AI model design:We reveal that human recognition of hallucinatory information encompasses distinct stages, including attentional engagement, semantic matching, and memory retrieval\. These insights could guide the design of more robust hallucination detection and mitigation systems\. For instance, MLLMs might benefit from a cooperation of modules that do not rely solely on passive prediction but are actively “aware” of deviations\.\(2\) AI impact on Human:Existing research aimed at mitigating AI hallucination often misses important human factors\. From our study, we reveal that the cognitive risk of hallucination content depends heavily on whether humans can successfully recognize it\. Therefore, reducing the cognitive risks associated with such hallucinations constitutes a key research direction\.\(3\) Human\-AI Interaction:We observe that conscious awareness \(e\.g\., P200\) is essential for triggering anomalous neural responses\. This suggests that user interfaces with active interventions may help reduce the risk of users accepting hallucinated content uncritically\. Design strategies should emphasize helping users notice and identify when content seems incongruent\.

## 5Prediction Experiments

To explore whether EEG signals can act as signals to predict whether the content generated by an MLLM contains hallucinations, we conducted word\-level and sentence\-level prediction experiments on the dataset we collected\. In this section, we detail the procedures and results of experiments\.

### 5\.1Experimental Setup

#### Task Definition

We formalize the prediction task as follows\. Let a stimulus sentence containllwords, and for each word, we extract EEG features during its presentation\. We denote the sequence of word\-level EEG featuresX=\[x1,x2,…,xl\]∈ℝl×dX=\[x\_\{1\},x\_\{2\},\\dots,x\_\{l\}\]\\in\\mathbb\{R\}^\{l\\times d\}as input, whereddis the dimension of the extracted features\. The model produces two outputs: word\-level predictionyw=\[yw,1,yw,2,…,yw,l\]∈\{0,1\}ly\_\{w\}=\[y\_\{w,1\},y\_\{w,2\},\\dots,y\_\{w,l\}\]\\in\\\{0,1\\\}^\{l\}and sentence\-level predictionys∈\{0,1\}y\_\{s\}\\in\\\{0,1\\\}\. For evaluation, we selected AUC as a metric\.

#### Feature Selection

To build input features for our prediction models, we combined Frequency\-Band\-based Features \(FBFs\) with Event\-Related Potential\-based Features \(ERPFs\)\. FBFs capture global spectral information, while ERPFs focus on specific, behaviorally relevant time windows indicated by our ERP analyses\. We selected four brain regions \(central, l\-temporal, r\-temporal, and occipital\) that consistently showed strong effects in our significance tests in the previous section\. We selected a set of time points within those time window and divided each into five equal segments\. For each of those four regions, we computed differential entropy for five standard EEG frequency bands\. Differential entropy is widely used to quantify complexity in EEG signals, and has been shown to be effective for classification tasks such as emotion recognition\(Chenet al\.,[2019](https://arxiv.org/html/2605.16953#bib.bib27); Duanet al\.,[2013](https://arxiv.org/html/2605.16953#bib.bib28)\)\. We concatenated them to create a 760\-dimensional input vector\.

#### Data splitting strategies

To examine whether the model’s performance is consistent across different participants and whether it generalizes robustly across varying participant data distributions, we employed two data splitting strategies\. Within\-subject paradigm means for each participant individually, we perform ten\-fold cross\-validation, and then average performance across folds\. Across\-subject paradigm means that we hold out one participant’s data as the test set and train the model on the other 26 participants’ data\.

#### Model selection

We selected support vector machine \(SVM\), gradient boosting decision tree \(GBDT\), and an attention\-based model\. Our rationale for not selecting more complex or highly specialized neural architectures is twofold: 1\) this task is novel, and to our knowledge, no dedicated model has previously been designed specifically; 2\) our goal in this work is to demonstrate the effectiveness of EEG as an implicit feedback signal for predicting hallucinated content\. More sophisticated architectures to push maximal performance remain a promising direction for future work\. For more model and training details, please refer to the code and Appendix[A\.7](https://arxiv.org/html/2605.16953#A1.SS7)\.

### 5\.2Results

Table 2:The classification results of word\-level and sentence\-level prediction\. Best results are inBold\. †/\* indicates the result is significantly different with p\-value<0\.05 compared to the best model and random, respectively\.SettingsModelsWord\-levelSentence\-levelAUCwithin\\text\{AUC\}\_\{\\text\{within\}\}AUCcross\\text\{AUC\}\_\{\\text\{cross\}\}AUCwithin\\text\{AUC\}\_\{\\text\{within\}\}AUCcross\\text\{AUC\}\_\{\\text\{cross\}\}HalluCorrectvsNoHalluSVM0\.9393\*0\.8631\*0\.9601†\*0\.9494†\*GBDT0\.9190†\*0\.8362†\*0\.9647\*0\.8531†\*attention0\.9113†\*0\.7955†\*0\.9673\*0\.9846\*HalluWrongvsNoHalluSVM0\.53300\.51200\.49350\.5457GBDT0\.49350\.52170\.48700\.5392attention0\.51510\.50340\.49660\.5684HalluCorrectvsHalluWrongSVM0\.54690\.46000\.48280\.4718GBDT0\.53160\.47480\.48970\.4728attention0\.48940\.54100\.50710\.4952Table[2](https://arxiv.org/html/2605.16953#S5.T2)presents the results of the word\-level and sentence\-level prediction classification\. When comparing HalluCorrect and NoHallu words, it shows that several models achieved strong performance, with SVM attaining the highest word\-level performance and the attention\-based model performing best at the sentence\-level prediction\. As expected, the cross\-subject AUC scores are generally lower than the within\-subject ones, likely due to inter\-subject variability in EEG signals\. Differences in brain anatomy, electrode placement, cognitive strategies, and noise make generalization across individuals more challenging\(Apicellaet al\.,[2024](https://arxiv.org/html/2605.16953#bib.bib47)\)\. Another consistent trend is that sentence\-level classification outperforms word\-level classification\. This is plausible because sentence\-level prediction allows the model to integrate information across all constituent words, capturing contextual dependencies and cumulative signals\. The attention\-based model in particular can exploit sequential dependencies via its internal weighting mechanism, which helps it better aggregate subtle signals across words\. The results indicate that the EEG signals we collected carry meaningful information for predicting hallucinated vs non\-hallucinated content\. We further experimented by comparing HalluWrong vs NoHallu and HalluCorrect vs HalluWrong words\. The results show that model performance drops significantly and does not exceed random chance by a meaningful margin\. This indicates that EEG signals contain discriminative information only when participants correctly recognize hallucinations\. When participants fail to detect hallucinated content, the model is unable to distinguish hallucinated words from non\-hallucinated words based on EEG signals\.

Overall, our experiments validate that, at both the word and sentence levels, EEG is a viable implicit feedback signal for detecting hallucination in generated content\. However, this prediction is reliable only when participants correctly recognize hallucination\. \(addressingRQ3\)

## 6Conclusion

In summary, this paper makes the following three contributions\. 1\) We collected and will release an EEG dataset from 27 participants, in which subjects viewed text generated by MLLM, including both hallucinated and non\-hallucinated content\. 2\) We performed ERP analyses to probe the neural mechanisms of human recognition of MLLM\-generated hallucinations and found that early attention and perceptual processing, semantic\-thematic integration, inferential reasoning, and memory retrieval are all involved at very fine temporal resolution\. Crucially, ERP differences between hallucination vs non\-hallucination words only emerge when participants correctly recognize hallucination content, indicating that endogenous cognitive mechanisms that attenuate conflict detection when the hallucinated content appears fluent and contextually plausible\. 3\) We demonstrated that it is possible to predict whether content contains hallucinations with EEG at both the word\-level and sentence\-level, but reliable prediction depends on correct recognition of hallucinated content by participants\.

Despite the promising findings, this study has several limitations that must be acknowledged\. 1\) Although we have a relatively large number of participants \(n = 27\), which helps statistical reliability, each participant in our dataset viewed relatively few hallucination words, since the EEG data collection equipment is not portable and the sessions are time\-consuming\. 2\) Our setup was constrained to a laboratory setting\. We made efforts to approximate real\-world conditions, but there remains a gap between them\. Factors such as ambient noise, participant movement, multitasking, natural reading behavior, and variations in attention in real life are not fully captured\.

Our empirical findings suggest that the detection of hallucinated content involves memory retrieval and semantic matching\. Those cognitive processes may depend on whether participants have relevant knowledge\. Future studies would be valuable to conduct experiments within groups possessing specialized backgrounds \(e\.g\., experts in medicine, law, science\) to assess how prior knowledge modulates EEG signatures of hallucination recognition\. Although our study examined several categories of hallucination \(relation, entity, attribute\), a more fine‐grained investigation is needed to understand how different kinds of semantic and perceptual violations produce distinct neural effects\. This would help map which categories are most difficult to detect, in which brain regions, and at what latencies, thereby informing both cognitive theory and model design\. Another promising direction is to move beyond offline analysis toward real\-time or adaptive human–AI interaction systems\. Future work could explore whether EEG\-based signals of hallucination recognition can be decoded online and fed back to user interfaces in real time, enabling dynamic warning mechanisms, adaptive response generation, or confidence calibration\.

## Acknowledgements

This work is supported by the Research Project of Quan Cheng Laboratory, China \(Grant No\. QCL20250105\)\.

## Impact Statement

This work aims to advance understanding of human–AI interaction and the neural mechanisms underlying the recognition of AI\-generated hallucinations\. All experiments involving human participants were conducted under formal ethical approval, with procedures designed to ensure participant safety, informed consent, and protection of privacy\. Detailed ethics approval information will be disclosed after the review process\. We believe this study poses minimal risk to participants and contributes positively to the development of safer and more trustworthy AI systems by providing insights into how humans perceive and evaluate potentially misleading AI\-generated content\.

## References

- A\. Apicella, P\. Arpaia, G\. D’Errico, D\. Marocco, G\. Mastrati, N\. Moccaldi, and R\. Prevete \(2024\)Toward cross\-subject and cross\-session generalization in eeg\-based emotion recognition: systematic review, taxonomy, and methods\.Neurocomputing604,pp\. 128354\.Cited by:[§5\.2](https://arxiv.org/html/2605.16953#S5.SS2.p1.1)\.
- S\. Barros \(2025\)I think, therefore i hallucinate: minds, machines, and the art of being wrong\.arXiv preprint arXiv:2503\.05806\.Cited by:[§1](https://arxiv.org/html/2605.16953#S1.p3.1)\.
- D\. Blackwood and W\. J\. Muir \(1990\)Cognitive brain potentials and their application\.The British Journal of Psychiatry157\(S9\),pp\. 96–101\.Cited by:[§4\.2](https://arxiv.org/html/2605.16953#S4.SS2.p1.1)\.
- A\. Borji \(2022\)Generated faces in the wild: quantitative comparison of stable diffusion, midjourney and dall\-e 2\.arXiv preprint arXiv:2210\.00586\.Cited by:[§1](https://arxiv.org/html/2605.16953#S1.p1.1)\.
- I\. Bornkessel\-Schlesewsky and M\. Schlesewsky \(2008\)An alternative perspective on “semantic p600” effects in language comprehension\.Brain research reviews59\(1\),pp\. 55–73\.Cited by:[§2\.2](https://arxiv.org/html/2605.16953#S2.SS2.p1.1),[§4\.2](https://arxiv.org/html/2605.16953#S4.SS2.p6.7)\.
- H\. Brouwer, H\. Fitz, and J\. Hoeks \(2012\)Getting real about semantic illusions: rethinking the functional role of the p600 in language comprehension\.Brain research1446,pp\. 127–143\.Cited by:[§4\.2](https://arxiv.org/html/2605.16953#S4.SS2.p6.7)\.
- D\. Chen, R\. Miao, W\. Yang, Y\. Liang, H\. Chen, L\. Huang, C\. Deng, and N\. Han \(2019\)A feature extraction method based on differential entropy and linear discriminant analysis for emotion recognition\.Sensors19\(7\),pp\. 1631\.Cited by:[§5\.1](https://arxiv.org/html/2605.16953#S5.SS1.SSS0.Px2.p1.1)\.
- B\. Du, Z\. Ye, M\. Jankowska, Z\. Wu, Q\. Ai, Y\. Zhou, and Y\. Liu \(2025a\)EEG reveals the cognitive impact of polarized content in short video scenarios\.Scientific Reports15\(1\),pp\. 18277\.Cited by:[§2\.2](https://arxiv.org/html/2605.16953#S2.SS2.p1.1)\.
- B\. Du, Z\. Ye, Z\. Wu, M\. Jankowska, Q\. Ai, and Y\. Liu \(2025b\)Understanding the effect of opinion polarization in short video browsing\.InProceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval,pp\. 906–916\.Cited by:[§2\.2](https://arxiv.org/html/2605.16953#S2.SS2.p1.1)\.
- R\. Duan, J\. Zhu, and B\. Lu \(2013\)Differential entropy feature for eeg\-based emotion classification\.In2013 6th international IEEE/EMBS conference on neural engineering \(NER\),pp\. 81–84\.Cited by:[§5\.1](https://arxiv.org/html/2605.16953#S5.SS1.SSS0.Px2.p1.1)\.
- S\. Farquhar, J\. Kossen, L\. Kuhn, and Y\. Gal \(2024\)Detecting hallucinations in large language models using semantic entropy\.Nature630\(8017\),pp\. 625–630\.Cited by:[§1](https://arxiv.org/html/2605.16953#S1.p2.1)\.
- S\. L\. Frank, L\. J\. Otten, G\. Galli, and G\. Vigliocco \(2013\)Word surprisal predicts n400 amplitude during reading\.Cited by:[§2\.2](https://arxiv.org/html/2605.16953#S2.SS2.p1.1)\.
- U\. Ghani, N\. Signal, I\. K\. Niazi, and D\. Taylor \(2020\)ERP based measures of cognitive workload: a review\.Neuroscience & Biobehavioral Reviews118,pp\. 18–26\.Cited by:[§4\.2](https://arxiv.org/html/2605.16953#S4.SS2.p4.7)\.
- L\. Huang, W\. Yu, W\. Ma, W\. Zhong, Z\. Feng, H\. Wang, Q\. Chen, W\. Peng, X\. Feng, B\. Qin,et al\.\(2025\)A survey on hallucination in large language models: principles, taxonomy, challenges, and open questions\.ACM Transactions on Information Systems43\(2\),pp\. 1–55\.Cited by:[§1](https://arxiv.org/html/2605.16953#S1.p2.1)\.
- Z\. Ji, N\. Lee, R\. Frieske, T\. Yu, D\. Su, Y\. Xu, E\. Ishii, Y\. J\. Bang, A\. Madotto, and P\. Fung \(2023\)Survey of hallucination in natural language generation\.ACM computing surveys55\(12\),pp\. 1–38\.Cited by:[§1](https://arxiv.org/html/2605.16953#S1.p2.1),[§2\.1](https://arxiv.org/html/2605.16953#S2.SS1.p1.1)\.
- A\. H\. Kemp, P\. J\. Hopkinson, D\. F\. Hermens, D\. L\. Rowe, A\. L\. Sumich, C\. R\. Clark, W\. Drinkenburg, N\. Abdi, R\. Penrose, A\. McFarlane,et al\.\(2009\)Fronto\-temporal alterations within the first 200 ms during an attentional task distinguish major depression, non\-clinical participants with depressed mood and healthy controls: a potential biomarker?\.Human brain mapping30\(2\),pp\. 602–614\.Cited by:[§4\.2](https://arxiv.org/html/2605.16953#S4.SS2.p4.7)\.
- S\. S\. Kim, J\. W\. Vaughan, Q\. V\. Liao, T\. Lombrozo, and O\. Russakovsky \(2025\)Fostering appropriate reliance on large language models: the role of explanations, sources, and inconsistencies\.InProceedings of the 2025 CHI Conference on Human Factors in Computing Systems,pp\. 1–19\.Cited by:[§1](https://arxiv.org/html/2605.16953#S1.p3.1)\.
- A\. Klingbeil, C\. Grützner, and P\. Schreck \(2024\)Trust and reliance on ai—an experimental study on the extent and costs of overreliance on ai\.Computers in Human Behavior160,pp\. 108352\.Cited by:[§1](https://arxiv.org/html/2605.16953#S1.p3.1)\.
- E\. F\. Lau, C\. Phillips, and D\. Poeppel \(2008\)A cortical network for semantics:\(de\) constructing the n400\.Nature reviews neuroscience9\(12\),pp\. 920–933\.Cited by:[§4\.2](https://arxiv.org/html/2605.16953#S4.SS2.p5.2)\.
- D\. Lehmann and W\. Skrandies \(1980\)Reference\-free identification of components of checkerboard\-evoked multichannel potential fields\.Electroencephalography and clinical neurophysiology48\(6\),pp\. 609–621\.Cited by:[§4\.2](https://arxiv.org/html/2605.16953#S4.SS2.p1.1)\.
- P\. Lewis, E\. Perez, A\. Piktus, F\. Petroni, V\. Karpukhin, N\. Goyal, H\. Küttler, M\. Lewis, W\. Yih, T\. Rocktäschel,et al\.\(2020\)Retrieval\-augmented generation for knowledge\-intensive nlp tasks\.Advances in neural information processing systems33,pp\. 9459–9474\.Cited by:[§1](https://arxiv.org/html/2605.16953#S1.p2.1),[§2\.1](https://arxiv.org/html/2605.16953#S2.SS1.p1.1)\.
- T\. Lin, M\. Maire, S\. Belongie, J\. Hays, P\. Perona, D\. Ramanan, P\. Dollár, and C\. L\. Zitnick \(2014\)Microsoft coco: common objects in context\.InComputer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6\-12, 2014, Proceedings, Part V 13,pp\. 740–755\.Cited by:[§3\.2](https://arxiv.org/html/2605.16953#S3.SS2.p1.1)\.
- A\. Lindborg, L\. Musiolek, D\. Ostwald, and M\. Rabovsky \(2023\)Semantic surprise predicts the n400 brain potential\.Neuroimage: Reports3\(1\),pp\. 100161\.Cited by:[§4\.2](https://arxiv.org/html/2605.16953#S4.SS2.p5.2)\.
- S\. J\. Luck, G\. F\. Woodman, and E\. K\. Vogel \(2000\)Event\-related potential studies of attention\.Trends in cognitive sciences4\(11\),pp\. 432–440\.Cited by:[§4\.2](https://arxiv.org/html/2605.16953#S4.SS2.p1.1)\.
- S\. J\. Luck \(2014\)An introduction to the event\-related potential technique\.MIT press\.Cited by:[§1](https://arxiv.org/html/2605.16953#S1.p4.1),[§2\.2](https://arxiv.org/html/2605.16953#S2.SS2.p1.1)\.
- P\. Manakul, A\. Liusie, and M\. J\. Gales \(2023\)Selfcheckgpt: zero\-resource black\-box hallucination detection for generative large language models\.arXiv preprint arXiv:2303\.08896\.Cited by:[§2\.1](https://arxiv.org/html/2605.16953#S2.SS1.p1.1)\.
- J\. Maynez, S\. Narayan, B\. Bohnet, and R\. McDonald \(2020\)On faithfulness and factuality in abstractive summarization\.arXiv preprint arXiv:2005\.00661\.Cited by:[§1](https://arxiv.org/html/2605.16953#S1.p2.1)\.
- J\. A\. Michaelov, S\. Coulson, and B\. K\. Bergen \(2022\)So cloze yet so far: n400 amplitude is better predicted by distributional information than human predictability judgements\.IEEE Transactions on Cognitive and Developmental Systems15\(3\),pp\. 1033–1042\.Cited by:[§4\.2](https://arxiv.org/html/2605.16953#S4.SS2.p5.2)\.
- S\. Minaee, T\. Mikolov, N\. Nikzad, M\. Chenaghlu, R\. Socher, X\. Amatriain, and J\. Gao \(2024\)Large language models: a survey\.arXiv preprint arXiv:2402\.06196\.Cited by:[§1](https://arxiv.org/html/2605.16953#S1.p1.1)\.
- Z\. Pinkosova, W\. J\. McGeown, and Y\. Moshfeghi \(2022\)Revisiting neurological aspects of relevance: an eeg study\.InInternational Conference on Machine Learning, Optimization, and Data Science,pp\. 549–563\.Cited by:[§4\.3](https://arxiv.org/html/2605.16953#S4.SS3.p2.1)\.
- J\. Polich \(2007\)Updating p300: an integrative theory of p3a and p3b\.Clinical neurophysiology118\(10\),pp\. 2128–2148\.Cited by:[§4\.2](https://arxiv.org/html/2605.16953#S4.SS2.p4.7)\.
- A\. Rohrbach, L\. A\. Hendricks, K\. Burns, T\. Darrell, and K\. Saenko \(2018\)Object hallucination in image captioning\.arXiv preprint arXiv:1809\.02156\.Cited by:[§2\.1](https://arxiv.org/html/2605.16953#S2.SS1.p1.1)\.
- A\. M\. Rutman, W\. C\. Clapp, J\. Z\. Chadick, and A\. Gazzaley \(2010\)Early top–down control of visual processing predicts working memory performance\.Journal of cognitive neuroscience22\(6\),pp\. 1224–1234\.Cited by:[§4\.2](https://arxiv.org/html/2605.16953#S4.SS2.p3.3)\.
- M\. Schrimpf, I\. A\. Blank, G\. Tuckute, C\. Kauf, E\. A\. Hosseini, N\. Kanwisher, J\. B\. Tenenbaum, and E\. Fedorenko \(2021\)The neural architecture of language: integrative modeling converges on predictive processing\.Proceedings of the National Academy of Sciences118\(45\),pp\. e2105646118\.Cited by:[§2\.2](https://arxiv.org/html/2605.16953#S2.SS2.p1.1)\.
- Z\. Seyednozadi, R\. Pishghadam, and M\. Pishghadam \(2021\)Functional role of the n400 and p600 in language\-related erp studies with respect to semantic anomalies: an overview\.Archives of Neuropsychiatry58\(3\),pp\. 249\.Cited by:[§4\.2](https://arxiv.org/html/2605.16953#S4.SS2.p6.7)\.
- W\. Su, C\. Wang, Q\. Ai, Y\. Hu, Z\. Wu, Y\. Zhou, and Y\. Liu \(2024\)Unsupervised real\-time hallucination detection based on the internal states of large language models\.arXiv preprint arXiv:2403\.06448\.Cited by:[§3\.2](https://arxiv.org/html/2605.16953#S3.SS2.p1.1)\.
- Y\. Sun, D\. Sheng, Z\. Zhou, and Y\. Wu \(2024\)AI hallucination: towards a comprehensive classification of distorted information in artificial intelligence\-generated content\.Humanities and Social Sciences Communications11\(1\),pp\. 1–14\.Cited by:[§1](https://arxiv.org/html/2605.16953#S1.p1.1)\.
- Q\. Team \(2025\)Qwen2\.5\-vl\.External Links:[Link](https://qwenlm.github.io/blog/qwen2.5-vl/)Cited by:[§1](https://arxiv.org/html/2605.16953#S1.p1.1),[§3\.2](https://arxiv.org/html/2605.16953#S3.SS2.p1.1)\.
- A\. R\. D\. Thornton, M\. Harmer, and B\. A\. Lavoie \(2007\)Selective attention increases the temporal precision of the auditory n100 event\-related potential\.Hearing Research230\(1\-2\),pp\. 73–79\.Cited by:[§4\.2](https://arxiv.org/html/2605.16953#S4.SS2.p3.3)\.
- H\. Touvron, T\. Lavril, G\. Izacard, X\. Martinet, M\. Lachaux, T\. Lacroix, B\. Rozière, N\. Goyal, E\. Hambro, F\. Azhar,et al\.\(2023\)Llama: open and efficient foundation language models\.arXiv preprint arXiv:2302\.13971\.Cited by:[§1](https://arxiv.org/html/2605.16953#S1.p1.1)\.
- J\. Wang, Y\. Wang, G\. Xu, J\. Zhang, Y\. Gu, H\. Jia, J\. Wang, H\. Xu, M\. Yan, J\. Zhang,et al\.\(2023\)Amber: an llm\-free multi\-dimensional benchmark for mllms hallucination evaluation\.arXiv preprint arXiv:2311\.07397\.Cited by:[§2\.1](https://arxiv.org/html/2605.16953#S2.SS1.p1.1),[§3\.2](https://arxiv.org/html/2605.16953#S3.SS2.p1.1)\.
- P\. Wang, S\. Bai, S\. Tan, S\. Wang, Z\. Fan, J\. Bai, K\. Chen, X\. Liu, J\. Wang, W\. Ge, Y\. Fan, K\. Dang, M\. Du, X\. Ren, R\. Men, D\. Liu, C\. Zhou, J\. Zhou, and J\. Lin \(2024\)Qwen2\-vl: enhancing vision\-language model’s perception of the world at any resolution\.arXiv preprint arXiv:2409\.12191\.Cited by:[§3\.2](https://arxiv.org/html/2605.16953#S3.SS2.p1.1)\.
- J\. Wu, W\. Gan, Z\. Chen, S\. Wan, and P\. S\. Yu \(2023\)Multimodal large language models: a survey\.In2023 IEEE International Conference on Big Data \(BigData\),pp\. 2247–2256\.Cited by:[§1](https://arxiv.org/html/2605.16953#S1.p1.1)\.
- C\. Yang, C\. Wang, X\. Chen, B\. Xiao, N\. Fu, B\. Ren, and Y\. Liu \(2022\)Event\-related potential assessment of visual perception abnormality in patients with obstructive sleep apnea: a preliminary study\.Frontiers in Human Neuroscience16,pp\. 895826\.Cited by:[§4\.2](https://arxiv.org/html/2605.16953#S4.SS2.p3.3)\.
- Z\. Yang, L\. Li, K\. Lin, J\. Wang, C\. Lin, Z\. Liu, and L\. Wang \(2023\)The dawn of lmms: preliminary explorations with gpt\-4v \(ision\)\.arXiv preprint arXiv:2309\.17421\.Cited by:[§1](https://arxiv.org/html/2605.16953#S1.p1.1)\.
- D\. Yao, Y\. Qin, S\. Hu, L\. Dong, M\. L\. Bringas Vega, and P\. A\. Valdés Sosa \(2019\)Which reference should we use for eeg and erp practice?\.Brain topography32\(4\),pp\. 530–549\.Cited by:[§A\.4](https://arxiv.org/html/2605.16953#A1.SS4.p1.1)\.
- Z\. Ye, Q\. Ai, Y\. Liu, M\. de Rijke, M\. Zhang, C\. Lioma, and T\. Ruotsalo \(2025\)Generative language reconstruction from brain recordings\.Communications Biology8\(1\),pp\. 346\.Cited by:[§2\.2](https://arxiv.org/html/2605.16953#S2.SS2.p1.1)\.
- Z\. Ye, X\. Xie, Q\. Ai, Y\. Liu, Z\. Wang, W\. Su, and M\. Zhang \(2024\)Relevance feedback with brain signals\.ACM Transactions on Information Systems42\(4\),pp\. 1–37\.Cited by:[§4\.2](https://arxiv.org/html/2605.16953#S4.SS2.p1.1)\.
- Z\. Ye, X\. Xie, Y\. Liu, Z\. Wang, X\. Chen, M\. Zhang, and S\. Ma \(2022\)Towards a better understanding of human reading comprehension with brain signals\.InProceedings of the ACM Web Conference 2022,pp\. 380–391\.Cited by:[§3\.3](https://arxiv.org/html/2605.16953#S3.SS3.p2.1),[§4\.2](https://arxiv.org/html/2605.16953#S4.SS2.p1.1),[§4\.3](https://arxiv.org/html/2605.16953#S4.SS3.p2.1)\.
- C\. Zhai, S\. Wibowo, and L\. D\. Li \(2024\)The effects of over\-reliance on ai dialogue systems on students’ cognitive abilities: a systematic review\.Smart Learning Environments11\(1\),pp\. 28\.Cited by:[§1](https://arxiv.org/html/2605.16953#S1.p3.1)\.
- W\. X\. Zhao, K\. Zhou, J\. Li, T\. Tang, X\. Wang, Y\. Hou, Y\. Min, B\. Zhang, J\. Zhang, Z\. Dong,et al\.\(2023\)A survey of large language models\.arXiv preprint arXiv:2303\.182231\(2\)\.Cited by:[§1](https://arxiv.org/html/2605.16953#S1.p1.1)\.
- S\. Zhu, X\. Xie, Z\. Ye, Q\. Ai, and Y\. Liu \(2024\)Comparing point\-wise and pair\-wise relevance judgment with brain signals\.Journal of the Association for Information Science and Technology75\(9\),pp\. 957–971\.Cited by:[§4\.2](https://arxiv.org/html/2605.16953#S4.SS2.p1.1),[§4\.3](https://arxiv.org/html/2605.16953#S4.SS3.p1.1)\.

## Appendix AAppendix

### A\.1Apparatus

The stimuli are presented on a desktop computer that has a 27\-inch monitor with a resolution of2560×14402560\\times 1440pixels and a refresh rate of 60 Hz\. Participants are required to use the keyboard to interact with the platform\. EEG signals are captured and amplified using a Scan NuAmps Express system \(Compumedics Ltd\., VIC, Australia\) and a 64\-channel Quik\-Cap \(Compumedical NeuroScan\)\. A laptop computer functions as a server to record EEG signals and triggers using Curry8 software\. Throughout the experiment, electrode\-scalp impedance is maintained under50kΩ50k\\Omega, and the sampling rate is set at 1000 Hz\.

### A\.2Pilot Study

We conducted a power analysis based on pilot data, which indicated that 19 participants are sufficient to achieve 80% statistical power atα=0\.05\\alpha=0\.05\. To ensure robustness, we recruited 27 participants, thereby providing sufficient statistical power to detect the reported effects\.

### A\.3GPT4\-HDM Verification

We further validated the selected textual stimuli using the GPT4\-HDM verification method, with a carefully designed prompt adapted from prior hallucination\-detection settings\. Specifically, we employed the following prompt to evaluate whether each text span contained non\-factual or hallucinated information:

> Given the following text span, your objective is to determine if the provided text contains non\-factual or hallucinated information\. You SHOULD give your judgment based on the world knowledge\. Text span:\[Provided Text\] Now, determine if the above text span contains non\-factual or hallucinated information\. The answer you give MUST be “Yes” or “No”\.

### A\.4Preprocess

We preprocess the EEG data using several steps: first, we re\-reference all recorded signals offline using the linked\-mastoids method to reduce reference bias \(\(Yaoet al\.,[2019](https://arxiv.org/html/2605.16953#bib.bib12)\)\); second, we apply notch, high\-pass, and low\-pass filters to eliminate environmental interference, slow voltage drift, and high\-frequency noise respectively; third, we extract epochs of interest and compute their averages to obtain ERP waveforms\. The epochs are defined from 200 ms before the presentation of each stimulus word to 800 ms after, covering the expected time window for relevant neural responses\.

### A\.5ERP Analysis

Table 3:Raw and FDR\-corrected p\-values \(HalluCorrect vs\. NoHallu words\)pre\-frontalfrontalcentrall\-temporalr\-temporalparietaloccipitalRaw p\-values50–120 ms0\.06090\.22820\.69160\.10840\.00820\.00930\.8691120–280 ms0\.00700\.00020\.00060\.00090\.04930\.07310\.0025280–550 ms0\.11480\.08490\.00040\.03790\.00920\.56210\.0304550–750 ms0\.02260\.01210\.00000\.01090\.00860\.10850\.0166FDR\-corrected p\-values50–120 ms0\.14220\.31940\.80690\.18970\.03260\.03260\.8691120–280 ms0\.00980\.00160\.00210\.00210\.05750\.07310\.0043280–550 ms0\.13390\.11890\.00280\.06630\.03210\.56210\.0663550–750 ms0\.02640\.02130\.00000\.02130\.02130\.10850\.0232Tables[3](https://arxiv.org/html/2605.16953#A1.T3)present the p\-values before and after FDR correction\(α=0\.05\\alpha=0\.05\), for different ERP components across brain regions, comparing HalluWrong vs\. NoHallu words\.

Table 4:Raw and FDR\-corrected p\-values \(HalluCorrect vs\. HalluWrong words\)pre\-frontalfrontalcentrall\-temporalr\-temporalparietaloccipitalRaw p\-values50–120 ms0\.16160\.35230\.09730\.18890\.09370\.61330\.0056120–280 ms0\.15210\.04550\.00260\.07180\.06310\.00170\.2117280–550 ms0\.03600\.02930\.03460\.29120\.02740\.04080\.1306550–750 ms0\.04130\.04590\.01890\.06190\.03820\.00550\.0668FDR\-corrected p\-values50–120 ms0\.26440\.4110\.2270\.26440\.2270\.61330\.0391120–280 ms0\.17750\.10050\.0090\.10050\.10050\.0090\.2117280–550 ms0\.05710\.05710\.05710\.29120\.05710\.05710\.1524550–750 ms0\.06430\.06430\.06430\.06680\.06430\.03820\.0668Tables[4](https://arxiv.org/html/2605.16953#A1.T4)present the p\-values before and after FDR correction\(α=0\.05\\alpha=0\.05\), for different ERP components across brain regions, comparing HalluWrong vs\. HalluWrong words\.

Table 5:Raw and FDR\-corrected p\-values \(HalluWrong vs\. NoHallu words\)pre\-frontalfrontalcentrall\-temporalr\-temporalparietaloccipitalRaw p\-values50\-120 ms0\.30950\.14990\.74890\.78160\.46520\.22900\.0801120\-280 ms0\.20670\.07750\.45560\.28850\.88810\.07960\.0750280\-550 ms0\.89630\.85530\.55620\.59280\.66420\.48440\.0958550\-750 ms0\.81550\.94070\.23790\.70530\.69840\.09240\.3060FDR\-corrected p\-values50–120 ms0\.54160\.52470\.78160\.78160\.65130\.53430\.5247120–280 ms0\.36170\.18570\.53150\.40390\.88810\.18570\.1857280–550 ms0\.89630\.89630\.89630\.89630\.89630\.89630\.6706550–750 ms0\.94070\.94070\.71400\.94070\.94070\.64680\.7140Table 6:The statistical significance test results \(F score\) for different ERP components across brain regions for HalluWrong vs\. NoHallu words\.F\[1,26\]pre\-frontalfrontalcentrall\-temporalr\-temporalparietaloccipital50\-120 ms1\.07622\.20690\.10470\.07860\.55001\.52073\.3269120\-280 ms1\.68053\.39030\.57431\.17590\.02023\.33873\.6863280\-550 ms0\.01730\.03400\.35580\.29350\.19300\.50382\.9955550\-750 ms0\.05560\.00561\.46220\.14630\.15363\.06171\.0923Tables[6](https://arxiv.org/html/2605.16953#A1.T6)and Table[5](https://arxiv.org/html/2605.16953#A1.T5)present the statistical significance test results \(F scores and p values before and after FDR correction\(α=0\.05\\alpha=0\.05\), respectively\) for different ERP components across brain regions, comparing HalluWrong vs\. NoHallu words\. The results indicate that for all ERP components and in all examined regions of interest, the differences between HalluWrong and NoHallu words are not statistically significant\.

### A\.6More Statistic Analysis

Table 7:Number of Correct Items per ParticipantIDCorrectIDCorrectIDCorrectP01100P10103P19104P0285P11112P20105P03106P12103P21102P04106P1390P22107P0598P14103P23106P06100P15106P24101P0793P1696P25102P0897P17107P2688P09112P1898P27111Table[7](https://arxiv.org/html/2605.16953#A1.T7)presents the distribution of the number of correct items answered by all participants, where the mean is approximately 101\.14, the standard deviation is 6\.53, and the coefficient of variation is 6\.46% — reflecting a relatively high level of consistency in the number of correct items among the participants\.

### A\.7Models

Selecting features at the ROI level, as we do in the main text, rather than individual electrodes, is a common practice in EEG research, as it helps reduce spurious correlations and improves robustness across subjects and trials\. Following this standard approach, our feature definition is based on predefined ROIs and ERP time windows, rather than fine\-grained, data\-driven selection tied to specific electrodes\. To further address the concern about optimistic bias, we conducted additional experiments where feature selection was performed independently within each training fold\. The results show that the selected features are highly consistent across folds, with the same major ROIs \(e\.g\., central, temporal, and occipital\) repeatedly identified\. Moreover, the corresponding classification performance shows no significant difference compared to our original setup\. This provides strong empirical evidence that our results are not driven by data leakage or overfitting\. From a cognitive neuroscience perspective, this consistency is also expected\. The selected ROIs and time windows align well with established neural substrates of visual and language processing, and ROI\-level aggregation is generally less sensitive to random noise or electrode\-specific variability than single\-electrode selection\. Therefore, our feature design reflects both empirical robustness and neuroscientific priors, supporting its validity and generalizability\.

The model structures and hyperparameters are as follows\. For all models, the input features first undergo a preprocessing pipeline, which includes mean imputation for any missing values, followed by standard scaling to normalize the data\.

SVM \(Support Vector Machine\) We use a Radial Basis Function \(RBF\) kernel\. The regularization parameterCCis set to11\. The model is configured to output probability estimates for classification\.

RF \(Random Forest\) We set the number of trees in the forest to100100\. All other parameters are kept at their default values as specified in the scikit\-learn library\.

GBDT \(Gradient Boosting Decision Trees\) We set the number of boosting stages to100100and the learning rate to0\.10\.1\. All other parameters are set to their default values\.

MLP \(Multi\-Layer Perceptron\) We implement a network using PyTorch\. The architecture consists of a single hidden layer with100100units, which uses a ReLU activation function\. A dropout layer with a probability of0\.50\.5is applied after the activation function for regularization\. The output layer is a linear layer that maps to the two output classes\.

Attention\-based model We use a Transformer Encoder architecture implemented in PyTorch\. The input features are first projected into an embedding space with a dimension of128128\. This is followed by a 2\-layer Transformer Encoder\. Each encoder layer utilizes a multi\-head attention mechanism with88attention heads and a dropout rate of0\.50\.5\. A final linear layer maps the encoder’s output to the class scores\.

Training Configuration for Deep Models For both deep learning models \(MLP and Attention\-based\), we use the cross\-entropy loss function and the Adam optimizer with a learning rate of10−310^\{\-3\}\. The models are trained for300300epochs with a batch size of3232\.

### A\.8More Results

Table 8:The classification results of word\-level and sentence\-level prediction\. Best results are inBold\. †/\* indicates the result is significantly different with p\-value<0\.05 compared to the best model and random, respectively\.SettingsModelsWord\-levelSentence\-levelAUCwithin\\text\{AUC\}\_\{\\text\{within\}\}AUCcross\\text\{AUC\}\_\{\\text\{cross\}\}AUCwithin\\text\{AUC\}\_\{\\text\{within\}\}AUCcross\\text\{AUC\}\_\{\\text\{cross\}\}HalluCorrectvsNoHalluSVM0\.9393\*0\.8631\*0\.9601†\*0\.9494†\*RF0\.9069†\*0\.7924†\*0\.9622†\*0\.9384†\*GBDT0\.9190†\*0\.8362†\*0\.9647\*0\.8531†\*MLP0\.9125†\*0\.8272†\*0\.9655\*0\.9824\*attention0\.9113†\*0\.7955†\*0\.9673\*0\.9846\*HalluWrongvsNoHalluSVM0\.53300\.51200\.49350\.5457RF0\.51200\.50060\.54570\.5367GBDT0\.49350\.52170\.48700\.5392MLP0\.54570\.53410\.53670\.5423attention0\.51510\.50340\.49660\.5684HalluCorrectvsHalluWrongSVM0\.54690\.460\.48280\.4718RF0\.53050\.46840\.47970\.4683GBDT0\.53160\.47480\.48970\.4728MLP0\.50530\.50140\.49660\.4793attention0\.48940\.5410\.50710\.4952Table 9:The recall results of word\-level and sentence\-level prediction\. † indicates the result is significantly different with p\-value<0\.05 compared to the best model\. \* indicates including HalluWrong words in the training\.SettingsModelsword\-levelsentence\-levelRecallwithinRecall\_\{within\}RecallcrossRecall\_\{cross\}RecallwithinRecall\_\{within\}RecallcrossRecall\_\{cross\}HalluCorrectvsNoHalluSVM0\.5740†0\.3669†0\.6610†0\.4655†RF0\.2629†0\.1169†0\.0404†0\.0162†GBDT0\.4104†0\.2132†0\.4602†0\.0346†MLP0\.6141†0\.3487†0\.8952†0\.8244attention0\.66170\.44210\.93100\.7407†HalluWrongvsNoHalluSVM0\.00000\.00000\.00230\.0000RF0\.00000\.00000\.00000\.0000GBDT0\.00230\.07920\.12010\.0310MLP0\.00000\.00280\.04710\.0523attention0\.00000\.00000\.08470\.0000HalluCorrectvsHalluWrongSVM1111RF0\.98860\.96410\.9980\.9941GBDT0\.88640\.95890\.96630\.9609MLP0\.93680\.96050\.98880\.9985attention0\.85560\.90790\.98320\.9806Table[8](https://arxiv.org/html/2605.16953#A1.T8)and Table[9](https://arxiv.org/html/2605.16953#A1.T9)shows the more results of word\-level and sentence\-level prediction\.

Table 10:Comparison between EEG\-based hallucination detection and representative AI\-based LVLM hallucination detection methods\.MethodTypeAUCEEG\-based \(cross\-subject, attention, ours\)Neural\-based0\.931Uncertainty\-based \(PPL\)Uncertainty\-based0\.876Uncertainty\-based \(Token Confidence\)Uncertainty\-based0\.892Confidence\-based \(Consistency via re\-asking\)Confidence\-based0\.881Self\-verification \(Verification prompt\)Self\-verification0\.851To compare our prediction results with existing LVLM hallucination detection approaches, we evaluated three representative AI\-based hallucination detection methods alongside our EEG\-based detection framework on the constructed hallucination dataset, as shown in Table[10](https://arxiv.org/html/2605.16953#A1.T10)\. The compared methods include uncertainty\-based detection, confidence\-based consistency checking, and self\-verification approaches\. We found that the EEG\-based detection method consistently achieved higher AUC scores than these conventional model\-based approaches\. We believe this advantage may arise from two main factors\. First, our EEG signals were derived from the averaged event\-related potential \(ERP\) responses of 27 participants, providing a more stable and less noisy representation of human cognitive processing\. Second, neural signals offer an intrinsic process\-level measurement that is not restricted to post\-hoc behavioral outputs or heuristic model\-internal statistics\. As a result, EEG signals may capture certain aspects of hallucination processing that are inaccessible to standard model\-based detection methods\.
How do Humans Process AI-generated Hallucination Contents: a Neuroimaging Study

Similar Articles

AI Hallucinations Might Be More Human Than We’d Like to Admit

Growing number of AI hallucinations that are appearing in academic papers and articles

what happens if you instruct your go-to AI model to: "NEVER HALLUCINATE!!!"

This article about AI allucinations written by thehackernews, is literally written with AI lol... We need to do something to stop this phenomenon

How do you deal with AI "hallucinations" in your automations?

Submit Feedback

Similar Articles

AI Hallucinations Might Be More Human Than We’d Like to Admit
Growing number of AI hallucinations that are appearing in academic papers and articles
what happens if you instruct your go-to AI model to: "NEVER HALLUCINATE!!!"
This article about AI allucinations written by thehackernews, is literally written with AI lol... We need to do something to stop this phenomenon
How do you deal with AI "hallucinations" in your automations?