reliability-analysis

Tag

Cards List
#reliability-analysis

LLM-Assisted Stance Detection in Scientific Discourse: A Test Case in Bayesian Cognitive Science

arXiv cs.CL · 2026-06-16 Cached

This paper presents a method using LLMs for stance detection in scientific discourse, specifically identifying realism vs. instrumentalism in Bayesian cognitive science articles. The approach combines theory-driven coding, expert annotations, and prompt optimization to achieve high reliability.

0 favorites 0 likes
#reliability-analysis

When Stored Evidence Stops Being Usable: Scale-Conditioned Evaluation of Agent Memory

arXiv cs.AI · 2026-05-11 Cached

This paper introduces a scale-conditioned evaluation protocol for agent memory, analyzing how reliability degrades as irrelevant sessions accumulate. It identifies specific failure regimes and usable-scale boundaries across different memory interfaces and LLMs.

0 favorites 0 likes
← Back to home

Submit Feedback