Computer Science Conferences Should Require Nonrepudiable Experimental Results
Summary
This paper argues that computer science conferences should require nonrepudiable experimental results to prevent tampering and denial, and introduces K-Veritas, a reference implementation for signed reports without accessing training data.
View Cached Full Text
Cached at: 05/20/26, 10:40 PM
Paper page - Computer Science Conferences Should Require Nonrepudiable Experimental Results
Source: https://huggingface.co/papers/2605.08586 Published on May 9
·
Submitted byhttps://huggingface.co/Mamadou2727
MKLDon May 20
Abstract
Thispositionpaperarguesthatcomputerscienceconferencesshouldrequiretamper-evident,nonrepudiableattestationsofexperimentalresults.Wenametheunderlyingproblemexperimentnonrepudiation:acompliantprotocolmustbindthenumbersinapapertoanactualexecutedcomputationinawaytheauthorcannotlateralterordeny.Thecurrentsystemreliesonself-reportedchecklists,optionalcodesharing,andauthor-controlledlogging.Noneofthesemechanismsanswerthequestionareviewercannotcheck:didthecodethepaperdescribesproducethenumbersthepaperreports?Wedefinetheproblemformally,statethesecuritypropertiesanycompliantprotocolmustsatisfy,anddescribeathreatmodelthatincludesattackscurrentapproachesdonotprevent.Toshowthattheproblemissolvable,webuiltK-Veritas,areferenceimplementationinGothatproducessignedreportswithoutaccessingtrainingdata.K-Veritasisatestbed,notafinishedanswer.Wecallonconferencesandthecommunitytotreatnonrepudiationasafirst-classrequirementandtohelpbuildanopen,independentstandardforit.
View arXiv pageView PDFProject pageAdd to collection
Get this paper in your agent:
hf papers read 2605\.08586
Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash
Models citing this paper0
No model linking this paper
Cite arxiv.org/abs/2605.08586 in a model README.md to link it from this page.
Datasets citing this paper0
No dataset linking this paper
Cite arxiv.org/abs/2605.08586 in a dataset README.md to link it from this page.
Spaces citing this paper0
No Space linking this paper
Cite arxiv.org/abs/2605.08586 in a Space README.md to link it from this page.
Collections including this paper0
No Collection including this paper
Add this paper to acollectionto link it from this page.
Similar Articles
OpenComputer: Verifiable Software Worlds for Computer-Use Agents
OpenComputer presents a framework for creating verifiable software environments for computer-use agents, integrating state verifiers, self-improving verification layers, task synthesis, and evaluation systems across 33 desktop applications. Experiments show its verifiers align better with human judgment than LLM-as-judge, and frontier agents struggle with end-to-end completion.
How did software get so reliable without proof? (1996)
This 1996 paper explores the reasons behind the increasing reliability of software despite the lack of formal proofs, discussing informal methods and engineering practices.
Kettle: Attested builds for verifiable software provenance
This paper introduces Kettle, an attested build system that generates cryptographically verifiable software provenance using Trusted Execution Environments (TEEs). It aims to eliminate the build infrastructure and operators from the trust surface by binding provenance documents directly to hardware-signed attestation reports.
ScientistOne: Towards Human-Level Autonomous Research via Chain-of-Evidence
ScientistOne introduces Chain-of-Evidence, a verifiability framework for autonomous research agents that ensures every claim is traceable to evidence, achieving zero hallucinated references, perfect score verification, and the highest method-code alignment across 75 papers while matching or exceeding human expert performance on frontier research tasks.
When Should an AI Scientist Stop? Verifiable Experiment Steering and Refusal for Autonomous Discovery
This paper introduces Cartograph, a verification layer for AI scientists that couples subspace experiment steering, ambiguity resolution, and library inadequacy detection. The framework outperforms baselines in autonomous discovery testbeds and retrospectively flags inconclusive claims in the A-Lab materials system.