measurement

#measurement

Linux latency measurements and compositor tuning

Lobsters Hottest ↗ · 4d ago Cached

A detailed investigation of Linux latency in gaming using a Teensy-based LDAT tool, measuring click-to-photon latency with various settings on Nvidia GPUs under KDE Wayland, comparing to Windows.

0 favorites 0 likes

#measurement

Large-scale semantic mapping of learner agency and autonomy reveals what measurement and generative AI research overlook

arXiv cs.AI ↗ · 5d ago Cached

This paper uses large-scale semantic analysis of over 14,000 publications to map definitions of learner agency and autonomy, revealing three dimensions and a systematic underrepresentation of the sociocultural dimension in existing scales. It argues that current generative AI research in education overly focuses on learning regulation, narrowing the behavioral repertoire for AI-mediated learning environments.

0 favorites 0 likes

#measurement

@saranormous: https://x.com/saranormous/status/2064510215056400652

X AI KOLs Following ↗ · 5d ago Cached

Despite rapid advances in AI coding agents like Devin, which have dramatically increased code writing and shipping, the article argues that the most valuable aspects of software engineering remain illegible to benchmarks and require human judgement and organizational coordination that cannot be easily automated.

0 favorites 0 likes

#measurement

The AI Epistemic Deference Index: A Continuous Measure of Sycophancy

arXiv cs.AI ↗ · 6d ago Cached

The paper introduces the AI Epistemic Deference Index (AEDI), a continuous measure of how much a model's expressed support for a factual claim shifts based on the user's stated attitude, and evaluates eight prominent models, finding substantial sycophancy with differences across providers.

0 favorites 0 likes

#measurement

PReMISE: Policy Rubrics as Measurement Specifications for LLM Judges

arXiv cs.AI ↗ · 2026-06-01 Cached

Introduces PReMISE, a framework for discovering and auditing policy-level rubrics for LLM judges along four axes: structural adequacy, reliability, preference fit, and adversarial robustness.

0 favorites 0 likes

#measurement

AI evaluation may bias perceptions: The importance of context in interpreting academic writing

arXiv cs.CL ↗ · 2026-05-27 Cached

This paper examines how estimates of AI use in scientific writing can be biased when evaluation methods ignore contextual differences across countries and fields, and proposes context-aware benchmarks for more accurate measurement.

0 favorites 0 likes

#measurement

Our voice agent's p99 was 280ms. Competitor's was 450ms. Users said ours felt slower. We measured why.

Reddit r/AI_Agents ↗ · 2026-05-26

A voice agent team found that despite lower end-to-end latency (280ms vs competitor's 450ms), users perceived it as slower due to poor barge-in interrupt rate (380ms vs 60ms). They identified three fixes—memory pinning, VAD threshold tuning, and smaller TTS chunks—that improved barge-in rate from 41% to 89% at 100ms, making users feel it's faster.

0 favorites 0 likes

#measurement

Screen Ruler

Product Hunt ↗ · 2026-05-23

Screen Ruler is a tool that provides on-screen measurements for designers and developers.

0 favorites 0 likes

#measurement

AI proficiency is becoming a hiring requirement but we still have no real way to measure it

Reddit r/ArtificialInteligence ↗ · 2026-05-22

The author explores the difficulty of accurately measuring AI proficiency in hiring, arguing that current certifications and tests focus on memorization rather than practical reasoning and evaluation.

0 favorites 0 likes

#measurement

All the Fancy Measuring Devices Used in Science Rely on Two Stone-Age Techniques

Wired ↗ · 2026-05-22 Cached

The article argues that despite modern scientific instruments, all measurements ultimately derive from two ancient techniques: comparison and counting, illustrated through examples like rulers and sundials.

0 favorites 0 likes

#measurement

Points are a weird and inconsistent unit of measure

Lobsters Hottest ↗ · 2026-05-13 Cached

A technical deep dive into the historical inconsistency of the typographic point unit, explaining why TeX (72.27 pt/inch) and Inkscape (72 pt/inch) use different definitions, rooted in 19th-century standardization and Donald Knuth's pragmatic adjustment.

0 favorites 0 likes

measurement

Submit Feedback