data-annotation

#data-annotation

The Indian workers training AI robots to take their jobs | AFP

Reddit r/ArtificialInteligence ↗ · 2026-06-11 Cached

Indian workers, paid 250 rupees per hour, strap phones to their heads to record themselves doing household chores, providing training data for AI robots. They film over 90 different scenes and angles of actions every day, highlighting the labor issues behind AI training.

0 favorites 0 likes

#data-annotation

@StartupArchive_: Alexandr Wang on why Paul Graham’s “Schlep Blindness” essay was seminal for Scale AI “One of the secrets to Scale AI — …

X AI KOLs Following ↗ · 2026-05-20 Cached

Scale AI CEO Alexandr Wang shares how Paul Graham's 'Schlep Blindness' essay inspired the company's focus on solving the unglamorous but critical problem of building high-quality data sets for machine learning.

0 favorites 0 likes

#data-annotation

SSP-based construction of evaluation-annotated data for fine-grained aspect-based sentiment analysis

arXiv cs.CL ↗ · 2026-05-11 Cached

This paper presents the construction of a Korean evaluation-annotated corpus (EVAD) for fine-grained aspect-based sentiment analysis in e-commerce reviews using Semi-Automatic Symbolic Propagation. It evaluates KoBERT and KcBERT models on the dataset, achieving high F1 scores in aspect-value pair recognition.

0 favorites 0 likes

#data-annotation

Understanding Annotator Safety Policy with Interpretability

arXiv cs.AI ↗ · 2026-05-08 Cached

This paper introduces Annotator Policy Models (APMs) by Apple, which use interpretability techniques to infer annotators' internal safety policies from their labeling behavior without requiring additional annotation effort. The authors demonstrate that APMs can accurately model these policies and distinguish between sources of annotation disagreement, such as operational failures, policy ambiguity, and value pluralism.

0 favorites 0 likes

#data-annotation

Who and What? Using Linguistic Features and Annotator Characteristics to Analyze Annotation Variation

arXiv cs.CL ↗ · 2026-05-08 Cached

This paper presents a large-scale analysis of four harmful language detection datasets, examining how annotator characteristics and linguistic features interact to influence annotation variation. It highlights intersectional effects and warns against generalizing findings across different datasets.

0 favorites 0 likes

#data-annotation

Tendem by Toloka

Product Hunt ↗ · 2026-05-06

Tendem by Toloka is a platform that connects AI developers with human experts for data annotation and training.

0 favorites 0 likes

data-annotation

The Indian workers training AI robots to take their jobs | AFP

@StartupArchive_: Alexandr Wang on why Paul Graham’s “Schlep Blindness” essay was seminal for Scale AI “One of the secrets to Scale AI — …

SSP-based construction of evaluation-annotated data for fine-grained aspect-based sentiment analysis

Understanding Annotator Safety Policy with Interpretability

Who and What? Using Linguistic Features and Annotator Characteristics to Analyze Annotation Variation

Tendem by Toloka

Submit Feedback