translate-gemma

Tag

Cards List
#translate-gemma

Follow-up to my TranslateGemma-12b benchmark post: human reviewers flagged 71% of the segments automated metrics rated clean

Reddit r/LocalLLaMA · 6d ago

A human review of TranslateGemma-12b's translations revealed that 71% of segments rated clean by automated metrics actually contained errors, highlighting significant gaps in metric-only evaluation for multilingual translation quality.

0 favorites 0 likes
← Back to home

Submit Feedback