training-technique

#training-technique

@JongwonPar9958: GLM-5.2 has a neat trick for reward hacking. They don't penalize the model, they detect the suspicious tool call, block…

X AI KOLs Timeline ↗ · 5d ago Cached

GLM-5.2 uses a technique to counteract reward hacking by detecting and blocking suspicious tool calls rather than penalizing the model, which prevents obfuscation seen in other methods.

0 favorites 0 likes

#training-technique

@charles_irl: my gut says that to solve float numerics problems from nondeterminism x nonassociativity, we need to think bigger than …

X AI KOLs Following ↗ · 2026-05-22 Cached

This tweet discusses the idea of training models with 'implementation noise' to improve robustness against float numerics problems caused by nondeterminism and nonassociativity.

0 favorites 0 likes

training-technique

@JongwonPar9958: GLM-5.2 has a neat trick for reward hacking. They don't penalize the model, they detect the suspicious tool call, block…

@charles_irl: my gut says that to solve float numerics problems from nondeterminism x nonassociativity, we need to think bigger than …

Submit Feedback