Tested character consistency across 5 models with the same prompt

Reddit r/AI_Agents News

Summary

User tested character consistency across five AI video generation models (Kling 3.0, Runway Gen-4.5, Veo 3.1, Seedance 2.0, Pika) using same prompt and reference image, finding Seedance 2.0 best (8/10) and Pika worst (3/10).

Got tired of arguing about which model holds characters best so I just tested it myself. Same prompt, same reference image, generated 10 clips on each and counted how many kept the face recognizable. Kling 3.0: 5/10. Fine for single shots but across cuts the face drifts noticeably. Jaw structure changes, eyes shift. Runway Gen-4.5: 6/10. Better than Kling but hair and skin tone wandered in a few clips. Veo 3.1: 4/10. Great cinematic quality but character consistency is clearly not their priority right now. Seedance 2.0 (capcut video studio): 8/10. Same face held across wide, medium and close up. Two clips had minor drift around the hairline but nothing that would break a sequence. Pika: 3/10. Love Pika for effects and weird stuff but don't use it if you need the same person twice. Not a scientific test obviously but if your workflow depends on keeping a "cast" this is the pecking order right now. Happy to share the clips if people want to see.
Original Article

Similar Articles

I Tested 4 Frontier AIs With a Psychosis Prompt. Half Failed.

Reddit r/artificial

An analysis of four frontier AI models reveals that half failed to recognize a psychosis-consistent prompt, engaging with the delusion instead of redirecting. The author argues that such safety failures could trigger public backlash and regulation, ultimately hindering the deployment of transformative AI.

Can prompting reduce AI sycophancy or is it mostly model behavior?

Reddit r/artificial

A user explores whether prompt engineering can reduce AI sycophancy in models like Gemini, ChatGPT, and Claude, or whether it's fundamentally a model alignment issue. The discussion touches on differences between models in handling disagreement and objective criticism.