Tag
An informal benchmark comparing 8 AI models (Qwen3.6 35B, Qwen3.5 series, Gemma 4 series, GLM 4.7 Flash) in creating racing games via OpenCode/Playwright MCP, testing their coding agent capabilities and documenting various implementation quirks.