Tag
A major shift has occurred in the visual AI field: top tools no longer directly generate final outputs, but instead generate the source code behind them. a16z partner Yoko Li has provided an in-depth analysis of this.
The article argues that the next frontier of visual AI is generating code (e.g., SVG, HTML/CSS, React components) instead of raw pixels, enabling editability, iteration, and integration into professional design and development workflows.
Martin Scorsese joins Black Forest Labs as an advisor and uses FLUX for storyboarding, showcasing AI's role in visual creativity.
EyeBench-V3 visual benchmark evaluates Claude Opus 4.8, finding it still fails basic vision tasks, similar to IBench. The benchmark is introduced via a Twitter thread by Adonis Singh.
OpenAI released ChatGPT Images 2.0, claiming a GPT-3-to-GPT-5 leap; Simon Willison benchmarks it with a "Where's Waldo"-style raccoon-and-ham-radio prompt against gpt-image-1, Google Nano Banana 2 and Pro, showing mixed hide-and-seek success.
ChatGPT Images 2.0 in "Thinking" mode can turn 1,000-word prompts or 70-page PDFs into ready-to-use infographics, slide decks, and academic posters without manual editing.