Gemma 4 12B first coding agent test on a 4080 Super
Summary
A user tested Gemma 4 12B as a coding agent in VSCodium using Pi Agent extension, successfully performing a task to create a Python script that reads logs and outputs JSON. The model handled tool use autonomously with zero bugs.
Similar Articles
Gemma 4 2B handling structured JSON output + tool calling + reasoning traces correctly via Spring AI / LM Studio — including identifying a real Java bug in code review
User tested Gemma 4 2B running locally via LM Studio and Spring AI for structured JSON output, tool calling, and reasoning traces, finding it correctly identified a Java bug in code review and performed comparably to larger models.
Gemma 4 12B is my new main squeeze
The author shares their experience switching from Qwen 3.6 to Gemma 4 12B (Unsloth Q5_K_XL) for local coding, praising its plug-and-play setup, better syntax accuracy, and manageable VRAM usage despite a slight speed trade-off.
google/gemma-4-26B-A4B-it-assistant
Google DeepMind released Gemma 4 MTP drafters for the Gemma 4 family, enabling significant decoding speedups via speculative decoding while maintaining exact generation quality for low-latency applications.
google/gemma-4-E4B-it-assistant
Google DeepMind releases the Gemma 4 E4B instruction-tuned assistant model, featuring multimodal capabilities, reasoning improvements, and optimized speculative decoding for low-latency on-device applications.
Gemma 4 VLA Demo on Jetson Orin Nano Super
NVIDIA and Hugging Face publish a hands-on demo showing Gemma 4 running as a vision-language-action model entirely on the Jetson Orin Nano Super, using local STT/TTS and webcam input.