@MaximeRivest: Tool calling in open source LLMs is wildly different from one model to another. I just wipped up: http://chattemplatepl…
Summary
A new web tool, Chat Template Playground, lets users visualize how different open-source LLMs render their chat templates, highlighting differences in prompting and tokenization.
View Cached Full Text
Cached at: 06/03/26, 05:53 PM
Tool calling in open source LLMs is wildly different from one model to another.
I just wipped up: http://chattemplateplayground.com to help me vizualize how models on huggingface renders their chat
Same messages very different prompt and tokenization!
Try it and contrast gemma, with qwen, with gpt-oss, with kimi-2.6, with glm-5.1
Chat Template Playground
Source: https://chattemplateplayground.com/ Some models are gated (e.g. Meta Llama 3) or hosted in private repositories. Provide a Hugging Face User Access Token (Read) to fetch their templates directly.
Hugging Face API Token
This token is stored locally in your browser’s local storage and is sent only to the official Hugging Face API (huggingface.co).
Similar Articles
Markdown browser for LLMs
The author introduces TextWeb, an open-source tool that renders web pages as markdown for LLMs instead of using expensive vision models, featuring CLI and MCP server support.
froggeric/Qwen-Fixed-Chat-Templates
This repository provides fixed Jinja chat templates for Qwen 3.5 and 3.6, addressing rendering errors, token waste, and missing features in the official templates for engines like LM Studio and llama.cpp.
@OpenAIDevs: Introducing Euphony, an open-source tool for visualizing chat data and Codex session logs. Paste in a public URL or upl…
OpenAI released Euphony, an open-source tool that visualizes chat data and Codex session logs with filtering and editing features.
Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language
Chat2Workflow introduces a benchmark and agentic framework for generating executable visual workflows from natural language, showing current LLMs struggle with industrial-grade automation despite intent capture.
@royvanrijn: For curious developers I built "The Anatomy of an LLM", an interactive explainer showing how text becomes tokens, vecto…
An interactive visual guide that explains how large language models work, from tokenization through attention, transformer blocks, and text generation, built by Roy van Rijn.