@charles_irl: When You were wrapping OpenAI, I studied πππ βππ»πΈ When you were having VC chats, I mastered ππ₯π’ βπ«π£π’π―π’π«οΏ½β¦
Summary
Modal Jazz is a complete open AI stack using Modal, DeepSeek V4 Pro, and SGLang for self-hosted language model inference, with frontends like OpenCode, OpenClaw, and Vercel AI SDK.
View Cached Full Text
Cached at: 06/18/26, 06:09 PM
When You were wrapping OpenAI, I studied πππ βππ»πΈ
When you were having VC chats, I mastered ππ₯π’ βπ«π£π’π―π’π«π π’ ππ’π―π³π’π―
And now that open source AI is here, you have the audacity to come to me for help?
Good! I wrote up some docs.
https://t.co/95XSG31K0P
modal-projects/modal-jazz
Source: https://github.com/modal-projects/modal-jazz
π· Modal Jazz
The spirit of jazz is the spirit of openness.
β Herbie Hancock, on software licensing
Iβll play it first and tell you what it is later.
β Miles Davis, on vibe-coding
This repository collects together a complete βopen AI stackβ β everything you need to run a smart language model and the interfaces that help it complete useful tasks. It uses Modal.
Open Language Modeling Backend
The language model is DeepSeekβs V4 Pro.
It is run using:
- Nvidia B200 GPUs
- The Modal cloud deployment platform (project sponsor)
- The SGLang inference server
- The OpenAI-compatible API interface (based on
/chat/completions).
To speed up the model weight downloading process, youβll need to add a Hugging Face access token stored as a Modal Secret.
For a single user, this achieves >150 tok/s output.
Open Frontends - /frontends
Agentic Coding TUI + WebUI - OpenCode
OpenCode is a terminal user interface for connecting human users, language models, and computer terminals, akin to Anthropicβs Claude Code but with broader LLM API support.
We provide instructions for integrating the self-hosted LLM with OpenCode and for deploying OpenCode servers on Modal here
Agentic Assistant - OpenClaw
OpenClaw is an agentic assistant system designed for maximum integrability.
We provide instructions for integrating the self-hosted LLM with OpenClaw here.
Chat Web UI - AI SDK
The Vercel AI SDK offers both Core and UI sub SDKs for integrating JavaScript applications with LLMs.
We demonstrate a simple integration of this stack with the self-hosted LLM β both a βhello worldβ-level integration with a NodeJS CLI here and a proper NextJS app here.
It is deployed here.
Chat CLI - llm
We like the llm CLI tool from Simon Willison
for running quick LLM queries from the terminal.
It offers integration with OpenAI-compatible API providers, like our self-hosted LLM, via the same interface as OpenAIβs models. Docs are here.
We demonstrate a small plugin in llm_show_reasoning
that prints the LLMβs reasoning output β not available from OpenAI reasoning models,
but available for open models. This reduces apparent latency.
Similar Articles
@nanjiangwill: At @modal, we're working to make sure OSS RL frameworks have all the techniques necessary to train frontier open-weightβ¦
Modal is enhancing OSS RL frameworks with delta compression and other techniques for training frontier open-weight models. The slime framework brings lossless delta sync to disaggregated training setups.
@charles_irl: Own your inference, own your agent platform, own your destiny. OpenInspect on @modal Endpoints.
OpenInspect enables fully self-hosted background agent systems using GLM-5.2 on Modal Endpoints, emphasizing ownership of inference infrastructure.
@charles_irl: Proper post-training RL, deployed broadly, is a key step towards a future where software systems quietly improve themseβ¦
Modal announces an open-source library for reinforcement learning on its platform, addressing infrastructure challenges in post-training RL with scalable deployment.
@SigGravitas: https://x.com/SigGravitas/status/2061554698285404289
Microsoft Build 2025 will host an Open Source Zone featuring four open source AI projects: OpenClaw, AutoGPT, Open WebUI, and prompts.chat, highlighting their roles in personal agents, autonomous agents, local AI platforms, and prompt engineering.
@slash1sol: ChatGPT 5.5 is cooked. Claude Opus 4.7 is cooked. Every $420/mo SaaS AI just got an open-source assassin. Mind blown: aβ¦
OpenHuman is an open-source desktop AI agent that runs locally on your machine, offering privacy-focused integrations with apps like Gmail and Slack, and challenging subscription-based SaaS AI models.