@charles_irl: When You were wrapping OpenAI, I studied 𝕋𝕙𝕖 β„‚π•Œπ”»π”Έ When you were having VC chats, I mastered 𝔗π”₯𝔒 ℑ𝔫𝔣𝔒𝔯𝔒𝔫�…

X AI KOLs Following Tools

Summary

Modal Jazz is a complete open AI stack using Modal, DeepSeek V4 Pro, and SGLang for self-hosted language model inference, with frontends like OpenCode, OpenClaw, and Vercel AI SDK.

When You were wrapping OpenAI, I studied 𝕋𝕙𝕖 β„‚π•Œπ”»π”Έ When you were having VC chats, I mastered 𝔗π”₯𝔒 ℑ𝔫𝔣𝔒𝔯𝔒𝔫𝔠𝔒 𝔖𝔒𝔯𝔳𝔒𝔯 And now that open source AI is here, you have the audacity to come to me for help? Good! I wrote up some docs. https://t.co/95XSG31K0P
Original Article
View Cached Full Text

Cached at: 06/18/26, 06:09 PM

When You were wrapping OpenAI, I studied 𝕋𝕙𝕖 β„‚π•Œπ”»π”Έ

When you were having VC chats, I mastered 𝔗π”₯𝔒 ℑ𝔫𝔣𝔒𝔯𝔒𝔫𝔠𝔒 𝔖𝔒𝔯𝔳𝔒𝔯

And now that open source AI is here, you have the audacity to come to me for help?

Good! I wrote up some docs.

https://t.co/95XSG31K0P


modal-projects/modal-jazz

Source: https://github.com/modal-projects/modal-jazz

🎷 Modal Jazz

The spirit of jazz is the spirit of openness.

β€” Herbie Hancock, on software licensing

I’ll play it first and tell you what it is later.

β€” Miles Davis, on vibe-coding

This repository collects together a complete β€œopen AI stack” – everything you need to run a smart language model and the interfaces that help it complete useful tasks. It uses Modal.

Open Language Modeling Backend

The language model is DeepSeek’s V4 Pro.

It is run using:

  • Nvidia B200 GPUs
  • The Modal cloud deployment platform (project sponsor)
  • The SGLang inference server
  • The OpenAI-compatible API interface (based on /chat/completions).

To speed up the model weight downloading process, you’ll need to add a Hugging Face access token stored as a Modal Secret.

For a single user, this achieves >150 tok/s output.

Open Frontends - /frontends

Agentic Coding TUI + WebUI - OpenCode

OpenCode is a terminal user interface for connecting human users, language models, and computer terminals, akin to Anthropic’s Claude Code but with broader LLM API support.

We provide instructions for integrating the self-hosted LLM with OpenCode and for deploying OpenCode servers on Modal here

Agentic Assistant - OpenClaw

OpenClaw is an agentic assistant system designed for maximum integrability.

We provide instructions for integrating the self-hosted LLM with OpenClaw here.

Chat Web UI - AI SDK

The Vercel AI SDK offers both Core and UI sub SDKs for integrating JavaScript applications with LLMs.

We demonstrate a simple integration of this stack with the self-hosted LLM – both a β€œhello world”-level integration with a NodeJS CLI here and a proper NextJS app here.

It is deployed here.

Chat CLI - llm

We like the llm CLI tool from Simon Willison for running quick LLM queries from the terminal.

It offers integration with OpenAI-compatible API providers, like our self-hosted LLM, via the same interface as OpenAI’s models. Docs are here.

We demonstrate a small plugin in llm_show_reasoning that prints the LLM’s reasoning output – not available from OpenAI reasoning models, but available for open models. This reduces apparent latency.

Similar Articles

@SigGravitas: https://x.com/SigGravitas/status/2061554698285404289

X AI KOLs Following

Microsoft Build 2025 will host an Open Source Zone featuring four open source AI projects: OpenClaw, AutoGPT, Open WebUI, and prompts.chat, highlighting their roles in personal agents, autonomous agents, local AI platforms, and prompt engineering.