@exploraX_: https://x.com/exploraX_/status/2058847991264383485

X AI KOLs Timeline News

Summary

A curated list of 100 free open-source GitHub repos across categories like AI tools, self-hosted alternatives, dev essentials, and more, compiled by a content creator on X.

https://t.co/497AxEwvQs
Original Article
View Cached Full Text

Cached at: 05/25/26, 02:55 PM

100 free open-source github repos everyone needs to know about.

**in this article, I’ve curated 100 free open-source github repos worth knowing, the ones builders, creators, and curious people are actually using in 2026. **

hey, I’m m0h. researcher and content creator on X, 5+ years in. I write daily content like this breaking down AI, agents, and the tools shaping the new world, in a way that actually helps you use them.

follow + repost so others can see this, and bookmark it. you’ll want to come back to it.

the 100 repos sit inside 10 themed sections:

  • agentic & AI builder stack — claude code, codex, gemini cli skills, memory, context tools

  • self-hosted SaaS alternatives — kill the monthly bill, own your data

  • AI tools & LLM tooling — beyond agents, the LLM stack everyone uses

  • dev essentials — the boring tools you use weekly without thinking

  • learning, roadmaps & reference — the meta-resources that teach you the field

  • design & frontend — components, icons, frameworks

  • data, scraping & analytics — collection, processing, monitoring

  • security & privacy — secret scanners, hardening tools, password managers

  • web3, crypto & onchain — verified picks for builders and analysts

  • creator & content tools — what’s left for indie makers and content folk

some you’ve heard of, some you haven’t. all verified live. let’s get into it!

agentic coding & AI builder stack

the section that’s grown the most in the last 12 months. all of these turn claude code, codex, gemini cli, or cursor into something more — memory, skills, context, tools.

1. obra/superpowers

agentic skills framework + software development methodology. TDD, debugging, brainstorming, planning — battle-tested skills your AI agent actually uses. installs into claude code, codex cli, gemini cli, cursor, opencode, copilot cli.

  • license: MIT

  • catch: 200k+ stars means rapid changes. pin a version if you want stability for production work.

2. mksglu/context-mode

context window optimization for AI coding agents. sandboxes tool output and ships up to 98% reduction in context usage. 15 platforms supported. if your claude code session keeps running out of tokens, this is the fix.

  • license: MIT

  • catch: primary platform is claude code. other platforms work but get less love.

3. thedotmack/claude-mem

persistent memory across sessions. captures everything your agent does, compresses it with the claude agent SDK, injects relevant context back into future sessions. works with claude code, codex, gemini, opencode.

  • license: apache 2.0

  • catch: there’s a $CMEM solana token “officially embraced by the creator” — purely cosmetic to the tool, but be aware. open core today, pro tier coming later.

4. AgriciDaniel/claude-ads

comprehensive paid-ads audit and optimization skill. 250+ checks across google, meta, youtube, linkedin, tiktok, microsoft, apple ads. weighted scoring, parallel agents, AI creative generation. if you run paid ads, this audit saves hours.

  • license: MIT

  • catch: there’s a private mirror gated behind a paid community (AI marketing hub pro). the public MIT version is fully usable. read which features are which.

5. nowork-studio/toprank

open-source claude code skills for SEO, GEO, google ads, meta ads. covers technical audits, content writing, keyword research, ad management. turns your terminal into a marketing operator loop.

  • license: MIT

  • catch: the google ads + meta ads MCP servers route through notfair.co (the creators’ service). free auth, no API key, but you’re handing token storage to a third party.

6. microsoft/playwright-mcp

microsoft’s official MCP server for browser automation. gives any AI agent full browser control via accessibility trees (not screenshots — deterministic, token-efficient). plug into claude code, cursor, windsurf, codex.

  • license: apache 2.0

  • catch: runs locally. can’t bypass cloudflare bot detection at scale and won’t help with production scraping fleets. perfect for solo agents.

7. danny-avila/LibreChat

open-source chatgpt-style frontend that lets you bring your own keys (openai, anthropic, google, openrouter, ollama, etc.) in one chat UI. plugins, agents, vision, code interpreter, voice. self-host on docker.

  • license: MIT

  • catch: you still pay for the API keys you bring. the UI is free; the inference isn’t.

8. ollama/ollama

run llama, gemma, qwen, mistral, deepseek and 100+ other models locally with one command. official model registry, REST API, gpu acceleration. the default way most people run local LLMs today.

  • license: MIT

  • catch: you need real hardware. a 7B model wants 8GB+ RAM minimum, a 70B model needs serious VRAM. quantized models help but cost quality.

9. heygen-com/hyperframes

write HTML, render video. built specifically for AI agents. inspired by remotion but apache 2.0 instead of source-available. claude code, cursor, codex, gemini CLI can compose videos by writing HTML/CSS/JS.

  • license: apache 2.0

  • catch: uses headless chrome + ffmpeg under the hood. some compositions need git LFS. not a replacement for after effects.

10. anthropics/skills

official claude code skills from anthropic. PDF/DOCX/XLSX/PPTX manipulation, art generation, the reference implementations. read this repo to learn how skill files actually work.

  • license: MIT

  • catch: demonstrates “the official way” — your custom skills might need more flexibility than these patterns allow.

self-hosted SaaS alternatives

own your data, kill the monthly bill. these replace tools you’re probably paying for right now.

11. n8n-io/n8n

workflow automation that competes with zapier and make. 400+ integrations, AI nodes, self-hosted or cloud. if you’ve built anything in zapier, you can rebuild it here for free.

  • license: Sustainable Use License (fair-code, not OSI open source)

  • catch: the sustainable use license restricts use to internal business purposes — you can self-host for yourself or your company, but you can’t wrap it as a competing SaaS product. consulting and support services are allowed.

12. calcom/cal.com

calendly alternative. one-on-ones, group events, round-robin, team booking, stripe payments, routing forms. integrates with google/outlook/apple calendar, zoom, meet, teams.

  • license: AGPL-3.0 (core) + commercial license for /ee/ enterprise modules

  • catch: open core. SAML SSO and some enterprise features live under /ee/ and require a commercial license in production.

13. dani-garcia/vaultwarden

bitwarden-compatible server in rust. works with every official bitwarden client. unlimited users, full TOTP, attachments, organizations, sends. runs on a $5 VPS.

  • license: AGPL-3.0

  • catch: you are now your own password manager backend. if your server dies and you don’t have backups, your vault dies with it. set up automated backups before you trust this with anything.

14. bitwarden/server

the official bitwarden server. if you want first-party support and the corporate-blessed stack, use this instead of vaultwarden. heavier (C#, multiple containers) but officially supported.

  • license: GPL-3.0 (core) — some enterprise components are commercially licensed

  • catch: much heavier resource footprint than vaultwarden. vaultwarden = small. bitwarden/server = real infrastructure.

15. plausible/analytics

google analytics alternative that’s actually privacy-friendly. no cookies, no GDPR popup needed, lightweight script. self-host or use their hosted version.

  • license: AGPL-3.0

  • catch: less feature-dense than GA4. if you need ecommerce attribution, multi-touch funnels, or deep audience segmentation, you’ll outgrow it.

16. immich-app/immich

google photos alternative. self-hosted photo and video backup with mobile apps, facial recognition, object detection, places. iOS + android clients. genuinely the best self-hosted photo manager that exists right now.

  • license: AGPL-3.0

  • catch: still pre-1.0. the team explicitly tells you to keep an additional backup of your photos. don’t trust it as your only copy yet.

17. paperless-ngx/paperless-ngx

self-hosted document management. scans, OCRs, indexes and tags your paper documents into a searchable archive. handles tax records, invoices, contracts, receipts. once you have it, you never lose a document again.

  • license: GPL-3.0

  • catch: initial setup is fiddly (postgres, redis, OCR languages, mail rules). once it’s running, it just works.

18. home-assistant/core

the smart home OS. controls 3000+ device integrations from one local dashboard, no cloud required. zigbee, z-wave, matter, mqtt — all supported.

  • license: apache 2.0

  • catch: the learning curve is real. expect a weekend of YAML and automation tuning before you get anything useful working.

19. advplyr/audiobookshelf

audiobook + podcast server. apple books / audible / overcast alternative. iOS + android apps, web UI, syncs progress across devices, supports m4b/mp3/flac.

  • license: GPL-3.0

  • catch: transcoding audiobooks for some apps eats CPU. expect one-time setup pain for library metadata matching.

20. Stirling-Tools/Stirling-PDF

self-hosted PDF toolbox. merge, split, compress, OCR, convert, watermark, fill forms — all in a clean web UI. replaces ilovepdf, smallpdf, pdf24.

  • license: MIT

  • catch: runs locally so you don’t have to upload sensitive PDFs to random websites. depends on libreoffice + tesseract — your docker image will be ~3GB.

21. yt-dlp/yt-dlp

youtube-dl fork that’s actually maintained. downloads video and audio from youtube and 1500+ other sites. used as a dependency by countless other tools.

  • license: unlicense (public domain)

  • catch: youtube fights it. expect occasional breakage during cat-and-mouse with detection — usually fixed within hours.

22. syncthing/syncthing

peer-to-peer file sync. dropbox without the cloud. encrypts in transit, never touches a third-party server, runs on linux/mac/win/android.

  • license: MPL-2.0

  • catch: no native iOS app (apple’s restrictions). real-time sync depends on both peers being online — no asynchronous cloud cache like dropbox.

23. twentyhq/twenty

open-source salesforce / hubspot CRM alternative. companies, contacts, deals, pipelines, custom objects, full GraphQL API. modern stack (next.js, postgres).

  • license: AGPL-3.0

  • catch: still v1.x — fewer integrations than mature CRMs. don’t migrate a 50-person sales org to it yet. small teams: fine.

24. mfts/papermark

docsend alternative. open-source secure document sharing with built-in analytics and custom domains. shareable links, branding, page-by-page view tracking.

  • license: AGPL-3.0

  • catch: the hosted version has paid tiers; the self-hosted gives you everything. the catch is you have to host it.

25. elie222/inbox-zero

open-source AI personal assistant for gmail. auto-archives, drafts replies, runs rules, cold-email blocking. read by AI, decisions logged.

  • license: AGPL-3.0

  • catch: you give AI write access to your gmail. set the rules conservatively before flipping it to “auto-execute” mode.

26. anyproto/anytype-ts

notion / obsidian alternative. encrypted, local-first, P2P knowledge base. blocks, relations, objects, custom types. mobile + desktop.

  • license: anytype license (custom, not standard OSS)

  • catch: the license is custom, not OSI-approved. read it before assuming you can self-host commercially.

27. karakeep-app/karakeep

self-hosted bookmark manager (formerly hoarder). saves articles, images, videos, twitter threads with full-text search and AI auto-tagging. browser extensions, iOS, android.

  • license: AGPL-3.0

  • catch: AI tagging needs an LLM key (openai, ollama, etc.). free if you use ollama locally; otherwise you pay for inference.

28. presenton/presenton

open-source AI presentation generator. local-first, supports ollama and openai. generates and exports .pptx files. alternative to gamma and beautiful.ai.

  • license: apache 2.0

  • catch: the AI quality depends entirely on the model you wire it to. local llama 3 isn’t going to beat gamma’s polished templates.

29. gitroomhq/postiz-app

social media scheduler covering 14+ platforms (x, linkedin, instagram, threads, bluesky, mastodon, tiktok, youtube, pinterest, facebook, reddit, discord). AI captions, team workspaces, analytics.

  • license: AGPL-3.0

  • catch: heavier stack (postgres + redis + temporal.io). plan ~1GB RAM minimum on your VPS.

30. Moh4696/open-source-vs-saas

(disclosure: this is our own repo) curated list of 10 open-source github repos that genuinely replace specific paid SaaS tools — with the honest “what’s the catch” for each one. think of it as a smaller, deeper version of this section.

  • license: CC0

  • catch: it’s our list. read it as one curator’s opinion, not a community-graded ranking.

AI tools & LLM tooling

beyond the agentic stack, these are the LLM-focused repos that everyone serious about AI ends up using.

31. lobehub/lobe-chat

open-source AI chat framework. supports openai, claude, gemini, deepseek, ollama, openrouter — pick your provider, get a clean UI. plugins, agents, voice, vision, knowledge bases.

  • license: apache 2.0

  • catch: the hosted version has a paid tier. self-hosted is fully featured.

32. open-webui/open-webui

web UI for ollama. clean chatgpt-style interface for your local models. multi-user, RAG, voice, function calling. installs in 30 seconds.

  • license: Open WebUI License (custom, based on BSD-3 with a branding-protection clause as of v0.6.6, April 2025)

  • catch: if you deploy it to 50+ users, you can’t remove the “Open WebUI” branding without an enterprise license or contributor permission. solo/small deployments: no restrictions. anything below v0.6.5 is still BSD-3 and you can fork from there freely.

33. openai/whisper

openai’s open-source speech-to-text. 99 languages, word-level timestamps, runs locally. powers most “AI transcription” SaaS products you’re paying for.

  • license: MIT

  • catch: the original repo is slow on CPU. for production use, switch to SYSTRAN/faster-whisper or ggerganov/whisper.cpp — same model, way faster inference.

34. langchain-ai/langchain

the framework that made agent and RAG patterns popular. python + javascript. document loaders, vector stores, agents, chains.

  • license: MIT

  • catch: infamous churn — APIs break between versions. read the migration notes carefully. some people prefer LlamaIndex, Haystack, or raw SDK calls now.

35. run-llama/llama_index

RAG-focused framework. data ingestion, indexing, retrieval, response synthesis. cleaner abstraction than langchain if you’re doing pure retrieval + generation.

  • license: MIT

  • catch: less community sprawl than langchain — fewer pre-built integrations, but more consistent APIs.

36. comfyanonymous/ComfyUI

node-based stable diffusion UI. the production tool for serious image and video generation workflows. supports SD 1.5, SDXL, SD3, flux, hunyuan, wan, and most major open models.

  • license: GPL-3.0

  • catch: the learning curve is brutal at first. once you get the node graph mental model, nothing else matches its flexibility.

37. AUTOMATIC1111/stable-diffusion-webui

the original stable diffusion web UI. easier than comfy for beginners, has plugin ecosystem, gradio-based.

  • license: AGPL-3.0

  • catch: development pace has slowed significantly compared to 2022-2023. flux and newer models work but with limitations. comfyui is where the cutting-edge model support tends to land first now.

38. ggerganov/llama.cpp

the engine. C/C++ inference for llama, mistral, qwen, gemma, and basically every open LLM. powers ollama, lmstudio, jan, and dozens of other apps.

  • license: MIT

  • catch: moves fast. if you’re building on top of it, pin your versions or expect breakage.

39. mudler/LocalAI

drop-in replacement for openai API — but running locally. point any openai-compatible app at LocalAI and it just works.

  • license: MIT

  • catch: you still need a model and hardware to back it. it’s a router, not a model.

40. langfuse/langfuse

open-source LLM observability. traces, evals, prompt management, datasets. the langsmith alternative everyone moves to when they want to self-host.

  • license: MIT (core), enterprise features under commercial license

  • catch: open core. SSO, RBAC, and some enterprise features are paid.

41. firecrawl/firecrawl

LLM-friendly web crawler. takes any website, returns clean markdown for RAG / agent use. handles javascript, screenshots, pagination. (formerly mendableai/firecrawl — org rebranded; old URLs auto-redirect.)

  • license: AGPL-3.0

  • catch: the hosted version has paid tiers. self-host is full-featured.

42. D4Vinci/Scrapling

adaptive web scraping that handles everything from a single request to a full crawl. parser relocates elements when pages change. fetchers bypass cloudflare turnstile. has an MCP server for AI integration.

  • license: BSD-3-Clause

  • catch: anti-bot bypass is rolling cat-and-mouse — works today, might need updates tomorrow when targets change.

43. YouMind-OpenLab/awesome-gpt-image-2

the largest curated GPT image 2 prompt library — 2000+ prompts with preview images, 16 languages, updated daily. essential if you use openai’s image model seriously.

  • license: CC BY 4.0

  • catch: prompt libraries decay fast. by 2027 a chunk of these will be tuned for an old model version.

44. Moh4696/free-google-ai-tools

(disclosure: our own repo) curated list of 10 free google AI tools (notebookLM, AI studio, gemini CLI, jules, stitch, gemma, illuminate, learn about, labs FX, colab) with the honest “what’s actually free vs paid” for each one.

  • license: CC0

  • catch: our own list. directional, not exhaustive.

dev essentials

the boring repos. tools you use weekly without thinking. nothing flashy — just the foundations.

45. ohmyzsh/ohmyzsh

zsh shell framework. themes, plugins, autocompletion. if you’ve ever copied “install oh-my-zsh” off a stack overflow answer, this is it.

  • license: MIT

  • catch: can slow down shell startup if you pile on plugins. profile your .zshrc if cold-start gets painful.

46. starship/starship

fast cross-shell prompt. shows git status, language versions, kubernetes context, AWS profile — but only when relevant. zero config to get started.

  • license: ISC

  • catch: none. just install it.

47. nvm-sh/nvm

node version manager. switch between node versions per-project with a .nvmrc file. if you do any node work, this is mandatory.

  • license: MIT

  • catch: windows users need nvm-windows (separate project) — nvm-sh doesn’t work natively on windows.

48. pyenv/pyenv

the python equivalent of nvm. install and switch between python versions cleanly.

  • license: MIT

  • catch: doesn’t manage packages — pair with poetry, uv, or pip.

49. astral-sh/uv

the fastest python package manager that exists. drop-in replacement for pip and pip-tools. 10-100x faster. written in rust.

  • license: apache 2.0 + MIT

  • catch: still young. most workflows work, but some niche tools (older legacy packaging stuff) might trip it up.

50. astral-sh/ruff

python linter and formatter, also in rust. replaces flake8, black, isort, pyupgrade — all of them combined, faster than any single one.

  • license: MIT

  • catch: if your team has very custom black/flake8 configs, expect a migration session.

51. junegunn/fzf

the fuzzy finder. ctrl+R for shell history, file search, git branch picking, anything you can pipe. once you have it, you can’t go back.

  • license: MIT

  • catch: none. installs cleanly via every package manager.

52. tldr-pages/tldr

simplified man pages with practical examples. forgot the tar syntax? tldr tar gives you the 5 commands you actually use, no novel.

  • license: CC-BY 4.0 (content) + MIT (clients)

  • catch: none. genuine quality-of-life tool.

53. BurntSushi/ripgrep

faster grep. searches respect .gitignore by default, parses huge codebases in milliseconds. used by every code editor’s “find in project” you like.

  • license: unlicense + MIT

  • catch: none. just install it.

54. sharkdp/bat

cat with syntax highlighting, line numbers, and git integration. once you see a file with bat, you don’t want to use cat again.

  • license: apache 2.0 + MIT

  • catch: none.

  1. sharkdp/fd

faster find. parallel, respects .gitignore, intuitive syntax. another small joy.

  • license: apache 2.0 + MIT

  • catch: none.

learning, roadmaps & reference

the meta-resources. if you’re learning, building toward, or curious about a field — these are the references.

56. Developer-Y/cs-video-courses

76k+ stars. every university-level CS course with video lectures, organized by topic. MIT, stanford, CMU, harvard, berkeley, IIT, ETH zurich. genuinely the best free CS education aggregator on github.

  • license: unspecified (treat as fair-use links)

  • catch: it’s links — some old courses 404 over time. fix-PRs are welcome there.

57. kamranahmedse/developer-roadmap

the visual roadmaps for frontend, backend, devops, AI engineer, blockchain, mobile, qa, and more. 300k+ stars. shows you exactly what to learn in what order.

  • license: various (read each roadmap)

  • catch: roadmaps are opinionated — there’s no single right path. use as a checklist, not a religion.

58. donnemartin/system-design-primer

how to design large-scale systems. interview prep for senior engineering roles. caching, sharding, consistency, queues — all explained with diagrams.

  • license: CC-BY 4.0 + others

  • catch: focused on FAANG-style interview patterns. real-world architecture is messier.

59. EbookFoundation/free-programming-books

500+ programming books, freely available, organized by language and topic. PDFs, online HTML, courses.

  • license: CC-BY 4.0

  • catch: some links rot. PRs fix them fast.

60. TheAlgorithms/Python

every common algorithm implemented in clean python. sorting, graph algorithms, ML basics, ciphers, neural nets from scratch. great for learning.

  • license: MIT

  • catch: these are teaching implementations. don’t ship bubble_sort.py to production.

61. trekhleb/javascript-algorithms

same idea but javascript. plus explanations and complexity analysis for each algo.

  • license: MIT

  • catch: same as above — read, don’t ship.

62. f/awesome-chatgpt-prompts

the original “awesome prompts” repo. 100+ role-prompting examples (“act as a linux terminal”, “act as an interviewer”, etc.). still useful as a baseline.

  • license: CC0

  • catch: the “act as a…” pattern is dated by 2026 standards. modern prompting uses skill files, MCP, and structured instructions. read for ideas, not gospel.

63. dair-ai/Prompt-Engineering-Guide

academic-leaning prompt engineering guide. chain-of-thought, few-shot, RAG, self-consistency. proper citations.

  • license: MIT

  • catch: mostly written for completion-era models. some patterns are less critical with frontier reasoning models.

64. anthropics/courses

official anthropic courses. covers prompt engineering, API fundamentals, real-world prompting, prompt evaluations, tool use. read the jupyter notebooks.

  • license: various

  • catch: anthropic-specific. great patterns transfer, but expect claude-shaped examples.

65. binhnguyennus/awesome-scalability

the bible for systems engineers. case studies from netflix, uber, airbnb, stripe — how they actually scaled. patterns + war stories.

  • license: CC-BY 4.0

  • catch: none. essential reading if you want to design beyond a single server.

design & frontend

design assets, component libraries, UI kits. some you’ll fork into your projects, some you’ll use directly.

66. penpot/penpot

figma alternative. open-source design and prototyping platform. native design tokens, CSS grid layout, components, variants. browser-based or self-hosted.

  • license: MPL-2.0

  • catch: smaller plugin ecosystem than figma. still a serious tool, especially if you self-host.

67. shadcn-ui/ui

the component library that took over react in 2024-2025. copy-paste components built on radix + tailwind. you own the code, no npm install hell.

  • license: MIT

  • catch: “you own the code” means you also maintain it. dependencies still need updates.

68. tailwindlabs/tailwindcss

utility-first CSS framework. love it or hate it — half the web is built on it now. v4.0 dropped the postCSS dependency.

  • license: MIT

  • catch: the only valid one ever: HTML gets noisy. component extraction (or just shadcn/ui) solves it.

69. tailwindlabs/heroicons

the icon set that tailwind ships with. 300+ MIT icons, react + vue components, SVG.

  • license: MIT

  • catch: opinionated style. mix carefully with other icon sets.

70. lucide-icons/lucide

the icon set everyone migrates to. fork of feather icons with 1000+ additional icons. react, vue, svelte, angular, flutter — every framework has a binding.

  • license: ISC

  • catch: none.

71. radix-ui/primitives

unstyled, accessible component primitives for react. what shadcn/ui is built on. dialog, popover, dropdown, accordion, etc. — the things that are hard to get right.

  • license: MIT

  • catch: unstyled by design. you bring your own CSS — that’s the point.

72. magicuidesign/magicui

animated react components. landing-page hero sections, marquees, particles, beams. ready-to-copy.

  • license: MIT

  • catch: heavy on animation libraries. don’t ship all of them or your bundle size explodes.

73. unjs/h3

minimal HTTP framework built for portability — runs in node, bun, deno, and edge runtimes (cloudflare workers, vercel edge). it’s the HTTP layer inside nitro, which powers nuxt.

  • license: MIT

  • catch: smaller ecosystem than express. compatible with connect/express middleware but not every middleware works out of the box.

74. withastro/astro

static-first web framework with island architecture. fast by default, ships zero JS to the browser unless you ask for it. great for content sites.

  • license: MIT

  • catch: if you need a heavy app-style SPA, use next or remix instead. astro shines on content + light interactivity.

75. vercel/next.js

react meta-framework. server components, app router, edge runtime. powers a huge chunk of modern web apps.

  • license: MIT

  • catch: moves fast. major-version migrations have caused pain (app router, server actions). pin your version and read the upgrade guides.

data, scraping & analytics

handling data — collection, processing, visualization, monitoring.

76. apache/superset

open-source BI platform. dashboards, charts, SQL lab. replaces tableau and looker for many teams.

  • license: apache 2.0

  • catch: the setup is real. python, redis, postgres, celery — full infra. once it’s running, it’s solid.

77. metabase/metabase

easier BI tool. clean UI, less infra, faster onboarding than superset. for teams that want dashboards without a data engineer.

  • license: AGPL-3.0 (core) + commercial enterprise features

  • catch: open core. some enterprise features (SSO, sandboxing) require paid plans.

78. duckdb/duckdb

in-process SQL OLAP database. blazing fast on local files (parquet, csv, json). pandas killer for tabular analytics.

  • license: MIT

  • catch: in-process means single-machine. if your data is bigger than one machine, you’ll outgrow it.

79. pola-rs/polars

the pandas alternative. written in rust, lazy evaluation, parallel by default. 10-100x faster on big dataframes.

  • license: MIT

  • catch: different API from pandas. expect a learning bump if you’re a heavy pandas user.

80. PostHog/posthog

open-source product analytics. session replays, feature flags, experimentation, surveys, A/B tests. replaces amplitude, mixpanel, hotjar combined.

  • license: MIT (with cloud features under separate license)

  • catch: the self-hosted version is heavy (kafka, clickhouse, redis). small teams use the hosted free tier (1M events/mo).

81. grafana/grafana

the dashboard for everything observable. metrics, logs, traces. if you’ve ever seen a server metrics dashboard, you’ve probably seen grafana.

  • license: AGPL-3.0 (since 2021)

  • catch: the license switch in 2021 was controversial. fine for self-hosting, but be aware of the AGPL implications.

82. prometheus/prometheus

time-series metrics database. pairs with grafana for dashboards. the de-facto open-source monitoring stack.

  • license: apache 2.0

  • catch: pull-based by design — your services need to expose /metrics endpoints. push-based monitoring needs the pushgateway sidecar.

83. apache/airflow

workflow orchestration platform. data pipelines as code (python DAGs). standard at most data teams.

  • license: apache 2.0

  • catch: the operations overhead is real. simpler workflow tools (prefect, dagster) exist for smaller teams.

84. Fincept-Corporation/FinceptTerminal

open-source bloomberg terminal alternative. 50+ screens for equity, portfolio, derivatives, fixed income, alternatives. native C++20 + qt6, embedded python.

  • license: Dual: AGPL-3.0 (open source) + Fincept Commercial License

  • catch: commercial use requires a paid commercial license. personal use, learning, and academic research stay free. read the license clause carefully if you’re a business — including for forks that replace fincept’s data sources with your own.

85. HKUDS/Vibe-Trading

natural-language finance research agent. 7 backtest engines, 75 specialist skills, 29 multi-agent presets. exposes 22 MCP tools to claude desktop / cursor.

  • license: MIT

  • catch: research infrastructure, not financial advice. the multi-agent swarm tool needs an LLM API key. and never blow up your portfolio on a backtest.

security & privacy

password managers, secret scanners, hardening tools.

86. trufflesecurity/trufflehog

finds secrets that leaked into git history. AWS keys, API tokens, private keys — over 800 secret types detected. essential pre-commit.

  • license: AGPL-3.0

  • catch: false positives happen. tune the regex packs for your codebase.

87. gitleaks/gitleaks

similar to trufflehog. detects secrets in git history. lighter, less complete, faster.

  • license: MIT

  • catch: less thorough than trufflehog. pair them, don’t pick.

88. OWASP/CheatSheetSeries

the OWASP cheat sheets. authentication, session management, SQL injection prevention, XSS, secure coding. read this before you launch a public API.

  • license: CC-BY 4.0

  • catch: none — just read.

89. anchore/syft

software bill of materials (SBOM) generator. lists every package in your container or binary. mandatory if you ship software to enterprises that ask for SBOMs.

  • license: apache 2.0

  • catch: none.

90. aquasecurity/trivy

all-in-one security scanner. containers, filesystem, git repos, kubernetes. finds vulnerabilities, misconfigs, secrets. industry standard.

  • license: apache 2.0

  • catch: false positive triage takes time. set baselines so devs don’t drown in noise.

web3, crypto & onchain

verified picks for builders, traders, and analysts. open-source, no paid gating in the core.

91. ethereum/go-ethereum

the reference ethereum client (geth). if you run an ethereum node, you probably run geth.

  • license: GPL-3.0 / LGPL-3.0

  • catch: running a mainnet node needs ~1.5TB+ SSD and 24/7 uptime. not for casual experimentation.

92. foundry-rs/foundry

the smart contract development toolkit that replaced hardhat for most serious solidity devs. fast, written in rust, scriptable with solidity.

  • license: apache 2.0 + MIT

  • catch: if you started on hardhat, expect a few hours to relearn the tooling. worth it.

93. OpenZeppelin/openzeppelin-contracts

the audited smart contract library. ERC-20, ERC-721, ERC-1155, governance, access control, upgrades. every reputable contract pulls from here.

  • license: MIT

  • catch: none — but pin your version. major releases between v4 and v5 changed APIs significantly.

94. duneanalytics/spellbook

the open dune analytics SQL spellbook. every public query, every spell, every dashboard contributed by the community. learn dune by reading other people’s queries.

  • license: apache 2.0

  • catch: the SQL is dialect-specific (dune SQL / DuneSQL v2). docs help. expect a learning curve.

95. crytic/slither

solidity static analyzer. finds common smart contract bugs before audit. used by every serious security firm.

  • license: AGPL-3.0

  • catch: false positives — tune your detectors. doesn’t replace a manual audit for production-bound code.

creator & content tools

the leftovers — repos that don’t fit cleanly elsewhere but matter to creators, content builders, and indie hackers.

96. AntonOsika/gpt-engineer

codebase scaffolding from a single prompt. less relevant since claude code and codex exist, but historically important.

  • license: MIT

  • catch: mostly historical interest now. use claude code or codex for real work.

97. firecrawl/firecrawl-mcp-server

the firecrawl MCP. drop it into claude desktop or cursor and your agent gains web research powers. (formerly under mendableai/, the org rebranded to firecrawl/ — old URLs auto-redirect.)

  • license: MIT

  • catch: firecrawl’s API key. free tier exists, heavy use bills.

98. browser-use/browser-use

let your AI agent control a real browser. python-first, works with playwright. essentially the open-source version of what perplexity-comet does.

  • license: MIT

  • catch: runs locally — same trade-off as playwright-MCP. local = fast and free, but no IP rotation or anti-detection at scale.

99. Anil-matcha/Open-Higgsfield-AI

open-source AI image and cinema studio. 20+ models (flux, SDXL, midjourney via proxy, ideogram). self-host, MIT.

  • license: MIT

  • catch: requires a muapi.ai API key to actually generate anything — that’s a paid third-party gateway. the wrapper is free; the inference isn’t.

100. obra/superpowers-marketplace

the marketplace where claude code skills live. browse, install, fork. if you’re going to use anything from this section, start here.

  • license: MIT

  • catch: as a marketplace, quality varies. read each skill’s repo before installing into your agent.

a few honest notes before you go

on licenses. “open-source” covers a lot. MIT lets you fork and ship commercial products freely. AGPL says if you serve it over a network, you have to share your modifications. apache adds patent grants. unlicense / CC0 are basically public domain. read the license on anything you fork.

on maintenance status. software rots. by the time you read this, some of the picks above may be in maintenance mode (or worse). check the “last commit” date before adopting anything new.

on stars. stars are an unreliable signal of quality — they show attention, not value. a 1k-star tool you use every day beats a 50k-star tool nobody actually uses.

on the “free” part. every tool here is free to start using. some have a hosted SaaS upsell. some have enterprise-feature paywalls. that’s not a contradiction with “open-source” — it’s just the dominant funding model now. if a free tier covers your needs, no guilt — that’s how open core is supposed to work.

100 repos is a lot. nobody uses all of them. pick the 5 from your category, star them, install two this week. that’s the real win.

found a repo that belongs here? reply @exploraX_ — open to swapping in better picks. honest “what’s the catch” reasoning required.

Similar Articles

@neil_xbt: https://x.com/neil_xbt/status/2056386359376396458

X AI KOLs Timeline

A comprehensive curated list of 100 repositories across 11 categories that extend Claude Code's capabilities, from awesome lists to memory systems and orchestrators, enabling developers to build a system rather than just use a tool.