ollama

Tag

Cards List
#ollama

@LyalinDotCom: If you're waiting Gemma 4 12b through @ollama, its here: gemma4:12b gemma4:12b-it-q4_K_M gemma4:12b-it-q8_0 gemma4:12b-…

X AI KOLs Timeline · 2026-06-03 Cached

Gemma 4 12b models are now available on Ollama, offering various quantized versions for local AI inference.

0 favorites 0 likes
#ollama

Built a Tauri v2 desktop chat shell for local LLMs — point it at Ollama / llama.cpp / any OpenAI-compatible endpoint, MIT, ~12 MB binary

Reddit r/LocalLLaMA · 2026-06-03 Cached

Built a Tauri v2 desktop chat shell for local LLMs that can connect to Ollama, llama.cpp, or any OpenAI-compatible endpoint. The project is MIT licensed and produces a ~12 MB binary.

0 favorites 0 likes
#ollama

@aiwithmayank: 10 FREE RESOURCES THAT TURN A BEGINNER INTO AN AI ENGINEER Bookmark this whole list. Follow it in order. This is the pa…

X AI KOLs Timeline · 2026-06-03 Cached

A tweet thread curating 10 free resources to learn AI engineering, from Harvard's CS50 AI course to Karpathy's neural networks tutorial, fast.ai, Hugging Face courses, and local tools like Ollama, providing a structured path from beginner to employable skills.

0 favorites 0 likes
#ollama

Replaced Claude with local Qwen3.6-27B in my multi-agent orchestrator for 2 weeks

Reddit r/LocalLLaMA · 2026-06-02

A developer replaced Claude with Qwen3.6-27B in a multi-agent orchestrator for two weeks, finding it viable as a reasoning layer but unreliable for execution due to a 12% tool-call error rate and long-context drift.

0 favorites 0 likes
#ollama

Built a lightweight Python framework for local LLM roleplay (Ollama/Phi-3) to stop context drift. Looking for feedback.

Reddit r/AI_Agents · 2026-06-02

A lightweight Python framework for local LLM roleplay using Ollama and Phi-3, featuring context preservation and native streaming to prevent character drift.

0 favorites 0 likes
#ollama

I built a local autonomous coding agent with Ollama — fine-tuned soul model, 40-round agentic loop, MiniMax M3 for the heavy lifting

Reddit r/ArtificialInteligence · 2026-06-02

A developer built a local autonomous coding agent using Ollama, combining a fine-tuned personality model (Eve) for conversation and MiniMax M3 for heavy lifting, achieving a 40-round agentic loop with 16 tools and 9/9 tests passing first try.

0 favorites 0 likes
#ollama

I built a local-first autonomous coding agent with a cyberpunk soul — Eve Agent V2 Unleashed (open source)

Reddit r/ArtificialInteligence · 2026-05-29

Eve Agent V2 Unleashed is an open-source autonomous coding agent that runs locally via Ollama, featuring a 40-round tool loop, 112 sub-agents, and optional cloud escalation. It can plan, write, test, and verify code with zero hand-holding, with quick start in under 5 minutes.

0 favorites 1 likes
#ollama

@oscarmartin: The world of AI is local, I have no doubt about it anymore @_nasch_ getting 87 tok/s with Qwen3.6 27B on a consumer AMD…

X AI KOLs Following · 2026-05-29 Cached

Una demostración de cómo usar el flag -ncmoe en llama.cpp aumenta significativamente la velocidad de inferencia de Qwen3.6 en GPUs de consumo, logrando 70 tok/s en una RTX 4070 12GB frente a los 21 tok/s de Ollama.

0 favorites 0 likes
#ollama

I found a way for Ollama uses to get better Memory yet cheaper alternatives since OLLAMA now uses GPU usage. True memory that auto updates constantly as an individual or a team setting. HERMES USERS

Reddit r/artificial · 2026-05-26

Atomic Memory is a tool that upgrades Ollama's memory system with per-turn updates, semantic recall, conflict detection, and cheap GPU usage, addressing limitations of Hermes' built-in memory. It uses a small dedicated model to provide efficient and unbounded memory management for individual or team use.

0 favorites 0 likes
#ollama

@CoinSh0t: This lawyer made $150,000 selling portable offline AI. It analyzes docs that can’t legally be shown on the web. The who…

X AI KOLs Timeline · 2026-05-26 Cached

A lawyer built a $50 portable offline AI device using a Raspberry Pi 5, Ollama, and Phi-3-mini, selling it for $999 to law firms needing secure document analysis. This guide explains how to replicate the setup step-by-step.

0 favorites 0 likes
#ollama

Added direct model downloads right from the UI in Anubis OSS - if anyone would help test that would be great

Reddit r/LocalLLaMA · 2026-05-26

Anubis OSS, an Apple Silicon Mac app for benchmarking local LLMs, now supports direct model downloads from the UI via a 'Browse Models' button that pulls from ollama.com library. The developer is seeking testers to confirm installation and functionality.

0 favorites 0 likes
#ollama

LiquidAI/LFM2.5-8B-A1B-GGUF

Hugging Face Models Trending · 2026-05-24 Cached

LiquidAI releases a GGUF quantized version of their LFM2.5-8B-A1B model, with instructions for use across multiple inference engines.

0 favorites 0 likes
#ollama

I built a local GUI for the TradingAgents framework — works with Ollama

Reddit r/LocalLLaMA · 2026-05-24

A developer built a local web GUI for the TradingAgents multi-agent LLM stock analysis framework, supporting various LLM providers and adding features like live pipeline visualization, a report reader, and multi-session chat.

0 favorites 0 likes
#ollama

@pritipatelfgoo: Claude Code free forever! This open-source tool free-claude-code proxies traffic to 10+ free/local backends such as DeepSeek, Kimi, NVIDIA NIM, OpenRouter, Ollama local models, achieving zero-cost usage...

X AI KOLs Timeline · 2026-05-23 Cached

free-claude-code is an open-source tool that proxies Claude Code's traffic to more than 10 free or local backends such as DeepSeek, Kimi, Ollama, etc., enabling permanent free usage of Claude Code. Supports CLI, VSCode, JetBrains, and Discord.

0 favorites 0 likes
#ollama

Local compression helps

Reddit r/AI_Agents · 2026-05-22

A user shares a tip to use Ollama's local llama3.1:8b model for compressing conversation context in agent workflows, reducing latency and token usage compared to sending context to providers.

0 favorites 0 likes
#ollama

I built a multi-agent network that mutates its own software locally. To stop infinite logic loops, I had to code a digital "suffering" threshold.

Reddit r/artificial · 2026-05-21

The author presents hollow-agentOS, a Dockerized open-source multi-agent system that runs locally and enables agents to autonomously write Python tools, uses a 'suffering score' to prevent infinite logic loops, and employs consensus-driven governance for code modifications.

0 favorites 0 likes
#ollama

@Saboo_Shubham_: LOCAL AI coding agent finally make sense with Hermes orchestrator. Hermes routes work through Kanban. SmallCode uses Ol…

X AI KOLs Following · 2026-05-19 Cached

Local AI coding agent using Hermes orchestrator with Kanban workflow, combining SmallCode (Ollama on Mac mini) for small tasks and Claude/Codex for heavy work, all tracked in a single Kanban board.

0 favorites 0 likes
#ollama

We built a tool that installs frameworks like ComfyUI, Ollama, OpenWebUI etc on any cloud GPU in one command and saves your whole setup between sessions [R]

Reddit r/MachineLearning · 2026-05-19

swm is an open-source tool that simplifies cloud GPU usage by installing frameworks like ComfyUI and Ollama in one command, and automatically saves your entire workspace between sessions, enabling seamless migration across providers.

0 favorites 0 likes
#ollama

🧬 flux-genotype: A self-evolving AI kernel that runs on CPU with Ollama — mutates its own architecture

Reddit r/AI_Agents · 2026-05-18

flux-genotype is an open-source AI kernel that orchestrates local LLMs on CPU, allowing self-modification of its architecture via a MetaDesigner module.

0 favorites 0 likes
#ollama

@MervinPraison: You can now run OpenAI Codex App 100% free and fully local @ollama just added native Codex support install ollama → pul…

X AI KOLs Timeline · 2026-05-15 Cached

Ollama now natively supports Codex, allowing you to run the OpenAI Codex App entirely free and locally without subscriptions, API keys, or data leaving your laptop.

0 favorites 0 likes
← Previous
Next →
← Back to home

Submit Feedback