Lemonade v10.8: auto memory management, cloud offload, Omni improvements, and call your local models as MCP tools

Reddit r/LocalLLaMA Tools

Summary

Lemonade v10.8 introduces auto memory management, cloud offload, improvements to Omni, and the ability to call local AI models as MCP tools.

No content available
Original Article

Similar Articles

Lemonade v10.7 release and project organization update

Reddit r/LocalLLaMA

Lemonade v10.7 release introduces LMX-Omni virtual models for omni-modal chat, a bench CLI tool for LLM performance comparison across backends, and expanded GPU support on AMD, Apple Silicon, Nvidia, and Intel systems.

macOS support in Lemonade has graduated out of beta!

Reddit r/LocalLLaMA

Lemonade, an open-source local AI solution, has graduated macOS support from beta, now offering all major capabilities including OmniRouter, coding, image/speech generation and transcription on macOS.

AMD's Lemonade SDK for local AI adds NVIDIA CUDA support

Reddit r/artificial

AMD's Lemonade SDK for local AI adds NVIDIA CUDA support in version 10.7, enabling the same local AI server experience on competitor GPUs. The release also introduces lemonade bench for cross-backend LLM benchmarking and broader Vulkan support.