software-stack

#software-stack

@TheAhmadOsman: Why do I focus on Inference Engines/Software Stacks for your hardware? - 2x RTX 3090s: ~14.5 tok/s → ~64 tok/s moving t…

X AI KOLs Following ↗ · 2d ago Cached

Comparison of inference engine performance on different hardware: moving from baseline to vLLM with TP=2 on 2x RTX 3090s improves from ~14.5 tok/s to ~64 tok/s, and on RTX PRO 6000 moving to Sglang improves from ~32 tok/s to ~110 tok/s. Recommends vLLM/Sglang for CUDA/multi-GPU and llama.cpp for edge devices.

0 favorites 0 likes

#software-stack

@heyshrutimishra: I analyzed the software stack behind autonomous robots, and here's what actually makes them work: It's 50+ tools workin…

X AI KOLs Following ↗ · 4d ago Cached

An analysis of the software stack behind autonomous robots, breaking down the components from perception to cloud support, and highlighting that most tools are open-source.

0 favorites 0 likes

#software-stack

@JulianGoldieSEO: OpenClaw = the employee. Hermes = the memory. Paperclip = the company. That’s the simplest way to understand the crazie…

X AI KOLs Timeline ↗ · 2026-05-09 Cached

The article introduces an open-source AI agent stack comprising OpenClaw, Hermes, and Paperclip, describing it as a comprehensive setup that functions like an automated AI business.

0 favorites 0 likes

software-stack

@TheAhmadOsman: Why do I focus on Inference Engines/Software Stacks for your hardware? - 2x RTX 3090s: ~14.5 tok/s → ~64 tok/s moving t…

@heyshrutimishra: I analyzed the software stack behind autonomous robots, and here's what actually makes them work: It's 50+ tools workin…

@JulianGoldieSEO: OpenClaw = the employee. Hermes = the memory. Paperclip = the company. That’s the simplest way to understand the crazie…

Submit Feedback