hardware-efficiency

#hardware-efficiency

@Snixtp: More efficiency tests on a single 3090 TL;DR: - I tested 8 local LLMs on a single RTX 3090, power limit from 100W to 45…

X AI KOLs Following ↗ · yesterday

The article presents benchmark results for 8 local LLMs on an RTX 3090, showing that power efficiency peaks around 225W, with diminishing returns at maximum power.

0 favorites 0 likes

#hardware-efficiency

@no_stp_on_snek: small update from the long-context experiments: I got MRCR v2 running out to 1M on a single MI300X droplet with an open…

X AI KOLs Following ↗ · 2d ago

The author reports successful experiments running MRCR v2 with 1M context length on a single MI300X using Qwen2.5-32B and FAISS, achieving competitive scores at low cost.

0 favorites 0 likes

#hardware-efficiency

KernelBench-X: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

Hugging Face Daily Papers ↗ · 3d ago Cached

KernelBench-X is a new benchmark for evaluating LLM-generated GPU kernels, revealing that task structure impacts correctness more than method design and that correctness does not guarantee hardware efficiency.

0 favorites 0 likes

hardware-efficiency

@Snixtp: More efficiency tests on a single 3090 TL;DR: - I tested 8 local LLMs on a single RTX 3090, power limit from 100W to 45…

@no_stp_on_snek: small update from the long-context experiments: I got MRCR v2 running out to 1M on a single MI300X droplet with an open…

KernelBench-X: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

Submit Feedback