ai-accelerators

Tag

Cards List
#ai-accelerators

7 Chinese companies are already shipping H100/H200-class AI chips, most IPO'd in the last 6 months. I mapped all of them.

Reddit r/LocalLLaMA · 4d ago

At least seven Chinese companies are shipping H100/H200-class AI accelerators, most having recently IPO'd, with several founded by former NVIDIA/AMD architects. Huawei's Ascend 950 targets H200-class performance, and China's domestic market share is rising as NVIDIA's declines.

0 favorites 0 likes
#ai-accelerators

Buying AI accelerators/GPUs in China...

Reddit r/LocalLLaMA · 2026-06-15

A user asks about buying Chinese AI accelerators/GPUs for inference, specifically looking for Huawei alternatives to Nvidia, with support for vLLM or Llama.cpp.

0 favorites 0 likes
#ai-accelerators

KForge: LLM-Driven Cross-Platform Kernel Generation for AI Accelerators

arXiv cs.LG · 2026-06-03 Cached

KForge is a cross-platform framework that uses two collaborating LLM-based agents to automatically generate and optimize high-performance compute kernels for diverse AI accelerators, achieving significant speedups on NVIDIA B200 and Intel Arc B580 hardware.

0 favorites 0 likes
#ai-accelerators

TRAM: Training Approximate Multiplier Structures for Low-Power AI Accelerators

arXiv cs.LG · 2026-05-12 Cached

This paper introduces TRAM, a method that jointly optimizes approximate multiplier structures and AI model parameters to reduce power consumption in AI accelerators while maintaining accuracy.

0 favorites 0 likes
#ai-accelerators

AccelOpt: A Self-Improving LLM Agentic System for AI Accelerator Kernel Optimization

Hugging Face Daily Papers · 2026-04-15 Cached

AccelOpt is a self-improving LLM agentic system that autonomously optimizes AI accelerator kernels through iterative generation and optimization memory, achieving 49-61% peak throughput improvements on AWS Trainium while being 26x cheaper than Claude Sonnet 4.

0 favorites 0 likes
← Back to home

Submit Feedback