ai-accelerators

Tag

Cards List
#ai-accelerators

KForge: LLM-Driven Cross-Platform Kernel Generation for AI Accelerators

arXiv cs.LG · 4d ago Cached

KForge is a cross-platform framework that uses two collaborating LLM-based agents to automatically generate and optimize high-performance compute kernels for diverse AI accelerators, achieving significant speedups on NVIDIA B200 and Intel Arc B580 hardware.

0 favorites 0 likes
#ai-accelerators

TRAM: Training Approximate Multiplier Structures for Low-Power AI Accelerators

arXiv cs.LG · 2026-05-12 Cached

This paper introduces TRAM, a method that jointly optimizes approximate multiplier structures and AI model parameters to reduce power consumption in AI accelerators while maintaining accuracy.

0 favorites 0 likes
#ai-accelerators

AccelOpt: A Self-Improving LLM Agentic System for AI Accelerator Kernel Optimization

Hugging Face Daily Papers · 2026-04-15 Cached

AccelOpt is a self-improving LLM agentic system that autonomously optimizes AI accelerator kernels through iterative generation and optimization memory, achieving 49-61% peak throughput improvements on AWS Trainium while being 26x cheaper than Claude Sonnet 4.

0 favorites 0 likes
← Back to home

Submit Feedback