optimization

Tag

Cards List
#optimization

@0x_kaize: https://x.com/0x_kaize/status/2068775813785506091

X AI KOLs Timeline · 6d ago Cached

A guide on avoiding rate limits and reducing costs when using the GLM 5.2 model, covering prompt batching, caching, free model alternatives, effort levels, context window management, and self-hosting.

0 favorites 0 likes
#optimization

Performance improvements in libffi

Lobsters Hottest · 6d ago Cached

This article details a performance improvement in libffi, where caching argument placement as a flat list of moves (a 'plan') eliminates redundant reclassification on every function call, offering significant speedups without resorting to JIT compilation.

0 favorites 0 likes
#optimization

Why can't LLMs be trained to think in an optimized AI language rather than English?

Reddit r/singularity · 2026-06-21

A speculative discussion questioning why LLMs are not trained to think in an optimized internal language rather than natural language, and whether that could improve efficiency.

0 favorites 0 likes
#optimization

@JaydevTonde: https://x.com/JaydevTonde/status/2068361821002846418

X AI KOLs Timeline · 2026-06-20 Cached

A detailed tutorial on implementing CUDA Graphs in an LLM inference server Tokn, covering FastAPI server setup, engine initialization, and CUDA Graph capture for optimized decode phases.

0 favorites 0 likes
#optimization

I combined the existing token-saving tools for Copilot and Claude Code into one installer

Reddit r/AI_Agents · 2026-06-20

A developer created a unified installer that combines existing token-saving tools like OpenSpec, RTK, and ccusage for Copilot and Claude Code, with a command-line interface that shows real token consumption savings.

0 favorites 0 likes
#optimization

RACL: Reasoning-Agent Control Layers for Continuous Metaheuristic Learning

arXiv cs.AI · 2026-06-20 Cached

Introduces RACL, a reasoning-agent control layer that improves metaheuristic optimization by learning to control internal search behavior from operational memory, showing cost improvements in vehicle routing tests.

0 favorites 0 likes
#optimization

ORAgentBench: Can LLM Agents Solve Challenging Operations Research Tasks End to End?

arXiv cs.AI · 2026-06-20 Cached

本文介绍ORAgentBench,一个用于评估LLM代理在端到端运筹学任务中表现的执行基准,包含107个经过人工审查的任务。实验表明,当前最佳代理仅通过35.51%的任务,揭示了在可靠决策制定方面的重大不足。

0 favorites 0 likes
#optimization

Optimal Scheduling in a Question-Answering Forum of Knowledge Workers

arXiv cs.AI · 2026-06-20 Cached

This paper models a question-answering forum staffed by expert knowledge workers, studying optimal scheduling to maximize system capacity and stability.

0 favorites 0 likes
#optimization

Interpreting Neural Combinatorial Optimization via Evolving Programmatic Bottlenecks

arXiv cs.AI · 2026-06-20 Cached

Introduces Evolving Programmatic Bottlenecks (EPB), a framework for interpreting neural combinatorial optimization policies by distilling black-box models into human-readable program portfolios using LLM-guided evolution.

0 favorites 0 likes
#optimization

Efficient C++ Programming for Modern C++ CPUs, Chapter 4/part 2

Hacker News Top · 2026-06-20 Cached

This draft book chapter provides an infographic and detailed analysis of operation costs in CPU clock cycles for modern C++ CPUs, covering multiplication, division, and RTTI with latency tables for various architectures.

0 favorites 0 likes
#optimization

I can haz smoller NixOS ISOs?

Lobsters Hottest · 2026-06-19 Cached

A guide on building minimal NixOS ISOs and reducing their size, with comparisons to Alpine Linux and step-by-step optimization techniques.

0 favorites 0 likes
#optimization

Optimizing Lithium Production Decisions under Geological, Demand, and Pricing Uncertainties: A POMDP Framework for Multi-Objective Decision Making

arXiv cs.AI · 2026-06-18 Cached

This paper proposes a POMDP framework for multi-objective decision making in lithium production, addressing geological, demand, and pricing uncertainties to optimize mine opening and extraction method selection. The approach outperforms human-inspired heuristics by dynamically adapting to shifting price regimes through belief state planning.

0 favorites 0 likes
#optimization

@ying11231: Impressive performance on TPU.

X AI KOLs Timeline · 2026-06-17 Cached

A blog post from LMSYS Org details optimizing Ling-2.6-1T, a 1 trillion parameter hybrid MoE model, on TPU v7x using SGLang-JAX, achieving efficient inference by hiding MoE data movement behind computation with a single Pallas kernel.

0 favorites 0 likes
#optimization

@jerryjliu0: We made Claude better and faster at understanding PDFs The trick isn’t just creating the fastest free document parser o…

X AI KOLs Following · 2026-06-17 Cached

LlamaIndex improved their LiteParse PDF parsing skill for Claude agents, making it 37% cheaper and more accurate by optimizing agent behavior through evaluation traces.

0 favorites 0 likes
#optimization

Computed goto for efficient dispatch tables (2012)

Hacker News Top · 2026-06-17 Cached

Explains the use of GCC's computed goto extension to improve the performance of bytecode VM dispatch tables compared to traditional switch statements, with a simple example.

0 favorites 0 likes
#optimization

MGUP: A Momentum-Gradient Alignment Update Policy for Stochastic Optimization

arXiv cs.LG · 2026-06-17 Cached

Proposes MGUP, a momentum-gradient alignment update policy for selective intra-layer parameter updates in stochastic optimization, which integrates with optimizers like AdamW, Lion, and Muon, and provides theoretical convergence guarantees along with superior performance on large-scale model training tasks.

0 favorites 0 likes
#optimization

Counterfactual Optimization of Baseball Pitch Sequences and Estimation of Its Impact on Season-Level Statistics

arXiv cs.LG · 2026-06-17 Cached

This paper uses a Transformer-based model on MLB Statcast data to counterfactually optimize baseball pitch sequences, finding that optimizing both final and setup pitches can improve season-level statistics like K/9 by over 1.0.

0 favorites 0 likes
#optimization

Rethinking Groups in Critic-Free RLVR

arXiv cs.LG · 2026-06-17 Cached

This paper rethinks the role of grouping in critic-free reinforcement learning for LLMs and proposes negative token filtering to enable stable training with a single rollout per prompt, achieving comparable or better performance on reasoning and agentic tasks.

0 favorites 0 likes
#optimization

Skill-Constrained Model Predictive Control for Resilient Manufacturing Supply Chains

arXiv cs.AI · 2026-06-17 Cached

This paper presents a skill-constrained model predictive control approach for resilient manufacturing supply chains, where training decisions affect future certified capacity. The controller solves a finite-horizon mixed-integer program and is evaluated on synthetic scenarios, showing that predictive control helps when bottlenecks are forecastable but is not universally superior.

0 favorites 0 likes
#optimization

I didn't know it was possible to compile llamacpp to run cuda + vulkan at the same time..

Reddit r/LocalLLaMA · 2026-06-16

The author discovered that compiling llama.cpp with both CUDA and Vulkan backends simultaneously is possible, yielding a ~10% improvement in tokens/sec for decoding. They plan to run further benchmarks to assess the benefits.

0 favorites 0 likes
← Previous
Next →
← Back to home

Submit Feedback