optimization

#optimization

@rohanpaul_ai: LLMs can learn better coding behavior from problems with no known answers. Many real problems do not have a gold soluti…

X AI KOLs Following ↗ · yesterday Cached

The paper introduces RiVER, a reinforcement learning method that improves LLMs' coding performance on problems without known gold solutions by ranking programs on hidden test cases and providing graded feedback.

0 favorites 0 likes

#optimization

Reflecting to optimise

Hacker News Top ↗ · yesterday Cached

A blog post discussing optimization techniques for constrained categorical probability distributions, using softmax reparameterization and log barrier methods, applied to protein binder design.

0 favorites 0 likes

#optimization

Help optimizing llama.cpp + Qwen 27B on RTX PRO 6000 Blackwell for coding agents

Reddit r/LocalLLaMA ↗ · 2d ago

A user details their setup running Qwen 27B with llama.cpp on an RTX PRO 6000 Blackwell for local coding agents, compares performance to Claude models, and asks for help resolving frequent crashes and malformed response issues.

0 favorites 0 likes

#optimization

Sketched Linear Contrastive Learning: Approximation, Optimization, and Statistical Scaling

arXiv cs.LG ↗ · 2d ago Cached

This paper derives a scaling law for sketched linear contrastive learning under a Gaussian latent-variable model, analyzing how risk decomposes into approximation, optimization, and statistical terms, and provides theoretical guidance for balancing model size, data, and compute in contrastive learning.

0 favorites 0 likes

#optimization

Context-Aware Synthesis of Optimization Pipelines for Warehouse Optimization

arXiv cs.AI ↗ · 2d ago Cached

This paper presents CASOP, a framework for context-aware synthesis and evaluation of optimization pipelines for warehouse order fulfillment, enabling automatic construction of valid algorithmic pipelines from a modular repository.

0 favorites 0 likes

#optimization

High-Probability PL-SGD with Markovian Noise: Optimal Mixing and Tail Dependence

arXiv cs.LG ↗ · 2d ago Cached

This paper provides optimal high-probability bounds for stochastic gradient descent under Markovian noise for PL-smooth objectives, closing gaps between expectation and high-probability guarantees and extending to heavy-tailed settings with matching lower bounds.

0 favorites 0 likes

#optimization

When Agents Meet Electric Bus Fleet Operations: Pricing Behavior, Trade-offs, and Policy Implications in an Aggregator Framework

arXiv cs.AI ↗ · 2d ago Cached

This paper proposes an agentic aggregator framework for coordinating electric bus fleet operations, integrating optimization-based scheduling with supervisory AI agents to handle disturbances, tariff adaptation, and value allocation, revealing trade-offs between operational efficiency and profit-oriented pricing.

0 favorites 0 likes

#optimization

@BunnyxStudio: After spending 3 weeks saying goodbye to SwiftData, Hive's startup speed has significantly improved. A library of 66,000 images is basically ready to use with no waiting. Reducing cold start time is really important for the experience.

X AI KOLs Following ↗ · 2d ago Cached

BunnyxStudio spent 3 weeks removing SwiftData, resulting in a significant improvement in Hive's startup speed. A library of 66,000 images is almost instantly usable without waiting.

0 favorites 0 likes

#optimization

LFM2.5 230M running in-browser at 1,400 tok/s using custom WebGPU kernels

Reddit r/LocalLLaMA ↗ · 2d ago

LFM2.5 230M model achieves 1,400 tokens per second in-browser using custom WebGPU kernels, demonstrating efficient local inference.

0 favorites 0 likes

#optimization

Structured Primary Keys

Lobsters Hottest ↗ · 2d ago Cached

This article discusses how traditional primary key designs can isolate tables, and introduces structured primary keys as an alternative approach to improve SQL query performance and maintain relational integrity.

0 favorites 0 likes

#optimization

How we made an AI agent faster by moving stable context out of the prompt

Reddit r/AI_Agents ↗ · 2d ago

Describes a technique to improve AI agent speed by moving stable context out of the prompt, reducing token usage and latency.

0 favorites 0 likes

#optimization

What I'm Finding About LLM Code Style and Token Costs

Hacker News Top ↗ · 3d ago Cached

The article discusses how LLM code style choices affect token consumption and costs, offering optimizations such as using Web API standards and simpler indentation to reduce output tokens.

0 favorites 0 likes

#optimization

Agentic AI for Bilevel Long-Term Optimization of Policy-Driven Physical Layer Systems

arXiv cs.AI ↗ · 4d ago Cached

This paper presents Agentic-LTPO, a nested bilevel optimization framework that uses agentic AI to adapt physical layer configurations under dynamic operator policies, achieving 57.2% long-term performance improvement in cell-free MIMO beamforming.

0 favorites 0 likes

#optimization

The Teensy Executable Revisited

Hacker News Top ↗ · 4d ago Cached

This article revisits techniques for creating extremely small ELF executables on Linux, exploring how to reduce size to 45 bytes by abusing header fields and overlapping structures while maintaining ELF specification conformance.

0 favorites 0 likes

#optimization

What are you actually evaluating these days: prompts, context, or the whole harness?

Reddit r/AI_Agents ↗ · 4d ago

A discussion about the focus of AI evaluations, questioning whether practitioners are optimizing prompts, context, or the entire harness, and noting a shift toward holistic optimization.

0 favorites 0 likes

#optimization

@Ex0byt: Update: the road to GLM-5.2: we're getting there, folks! non-quantized, non-pruned DeepSeek-v4-Flash. 11tok/s on a sing…

X AI KOLs Timeline ↗ · 4d ago Cached

Update on running a non-quantized DeepSeek-v4-Flash model at 11 tok/s on a single DGX Spark using sglang inference and a custom mega-kernel, progressing towards GLM-5.2.

0 favorites 0 likes

#optimization

Show HN: RLM-based local debugger for AI agent traces

Hacker News Top ↗ · 4d ago Cached

HALO is an open-source desktop app that uses reinforcement learning from model-based (RLM) techniques to debug and optimize AI agent traces locally, providing analysis and actionable recommendations.

0 favorites 0 likes

#optimization

I measured where my AI coding agents waste tokens, 42% was avoidable. Built a tool to catch it (Claude Code / Cursor / Codex)

Reddit r/AI_Agents ↗ · 5d ago

The author measured token waste in AI coding agents and found 42% avoidable, then built a tool to catch it. The tool works with Claude Code, Cursor, and Codex.

0 favorites 0 likes

#optimization

Hellishly Slow Level 13 Deflate Compression

Hacker News Top ↗ · 5d ago Cached

The article describes libdeflate's new level 13, a deliberately slow DEFLATE compression level that achieves marginally better compression (0.134% on Silesia) at the cost of being 56x slower than level 12, designed for scenarios where data is compressed once and decompressed many times.

0 favorites 0 likes

#optimization

p99 0ms* autocomplete for 240 million domain names

Lobsters Hottest ↗ · 5d ago Cached

The article explains how the author achieved p99 zero-millisecond perceived latency for autocomplete on 240 million domain names by prefetching suggestions on keyDown and caching, with a fast API built on Tranco and CZDS data.

0 favorites 0 likes

optimization

Submit Feedback