optimization

#optimization

Token minimization is not the same as context discipline

Reddit r/AI_Agents ↗ · 2h ago

The article distinguishes between token minimization and context discipline in AI usage, highlighting that efficient prompt optimization is not the same as maintaining proper context awareness.

0 favorites 0 likes

#optimization

Optimizing LLVM's bump allocator

Lobsters Hottest ↗ · 5h ago Cached

This blog post details three recent optimizations to LLVM's BumpPtrAllocator, reducing fast-path overhead by removing redundant alignment, null pointer checks, and per-allocation accounting, resulting in improved performance for Clang, lld, and other LLVM components.

0 favorites 0 likes

#optimization

Difference of Convex Programming in the Wasserstein Space with Applications to MMD Optimization

arXiv cs.LG ↗ · 5h ago Cached

This paper introduces a difference-of-convex programming framework in Wasserstein space for optimizing non-convex functionals over probability measures, with explicit decompositions for Maximum Mean Discrepancy and Energy Distance, and proves convergence of the lifted convex-concave procedure.

0 favorites 0 likes

#optimization

COOPA: A Modular LLM Agent Architecture for Operations Research Problems

arXiv cs.LG ↗ · 5h ago Cached

This paper introduces COOPA, a modular LLM agent architecture for operations research problems that combines iterative confidence-based modeling, element-level provenance, and multi-solver routing. Evaluated across eight LLM backbones and four baselines, COOPA achieves the best macro-average accuracy on six backbones and improves over the strongest baseline by up to 6.7 percentage points.

0 favorites 0 likes

#optimization

huff12 - a 12-stream Huffman decoder for Apple Silicon

Lobsters Hottest ↗ · 16h ago

huff12 is a 12-stream Huffman decoder optimized for Apple Silicon processors, aiming to improve decoding performance through parallel stream processing.

0 favorites 0 likes

#optimization

@antoniolupetti: "Mathematics of Neural Networks" is an excellent set of lecture notes for anyone who wants to study modern neural netwo…

X AI KOLs Timeline ↗ · yesterday Cached

A set of lecture notes covering the mathematics of neural networks, from basic activation functions to geometric concepts like group convolutions and equivariance.

0 favorites 0 likes

#optimization

@rohanpaul_ai: LLMs can learn better coding behavior from problems with no known answers. Many real problems do not have a gold soluti…

X AI KOLs Following ↗ · yesterday Cached

The paper introduces RiVER, a reinforcement learning method that improves LLMs' coding performance on problems without known gold solutions by ranking programs on hidden test cases and providing graded feedback.

0 favorites 0 likes

#optimization

Reflecting to optimise

Hacker News Top ↗ · 2d ago Cached

A blog post discussing optimization techniques for constrained categorical probability distributions, using softmax reparameterization and log barrier methods, applied to protein binder design.

0 favorites 0 likes

#optimization

Help optimizing llama.cpp + Qwen 27B on RTX PRO 6000 Blackwell for coding agents

Reddit r/LocalLLaMA ↗ · 3d ago

A user details their setup running Qwen 27B with llama.cpp on an RTX PRO 6000 Blackwell for local coding agents, compares performance to Claude models, and asks for help resolving frequent crashes and malformed response issues.

0 favorites 0 likes

#optimization

Sketched Linear Contrastive Learning: Approximation, Optimization, and Statistical Scaling

arXiv cs.LG ↗ · 3d ago Cached

This paper derives a scaling law for sketched linear contrastive learning under a Gaussian latent-variable model, analyzing how risk decomposes into approximation, optimization, and statistical terms, and provides theoretical guidance for balancing model size, data, and compute in contrastive learning.

0 favorites 0 likes

#optimization

Context-Aware Synthesis of Optimization Pipelines for Warehouse Optimization

arXiv cs.AI ↗ · 3d ago Cached

This paper presents CASOP, a framework for context-aware synthesis and evaluation of optimization pipelines for warehouse order fulfillment, enabling automatic construction of valid algorithmic pipelines from a modular repository.

0 favorites 0 likes

#optimization

High-Probability PL-SGD with Markovian Noise: Optimal Mixing and Tail Dependence

arXiv cs.LG ↗ · 3d ago Cached

This paper provides optimal high-probability bounds for stochastic gradient descent under Markovian noise for PL-smooth objectives, closing gaps between expectation and high-probability guarantees and extending to heavy-tailed settings with matching lower bounds.

0 favorites 0 likes

#optimization

When Agents Meet Electric Bus Fleet Operations: Pricing Behavior, Trade-offs, and Policy Implications in an Aggregator Framework

arXiv cs.AI ↗ · 3d ago Cached

This paper proposes an agentic aggregator framework for coordinating electric bus fleet operations, integrating optimization-based scheduling with supervisory AI agents to handle disturbances, tariff adaptation, and value allocation, revealing trade-offs between operational efficiency and profit-oriented pricing.

0 favorites 0 likes

#optimization

@BunnyxStudio: After spending 3 weeks saying goodbye to SwiftData, Hive's startup speed has significantly improved. A library of 66,000 images is basically ready to use with no waiting. Reducing cold start time is really important for the experience.

X AI KOLs Following ↗ · 3d ago Cached

BunnyxStudio spent 3 weeks removing SwiftData, resulting in a significant improvement in Hive's startup speed. A library of 66,000 images is almost instantly usable without waiting.

0 favorites 0 likes

#optimization

LFM2.5 230M running in-browser at 1,400 tok/s using custom WebGPU kernels

Reddit r/LocalLLaMA ↗ · 3d ago

LFM2.5 230M model achieves 1,400 tokens per second in-browser using custom WebGPU kernels, demonstrating efficient local inference.

0 favorites 0 likes

#optimization

Structured Primary Keys

Lobsters Hottest ↗ · 3d ago Cached

This article discusses how traditional primary key designs can isolate tables, and introduces structured primary keys as an alternative approach to improve SQL query performance and maintain relational integrity.

0 favorites 0 likes

#optimization

How we made an AI agent faster by moving stable context out of the prompt

Reddit r/AI_Agents ↗ · 3d ago

Describes a technique to improve AI agent speed by moving stable context out of the prompt, reducing token usage and latency.

0 favorites 0 likes

#optimization

What I'm Finding About LLM Code Style and Token Costs

Hacker News Top ↗ · 4d ago Cached

The article discusses how LLM code style choices affect token consumption and costs, offering optimizations such as using Web API standards and simpler indentation to reduce output tokens.

0 favorites 0 likes

#optimization

Agentic AI for Bilevel Long-Term Optimization of Policy-Driven Physical Layer Systems

arXiv cs.AI ↗ · 5d ago Cached

This paper presents Agentic-LTPO, a nested bilevel optimization framework that uses agentic AI to adapt physical layer configurations under dynamic operator policies, achieving 57.2% long-term performance improvement in cell-free MIMO beamforming.

0 favorites 0 likes

#optimization

The Teensy Executable Revisited

Hacker News Top ↗ · 5d ago Cached

This article revisits techniques for creating extremely small ELF executables on Linux, exploring how to reduce size to 45 bytes by abusing header fields and overlapping structures while maintaining ELF specification conformance.

0 favorites 0 likes

optimization

Submit Feedback