machine-learning

#machine-learning

How are you evaluating AI features in production?

Reddit r/AI_Agents ↗ · 4h ago

A discussion on the methodologies and challenges involved in evaluating AI features once they are deployed in production environments.

0 favorites 0 likes

#machine-learning

Show HN: Neural Particle Automata

Hacker News Top ↗ · 13h ago Cached

Introduces Neural Particle Automata, a method for learning self-organizing particle dynamics using smooth particle hydrodynamics perception, enabling particles to have local perception vectors for an update rule, analogous to Neural Cellular Automata but on continuous particle positions.

0 favorites 0 likes

#machine-learning

Experimenting with the proposed Cross-Origin Storage API in Transformers.js

Hugging Face Blog ↗ · 21h ago Cached

This guest post explores the proposed Cross-Origin Storage API to improve caching of AI model resources in Transformers.js, enabling efficient reuse across origins while maintaining privacy and integrity for in-browser inference.

0 favorites 0 likes

#machine-learning

@loganthorneloe: This is a excellent explanation of JAX. Understanding how ML frameworks work internally gives you a massive advantage w…

X AI KOLs Timeline ↗ · yesterday Cached

This article explains in detail the core ideas of JAX, including function purity, immutability, explicit state management, and JIT compilation, helping readers shift from object-oriented thinking to functional programming to optimize machine learning performance.

0 favorites 0 likes

#machine-learning

Finding the Best Dog Treat with Statistics

Hacker News Top ↗ · yesterday Cached

Uses the Bradley-Terry model and Elo rating system to statistically determine a dog's favorite treat through pairwise comparison experiments.

0 favorites 0 likes

#machine-learning

@sumitdotml: week 25, 2026: cpu tensor core basics (add/mul, reduce, stride, 2d matmul, etc.) in c, reading some arcee

X AI KOLs Timeline ↗ · yesterday Cached

The author shares progress on building a CPU-only tensor library in C, covering basics like add/mul, reduce, strides, and 2D matmul, along with insights from reading Arcee's technical blogs on foundation models.

0 favorites 0 likes

#machine-learning

Petition against Meta's employee training data collection for ML models

Hacker News Top ↗ · yesterday Cached

Meta employees are petitioning against the Model Capability Initiative (MCI), which collects computer-use data like keystrokes, mouse movements, and screen content for AI training, raising serious privacy and regulatory concerns.

0 favorites 0 likes

#machine-learning

@PandaTalk8: The Most Worth-Following YouTube Channels for Learning AI in 2026, No-Nonsense Edition. Bookmark them, study in this order: 1. 3Blue1Brown AI / Math Foundation. Uses visualizations to clearly explain linear algebra, neural networks, and underlying mathematical intuition. https://youtube.c…

X AI KOLs Timeline ↗ · yesterday Cached

Recommends 15 YouTube channels for learning AI in 2026, categorized by learning stage, with study path advice for beginners, engineering projects, and cutting-edge trends.

1 favorites 1 likes

#machine-learning

An Update on Matrix Recurrent Units, an Attention Alternative [R]

Reddit r/MachineLearning ↗ · 2d ago

An update on Matrix Recurrent Units (MRU), a linear-time attention alternative. The author explores methods to stabilize training, finding that orthogonal matrices underperform while LDU factorization works best, and shows MRU underperforms transformers on larger datasets like TinyStories.

0 favorites 0 likes

#machine-learning

@amitiitbhu: Research papers every LLM engineer must read: - Attention Is All You Need - BERT - GPT-3: Language Models are Few-Shot …

X AI KOLs Timeline ↗ · 2d ago Cached

A list of essential research papers for LLM engineers, including key works on transformers, scaling laws, and fine-tuning techniques.

0 favorites 0 likes

#machine-learning

@PierceZhang34: A Machine Learning Systems Notes Repo on GitHub — The author has deeply studied machine learning systems over the past few months, mainly focusing on training and inference of large language models. This notes collection covers distributed computing, parallelization, quantization, and PyTorch internals, with most content derived from the author's experiments. 1. Distributed Technologies - covering distributed training…

X AI KOLs Timeline ↗ · 3d ago Cached

Sharing a machine learning systems notes repo on GitHub, covering distributed computing, parallelization, quantization, and PyTorch internals related to LLM training and inference. Suitable for learners interested in ML systems.

0 favorites 0 likes

#machine-learning

A better way to model the behavior of metal alloys

MIT News — Artificial Intelligence ↗ · 4d ago Cached

MIT researchers have developed a machine-learning-based approach to accurately model the behavior of metal alloys, regardless of chemical complexity, enabling faster and cheaper materials innovation.

0 favorites 0 likes

#machine-learning

@FinanceYF5: Loop Engineering——The True Source of Alpha for Quantitative Traders 1/ Backtest perfect, goes live for two weeks and starts losing. Every quant has experienced this. The problem isn't that the model isn't good enough; it's that you only have one guess, no iteration. Loop Engineering is the solution.

X AI KOLs Timeline ↗ · 4d ago Cached

A thread introducing Loop Engineering as a solution to the common problem of quant strategies that backtest perfectly but fail in live trading, emphasizing the need for iterative optimization.

0 favorites 0 likes

#machine-learning

Shape Suffixes – Good Coding Style

Hacker News Top ↗ · 4d ago Cached

Noam Shazeer describes a coding convention for naming tensors with dimension suffixes to improve code readability and sanity, used at Character.AI since 2022.

0 favorites 0 likes

#machine-learning

Multivariate Probability Models in Machine Learning [D]

Reddit r/MachineLearning ↗ · 5d ago

A discussion thread on multivariate probability models in machine learning.

0 favorites 0 likes

#machine-learning

TS-Fault: Benchmarking Time Series Forecasters Against Structural Faults

arXiv cs.LG ↗ · 5d ago Cached

This paper introduces TS-Fault, a benchmark for evaluating time series forecasting models under structured fault scenarios like broken dependencies and regime changes, finding that clean-data accuracy often anti-correlates with robustness and that foundation models are especially fragile.

0 favorites 0 likes

#machine-learning

The Illusion of Improvement: Reject Inference Strategies in Credit Scoring

arXiv cs.LG ↗ · 5d ago Cached

This paper systematically evaluates reject inference methods in credit scoring and identifies a failure mode where accuracy improves while recall collapses, creating an illusion of improvement while rejection quality deteriorates. It proposes a controlled exploration strategy that breaks the feedback loop and shows that even minimal exploration rates are sufficient to diagnose the problem.

0 favorites 0 likes

#machine-learning

Measurement noise limits the advantage of nonlinear models over linear models in biomedical prediction

arXiv cs.LG ↗ · 5d ago Cached

This paper argues that measurement noise, not model inadequacy, explains why nonlinear models often fail to outperform linear regression in biomedical prediction, as noise attenuates nonlinear structure faster than linear structure, a limitation that cannot be overcome by more data or model complexity.

0 favorites 0 likes

#machine-learning

P$^2$CE: Model-Agnostic Plausible Pareto-Optimal Counterfactual Explanations

arXiv cs.LG ↗ · 5d ago Cached

Introduces P²CE, a model-agnostic algorithm for generating plausible Pareto-optimal counterfactual explanations that balances feasibility, plausibility, and computational efficiency using an isolation forest outlier detector and SHAP values.

0 favorites 0 likes

#machine-learning

TxBench-PP: Analyzing AI Agent Performance on Small-Molecule Preclinical Pharmacology

arXiv cs.AI ↗ · 5d ago Cached

TxBench-PP is a benchmark for evaluating AI agents on small-molecule preclinical pharmacology tasks. Across 16 model-harness configurations, the best system achieved only 59.3% accuracy, indicating significant room for improvement.

0 favorites 0 likes

machine-learning

Submit Feedback