neural-network

#neural-network

Learning to Distributedly Estimate under Partially Known Dynamics: A Covariance-Agnostic Neural Kalman Consensus Filter

arXiv cs.LG ↗ · 3h ago Cached

This paper presents CA-NKCF, a novel distributed latent state estimator combining partial domain knowledge with deep neural networks, achieving robust performance without noise statistics knowledge, outperforming traditional filters in linear, chaotic, and wireless tracking environments.

0 favorites 0 likes

#neural-network

The Simulacrum: Decision-Theoretic Pretraining for Near-Optimal Time-Series Forecasting and Inference

arXiv cs.LG ↗ · yesterday Cached

This paper introduces a unified decision-theoretic pretraining framework for neural network-based time series estimators, trained on stratified simulations to approximate near-optimal decision rules. Experiments show that the resulting estimators outperform traditional methods like maximum likelihood estimation on both synthetic and real-world benchmarks.

0 favorites 0 likes

#neural-network

@chessMan786: How to Build a Neural Network from Scratch in C++ Are you looking for a clear and practical explanation of how neural n…

X AI KOLs Timeline ↗ · yesterday Cached

A tweet promoting a guide on building a neural network from scratch in C++, aimed at providing a clear and practical explanation of how neural networks work.

0 favorites 0 likes

#neural-network

@FinanceYF5: The Platonic representation hypothesis is mostly a statistical illusion. New research shows that the apparent 'global convergence' in scaled AI models is actually a mathematical artifact caused by selection bias in model width and depth. Once calibrated, global convergence disappears.

X AI KOLs Following ↗ · 2d ago Cached

New research indicates that the apparent 'global convergence' in scaled AI models is actually a statistical illusion caused by selection bias in model width and depth, and disappears once calibrated.

0 favorites 0 likes

#neural-network

@himanshutwtxs: A complete architecture of GBrain by Garry Tan

X AI KOLs Timeline ↗ · 3d ago Cached

Garry Tan shares the complete architecture of GBrain, an AI model.

0 favorites 0 likes

#neural-network

@tetsuoai: The entire core of a neural network on four cards. Neuron, forward pass, activations, backprop. Learn these four and yo…

X AI KOLs Timeline ↗ · 6d ago Cached

A set of four cards covering the core concepts of neural networks: neuron, forward pass, activations, and backpropagation, aimed at helping learners understand how models from perceptrons to transformers work.

0 favorites 0 likes

#neural-network

Age of Empires II goat-based neural network highlights limits of AI consciousness claims.

Reddit r/ArtificialInteligence ↗ · 2026-06-21 Cached

A Microsoft AI researcher built a simple neural network using goats in Age of Empires II to argue that if such a system can be considered conscious, then claims of AI consciousness in chatbots are equally absurd.

0 favorites 0 likes

#neural-network

Neural Network Implementation of the Renormalization Group for Fault Diagnosis with Class Imbalance

arXiv cs.LG ↗ · 2026-06-18 Cached

Proposes RGNet, a neural network architecture based on renormalization group theory for hierarchical coarse-graining of feature space to address class imbalance and noise in fault diagnosis. Experimental results on the AI4I dataset show RGNet provides interpretable and competitive performance.

0 favorites 0 likes

#neural-network

Show HN: Microcrad – Micrograd Reimplemented in C

Hacker News Top ↗ · 2026-06-17 Cached

Microcrad reimplements Karpathy's micrograd autograd engine in C, providing an educational scalar-valued automatic differentiation library with reference counting and a small neural network, aimed at understanding backpropagation at the scalar level.

0 favorites 0 likes

#neural-network

I coded the biologically possible network training algorithm by nobel prize winner - Jeff Hinton

Reddit r/artificial ↗ · 2026-06-17

The author describes implementing a biologically plausible neural network training algorithm proposed by Geoffrey Hinton.

0 favorites 0 likes

#neural-network

Improving Neural Network Training by Decoupling the Magnitude and Direction of Weight Vectors | Alexander Hägele

Reddit r/LocalLLaMA ↗ · 2026-06-15 Cached

This blog post introduces Magnitude-Direction (MD) Decoupling, a method that separates neural network weight matrices into direction and magnitude components optimized with separate learning rates. Experiments show improved performance across Adam and Muon optimizers, automatic learning rate transfer across model widths, and scaling benefits in large Mixture-of-Experts models.

0 favorites 0 likes

#neural-network

A Longitudinal Attribute-Conditioned Neural Network for Modeling Health-State Transition Probabilities in Temporally Irregular Data: The LANTERN Framework

arXiv cs.LG ↗ · 2026-06-15 Cached

This paper introduces LANTERN, a neural network framework for estimating health-state transition probabilities from irregular longitudinal data, with applications to long-term care insurance. It outperforms traditional methods in discrimination and calibration for severe disability and mortality prediction.

0 favorites 0 likes

#neural-network

@7mood10061: A Chinese trader wanted to show off his AI neural network visualization on TikTok, but accidentally exposed his money printer to the world. He recorded a 2-minute video showing a glowing neural network: nodes flashing, hidden layers interweaving, blue synapses flowing like nerves, looking like an ordinary AI tech demo. The video...

X AI KOLs Timeline ↗ · 2026-06-11 Cached

A Chinese trader, while showcasing an AI neural network visualization on TikTok, accidentally exposed his cryptocurrency trading wallet used for arbitrage. The wallet had profited $367,000 in the past 30 days, triggering widespread tracking and discussion.

0 favorites 0 likes

#neural-network

Physics-Distilled Neural Network enabled by Large Language Models for Manufacturing Process-Property Predictive Modeling

arXiv cs.LG ↗ · 2026-06-11 Cached

This paper proposes a novel framework that uses LLMs to extract analytical physics priors from scientific literature and distills them into a lightweight neural network for high-accuracy, real-time manufacturing process-property prediction, even with limited data.

0 favorites 0 likes

#neural-network

The Smallest Brain You Can Build: A Perceptron in Python

Hacker News Top ↗ · 2026-06-08 Cached

A perceptron is the simplest neural network building block. This tutorial implements one from scratch in Python, explaining weights, bias, and learning through a clear example.

0 favorites 0 likes

#neural-network

MeshWeaver: Sparse-Voxel-Guided Surface Weaving for Autoregressive Mesh Generation

Hugging Face Daily Papers ↗ · 2026-06-03 Cached

MeshWeaver presents an autoregressive mesh generation framework that directly predicts vertices using a multi-level sparse-voxel encoder, achieving state-of-the-art compression and geometric fidelity for high-poly meshes.

0 favorites 0 likes

#neural-network

Amortized Factor Inference Networks for Posterior Inference

arXiv cs.LG ↗ · 2026-05-27 Cached

Introduces Amortized Factor Inference Networks (AFINs), a family of encode-merge-decode inference networks that generalize across varying priors, likelihoods, and dimensionality, achieving posterior accuracy comparable to NUTS with much less compute.

0 favorites 0 likes

#neural-network

Thermocompute constant time inference [P]

Reddit r/MachineLearning ↗ · 2026-05-24 Cached

Thermocompute is a PyTorch emulator for thermodynamic probabilistic computing that enables neural network layers to achieve constant modeled physical time inference by exploiting parallel thermodynamic substrate, with immediate GPU-usable stochastic layers.

0 favorites 0 likes

#neural-network

AutoMCU: Feasibility-First MCU Neural Network Customization via LLM-based Multi-Agent Systems

arXiv cs.LG ↗ · 2026-05-22 Cached

AutoMCU is a multi-agent system leveraging LLMs to automate neural network design for microcontroller units, significantly reducing customization time while ensuring feasibility under hardware constraints.

0 favorites 0 likes

#neural-network

A mini-computer you run from a folder on your computer that can train small LLMS

Reddit r/artificial ↗ · 2026-05-17

VirtualPC is an open-source 8-bit computer simulator that can train small neural networks from assembly code, demonstrating machine learning at the bare-metal level.

0 favorites 0 likes

neural-network

Submit Feedback