rnn

#rnn

RWKV is an RNN with great LLM performance and parallelizable like a Transformer.

Reddit r/LocalLLaMA ↗ · 6d ago Cached

RWKV is a novel language model architecture that combines the efficiency of RNNs with the parallelizability of Transformers, achieving strong LLM performance with linear time complexity and constant memory footprint.

0 favorites 0 likes

#rnn

Letter Lemmatization: One-to-one and Banded RNNs for Reversing Character-Set Simplification and Abbreviation in Medieval Text

arXiv cs.CL ↗ · 2026-07-13 Cached

This paper introduces neural approaches for reversing character-set simplification and abbreviation in medieval text, using one-to-one and banded RNNs trained with self-supervision or parallel corpora, and presents a Python library for letter lemmatization.

0 favorites 0 likes

#rnn

FLYNN: Robust Neural Network for Robot Navigation using Fly Brain Topology

arXiv cs.AI ↗ · 2026-07-02 Cached

FLYNN is a recurrent neural network derived from the fruit fly brain connectome for vision-based robot navigation. It achieves comparable performance to handcrafted networks while showing superior robustness to out-of-distribution data and sensory loss.

0 favorites 0 likes

#rnn

RNNs vs Transformers vs SSMs: where should AI memory live for continual learning?

Reddit r/artificial ↗ · 2026-06-18

A technical analysis comparing memory designs in RNNs, Transformers, and SSMs, arguing that the key question is where to store sequence state rather than which architecture is better. Discusses trade-offs between compressed hidden states, growing KV caches, and synaptic-like memory in model connectivity.

0 favorites 0 likes

#rnn

@Phoenixyin13: If the full score is 10, I would honestly give this MIT paper's SMT idea and writing an 8. The paper proposes Supervised Memory Training, using Transformer as a super teacher to first distill in parallel the most important things to remember at each moment…

X AI KOLs Timeline ↗ · 2026-06-16 Cached

This paper proposes Supervised Memory Training (SMT), which uses Transformer as a super teacher to distill memory states in parallel, then trains RNN with one-step supervised learning, achieving fully parallel training and reducing gradient path from O(T) to O(1), significantly improving long-range dependency learning.

0 favorites 0 likes

#rnn

@Phoenixyin13: AI has fallen into an either-or trap. On one side is the world-dominating Transformer architecture — excellent memory, but its quadratic computational explosion makes long contexts increasingly expensive, a real resource hog. On the other is the classic RNN architecture — lightning fast and cheap, but a total scatterbrain that forgets earlier content after a few more lines.

X AI KOLs Timeline ↗ · 2026-06-07 Cached

This article introduces a new method proposed by Google Research, Cornell, and USC that takes snapshots of RNN memory and caches them, enabling RNNs to efficiently handle long contexts. It combines Transformer-like strong memory with RNN-like low cost, offering a new direction for long-context AI.

0 favorites 0 likes

#rnn

@BlinkDL_AI: RWKV-7 G1g is here: the world's best pure RNN LLM, and a competitive LLM in general. Try https://huggingface.co/spaces/…

X AI KOLs Following ↗ · 2026-05-23 Cached

BlinkDL announces RWKV-7 G1g, a pure RNN LLM that claims to be the best in its class and competitive with general LLMs, with high-speed inference on a single RTX 5090.

0 favorites 0 likes

#rnn

@ekzhang1: Homage to karpathy joining anthropic: made an in-browser (webgpu) version of the famous char-rnn "Unreasonable effectiv…

X AI KOLs Following ↗ · 2026-05-20 Cached

Created an in-browser WebGPU version of the classic char-rnn demo training on Shakespeare, paying homage to Karpathy joining Anthropic.

0 favorites 0 likes

#rnn

PDRNN: Modular Data-driven Pedestrian Dead Reckoning on Loosely Coupled Radio- and Inertial-Signalstreams

arXiv cs.LG ↗ · 2026-05-18 Cached

Proposes PDRNN, a modular hybrid AI-assisted pedestrian dead reckoning system that combines a recurrent neural network with separate ML models for orientation, velocity, and distance estimation, with optional radio-based stabilization. Experiments on dynamic sports movement data show superior accuracy and precision compared to classic and ML-based methods.

0 favorites 0 likes

#rnn

Key-Value Means

Hugging Face Daily Papers ↗ · 2026-05-11 Cached

Key-Value Means (KVM) is a novel attention mechanism that combines the strengths of transformers and RNNs with controllable computational complexity and memory usage. It supports fixed-size or growing state, offers subquadratic prefill time and sublinear state growth, and can be implemented without custom kernels.

0 favorites 0 likes

#rnn

RL²: Fast reinforcement learning via slow reinforcement learning

OpenAI Blog ↗ · 2016-11-09 Cached

RL² proposes encoding a fast reinforcement learning algorithm as the weights of a recurrent neural network, learned through slow general-purpose RL, enabling agents to adapt to new tasks with few trials similar to biological learning. The method demonstrates strong performance on both small-scale bandit problems and large-scale vision-based navigation tasks.

0 favorites 0 likes

rnn

Submit Feedback