Gated QKAN-FWP: Scalable Quantum-inspired Sequence Learning
摘要
# Paper page - Gated QKAN-FWP: Scalable Quantum-inspired Sequence Learning Source: [https://huggingface.co/papers/2605.06734](https://huggingface.co/papers/2605.06734) Authors: , , , , , , , , , , , , , , , , , ## Abstract Quantum\-inspired fast\-weight programming framework using single\-qubit circuits achieves superior forecasting performance with reduced parameters compared to classical recurrent models while maintaining NISQ device compatibility\. [Fast Weight Programmers](https://huggingfac
查看缓存全文
缓存时间: 2026/05/11 10:44
Paper page - Gated QKAN-FWP: Scalable Quantum-inspired Sequence Learning
Source: https://huggingface.co/papers/2605.06734 Authors:
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Abstract
Quantum-inspired fast-weight programming framework using single-qubit circuits achieves superior forecasting performance with reduced parameters compared to classical recurrent models while maintaining NISQ device compatibility.
Fast Weight Programmers(FWPs) encode temporal dependencies through dynamically updated parameters rather than recurrent hidden states.Quantum FWPs(QFWPs) extend this idea withvariational quantum circuits(VQCs), but existing implementations rely on multi-qubit architectures that are difficult to scale on noisy intermediate-scale quantum (NISQ) devices and expensive to simulate classically. We propose gated QKAN-FWP, a fast-weight framework that integrates FWP withQuantum-inspired Kolmogorov-Arnold Network(QKAN) using single-qubitdata re-uploading circuitsas learnable nonlinear activation, known as DatA Re-Uploading ActivatioN (DARUAN). We further introduce ascalar-gated fast-weight update rulethat stabilizes parameter evolution, supported by a theoretical analysis of itsadaptive memory kernel,geometric boundedness, andparallelizable gradient paths. We evaluate the framework acrosstime-series benchmarks,MiniGridreinforcement learning, and highlight real-worldsolar cycle forecastingas our main practical result. In the long-horizon setting with 528-month input window and 132-month forecast horizon, our 12.5k-parameter model achieves lowerscaled Mean Square Error(MSE), peak amplitude error, and peak timing error than a suite of classical recurrent baselines with up to 13x more parameters, including Long Short-Term Memory (LSTM) networks (25.9k-89.1k parameters),WaveNet-LSTM(167k), Vanilla recurrent neural network (11.5k), and aModified Echo State Network(132k). To validate NISQ compatibility, we further deploy the trained fast programmer onIonQandIBM Quantumprocessors, recovering forecasting accuracy within 0.1% relative MSE of the noiseless simulator at 1024 shots. These results position gated QKAN-FWP as a scalable, parameter-efficient, and NISQ-compatible approach to quantum-inspired sequence modeling.
View arXiv pageView PDFAdd to collection
Get this paper in your agent:
hf papers read 2605\.06734
Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash
Models citing this paper0
No model linking this paper
Cite arxiv.org/abs/2605.06734 in a model README.md to link it from this page.
Datasets citing this paper0
No dataset linking this paper
Cite arxiv.org/abs/2605.06734 in a dataset README.md to link it from this page.
Spaces citing this paper0
No Space linking this paper
Cite arxiv.org/abs/2605.06734 in a Space README.md to link it from this page.
Collections including this paper1
相似文章
Gated QKAN-FWP:可扩展的量子启发序列学习
本文提出了 Gated QKAN-FWP,这是一个可扩展的量子启发序列学习框架,它通过单量子比特数据重新加载电路,将快速权重程序员(Fast Weight Programmers)与柯尔莫哥洛夫-阿诺德网络(Kolmogorov-Arnold Networks)相结合。
QuantFPFlow:连续强化学习中的Fokker-Planck策略优化的量子振幅估计
介绍QuantFPFlow,一种强化学习框架,利用量子振幅估计在连续控制的Fokker-Planck配分函数估计中实现二次加速,从而改善探索并避免局部最优。
生成式量子启发柯尔莫哥洛夫-阿诺德本征求解器
本文介绍生成式量子启发柯尔莫哥洛夫-阿诺德本征求解器(GQKAE),一种参数高效架构,用柯尔莫哥洛夫-阿诺德模块替代传统神经网络组件,显著降低内存使用并改善量子化学模拟中的收敛性。
在量子退火器上通过基于QUBO的客户端选择的拜占庭鲁棒联邦学习
本文提出了一种量子退火方法,将联邦学习中的客户端选择重新表述为QUBO问题以防御拜占庭攻击。实验表明,在复杂攻击上,该方法相比经典MultiKrum具有更高的检测准确性,尤其是与MultiSignal集成结合时。
Qift: 移位友好的无零点W2训练后量化,用于旋转W2A4/KV4大语言模型推理
本文介绍了Qift,一种固定的无零点两位权重量化层级集,专为Hadamard旋转的大语言模型设计,通过利用旋转权重的近零中心高斯类分布,实现了改进的W2A4/KV4推理。在LLaMA-2-7B和LLaMA-3.1-8B上的实验显示,相比于标准W2量化,困惑度持续提升。