deep-learning

Tag

Cards List
#deep-learning

Spam and Sentiment Detection in Arabic Tweets Using MARBERT Model

arXiv cs.CL · 3h ago Cached

This paper presents a sentiment analysis and spam detection system for Arabic tweets using the MARBERT model, trained on a dataset of 24,513 tweets to improve customer service for Saudi Telecom Company.

0 favorites 0 likes
#deep-learning

Multi-Stream Temporal Fusion for Financial Fraud Detection

arXiv cs.LG · 3h ago Cached

Proposes the Multi-Stream Fraud Transformer (MSFT) for financial fraud detection, which independently encodes transaction, login, and risk event streams using Transformers and fuses them with time-aware positional encoding and gated fusion, achieving 0.9961 AUROC on a large dataset.

0 favorites 0 likes
#deep-learning

A Zeroth-Order Deep Learning Method for Fully Nonlinear Parabolic Partial Differential Equations with Unknown Coefficients

arXiv cs.LG · 3h ago Cached

This paper introduces a model-free deep learning method for solving high-dimensional nonlinear partial differential equations with unknown coefficients, using zeroth-order derivative estimators derived from perturbed Monte Carlo trajectories. The approach avoids automatic differentiation, provides theoretical error bounds, and demonstrates competitive performance in numerical experiments.

0 favorites 0 likes
#deep-learning

Quantifying Explainable AI-introduced signal noise on ECG data with Spectral Entropy

arXiv cs.LG · 3h ago Cached

The paper proposes using spectral entropy as a metric to quantify noise introduced by explainability techniques in ECG arrhythmia classification, helping to distinguish true model signal from XAI-generated artifacts.

0 favorites 0 likes
#deep-learning

@0xLinehigher: I strongly recommend every college student majoring in Computer Science to thoroughly study CS336 during their university years, without Chinese subtitles, only English subtitles. After finishing it, your understanding of LLMs and English proficiency will be at least in the top 1% in China. This course surpasses any computer science course in any domestic university. 《Stanford CS336: La…

X AI KOLs Timeline · 11h ago Cached

Recommend computer science students to study the Stanford CS336 course (Language Modeling from Scratch) to improve LLM understanding and English ability.

0 favorites 0 likes
#deep-learning

AI trained on hundreds of thousands of EKGs, improves prediction of sudden cardiac death risk

Reddit r/ArtificialInteligence · 13h ago Cached

UC Berkeley researchers trained an AI model on hundreds of thousands of EKGs to detect a previously unrecognized signal that predicts sudden cardiac death risk more accurately than current methods, potentially saving thousands of lives annually.

0 favorites 0 likes
#deep-learning

@DanKornas: Learn deep learning with a structured MIT course. What you will learn: - Build the foundations before jumping into adva…

X AI KOLs Timeline · 16h ago Cached

Promotes a structured MIT deep learning course that covers foundations, generative models, agents, and sequence problems. The course aims to build practical understanding before advanced topics.

0 favorites 0 likes
#deep-learning

@bookwormengr: It is a shame this account has only 5k followers. Zhihu is pinnacle of Deep Learning blogging . This is Less Wrong of C…

X AI KOLs Timeline · 22h ago Cached

Discusses why GLM-5.2 moved away from GRPO, suggesting that GRPO's assumptions may not hold for long-horizon agentic tasks.

0 favorites 0 likes
#deep-learning

@heynavtoor: 10 free resources that teach you more about AI in 30 days than a $15,000 bootcamp. Bookmark this list. 1. 3Blue1Brown G…

X AI KOLs Timeline · 23h ago Cached

A curated list of 10 free AI learning resources including courses, newsletters, podcasts, and interactive books from experts like 3Blue1Brown, Andrej Karpathy, and Andrew Ng.

0 favorites 0 likes
#deep-learning

RAVEN: A Regime-Aware Variable-context Expert Network for Financial Time Series Forecasting

arXiv cs.LG · yesterday Cached

The paper proposes RAVEN, a Mixture-of-Experts framework that adaptively determines temporal context windows for each input sample to handle non-stationary financial time series. It achieves state-of-the-art performance on financial and traffic benchmarks.

0 favorites 0 likes
#deep-learning

Fast and Slow Variational Continual Learning

arXiv cs.LG · yesterday Cached

This paper introduces the Continual IVON (CoVON) optimizer, which integrates fast and slow adaptation into variational continual learning to balance stability and plasticity, outperforming existing methods in domain-incremental learning, continual pre-training, and fine-tuning of large language models.

0 favorites 0 likes
#deep-learning

DREG: A Layer-Wise Jacobian Regularization as a General-Purpose Penalty

arXiv cs.LG · yesterday Cached

This paper presents a large-scale empirical study of the Derivative Regularization (DREG) penalty, showing it achieves high accuracy and noise robustness, particularly with GELU activation and data-scarce regimes, positioning it as a general-purpose plug-and-play regularizer for neural networks.

0 favorites 0 likes
#deep-learning

ARIA: Adaptive Region-Based Importance Allocation for Conditional Diffusion Distillation

arXiv cs.LG · yesterday Cached

This paper introduces ARIA, a framework that adaptively allocates training effort across regions of the conditioning space for distilling conditional diffusion models, improving performance on unseen and underrepresented conditions.

0 favorites 0 likes
#deep-learning

Reconstructing GRACE Terrestrial Water Storage with Spatio-Temporal Graph Neural Networks: An Application to South America

arXiv cs.LG · yesterday Cached

This paper presents a deep learning approach using a spatio-temporal graph neural network (MTGNN) to reconstruct GRACE terrestrial water storage anomalies back to 1940 for South America, achieving high accuracy and outperforming previous methods with fewer predictors.

0 favorites 0 likes
#deep-learning

Exploring Dualistic Meta-Learning to Enhance Domain Generalization in Open Set Scenarios

arXiv cs.LG · yesterday Cached

Proposes a novel meta-learning strategy called MEDIC for open set domain generalization, which uses implicit gradient matching across domain and class splits to achieve better boundaries. Experiments show state-of-the-art performance.

0 favorites 0 likes
#deep-learning

Uncertainty-Aware Longitudinal Forecasting of Alzheimer's Disease Progression Using Deep Learning

arXiv cs.AI · yesterday Cached

This paper proposes a probabilistic framework for Alzheimer's disease progression forecasting that combines ordinal diagnosis prediction, multi-horizon trajectory generation, and decomposed uncertainty estimation using a Temporal Fusion Transformer encoder and an autoregressive Mixture Density Network. The model outperforms baselines on ADNI data, achieving near-nominal 90% credible interval coverage with clinically meaningful uncertainty signals.

0 favorites 0 likes
#deep-learning

MVG-KAN: Multi-View Geo-Wind Guided KAN for PM$_{2.5}$ Forecasting

arXiv cs.AI · yesterday Cached

This paper proposes MVG-KAN, a multi-view model integrating periodic-residual decomposition, a Geo-Wind Graph for wind-aware spatial dependencies, and a temporal KAN head for PM2.5 forecasting, achieving MAE 14.09 on Beijing data.

0 favorites 0 likes
#deep-learning

Aspect-Based Sentiment Evolution and its Correlation with Review Rounds in Multi-Round Peer Reviews: A Deep Learning Approach

arXiv cs.CL · yesterday Cached

This paper investigates the distribution and evolution of aspect-level sentiments in multi-round peer reviews from Nature Communications, using a deep learning approach (LCF-BERT-CDM) to achieve 82.65% Macro-F1, and finds that positive sentiment increases while negative sentiment decreases with more review rounds.

0 favorites 0 likes
#deep-learning

@tetsuoai: The entire core of a neural network on four cards. Neuron, forward pass, activations, backprop. Learn these four and yo…

X AI KOLs Timeline · yesterday Cached

A set of four cards covering the core concepts of neural networks: neuron, forward pass, activations, and backpropagation, aimed at helping learners understand how models from perceptrons to transformers work.

0 favorites 0 likes
#deep-learning

@JustinAngel: https://x.com/JustinAngel/status/2069482255312195980

X AI KOLs Timeline · yesterday Cached

Release of free workshop recordings and materials (23 videos, 250 slides, 50 exercises) for building your own LLM from fundamentals to transformer architecture, with no math or ML prerequisites.

0 favorites 0 likes
Next →
← Back to home

Submit Feedback