Kronos: A Foundation Model for the Language of Financial Markets
Summary
Kronos is a new foundation model for financial K-line data that uses a specialized tokenizer and autoregressive pre-training to outperform existing models in forecasting and synthetic data generation.
View Cached Full Text
Cached at: 05/08/26, 08:33 AM
Paper page - Kronos: A Foundation Model for the Language of Financial Markets
Source: https://huggingface.co/papers/2508.02739
Abstract
Kronos, a specialized pre-training framework for financial K-line data, outperforms existing models in forecasting and synthetic data generation through a unique tokenizer and autoregressive pre-training on a large dataset.
The success of large-scale pre-training paradigm, exemplified by Large Language Models (LLMs), has inspired the development of Time Series Foundation Models (TSFMs). However, their application tofinancial candlestick(K-line) data remains limited, often underperforming non-pre-trained architectures. Moreover, existing TSFMs often overlook crucial downstream tasks such asvolatility predictionandsynthetic data generation. To address these limitations, we propose Kronos, a unified, scalable pre-training framework tailored to financial K-line modeling. Kronos introduces a specialized tokenizer that discretizes continuous market information intotoken sequences, preserving bothprice dynamicsandtrade activity patterns. We pre-train Kronos using anautoregressive objectiveon a massive, multi-market corpus of over 12 billion K-line records from 45 global exchanges, enabling it to learn nuanced temporal and cross-asset representations. Kronos excels in azero-shot settingacross a diverse set of financial tasks. On benchmark datasets, Kronos boostsprice series forecastingRankICby 93% over the leading TSFM and 87% over the best non-pre-trained baseline. It also achieves a 9% lowerMAEin volatility forecasting and a 22% improvement ingenerative fidelityfor synthetic K-line sequences. These results establish Kronos as a robust, versatile foundation model for end-to-end financial time series analysis. Our pre-trained model is publicly available at https://github.com/shiyu-coder/Kronos.
View arXiv pageView PDFGitHub23.5kautoAdd to collection
Get this paper in your agent:
hf papers read 2508\.02739
Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash
Models citing this paper33
#### NeoQuasar/Kronos-base Time Series Forecasting• UpdatedSep 9, 2025 • 831k • 147
#### NeoQuasar/Kronos-Tokenizer-base Time Series Forecasting• UpdatedSep 9, 2025 • 2.6M • 51
#### NeoQuasar/Kronos-mini Time Series Forecasting• UpdatedSep 9, 2025 • 691k • 19
#### NeoQuasar/Kronos-small Time Series Forecasting• UpdatedSep 9, 2025 • 1.14M • 18
Browse 33 models citing this paper## Datasets citing this paper0
No dataset linking this paper
Cite arxiv.org/abs/2508.02739 in a dataset README.md to link it from this page.
Spaces citing this paper42
Collections including this paper16
Similar Articles
shiyu-coder/Kronos
Kronos is an open-source foundation model for financial K-line sequences, trained on data from over 45 global exchanges. It uses a specialized tokenizer and a decoder-only Transformer, and has been accepted at AAAI 2026.
@Huanusa: This is absolutely mind-blowing! Someone actually built an AI that can directly read candlestick trading, and its performance is through the roof! It's called Kronos — the world's first open-source foundational large model designed specifically for financial markets! Trained from scratch on 12 billion real candlestick data points from 45 exchanges, not a repurposed general AI. It can: price prediction + volatility prediction and more.
Kronos is the world's first open-source foundational large model for financial markets, trained from scratch on 12 billion real candlestick data points, supporting price prediction and volatility forecasting, far outperforming general models, and completely free and open-source.
CRONOS: Benchmarking Counterfactual Physical Consistency in Video Models
CRONOS is a benchmark that evaluates counterfactual physical consistency in video prediction models by intervening on viewpoint, scene, object category, and appearance while keeping physical event types fixed. It reveals substantial failures in current video generators.
Chronicle: A Multimodal Foundation Model for Joint Language and Time Series Understanding
Chronicle is a 324M-parameter decoder-only transformer pretrained from scratch on both natural language and time series, achieving competitive performance on NLU and time series classification tasks, and setting new state-of-the-art for frozen-embedding time series classification on UCR/UEA datasets.
@ForecastEng: Amazon quietly open-sourced a time-series model that forecasts out of the box. No training. No feature engineering. Poi…
Amazon open-sourced Chronos, a time-series forecasting model that predicts out of the box without training or feature engineering, treating forecasting like language models treat text.