DeepSpec - a deepseek-ai Collection
Summary
DeepSeek AI released the DeepSpec collection on Hugging Face, featuring speculative decoding models (dspark, dflash, eagle3) based on Qwen3 and Gemma4 in various sizes (1B-3B).
View Cached Full Text
Cached at: 06/28/26, 04:09 PM
DeepSpec - a deepseek-ai Collection
Source: https://huggingface.co/collections/deepseek-ai/deepspec updatedabout 3 hours ago
- —
#### deepseek-ai/dspark_qwen3_4b_block7 1B• Updatedabout 4 hours ago • 4 Notedspark_qwen3_4b_block7 - —
#### deepseek-ai/dspark_qwen3_8b_block7 2B• Updatedabout 4 hours ago • 1 Notedspark_qwen3_8b_block7 - —
#### deepseek-ai/dspark_qwen3_14b_block7 3B• Updatedabout 4 hours ago • 2 Notedspark_qwen3_14b_block7 - —
#### deepseek-ai/dspark_gemma4_12b_block7 3B• Updatedabout 4 hours ago • 3 Notedspark_gemma4_12b_block7 - —
#### deepseek-ai/dflash_qwen3_4b_block7 1B• Updatedabout 4 hours ago Notedflash_qwen3_4b_block7 - —
#### deepseek-ai/dflash_qwen3_8b_block7 2B• Updatedabout 4 hours ago Notedflash_qwen3_8b_block7 - —
#### deepseek-ai/dflash_qwen3_14b_block7 3B• Updatedabout 4 hours ago • 1 Notedflash_qwen3_14b_block7 - —
#### deepseek-ai/dflash_gemma4_12b_block7 3B• Updatedabout 4 hours ago • 2 Notedflash_gemma4_12b_block7 - —
#### deepseek-ai/eagle3_qwen3_4b_ttt7 0.9B• Updatedabout 4 hours ago Noteeagle3_qwen3_4b_ttt7 - —
#### deepseek-ai/eagle3_qwen3_8b_ttt7 2B• Updatedabout 4 hours ago • 1 Noteeagle3_qwen3_8b_ttt7 - —
#### deepseek-ai/eagle3_qwen3_14b_ttt7 2B• Updatedabout 4 hours ago Noteeagle3_qwen3_14b_ttt7 - —
#### deepseek-ai/eagle3_gemma4_12b_ttt7 2B• Updatedabout 4 hours ago • 1 Noteeagle3_gemma4_12b_ttt7
Similar Articles
deepseek-ai/DeepSeek-V4-Flash-DSpark
DeepSeek releases V4 series of Mixture-of-Experts language models (Pro 1.6T/49B activated, Flash 284B/13B activated) supporting one-million-token context with hybrid attention and speculative decoding, claiming best open-source model performance.
DeepSeek open-sources inference optimizations with 60–85% faster generation [pdf]
DeepSeek open-sourced DeepSpec, a full-stack codebase for training and evaluating draft models for speculative decoding, enabling 60-85% faster generation. It includes data preparation, training, and evaluation scripts with support for multiple draft model algorithms (DSpark, DFlash, Eagle3).
@danielhanchen: DeepSeek just released DSpark for V4 Flash & Pro, a new speculative decoding method boosting throughput by 51% to 400%!…
DeepSeek released DSpark, a speculative decoding method that boosts throughput by 51% to 400% for V4 Flash & Pro, along with the open-source DeepSpec codebase for training and evaluating draft models.
@charles_irl: it’s hot spec summer
DeepSeek has open-sourced DeepSpec, a full-stack codebase for training and evaluating speculative decoding models.
deepseek-ai/DeepSeek-V4-Pro-DSpark
DeepSeek releases preview versions of its V4 series, including DeepSeek-V4-Pro (1.6T parameters, 49B activated) and DeepSeek-V4-Flash (284B parameters, 13B activated), both supporting a one-million-token context and featuring hybrid attention, manifold-constrained hyper-connections, and a Muon optimizer.