MOOSE-Star (ICML 2026): 7B model + 108K-paper dataset for scientific hypothesis discovery
Summary
MOOSE-Star presents a 7B model fine-tuned from DeepSeek-R1-Distill-Qwen-7B for scientific hypothesis discovery, along with a dataset of 108K NCBI papers. The model achieves state-of-the-art inspiration retrieval accuracy, outperforming larger models like GPT-5.4 and Gemini-3 Pro.
Similar Articles
deepseek-ai/DeepSeek-V4-Pro
DeepSeek releases V4-Pro and V4-Flash, Mixture-of-Experts models supporting million-token context with hybrid attention and Muon optimizer.
@AlphaSignalAI: A 4B model can now anticipate scientific breakthroughs before scientists do. Researchers often build breakthroughs by c…
A new paper introduces GIANTS-4B, a 4-billion-parameter model trained with reinforcement learning to predict scientific insights by combining ideas from foundational papers, achieving higher similarity and citation potential than larger models like Gemini 3 Pro.
Open source battle: GLM vs Kimi vs MiMo vs DeepSeek
This article tests four open-source Chinese AI models — Zhipu GLM 5.1, Moonshot Kimi K2.6, Stepfun MIMO 2.5 Pro, and DeepSeek V4 Pro — on programming tasks. It finds that GLM leads overall in most tasks but not absolutely; each model has its own strengths and weaknesses.
@tom_doerr: Fully open sources training data for 30B scale search agents https://github.com/PolarSeeker/OpenSeeker…
OpenSeeker fully open-sources training data and models for 30B-scale ReAct-based search agents, achieving state-of-the-art performance on multiple benchmarks including BrowseComp and Humanity's Last Exam. It is the first purely academic project to reach frontier search benchmark performance while releasing complete training data.
@eliebakouch: one of my favorite projects is Marin from the stanford folks, they have a scientific approach to training, are ready to…
Marin is an open-source framework from Stanford for reproducible foundation model research, covering data curation, tokenization, training, and evaluation; it was used to train an 8B parameter model that outperforms Llama 3.1 8B.