Tag
SpaceX is partnering with Cursor to build advanced AI for coding and knowledge work, with an option to acquire Cursor for $60 billion later this year.
SpaceXAI and Cursor are collaborating to build advanced coding and knowledge-work AI, leveraging Cursor’s developer reach and SpaceX’s massive H100-equivalent Colossus supercomputer.
KWBench introduces a benchmark of 223 professional tasks to evaluate whether LLMs can recognize the underlying game-theoretic structure of a situation without prompting, finding that even the best model succeeds on only 27.9% of tasks. The benchmark targets unprompted problem recognition—a step prior to task execution—across domains like acquisitions, clinical pharmacy, and fraud analysis.
STADLER, a 230-year-old waste sorting technology company, has successfully embedded ChatGPT across its organization to enhance knowledge work productivity, achieving 30-40% time savings on documentation tasks and 2.5x faster drafting with >85% daily active usage across 125 custom GPTs.
GPT-5.5 brings a 19 percentage point improvement in multi-step reasoning and financial modeling, significantly reducing the burden of knowledge work, which excites the Box team.