microsoft-research

#microsoft-research

@AlphaSignalAI: https://x.com/AlphaSignalAI/status/2069424192274252094

X AI KOLs Timeline ↗ · 6h ago Cached

Microsoft's NextLat introduces a training objective that rewards belief-state representations instead of relying solely on next-token prediction, pushing models toward compact world models for better generalization.

0 favorites 0 likes

#microsoft-research

Next-Latent Prediction Transformers [R]

Reddit r/MachineLearning ↗ · 6d ago

Microsoft Research introduces Next-Latent Prediction (NextLat), a self-supervised method that trains transformers to predict their own next latent state, enabling compact world models for reasoning and planning and achieving up to 3.3x faster inference via self-speculative decoding.

0 favorites 0 likes

#microsoft-research

@MSFTResearch: 30x faster analytics, GPU kernels generated automatically from SQL, AI matched to lab-grown tumor models for cancer tre…

X AI KOLs Following ↗ · 2026-06-15 Cached

Microsoft Research highlights multiple advances including 30x faster analytics with CoddSpeed, AI wildlife re-identification, and LLMs that learn across tasks without retraining in the latest Research Focus newsletter.

0 favorites 0 likes

#microsoft-research

@MSFTResearch: Project Ire examined a timely malware sample and determined its intent through reverse engineering—identifying LOTUSLIT…

X AI KOLs Following ↗ · 2026-06-12 Cached

Microsoft's Project Ire, an autonomous malware-classification agent, successfully identified a LOTUSLITE variant that evaded major EDR tools through behavioral reverse engineering without relying on IOC signatures.

0 favorites 0 likes

#microsoft-research

@HuggingPapers: Microsoft Research introduces Arbor A generalist autonomous research agent that uses persistent hypothesis-tree refinem…

X AI KOLs Following ↗ · 2026-06-11 Cached

Microsoft Research introduces Arbor, a generalist autonomous research agent that uses persistent hypothesis-tree refinement for cumulative learning, outperforming Codex and Claude Code across six research tasks and achieving 86% Any-Medal on MLE-Bench Lite.

0 favorites 0 likes

#microsoft-research

@HuggingPapers: Microsoft Research introduces Mirage Latent spatial memory stores 3D scenes directly as latent tokens, skipping the cos…

X AI KOLs Following ↗ · 2026-06-09 Cached

Microsoft Research introduces Mirage, a latent spatial memory that stores 3D scenes as latent tokens, achieving up to 10.57x faster video generation and 55x lower memory use with state-of-the-art consistency.

0 favorites 0 likes

#microsoft-research

@MSFTResearch: Evaluating agentic behaviors at scale, making the case for repositories over documents, and inviting researchers worldw…

X AI KOLs Following ↗ · 2026-06-01 Cached

Microsoft Research's latest newsletter highlights AgentPex, an open-source system for automated evaluation of agentic behaviors; new theoretical work on variance reduction for ranking systems; a call to shift from documents to repositories for human-agent collaboration; and a global challenge on AI value alignment.

0 favorites 0 likes

#microsoft-research

@dair_ai: https://x.com/dair_ai/status/2061104052818108476

X AI KOLs Following ↗ · 2026-05-31 Cached

A roundup of three notable AI papers: SkillOpt treats skill documents as trainable parameters to optimize frozen agents; a new method compiles agentic workflows into model weights for 100x cost reduction; and AutoScientists introduces a decentralized agent team for long-running science without a central planner.

0 favorites 0 likes

#microsoft-research

@omarsar0: New research from Microsoft Research I see a lot of AI engineers handwriting agent skill docs and hope they generalize.…

X AI KOLs Following ↗ · 2026-05-25 Cached

Microsoft Research introduces SkillOpt, a method that treats agent skill documents as trainable external state, using an optimizer model to make bounded edits validated by a held-out set. The approach achieves best or tied results across 52 evaluation cells and improves accuracy by over 23 points on GPT-5.5, with zero extra inference cost and transferable skills.

0 favorites 0 likes

#microsoft-research

@Yif_Yang: Introducing SkillOpt — an optimizer for agent skills. Instead of finetuning model weights, we treat a natural-language …

X AI KOLs Timeline ↗ · 2026-05-25 Cached

Introducing SkillOpt, an optimizer that treats natural-language skills as trainable external parameters instead of finetuning model weights. It uses bounded edits and validation gating to enable stable, controllable skill updates, achieving best or tied-best results across 52 settings on 6 benchmarks with 7 models.

0 favorites 0 likes

#microsoft-research

@DanKornas: Most AI agents still split vision, language, and action across separate systems. Magma is a Microsoft Research foundati…

X AI KOLs Timeline ↗ · 2026-05-23 Cached

Magma is an open-source repository from Microsoft Research for building multimodal AI agents that integrate vision, language, and action, providing model links, inference examples, training instructions, and demos.

0 favorites 0 likes

#microsoft-research

@KengGuangLong: https://x.com/KengGuangLong/status/2057311636348944738

X AI KOLs Timeline ↗ · 2026-05-21 Cached

Microsoft's 2026 Future of Work report indicates that generative AI is reshaping the workplace at an unprecedented pace, but the benefits are highly unevenly distributed, with junior roles hit hardest; AI is evolving from an acceleration tool to a collaboration partner, making human professional judgment even more crucial.

0 favorites 0 likes

#microsoft-research

@MSFTResearch: New tools, models, repos, and papers out of Microsoft Research are here. Use AI and agents? It's worth watching: • Mage…

X AI KOLs Following ↗ · 2026-05-15 Cached

Microsoft Research announced new tools, models, repositories, and papers, including MagenticLite, agentic GitHub workflows, verification-first agents, and meaning-matching fine-tuning, during the Microsoft Research Forum virtual series.

0 favorites 0 likes

#microsoft-research

@MSFTResearch: Introducing GridSFM, a small foundation model that can predict AC optimal power flow in milliseconds, boosting efficien…

X AI KOLs Following ↗ · 2026-05-13 Cached

Microsoft introduces GridSFM, a small foundation model that can predict AC optimal power flow in milliseconds, significantly improving grid efficiency and reducing congestion costs.

0 favorites 0 likes

#microsoft-research

@MSFTResearch: MatterSim is expanding what AI can do for materials science—from faster large-scale simulations to MatterSim-MT, a new …

X AI KOLs Following ↗ · 2026-05-12 Cached

Microsoft Research announces MatterSim updates including MatterSim-MT, a multi-task foundation model for materials characterization, faster simulation (3-5x speedup), and experimental validation of thermal conductivity predictions for a new material.

0 favorites 0 likes

#microsoft-research

Switchcraft: AI Model Router for Agentic Tool Calling

arXiv cs.AI ↗ · 2026-05-11 Cached

This paper introduces Switchcraft, the first AI model router specifically optimized for agentic tool calling to reduce inference costs. By using a lightweight DistilBERT classifier, it achieves significant cost savings while maintaining high accuracy in tool-use tasks.

0 favorites 0 likes

#microsoft-research

@HowToAI_: Microsoft Research + Salesforce has published a paper that should scare every single AI builder right now. It’s called …

X AI KOLs Timeline ↗ · 2026-05-09

A new paper by Microsoft Research and Salesforce reveals that LLM performance drops significantly in multi-turn conversations due to a 'Lost in Conversation' phenomenon, challenging the reliability of current single-turn benchmarks.

0 favorites 0 likes

#microsoft-research

A Randomized Scheduler with Probabilistic Guarantees of Finding Bugs

Lobsters Hottest ↗ · 2026-05-09 Cached

This Microsoft Research paper introduces a randomized scheduling technique designed to provide probabilistic guarantees for uncovering bugs in software systems. Published for the ASPLOS conference, it focuses on systematic fault detection through algorithmic randomness.

0 favorites 0 likes

#microsoft-research

DataDignity: Training Data Attribution for Large Language Models

arXiv cs.AI ↗ · 2026-05-08 Cached

This paper introduces DataDignity, a framework and benchmark (FakeWiki) for pinpoint provenance, aiming to identify the specific training data sources that support an LLM's response. It proposes ScoringModel and SteerFuse methods to improve attribution accuracy over standard retrieval baselines.

0 favorites 0 likes

#microsoft-research

AgenticRAG: Agentic Retrieval for Enterprise Knowledge Bases

arXiv cs.AI ↗ · 2026-05-08 Cached

This paper introduces AgenticRAG, a framework from Microsoft that enhances enterprise knowledge base retrieval by equipping LLMs with tools for iterative search, document navigation, and analysis. It demonstrates significant improvements in recall and factuality over standard RAG pipelines on multiple benchmarks.

0 favorites 0 likes

microsoft-research

Submit Feedback