foundation-models

#foundation-models

BehaviorBench: Benchmarking Foundation Models for Behavioral Science Tasks

arXiv cs.CL ↗ · yesterday Cached

This paper introduces BehaviorBench, a comprehensive benchmark for evaluating foundation models on behavioral science tasks including behavior prediction, strategic decision-making, subject-trait inference, and behavioral knowledge application. It also presents Be.FM-1.5, a fine-tuned model that achieves strong distributional alignment, highlighting the gap between general-purpose and behaviorally adapted models.

0 favorites 0 likes

#foundation-models

PORTER: Language-Grounded Event Representations for Portable Structured EHR Foundation Models

arXiv cs.CL ↗ · yesterday Cached

PORTER is a language-grounded structured EHR foundation model that represents clinical events through text descriptions and numeric values, enabling vocabulary-independent transfer across institutions without retraining. On pediatric prediction tasks, PORTER matches fixed-vocabulary models and recovers 97.1% of AUROC when transferred to unseen event descriptions.

0 favorites 0 likes

#foundation-models

NAIRR Science Program Reshapes Scientific Research, Powered by NVIDIA AI Infrastructure

NVIDIA Blog ↗ · 2d ago Cached

The NAIRR pilot program, powered by NVIDIA AI infrastructure, has supported over 700 research projects, including the development of the Walrus foundation model for fluid simulations and the MIST molecular foundation models for energy storage.

0 favorites 0 likes

#foundation-models

The data black hole at the center of AI

Reddit r/artificial ↗ · 3d ago Cached

This article deeply analyzes the problem that AI's sample efficiency is far lower than that of humans, pointing out that frontier models require massive amounts of domain-specific data, while humans can learn from just a few examples. This data black hole is a core bottleneck in current AI development. Through multiple comparisons (annotation volume, robot manipulation, driving) and refuting common objections, the article demonstrates the severity of this gap and explores its impact on the goals of AI automation.

0 favorites 0 likes

#foundation-models

Laguna by Poolside

Product Hunt ↗ · 4d ago

Poolside introduces Laguna, a foundation model for agentic coding and long-horizon work.

0 favorites 0 likes

#foundation-models

@KempeLab: I am excited to share that I am joining @amilabs as Director of Research, Paris, working with @ylecun and an exceptiona…

X AI KOLs Following ↗ · 6d ago Cached

An AI researcher announces joining AmiLabs as Director of Research in Paris, working with Yann LeCun and a team focused on world modeling and foundation models.

0 favorites 0 likes

#foundation-models

Do Time Series Foundation Model Benchmarks Hide Regime-Dependent Failures? Evidence from Traffic Speed Forecasting

arXiv cs.LG ↗ · 2026-06-18 Cached

This paper introduces regime-stratified evaluation for time series foundation models, revealing that aggregate metrics hide severe failures during traffic regime transitions, and proposes bimodal mixture augmentation to improve coverage while preserving overall accuracy.

0 favorites 0 likes

#foundation-models

DeFAb: A Verifiable Benchmark for Defeasible Abduction in Foundation Models

arXiv cs.AI ↗ · 2026-06-18 Cached

Introduces DeFAb, a verifiable benchmark for defeasible abduction in foundation models, comprising over 372K instances and revealing that current frontier models perform poorly on this form of logical reasoning, with accuracy as low as 23.5% under robust evaluation.

0 favorites 0 likes

#foundation-models

HumanScale: Egocentric Human Video Can Outperform Real-Robot Data for Embodied Pretraining

Hugging Face Daily Papers ↗ · 2026-06-18 Cached

This paper finds that egocentric human video, when processed with a filtering and labeling pipeline, can outperform teleoperated real-robot data for pretraining embodied foundation models, achieving lower validation loss and higher success rates on real-robot tasks.

0 favorites 0 likes

#foundation-models

DeepInsight: A Unified Evaluation Infrastructure Across the Physical AI Stack

arXiv cs.AI ↗ · 2026-06-17 Cached

This paper introduces DeepInsight, a unified evaluation infrastructure for Physical AI stacks that spans from foundation model decoding to whole-body control, preserving heterogeneity through three narrow abstractions to enable cross-layer diagnostics.

0 favorites 0 likes

#foundation-models

Probing, Fusion, and Trustworthiness: A Systematic Evaluation of Foundation Model Representations for Multimodal Cancer Analysis

arXiv cs.LG ↗ · 2026-06-17 Cached

This paper systematically evaluates foundation model representations for multimodal cancer analysis, benchmarking unimodal and multimodal fusion strategies on real-world cohorts, and assessing trustworthiness via conformal prediction.

0 favorites 0 likes

#foundation-models

@gregbarbosa: Apple didn't, so I did: I made it dead simple to run macOS 27's local and Private Cloud Compute Foundation models in an…

X AI KOLs Following ↗ · 2026-06-16 Cached

fm-proxy is a drop-in proxy that lets any app accepting an OpenAI API URL run macOS 27's local and Private Cloud Compute Foundation models, with no extra servers or keys.

0 favorites 0 likes

#foundation-models

Overcoming the Impedance Mismatch: A Theoretical Roadmap for Fusing Foundation Models and Knowledge Graphs

arXiv cs.AI ↗ · 2026-06-16 Cached

This paper formalizes the 'Impedance Mismatch' between foundation models and knowledge graphs, and proposes a theoretical roadmap for neuro-symbolic fusion using structured residual streams, vector symbolic architectures, and orthogonal subspace editing.

0 favorites 0 likes

#foundation-models

Towards Next-Generation Healthcare: A Survey of Medical Embodied AI for Perception, Decision-Making, and Action

arXiv cs.AI ↗ · 2026-06-16 Cached

This paper systematically surveys the core components of medical embodied AI, emphasizing the coordinated integration of perception, decision-making, and action in clinical environments, and reviews representative applications, datasets, and future research directions.

0 favorites 0 likes

#foundation-models

Towards End-to-End Automation of AI Research

arXiv cs.AI ↗ · 2026-06-16 Cached

A paper presenting The AI Scientist, a system that automates the entire research lifecycle from idea generation to peer review, demonstrating AI's growing capacity for scientific contribution.

0 favorites 0 likes

#foundation-models

Hierarchical Modeling of ICD Codes in EHR Foundation Models

arXiv cs.AI ↗ · 2026-06-16 Cached

This paper investigates explicit encoding of ICD-10-CM hierarchy in EHR foundation models, using hierarchical token augmentation and graph-based code representations. Experiments on MIMIC-IV and eICU show improvements over flat code representations for in-domain and cross-dataset prediction tasks.

0 favorites 0 likes

#foundation-models

Apple Foundation Models

Hacker News Top ↗ · 2026-06-15

Apple has developed its own foundation models for AI, signaling its entry into the large language model space with proprietary technology.

0 favorites 0 likes

#foundation-models

Learning the Context of Errors: Black-Box Online Adaptation of Time Series Foundation Models

arXiv cs.LG ↗ · 2026-06-15 Cached

This paper proposes ORCA, a method for black-box online adaptation of time series foundation models by learning the context of predictive errors. It demonstrates effectiveness across five TSFMs and eight datasets, addressing the challenge of adapting closed-source API-based models.

0 favorites 0 likes

#foundation-models

Multi-Modal Agents for Power Distribution Defect Detection: An Evaluation of Foundation Models

arXiv cs.AI ↗ · 2026-06-12 Cached

This paper introduces a Multi-Modal Agent framework for power distribution defect detection, evaluating foundation models on perception, reasoning, and tool usage capabilities, with a new domain-specific dataset and benchmark.

0 favorites 0 likes

#foundation-models

A Tutorial on World Models and Physical AI

arXiv cs.AI ↗ · 2026-06-12 Cached

This tutorial presents a coherent framework unifying diverse world modeling approaches for physical AI, covering explicit and implicit world models and their role in prediction, reasoning, and planning.

0 favorites 0 likes

foundation-models

Submit Feedback