Tag
A new study demonstrates that AI-assisted peer review is vulnerable to low-cost manipulation via superficial rephrasing of paper abstracts, significantly inflating AI-generated review scores and potentially biasing human editorial decisions, highlighting the need for safeguards.
World Pilot enhances Vision-Language-Action models by incorporating dynamic scene evolution and trajectory priors from a World-Action Model, achieving state-of-the-art zero-shot performance on manipulation tasks.
A new paper co-authored by 30 experts examines epistemic risks from AI—threats to our ability to form accurate beliefs and reason well—including mechanisms like persuasion, cognitive offloading, and feedback loops, and outlines directions to mitigate these risks.
This paper introduces the FactualOpinionEditing with Evidence (FOE) benchmark to assess the ability to edit factual opinions in LLMs, and proposes a Self-Generated Evidence-Aligned method to improve opinion-evidence alignment.
RoboWits is a bi-manual robotic benchmark that systematically evaluates cognitive reasoning, creative tool use, and robustness to unexpected conditions, revealing significant performance gaps in current robot policies and pre-trained VLAs on mutated tasks.
DynaFLIP is a dynamics-aware multimodal pre-training framework that integrates motion understanding into visual perception for robot manipulation. It uses image-language-3D flow triplets and geometric regularization to improve representation learning, achieving significant gains in out-of-distribution scenarios.
Boston Dynamics demonstrates how the new Atlas humanoid robot learns complex manipulation tasks like lifting heavy refrigerators through simulation training, enabling rapid iteration from design to real-world execution with minimal simulation-to-reality gap.
The article argues that dexterous hands, rather than walking locomotion, will be the defining advancement in the next era of robotics.
Google updated its spam policies to explicitly prohibit attempts to manipulate its generative AI search results, with penalties for violators.
Elon Musk claims that only X is transparent, while other social media companies manipulate results behind closed doors.
Researchers from CMU and Bosch Center for AI introduced the Humanoid Transformer with Touch Dreaming (HTD) model, which uses tactile signal prediction to improve humanoid robot manipulation, achieving a 90.9% higher average success rate over the ACT baseline across five real-world tasks.
Introduces WarmPrior, a method that replaces the standard Gaussian source in flow-matching policies with a temporally grounded prior from recent action history, consistently improving success rates on robotic manipulation tasks by producing straighter probability paths.
Genesis AI highlights advancements in robotic manipulation using Gene 2.6.5, aiming to achieve human-level dexterity. The article discusses progress in training robots to perform complex physical tasks.
Researcher shares video clips of π0.7 model performing a shirt-folding manipulation task.
Google DeepMind releases new research and a toolkit for empirically measuring AI's potential to engage in harmful manipulation, based on studies with over 10,000 participants.