knowledge-bases

#knowledge-bases

AFFORDANCE20Q: Evaluating Affordance Reasoning from Physical Properties

arXiv cs.AI ↗ · 2026-06-15 Cached

Affordance20Q is a benchmark that evaluates LLMs' ability to reason about object affordances from physical properties without revealing object identity, using a 20-Questions format. Experiments show a ~20 point gap between LLMs and humans, and a proposed pipeline KARI improves open-source LLMs by up to 15.2 points.

0 favorites 0 likes

#knowledge-bases

Deliberative Curation: A Protocol for Multi-Agent Knowledge Bases

arXiv cs.AI ↗ · 2026-06-02 Cached

This paper introduces a deliberative curation protocol for multi-agent knowledge bases, addressing governance gaps such as agent statelessness and sycophancy. It evaluates the protocol via simulation, showing improved resilience under adversarial conditions.

0 favorites 0 likes

#knowledge-bases

@omarsar0: I did a talk on LLM Wikis and HTML artifacts recently, if you are curious to learn more on the topic: https://academy.d…

X AI KOLs Timeline ↗ · 2026-05-30 Cached

DAIR Academy announces a free live session on building visual LLM artifacts to make LLM knowledge bases more actionable, with updates on new tools and releases for Pro members.

0 favorites 0 likes

#knowledge-bases

DeepRefine: Agent-Compiled Knowledge Refinement via Reinforcement Learning

Hugging Face Daily Papers ↗ · 2026-05-11 Cached

DeepRefine is a research paper introducing an LLM-based reasoning model that refines agent-compiled knowledge bases using reinforcement learning and multi-turn interactions to improve downstream task performance.

0 favorites 0 likes

#knowledge-bases

@omarsar0: For those interested, I will be doing a live session on this topic soon: https://academy.dair.ai/events/cmovobp97000904…

X AI KOLs Following ↗ · 2026-05-08 Cached

DAIR Academy is hosting a free live session on May 21, 2026, demonstrating a framework for building visual LLM artifacts to enhance knowledge bases.

0 favorites 0 likes

knowledge-bases

AFFORDANCE20Q: Evaluating Affordance Reasoning from Physical Properties

Deliberative Curation: A Protocol for Multi-Agent Knowledge Bases

@omarsar0: I did a talk on LLM Wikis and HTML artifacts recently, if you are curious to learn more on the topic: https://academy.d…

DeepRefine: Agent-Compiled Knowledge Refinement via Reinforcement Learning

@omarsar0: For those interested, I will be doing a live session on this topic soon: https://academy.dair.ai/events/cmovobp97000904…

Submit Feedback