Tag
A developer tested how small edge models (LFM2.5, Gemma variants) retain a single fact across conversation turns, finding that models often confidently deny knowing information that remains in context, posing a trust issue for agent architectures and suggesting a trade-off between memory and format discipline.
Lantern introduces a lightweight memory layer that archives conversation turns and retrieves relevant details after compaction, recovering 78.3% of lost facts with zero LLM calls and outperforming MemGPT-based methods.
Jeff Dean thanks the host of Two Minute Papers for a conversation about AI research, with a link to the full video in the thread.
Terence Tao and Mark Chen discuss how AI is changing mathematical research, from literature search to code generation, and the need to adapt workflows.
A user shares their firsthand experience and impressions from talking to Opus 4.8, an AI language model.
The paper introduces SeDT, a training-free inference-time method that improves LLM reliability in multi-turn conversations by annotating conversation history with cumulative relevance scores from three signals, achieving up to +37.7% performance gains on the Lost-in-Conversation benchmark.
This paper introduces Found in Conversation (FiC), a training framework using View-Asymmetric Self-Distillation to close the multi-turn performance gap in LLMs. The method teaches models to recover single-turn competence from underspecified multi-turn prompts, achieving 92-100% recovery across model families and sizes.
This paper introduces the eJSL Dialog dataset for emotion recognition in sign language conversations, addressing the lack of conversational context in existing datasets. Benchmarking shows a domain gap when applying generic multimodal models, highlighting the need for context-aware visual extractors for sign language.