academic-paper

#academic-paper

TabClaw: An Interactive and Self-Evolving Agent for Spreadsheet Manipulation and Table Reasoning

arXiv cs.CL ↗ · 5d ago Cached

TabClaw is an open-source interactive AI agent for spreadsheet manipulation and table reasoning that uses LLMs to automate data analysis, support multi-table reasoning, and adapt to user preferences through memory and skill extraction.

0 favorites 0 likes

#academic-paper

FLaG: Fine-Grained Latent Grouping for Hallucination Detection

arXiv cs.LG ↗ · 2026-06-02 Cached

FLaG is a lightweight framework for hallucination detection in LLMs that models correctness via latent evidence groups and energy-based routing, achieving SOTA performance across benchmarks.

0 favorites 0 likes

#academic-paper

The Critical State of Cyberspacs

Lobsters Hottest ↗ · 2026-05-30 Cached

This academic paper addresses the current critical state of cyberspace, likely discussing vulnerabilities, threats, and governance challenges.

0 favorites 0 likes

#academic-paper

Alignment Tuning for Large Language Models: A Data-Centric Lens on Alignment Data Pipelines

arXiv cs.CL ↗ · 2026-05-27 Cached

This survey reframes the alignment tuning of large language models as a data pipeline design problem, decomposing it into three stages: response synthesis, preference evaluation, and preference instantiation. It identifies design trade-offs and failure modes, and outlines open challenges such as prompt-level alignment and agentic settings.

0 favorites 0 likes

#academic-paper

@sheriyuo: Actually, I have been writing papers purely with AI from the very beginning. Previously I used DeepSeek R1, now I use V4. Since I don't have English academic writing ability, but I can tell by eye whether a sentence or passage is appropriate. As for Chinese writing, I am fairly confident. So almost 9...

X AI KOLs Timeline ↗ · 2026-05-23 Cached

The user shared their experience of writing academic papers entirely using AI (DeepSeek R1 and V4), emphasizing that the Chinese outline and fine prompt tuning are key, and noting that manually editing AI-generated writing is more tiring than writing it themselves.

0 favorites 0 likes

#academic-paper

AgentAtlas: Beyond Outcome Leaderboards for LLM Agents

arXiv cs.AI ↗ · 2026-05-22 Cached

This paper introduces AgentAtlas, a framework that goes beyond outcome-only leaderboards for LLM agents by proposing a six-state control-decision taxonomy and a nine-category trajectory-failure taxonomy to evaluate agent behavior more comprehensively.

0 favorites 0 likes

#academic-paper

Evaluation of Chunking Strategies for Effective Text Embedding in Low-Resource Language on Agricultural Documents

arXiv cs.CL ↗ · 2026-05-22 Cached

This paper evaluates four text chunking strategies for Retrieval-Augmented Generation on Khmer agricultural documents, finding that character-based Recursive chunking with 300 characters yields the best retrieval and relevance performance.

0 favorites 0 likes

#academic-paper

DeepSlide: From Artifacts to Presentation Delivery

arXiv cs.AI ↗ · 2026-05-18 Cached

DeepSlide is a human-in-the-loop multi-agent system for the full presentation process, from requirement elicitation and time-budgeted narrative planning to evidence-grounded slide-script generation and rehearsal support. It introduces a dual-scoreboard benchmark separating static artifact quality from dynamic delivery excellence, and achieves gains in narrative flow, pacing precision, and slide-script synergy.

0 favorites 0 likes

#academic-paper

SAT: Sequential Agent Tuning for Coordinator Free Plug and Play Multi-LLM Training with Monotonic Improvement Guarantees

arXiv cs.LG ↗ · 2026-05-08 Cached

This paper introduces Sequential Agent Tuning (SAT), a coordinator-free training paradigm for multi-LLM teams that provides monotonic improvement guarantees and plug-and-play invariance, enabling smaller models to outperform larger ones.

0 favorites 0 likes

academic-paper

Submit Feedback