enterprise-agents

#enterprise-agents

A Single Rewrite Suffices: Empirical Lessons from Production Skill Description Optimization

arXiv cs.CL ↗ · 19h ago Cached

This paper presents an automated pipeline for optimizing natural language skill descriptions in enterprise AI agents to resolve skill collisions, achieving performance matching manual tuning with a 32× speedup. Ablation studies show that a single LLM rewrite using error cases captures most improvements, while other design choices have minimal impact.

0 favorites 0 likes

#enterprise-agents

@swyx: LOTS of alpha in this pod: - Why Databricks beat Snowflake (! a straight answer!) - Why everyone is building a metaharn…

X AI KOLs Following ↗ · 2026-06-24 Cached

A Twitter thread highlights key takeaways from a Latent.Space podcast episode with Databricks co-founders, covering why Databricks beat Snowflake, the rise of metaharners, Neon's success, HTAP via LTAP, MosaicML's fate, and maintaining startup culture in a large company.

0 favorites 0 likes

#enterprise-agents

EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions

Hugging Face Daily Papers ↗ · 2026-06-22 Cached

EnterpriseClawBench presents a benchmark for enterprise agents based on real-world workplace sessions, offering 852 reproducible tasks and comprehensive evaluation metrics beyond single performance scores.

0 favorites 0 likes

#enterprise-agents

Queen-Bee Agents: A BeeSpec-Centered Architecture for Governed Enterprise MCP Orchestration

arXiv cs.AI ↗ · 2026-06-08 Cached

This paper introduces Queen-Bee, a governed multi-agent architecture for enterprise MCP orchestration that separates planning and execution via a BeeSpec intermediate representation, achieving high task success rates with zero governance failures in prototype evaluations.

0 favorites 0 likes

#enterprise-agents

@tavilyai: Berlin was geht ab, Tavily ist jetzt in town! We're here with @GradiumAI showing off our new voice integration and host…

X AI KOLs Timeline ↗ · 2026-05-26 Cached

Tavily, Gradium, Nebius, and Cursor are hosting a full-day hackathon in Berlin on May 29th focused on building autonomous AI agents that can transact and execute. The event includes tech talks, building sessions, and prizes.

0 favorites 0 likes

#enterprise-agents

[N] LangChain Interrupt 2026 announcements [N]

Reddit r/MachineLearning ↗ · 2026-05-14

LangChain announced SmithDB, a distributed database for agent observability, Context Hub for managing agent context with an open memory standard, and Deep Agents v0.6 at Interrupt 2026, alongside enterprise case studies and keynotes by Andrew Ng and Harrison Chase.

0 favorites 0 likes

#enterprise-agents

Are we wasting time building enterprise agents on open-source models? (My experience with Ling 1T 2.6)

Reddit r/AI_Agents ↗ · 2026-05-07

An enterprise agent developer discusses the trade-offs of using open-source models like Ling 1T 2.6, highlighting the high overhead of optimization and benchmarking compared to proprietary APIs.

0 favorites 0 likes

enterprise-agents

A Single Rewrite Suffices: Empirical Lessons from Production Skill Description Optimization

@swyx: LOTS of alpha in this pod: - Why Databricks beat Snowflake (! a straight answer!) - Why everyone is building a metaharn…

EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions

Queen-Bee Agents: A BeeSpec-Centered Architecture for Governed Enterprise MCP Orchestration

@tavilyai: Berlin was geht ab, Tavily ist jetzt in town! We're here with @GradiumAI showing off our new voice integration and host…

[N] LangChain Interrupt 2026 announcements [N]

Are we wasting time building enterprise agents on open-source models? (My experience with Ling 1T 2.6)

Submit Feedback