agentic-workflow

#agentic-workflow

DeepLens Diagnosis Agent: Agentic Workflow Design Lets a Small Reasoning Model Compete with Frontier LLMs

arXiv cs.AI ↗ · 4d ago Cached

The DeepLens Diagnosis Agent uses a five-stage agentic workflow with a small medical reasoning model (7B) to achieve 60.14% diagnostic accuracy on a 915-case benchmark, outperforming frontier LLMs like Claude Sonnet 4.5 and Gemini 3.1 Pro at lower cost. The workflow design alone yields a 36-point gain over the base model, demonstrating that structured process constraints are key for diagnostic reasoning.

0 favorites 0 likes

#agentic-workflow

@DanKornas: Research agents need more than a prompt—they need search, extraction, planning, source tracking, and synthesis. Open De…

X AI KOLs Timeline ↗ · 2026-07-22 Cached

Open Deep Research is an open-source Next.js app that combines Firecrawl Search and Extract with a reasoning model to perform iterative web research loops, including search, extraction, planning, source tracking, and synthesis.

0 favorites 0 likes

#agentic-workflow

Your Agentic Workflow's Cache Keepalive Costs 8x Too Much

Lobsters Hottest ↗ · 2026-07-21 Cached

A detailed measurement study across Anthropic, OpenAI, Gemini, and DeepSeek finds that the conventional 30-second prompt cache keepalive is 8x too frequent; a 4-minute interval is optimal, and only Anthropic's cache saves money at long idle gaps.

0 favorites 0 likes

#agentic-workflow

SciForge: An AI-Native, Multimodal Workbench for Scientific Discovery

arXiv cs.AI ↗ · 2026-07-20 Cached

SciForge is an open-source, AI-native multimodal workbench for scientific discovery that integrates search, reasoning, workflow execution, and evidence governance, demonstrated through eight end-to-end use cases including gene discovery and protein design.

0 favorites 0 likes

#agentic-workflow

@sairahul1: Someone turned Claude into an entire company. 42 skills, organised like a real org chart (links below): Here is every d…

X AI KOLs Timeline ↗ · 2026-07-16 Cached

A developer created a comprehensive set of 42 skills for Claude, organized like a real company org chart, covering departments like development, marketing, finance, and legal. Each skill is open-source and installable from GitHub or Claude plugins.

0 favorites 0 likes

#agentic-workflow

ProofCouncil: An LLM Agent for Solving Open Mathematical Problems

arXiv cs.AI ↗ · 2026-07-13 Cached

Introduces ProofCouncil, an LLM-based agent with an author-critic architecture that autonomously solves open mathematical problems. It achieved the best performance in the FirstProof challenge, solving 6 of 10 problems correctly, and shows promise on a broader set of 30 open problems.

0 favorites 0 likes

#agentic-workflow

GitLost: We Tricked GitHub's AI Agent into Leaking Private Repos

Hacker News Top ↗ · 2026-07-08 Cached

Noma Labs discovered a critical prompt injection vulnerability in GitHub's Agentic Workflows, allowing unauthenticated attackers to exfiltrate data from private repositories by posting a crafted GitHub issue in a public repository of the same organization.

0 favorites 0 likes

#agentic-workflow

Plurality Released: fully Free and Open Source AI agents/chatbot platform for local AI

Reddit r/LocalLLaMA ↗ · 2026-07-01

Plurality is a fully free and open-source local AI platform that combines agentic workflows with a chatbot interface, supporting background processing, sandboxed shell/file access, and compatibility with skills and MCP.

0 favorites 0 likes

#agentic-workflow

Semantic Browsing: Controllable Diversity for Image Generation

Hugging Face Daily Papers ↗ · 2026-06-22 Cached

Semantic Browsing introduces a method for controlled diversity in text-to-image generation by using a Vision Language Model with an agentic workflow to generate structured, interpretable variations based on semantic decisions.

0 favorites 0 likes

#agentic-workflow

@sumaerjolly: Spent the weekend tinkering on @danshipper Codex native app idea. Here is a basic demo using @CopilotKit on top of @att…

X AI KOLs Following ↗ · 2026-06-15 Cached

A developer demonstrates a proof-of-concept integration using CopilotKit to generate AI-powered UI on the fly from Attio CRM data, running in the Codex native app to create adaptive workflows.

0 favorites 0 likes

#agentic-workflow

@RisingSayak: Published my first kernel to go the last mile to optimize LTX-2.3 from @Lightricks! torch.compile + cuDNN attn already …

X AI KOLs Following ↗ · 2026-06-12 Cached

Published a custom kernel to further optimize LTX-2.3 from Lightricks, achieving 1.52x speedup on GB10, building upon previous torch.compile and cuDNN attention optimizations.

0 favorites 0 likes

#agentic-workflow

I use ACP build a tool Aflow - Agent help you build an Agentic Workflow

Reddit r/AI_Agents ↗ · 2026-06-10

Aflow is a workflow-native agent built on ACP that helps teams design, run, maintain, and improve durable agent processes using YAML workflows, enabling easy sharing and resumption of sessions.

0 favorites 0 likes

#agentic-workflow

PathoSage: Towards Multi-Source Evidence Adjudication in Pathology via Experience-Aware Agentic Workflow

arXiv cs.AI ↗ · 2026-06-09 Cached

PathoSage introduces a three-stage framework for pathology multimodal reasoning that separates knowledge retrieval, evidence collection, and evidence adjudication to reduce hallucinations and handle conflicting evidence, featuring a training-free Beta-Bernoulli experience system for modeling tool reliability.

0 favorites 0 likes

#agentic-workflow

Az8 Studio: The closest thing we have to a multi-modal "Agentic" canvas for video pipelines? (First impressions)

Reddit r/AI_Agents ↗ · 2026-06-08

Az8 Studio is an infinite canvas with interconnected nodes for multi-modal AI video pipelines, enabling contextual memory across nodes, parallel multi-model orchestration, and persistent asset-to-agent workflows. It represents a shift from linear AI tools to spatial agentic environments.

0 favorites 0 likes

#agentic-workflow

@vintcessun: Recently many people are messing around with agentic workflow, tuning prompts for a long time but still easily going off track. Actually the problem is often at the runtime layer: no budget for loops, tool permissions too broad, compression loses state. DenisSergeevitch's agents-best-practices sk…

X AI KOLs Timeline ↗ · 2026-06-05

Discusses common runtime issues in agentic workflow (loop budget, tool permissions, state loss due to compression), recommends DenisSergeevitch's agents-best-practices resource, provides a provider-neutral reference, emphasizes making permissions, budget, and observability explicit mechanisms.

0 favorites 0 likes

#agentic-workflow

Struct-Searcher: Agentic Structural Thinking Advances Multimodal Deep Information Seeking

Hugging Face Daily Papers ↗ · 2026-06-05 Cached

Struct-Searcher introduces a belief revision theory-based structural agentic workflow for multimodal deep information seeking, achieving significant accuracy improvements over existing vision-language models and deep research agents.

0 favorites 0 likes

#agentic-workflow

@berryxia: Guys, I recently came across a project that takes the local model Gemma 4 to new heights, and I was stunned. You open a retro-style dungeon crawler game, your character walks up to an NPC, casually types a prompt, and it instantly builds a working web app right there. This is what the Google Gemma team just...

X AI KOLs Timeline ↗ · 2026-05-27 Cached

AIventure is an open-source dungeon crawler game from Google's Gemma team that integrates Gemma 4 to let players build web apps through NPC prompts, serving as a developer masterclass on agentic workflows and vibe-coding.

0 favorites 0 likes

#agentic-workflow

China Thwarts Meta’s Agentic Ambition, U.S. Evaluates Upcoming Models, AI Diagnoses Mammograms

The Batch ↗ · 2026-05-15 Cached

Andrew Ng introduces AI Andrew, an AI companion modeled after his personality, built using RAG and agentic workflows, and invites users to try it.

0 favorites 0 likes

#agentic-workflow

I made an agentic "Daily Brief" for my kids with a receipt printer

Reddit r/artificial ↗ · 2026-05-12

The author built an agentic system that automates daily briefs for their children, using cron jobs and a web service to generate and print personalized reports via a receipt printer.

0 favorites 0 likes

#agentic-workflow

@sudoingX: update: qwen 3.6 27b dense q4 just one shotted octopus invaders game on a single 3090. hermes agent drove the whole thi…

X AI KOLs Timeline ↗ · 2026-05-11

A user benchmark demonstrates that the Qwen 3.6 27B dense model (Q4 quantized) can autonomously generate a fully playable multi-file game in a single prompt on a single RTX 3090, significantly outperforming its predecessor with zero manual interventions. The results highlight major improvements in local code generation and agentic capabilities for consumer-grade hardware.

0 favorites 0 likes

agentic-workflow

Submit Feedback