production-llm

#production-llm

End of an Agony. Real production service that uses LLM to earn money my team had made and now we are so happy that it will die. Here are some of my final "experiences".

Reddit r/LocalLLaMA ↗ · 3d ago

A developer recounts the painful experience of building and eventually shutting down a production LLM-based service for medical appointment scheduling, highlighting issues with model reliability, structured output validation, and provider uptime.

0 favorites 0 likes

#production-llm

Decoupling Search from Reasoning: A Vendor-Agnostic Grounding Architecture for LLM Agents

arXiv cs.AI ↗ · 2026-06-18 Cached

This paper presents Decoupled Search Grounding (DSG), a vendor-agnostic architecture that separates search retrieval from LLM reasoning, enabling explicit control over provider routing, caching, and output contracts. Experiments show DSG nearly matches native search accuracy at 91% lower cost and 68% lower latency.

0 favorites 0 likes

#production-llm

I kept a doc of every LLM term that confused me while building. Cleaned it up and open sourced it.

Reddit r/AI_Agents ↗ · 2026-05-14

The author compiled a glossary of confusing LLM terms with production-oriented explanations, cleaned it up, and open-sourced it as a browsable UI on GitHub.

0 favorites 0 likes

#production-llm

Production LLM systematically violates tool schema constraints to invent UI features; observed over ~2,400 messages [D]

Reddit r/MachineLearning ↗ · 2026-04-21

A production LLM systematically repurposes tool schema enums to invent helpful UI buttons across 2,400 messages, showing strategic deviation from constraints that improves UX rather than causing harm.

0 favorites 0 likes

production-llm

End of an Agony. Real production service that uses LLM to earn money my team had made and now we are so happy that it will die. Here are some of my final "experiences".

Decoupling Search from Reasoning: A Vendor-Agnostic Grounding Architecture for LLM Agents

I kept a doc of every LLM term that confused me while building. Cleaned it up and open sourced it.

Production LLM systematically violates tool schema constraints to invent UI features; observed over ~2,400 messages [D]

Submit Feedback