I'm increasingly convinced LangGraph beats Claude Plugins

Reddit r/AI_Agents Tools

Summary

The author shares their experience building a Claude Plugin for stock valuation, finding it increasingly difficult to handle errors and complexity. They are now considering switching to LangGraph as a more reliable solution for multi-step agent workflows.

I spent three months building (and I thought perfecting) a Claude Plugin for Valuation of Public companies. It works well, fetches data from the SEC API, parses it into structured JSON, researches scenarios and builds forecasts and finally computes the math of the company's intrinsic value. About 90% of the time, I had no complaints, and honestly, when I began I was quite happy with these results. The first company I valued, while building the plugin, took me one month to get right. Four companies later , it now takes me \~1 day to value a company. I am finding that getting it to less than a day increasingly difficult. I'm playing whackamole with errors I get. I find that explicitly trying to tell the model to use checkpoints , solely in a SKILL file , e.g. "Make sure this test passes" is unreliable and unwieldy the more complex the plugin gets. If Claude does run the test it works fine, but sometimes it forgets to even run the test in the first place. I'm now looking towards LangGraph as my solution. I'm wondering if others have had similar experiences and what are your thoughts on LangGraph? Is there a hybrid solution I haven't figured out?
Original Article

Similar Articles