Tag
The article discusses that building LLM-driven AI tools still requires capturing domain knowledge, though it's easier than previous AI generations because knowledge doesn't need to be strictly structured.
The author replies to comments on their viral post about LLMs eroding their software engineering career, discussing how AI automation is reducing the need for deep domain knowledge in fintech and the challenges of maintaining diligence in a vibecoding culture.
A tweet comparing AI adoption to replacing employees with geniuses lacking domain knowledge, resulting in chaos.
BODHI is a domain knowledge prompting method that improves LLM-based generation of formal OS kernel specifications by augmenting few-shot prompts with a structured C-to-Python translation guide, achieving up to 96.73% Pass@1 on the OSV-Bench benchmark.
YC General Partner @t_blom presented a talk on building self-improving, AI-native companies, emphasizing recursive AI loops and reducing headcount through AI automation.
This paper introduces FINESSE-Bench, a suite of eight specialized benchmarks with 3,993 questions for hierarchical evaluation of financial competencies in large language models, covering professional certification topics and applied trading tasks.