medical-llm

Tag

Cards List
#medical-llm

HiMed: Incentivizing Hindi Reasoning in Medical LLMs

arXiv cs.CL · 2026-05-26 Cached

Introduces HiMed, a Hindi reasoning medical corpus and benchmark suite, and HiMed-8B, a Hindi-form medical reasoning LLM using decaying scaffolding reward, demonstrating improved Hindi medical reasoning and reduced English–Hindi accuracy gap.

0 favorites 0 likes
#medical-llm

When Cases Get Rare: A Retrieval Benchmark for Off-Guideline Clinical Question Answering

arXiv cs.CL · 2026-05-22 Cached

Introduces OGCaReBench, a free-form retrieval benchmark for evaluating LLMs on clinical questions that require reasoning beyond standard guidelines. Experiments show that even the best model achieves only 56% accuracy, but retrieval augmentation boosts performance to 82%.

0 favorites 0 likes
#medical-llm

Do No Harm? Hallucination and Actor-Level Abuse in Web-Deployed Medical Large Language Models

arXiv cs.CL · 2026-05-21 Cached

This paper presents a large-scale assessment of medical LLMs, including custom MedGPTs and open-source models, finding 25-30% exhibit low factual accuracy and 33.6-54.3% violate operational thresholds, highlighting systemic safety risks.

0 favorites 0 likes
#medical-llm

How NOT to fine-tune your medical LLM; a look into Mark Kaplan's healtthruth.ai - "override and reframe foundational training"

Reddit r/ArtificialInteligence · 2026-05-15

This article critiques Mark Kaplan's approach to fine-tuning medical LLMs via his platform healtthruth.ai, highlighting pitfalls in overriding foundational training for healthcare AI.

0 favorites 0 likes
← Back to home

Submit Feedback