medical

#medical

FaithMed: Training LLMs For Faithful Evidence-Based Medical Reasoning

arXiv cs.CL ↗ · yesterday Cached

FaithMed is a framework that trains LLMs for faithful evidence-based medical reasoning by integrating clinician-designed rubrics with reinforcement learning using step-level process reward assignment, achieving significant improvements over baselines on multiple medical benchmarks.

0 favorites 0 likes

#medical

Cross-Domain Feature Expansion for Tabular Medical Data via Knowledge Graphs Injection

arXiv cs.AI ↗ · 3d ago Cached

This paper introduces MedKGTab, a knowledge-injected framework that uses biomedical knowledge graphs to expand cross-domain features in tabular medical data, addressing data scarcity by generating high-fidelity biomedical profiles.

0 favorites 0 likes

#medical

Discrete Diffusion Language Models for Interactive Radiology Report Drafting

Hugging Face Daily Papers ↗ · 3d ago Cached

This paper adapts a mixture-of-experts diffusion language model, DiffusionGemma-26B, for interactive radiology report drafting, showing it matches or exceeds autoregressive models in medical VQA with 3.5-4.4x faster decoding and bidirectional infill capabilities.

0 favorites 0 likes

#medical

IMCBench: A benchmark for multimodal LLMs in Image-grounded Medical Conversations

arXiv cs.AI ↗ · 4d ago Cached

IMCBench is a new benchmark for evaluating multimodal LLMs on image-grounded medical conversations, pairing clinical images with synthetic patient profiles. Evaluations across safety, accuracy, and uncertainty show that even strong models like Claude Opus 4.6 have safety issues, highlighting the need for multi-dimensional evaluation.

0 favorites 0 likes

#medical

TriageRA-CCF: Source-Side Clinical Confidence and Coverage Signals for Adaptive Rank Budgeting in Medical LLMs

arXiv cs.CL ↗ · 4d ago Cached

This paper proposes TriageRA-CCF, a method for adaptive rank budgeting in LoRA for medical question answering. It uses source-side signals (base-model confidence, clinical coverage, counterfactual proxy) to dynamically choose rank budgets, achieving modest accuracy gains on Qwen3-8B and Llama3.1-8B.

0 favorites 0 likes

#medical

@MaziyarPanahi: A year ago, OpenMed didn't exist. Today: 340M model downloads. 1,500+ open medical models, all Apache 2.0. 650+ run on …

X AI KOLs Following ↗ · 5d ago Cached

A year after its inception, OpenMed has achieved 340 million model downloads, offering over 1,500 open medical models under Apache 2.0, with 650+ capable of running on-device on iPhones.

0 favorites 0 likes

#medical

Streaming medical STT running locally on a MacBook

Reddit r/LocalLLaMA ↗ · 2026-06-26

Describes a medical speech-to-text system that runs locally on a MacBook, enabling streaming transcription without cloud dependency.

0 favorites 0 likes

#medical

Explainable Ensemble-Based Machine Learning Models for Detecting the Presence of Cirrhosis in Hepatitis C Patients

arXiv cs.AI ↗ · 2026-06-26 Cached

This paper applies ensemble machine learning models (Random Forest, Gradient Boosting, XGBoost, Extra Trees) to detect cirrhosis in hepatitis C patients using 28 features from 2038 Egyptian patients. The Extra Trees model achieved 96.92% accuracy with only 16 features, outperforming other models.

0 favorites 0 likes

#medical

Fast medical RAG API to give your local LLMs access to facts

Reddit r/LocalLLaMA ↗ · 2026-06-25

A free RAG API using medical Wikipedia articles is now available to provide local LLMs with accurate medical facts, as demonstrated by correcting hallucinations about Lhermitte sign.

0 favorites 0 likes

#medical

MedGuards: Multi-Agent System for Reliable Medical Error Detection and Correction

arXiv cs.CL ↗ · 2026-06-25 Cached

MedGuards proposes a multi-agent framework for detecting and correcting errors in medical text using specialized agents and confidence-guided arbitration, improving reliability without additional training. Experiments on multilingual clinical notes show significant improvements.

0 favorites 0 likes

#medical

MMed-Bench-IR: A Heterogeneous Benchmark for Multilingual Medical Information Retrieval

arXiv cs.CL ↗ · 2026-06-24 Cached

MMed-Bench-IR is a heterogeneous benchmark for multilingual medical information retrieval across six languages, evaluating cross-lingual alignment, concept discrimination, and evidence retrieval. It reveals severe performance drops for non-English queries, highlighting gaps in existing English-only evaluations.

0 favorites 0 likes

#medical

@OpenAI: Many of these cases had evaded years of expert analysis. This study suggests AI could make expert-led periodic reanalys…

X AI KOLs ↗ · 2026-06-18 Cached

This study suggests that AI can make expert-led periodic reanalysis of old medical cases more scalable, helping clinicians revisit cases as medical knowledge advances and potentially bring answers to more cases that previously evaded analysis.

0 favorites 0 likes

#medical

Towards Next-Generation Healthcare: A Survey of Medical Embodied AI for Perception, Decision-Making, and Action

arXiv cs.AI ↗ · 2026-06-16 Cached

This paper systematically surveys the core components of medical embodied AI, emphasizing the coordinated integration of perception, decision-making, and action in clinical environments, and reviews representative applications, datasets, and future research directions.

0 favorites 0 likes

#medical

MSAIC-Net: A Multi-Scale Attention and Imbalance-Aware Contrastive Network for ECG-Based Myocardial Substrate Abnormality Detection

arXiv cs.LG ↗ · 2026-06-08 Cached

Proposes MSAIC-Net, a multi-scale attention-enhanced convolutional network for detecting myocardial substrate abnormalities from ECG signals, using imbalance-aware contrastive learning and lead-wise permutation importance for interpretability.

0 favorites 0 likes

#medical

A Multi-Domain Red Teaming Framework for Safety, Robustness, and Fairness Evaluation of Medical Large Language Models

arXiv cs.CL ↗ · 2026-06-02 Cached

This paper presents a multi-domain red teaming framework for evaluating safety, robustness, and fairness of medical LLMs across 690 clinically grounded scenarios. Results show that high aggregate accuracy can mask critical failures, and hybrid evaluation with clinician oversight is necessary for credible safety assessment.

0 favorites 0 likes

#medical

Same Question, Different Source, Different Answer: Auditing Source-Dependence in Medical Multi-Source RAG

arXiv cs.CL ↗ · 2026-05-29 Cached

This paper introduces a framework for auditing source-dependence in medical multi-source RAG systems, releasing the TransplantQA benchmark, HERO-QA retrieval strategy, and a structured-output judge to measure inter-source answer relationships. It demonstrates that better retrieval reveals more disagreement than previously estimated, and argues for shifting NLP evaluation from answer correctness to inter-source relationship analysis.

0 favorites 0 likes

#medical

Augmented Equivariant Mesh Networks for Anatomical Mesh Segmentation (ICML 2026 Workshops) [R]

Reddit r/MachineLearning ↗ · 2026-05-26

Presents EAMS, a lightweight equivariant mesh segmentation framework that generalizes across anatomical tasks, showing a trade-off between equivariance and accuracy on subtle features.

0 favorites 0 likes

#medical

MedicalBench: Evaluating Large Language Models Toward Improved Medical Concept Extraction

arXiv cs.CL ↗ · 2026-05-21 Cached

MedicalBench is a new benchmark for evaluating large language models on medical concept extraction from electronic health records, focusing on implicit reasoning and evidence grounding. It includes 823 expert-annotated examples and shows that current models perform modestly, highlighting the difficulty of extracting implicitly stated medical concepts.

0 favorites 0 likes

#medical

COTCAgent: Preventive Consultation via Probabilistic Chain-of-Thought Completion

arXiv cs.CL ↗ · 2026-05-15 Cached

COTCAgent is a hierarchical reasoning framework for longitudinal electronic health records that uses a probabilistic chain-of-thought completion approach, achieving 90.47% Top-1 accuracy on a self-built dataset and outperforming existing medical agents.

0 favorites 0 likes

#medical

OpenAI for Healthcare

OpenAI Blog ↗ · 2026-01-08 Cached

OpenAI launches OpenAI for Healthcare, a suite of enterprise products including ChatGPT for Healthcare and API solutions designed to support HIPAA-compliant AI adoption across healthcare organizations. The offering features healthcare-optimized GPT-5 models, evidence-based retrieval with citations, policy integration, and workflow automation tools already deployed at major institutions like Stanford Medicine and UCSF.

0 favorites 0 likes

medical

Submit Feedback