Mass General Brigham Study Finds 21 Frontier LLMs Fail Early Clinical Reasoning More Than 80 Percent of the Time
A JAMA Network Open study using the new PrIME-LLM framework finds top AI models excel at final diagnoses with full data but collapse on differential diagnosis when patient information is incomplete.
3 min read2 sources