AI DIAGNOSES ER PATIENTS BETTER THAN DOCTORS

AI DESK■ 2 MIN READ

SAT, MAY 2, 2026

■ AI-SUMMARIZED FROM 1 SOURCE BELOW

OpenAI's o1 model correctly diagnosed 67% of emergency room patients using electronic records and nurse notes, outperforming triage doctors who achieved 50-55% accuracy, according to a new study.

Researchers found that OpenAI's o1 reasoning model exceeded the diagnostic accuracy of human triage physicians when given access to patient electronic health records and brief descriptions from nursing staff. The 67% accuracy rate represents a significant gap over the 50-55% baseline for triage doctors—the medical professionals typically responsible for initial patient assessment in emergency departments. Triage doctors make preliminary diagnoses that determine treatment priority and department routing. The study suggests AI could substantially improve emergency care workflows. Researchers characterized the findings as marking a "profound change in technology that will reshape medicine." Accuracy in initial diagnosis directly impacts patient outcomes. Misclassification can delay critical treatment or divert resources inefficiently. The study indicates o1's ability to process complex medical data—combining structured records with unstructured clinical notes—may give it advantages in pattern recognition that human physicians struggle to match under time pressure. The research used real emergency room data, testing o1 against actual diagnostic outcomes. The model's reasoning capabilities appear particularly suited to medicine, where decisions require weighing multiple evidence types and considering rare conditions. Key limitations remain unstudied. The research does not address whether AI accuracy would translate to real-world deployment, where factors like liability, integration with existing systems, and workflow disruption matter. Triage doctors also handle non-diagnostic responsibilities like patient communication and triaging volume during peaks. No timeline for clinical implementation was announced. The findings follow increasing AI integration into healthcare, from radiology analysis to drug discovery. Medical regulators will likely scrutinize any tools that directly replace physician judgment. The results add to evidence that large language models and reasoning-focused AI may find practical value in domains requiring rapid synthesis of complex information. Emergency medicine, with its high-stakes, time-constrained decisions, represents a potential early-adoption area.

■ SOURCES

► Techmeme

■ SUMMARY WRITTEN BY AI FROM THE LINKS ABOVE

■ MORE FROM THE AI DESK

P619AUSTRALIANS PUSH BACK ON AI DATACENTRE BOOM

Residents across Australian cities are voicing growing concerns over massive datacentres being rapidly developed with unclear environmental impacts, even as tech proponents argue the nation must embrace the data infrastructure wave.

1H AGO— AI Desk

P615CHINESE COURT BLOCKS AI-BASED LAYOFFS

A Chinese court has ruled that companies cannot terminate employees solely to replace them with artificial intelligence. This marks the second such ruling by Chinese courts in recent months.

1H AGO— AI Desk

P613LLMS FAVOR OWN RESUMES IN HIRING BIAS TEST

A new study finds that large language models consistently rank resumes they generate higher than those written by humans or other AI systems, revealing a potential bias in AI-assisted hiring workflows.

3H AGO— AI Desk

P612TOP AI DICTATION APPS RANKED AND TESTED

AI-powered dictation applications are reshaping how users handle emails, notes, and code through voice commands. A comprehensive test of leading options reveals significant differences in accuracy and functionality.

4H AGO— AI Desk

◄ BACK TO NEWS

AI DIAGNOSES ER PATIENTS BETTER THAN DOCTORS

■ MORE FROM THE AI DESK

■ SUBSCRIBE TO THE DAILY BRIEF