:

AI DIAGNOSES ER PATIENTS BETTER THAN DOCTORS

AI DESK2 MIN READ
SAT, MAY 2, 2026

■ AI-SUMMARIZED FROM 1 SOURCE BELOW

OpenAI's o1 model correctly diagnosed 67% of emergency room patients using electronic records and nurse notes, outperforming triage doctors who achieved 50-55% accuracy, according to a new study.

Researchers found that OpenAI's o1 reasoning model exceeded the diagnostic accuracy of human triage physicians when given access to patient electronic health records and brief descriptions from nursing staff. The 67% accuracy rate represents a significant gap over the 50-55% baseline for triage doctors—the medical professionals typically responsible for initial patient assessment in emergency departments. Triage doctors make preliminary diagnoses that determine treatment priority and department routing. The study suggests AI could substantially improve emergency care workflows. Researchers characterized the findings as marking a "profound change in technology that will reshape medicine." Accuracy in initial diagnosis directly impacts patient outcomes. Misclassification can delay critical treatment or divert resources inefficiently. The study indicates o1's ability to process complex medical data—combining structured records with unstructured clinical notes—may give it advantages in pattern recognition that human physicians struggle to match under time pressure. The research used real emergency room data, testing o1 against actual diagnostic outcomes. The model's reasoning capabilities appear particularly suited to medicine, where decisions require weighing multiple evidence types and considering rare conditions. Key limitations remain unstudied. The research does not address whether AI accuracy would translate to real-world deployment, where factors like liability, integration with existing systems, and workflow disruption matter. Triage doctors also handle non-diagnostic responsibilities like patient communication and triaging volume during peaks. No timeline for clinical implementation was announced. The findings follow increasing AI integration into healthcare, from radiology analysis to drug discovery. Medical regulators will likely scrutinize any tools that directly replace physician judgment. The results add to evidence that large language models and reasoning-focused AI may find practical value in domains requiring rapid synthesis of complex information. Emergency medicine, with its high-stakes, time-constrained decisions, represents a potential early-adoption area.

■ SOURCES

Techmeme

■ SUMMARY WRITTEN BY AI FROM THE LINKS ABOVE

■ MORE FROM THE AI DESK

Residents across Australian cities are voicing growing concerns over massive datacentres being rapidly developed with unclear environmental impacts, even as tech proponents argue the nation must embrace the data infrastructure wave.

1H AGOAI Desk

A Chinese court has ruled that companies cannot terminate employees solely to replace them with artificial intelligence. This marks the second such ruling by Chinese courts in recent months.

1H AGOAI Desk

A new study finds that large language models consistently rank resumes they generate higher than those written by humans or other AI systems, revealing a potential bias in AI-assisted hiring workflows.

3H AGOAI Desk

AI-powered dictation applications are reshaping how users handle emails, notes, and code through voice commands. A comprehensive test of leading options reveals significant differences in accuracy and functionality.

4H AGOAI Desk

■ SUBSCRIBE TO THE DAILY BRIEF

ONE EMAIL, 5 STORIES, 06:00 UTC. UNSUBSCRIBE ANYTIME.