ChatGPT passes interventional cardiology certification exam

Michael Walter | May 03, 2024 | Cardiovascular Business | Interventional Cardiology

ChatGPT, OpenAI’s massively popular artificial intelligence (AI) chatbot, passed an interventional cardiology certification exam provided by the American College of Cardiology (ACC) and Society of Cardiovascular Angiography and Interventions (SCAI), according to a new research letter in JACC: Cardiovascular Interventions.[1]

However, fellows taking the same exam consistently achieved better scores.

“This study demonstrates AI’s capacity to answer multiple-choice questions from the field of interventional cardiology. However, it cautions against relying on ChatGPT-4 for decision-making within the field, as it often provides incorrect answers,” wrote senior author Emmanouil S. Brilakis, MD, PhD, director of the Center for Complex Coronary Interventions at the Minneapolis Heart Institute, and colleagues.

Brilakis et al. asked ChatGPT-4 to answer the simulated exam questions in two formats: multiple choice with forced justification (MC-J), which required answers to be explained, and multiple choice without forced justification (MC-NJ), which did not. The large language model’s score was better when answering in the MC-J format (76.7%) than when answering in the MC-NJ format (61.7%). It answered five out of six image-based questions correctly, but was unable to handle video-based questions.

When faced with more challenging questions that fewer than 75% of fellows answered correctly, ChatGPT did quite well, providing the right answer 75% of the time. On the questions that were easier for fellows, leading to a correct answer more than 75% of the time, ChatGPT got 77.3% correct. All in all, the average number of fellows answering questions correctly was 82.2%, higher than ChatGPT’s MC-J performance of 76.7%.

Brilakis and colleagues also emphasized the fact that the AI chatbot appeared to do worse when asked to answer the ACC/SCAI questions again. Its MC-J performance dropped from 76.7% to 65%.

“Specifically, three previously incorrect answers were corrected, whereas 10 previously correct answers were answered incorrectly upon retesting,” the authors wrote.

ChatGPT update promises key improvements

In April, OpenAI announced the launch of a new GPT-4 Turbo update with “improved capabilities in writing, math, logical reasoning and coding.” Would this latest version have performed even better than ChatGPT? Researchers are likely already working toward an answer.

Paid ChatGPT users now have access to GPT-4 Turbo.

Click here to read the full research letter.

Find more cardiology AI news

ChatGPT struggles to evaluate heart risk—but it could still help cardiologists

TAVR or SAVR? ChatGPT could help cardiologists decide

AI helps cardiologists deliver personalized healthcare—but there is still plenty of work to do

ChatGPT struggles with echocardiography, but still shows potential to help cardiology trainees

Cardiologists ask popular AI model ChatGPT to answer questions about cardiology

Michael Walter, Managing Editor

Michael has more than 18 years of experience as a professional writer and editor. He has written at length about cardiology, radiology, artificial intelligence and other key healthcare topics.

Around the web

Radiology Business

The impact of Trump tariffs on iodine contrast media costs

GE HealthCare said the price of iodine contrast increased by more than 200% between 2017 to 2023. Will new Chinese tariffs drive costs even higher?

Radiology Business

Key takeaways from RSNA 2024 for AI, CT, MRI and the radiology staffing shortage

Several key trends were evident at the Radiological Society of North America 2024 meeting, including new CT and MR technology and evolving adoption of artificial intelligence.

Health Imaging

Use of AI opportunistic screening in CT for cardiovascular disease

Ron Blankstein, MD, professor of radiology, Harvard Medical School, explains the use of artificial intelligence to detect heart disease in non-cardiac CT exams.