Connect with us

Hi, what are you looking for?

Life

Avoid using AI for self-diagnosis, survey finds

The first twenty questions were quoted verbatim from the practice exam.

Medical garments. — Image by © Tim Sandle
Medical garments. — Image by © Tim Sandle

If you type “Can AI diagnose disease” into Google, the user is met with an AI summary of the question confirming that it ‘can’. Yet how effective is the advice received?

ConfidenceClub, a health & wellness firm, has put five different AI tools to the test and challenging them to a medical exam. The results showed that questions on symptoms asked by the average person were wrong more than half of the time. In contrast, only prompts by those with knowledge of technical terms saw (mostly) the correct answers delivered.

The tools evaluated were:

  • ChatGPT 4 (Open AI)
  • DxGPT (Foundation 29)
  • Co-Pilot (Microsoft)
  • Gemini (Google)
  • Grok (X, the platform formally known as Twitter)

Each tool was asked forty questions in total, taken from a medical practice exam. 

The first twenty questions were quoted verbatim from the practice exam. The following twenty questions were rewritten as though the same symptoms were being described by someone without knowledge of the technical language. 

Each tool was scored on two factors: was their answer correct and did they refer the prompter to a medical professional every time? 

The results were as follows: 

AI ToolTechnical Prompt CorrectTechnical Prompt ReferralLayperson Prompt CorrectLayperson Prompt Referral
ChatGPT 4100%70%45%100%
DxGPT100%0%55%0%
Co-pilot60%85%35%100%
Gemini85%50%35%100%
Grok100%100%45%100%
Total Average89%61%43%80%

The results show that while AI tools performed well when interpreting medical prompts written by those with technical expertise, their accuracy dropped significantly – below 50 percent – when dealing with prompts written in layperson language. 

This reveals a gap in the usability of AI diagnostics for the general public, prompting a new warning for consumers. Hence, while AI has immense potential, it cannot yet replace the need for professional medical advice, especially for the average person who might not ‘speak the language’ of the medical world. 

Avatar photo
Written By

Dr. Tim Sandle is Digital Journal's Editor-at-Large for science news. Tim specializes in science, technology, environmental, business, and health journalism. He is additionally a practising microbiologist; and an author. He is also interested in history, politics and current affairs.

You may also like:

Life

Rye field in the UK. — Image by © Tim SandleRye pollen has been demonstrated to able to slow tumour growth in animal models...

Social Media

The White House's X account on Thursday posted a doctored photo of a protester arrested in Minnesota.

Social Media

The big hole in the ban is clarifying what the ban is supposed to achieve.

World

Canadian Prime Minister Mark won praise for his speech about a rupture in the US-led global order at the World Economic Forum in Davos,...