AI for Health Queries: How Accurate Are Chatbots? Study Reveals Risks

A recent study has raised concerns about the reliability of artificial intelligence (AI) chatbots for health-related queries. Researchers found that while chatbots can provide general information, their accuracy varies significantly across medical fields, with some areas showing dangerously low performance.

Study Highlights Poor Performance in Key Specialties

The study, conducted by a team of medical and AI researchers, evaluated the responses of popular AI chatbots to a range of health questions. The findings revealed that internal medicine, neurology, and dermatology were among the worst-performing specialties. In these fields, chatbots achieved low validity scores and higher harm scores, indicating that their responses could potentially mislead users or lead to adverse health outcomes.

What the Harm Score Means

The harm score measures the potential negative impact of a chatbot's advice if followed by a user. A high harm score suggests that the information provided could result in delayed diagnosis, incorrect treatment, or other serious consequences. The study underscores the importance of caution when relying on AI for medical advice.

—

Wide Pickt banner — collaborative shopping lists app for Telegram, phone mockup with grocery list

Broader Implications for AI in Healthcare

While AI has shown promise in assisting healthcare professionals, the study highlights the risks of direct consumer use. Chatbots lack the ability to understand context, ask follow-up questions, or verify symptoms through physical examination—capabilities essential for accurate diagnosis.

Recommendations for Users

Experts advise that individuals should not replace professional medical consultation with AI chatbots. Instead, these tools should be used as a starting point for gathering information, but any health concerns should be discussed with a qualified healthcare provider. The study also calls for improved regulation and transparency in AI health applications.

Future Directions

Researchers emphasize the need for ongoing evaluation and improvement of AI models to enhance their safety and reliability. As technology evolves, integrating medical expertise into AI training could help reduce errors and harm. Until then, users are urged to exercise caution.