A recent study has raised concerns about the reliability of artificial intelligence (AI) chatbots for health-related queries. Researchers found that while chatbots can provide general information, their accuracy varies significantly across medical fields, with some areas showing dangerously low performance.
Study Highlights Poor Performance in Key Specialties
The study, conducted by a team of medical and AI researchers, evaluated the responses of popular AI chatbots to a range of health questions. The findings revealed that internal medicine, neurology, and dermatology were among the worst-performing specialties. In these fields, chatbots achieved low validity scores and higher harm scores, indicating that their responses could potentially mislead users or lead to adverse health outcomes.
What the Harm Score Means
The harm score measures the potential negative impact of a chatbot's advice if followed by a user. A high harm score suggests that the information provided could result in delayed diagnosis, incorrect treatment, or other serious consequences. The study underscores the importance of caution when relying on AI for medical advice.
Broader Implications for AI in Healthcare
While AI has shown promise in assisting healthcare professionals, the study highlights the risks of direct consumer use. Chatbots lack the ability to understand context, ask follow-up questions, or verify symptoms through physical examination—capabilities essential for accurate diagnosis.
Recommendations for Users
Experts advise that individuals should not replace professional medical consultation with AI chatbots. Instead, these tools should be used as a starting point for gathering information, but any health concerns should be discussed with a qualified healthcare provider. The study also calls for improved regulation and transparency in AI health applications.
Future Directions
Researchers emphasize the need for ongoing evaluation and improvement of AI models to enhance their safety and reliability. As technology evolves, integrating medical expertise into AI training could help reduce errors and harm. Until then, users are urged to exercise caution.



