Urology Articles

Evaluation of ChatGPT for Patient Counseling in Kidney Stone Clinic: A Prospective Study

Mohamed Javid
Mahendra Bhandari, Henry Ford HealthFollow
P. Parameshwari
Madhu Reddiboina
Srikala Prasad

Recommended Citation

Javid M, Bhandari M, Parameshwari P, Reddiboina M, and Prasad S. Evaluation of ChatGPT for Patient Counseling in Kidney Stone Clinic: A Prospective Study. J Endourol 2024.

Document Type

Article

Publication Date

4-1-2024

Publication Title

Journal of endourology

Abstract

Introduction: The potential of large language models (LLMs) is to improve the clinical workflow and to make patient care efficient. We prospectively evaluated the performance of the LLM ChatGPT as a patient counseling tool in the urology stone clinic and validated the generated responses with those of urologists.

Methods: We collected 61 questions from 12 kidney stone patients and prompted those to ChatGPT and a panel of experienced urologists (Level 1). Subsequently, the blinded responses of urologists and ChatGPT were presented to two expert urologists (Level 2) for comparative evaluation on preset domains: accuracy, relevance, empathy, completeness, and practicality. All responses were rated on a Likert scale of 1 to 10 for psychometric response evaluation. The mean difference in the scores given by the urologists (Level 2) was analyzed and interrater reliability (IRR) for the level of agreement in the responses between the urologists (Level 2) was analyzed by Cohen's kappa.

Results: The mean differences in average scores between the responses from ChatGPT and urologists showed significant differences in accuracy (p < 0.001), empathy (p < 0.001), completeness (p < 0.001), and practicality (p < 0.001), except for the relevance domain (p = 0.051), with ChatGPT's responses being rated higher. The IRR analysis revealed significant agreement only in the empathy domain [k = 0.163, (0.059-0.266)].

Conclusion: We believe the introduction of ChatGPT in the clinical workflow could further optimize the information provided to patients in a busy stone clinic. In this preliminary study, ChatGPT supplemented the answers provided by the urologists, adding value to the conversation. However, in its current state, it is still not ready to be a direct source of authentic information for patients. We recommend its use as a source to build a comprehensive Frequently Asked Questions bank as a prelude to developing an LLM Chatbot for patient counseling.

Medical Subject Headings

Humans; Prospective Studies; Reproducibility of Results; Kidney Calculi; Dietary Supplements; Counseling; ChatGPT; large language models; patient queries; renal stone disease

PubMed ID

38411835

ePublication

ePub ahead of print

Volume

Issue

First Page

377

Last Page

383

Find It @ Sladen

COinS

Urology Articles

Evaluation of ChatGPT for Patient Counseling in Kidney Stone Clinic: A Prospective Study

Recommended Citation

Document Type

Publication Date

Publication Title

Abstract

Medical Subject Headings

PubMed ID

ePublication

Volume

Issue

First Page

Last Page

Browse

Author Corner

Urology Articles

Evaluation of ChatGPT for Patient Counseling in Kidney Stone Clinic: A Prospective Study

Authors

Recommended Citation

Document Type

Publication Date

Publication Title

Abstract

Medical Subject Headings

PubMed ID

ePublication

Volume

Issue

First Page

Last Page

Share

Browse

Author Corner