Research Administration Articles

Automatic speech recognition performance for digital scribes: a performance comparison between general-purpose and specialized models tuned for patient-clinician conversations

Brian D. Tran
Ramya Mangu
Ming Tai-Seale
Jennifer LaFata, Henry Ford HealthFollow
Kai Zheng

Recommended Citation

Tran BD, Mangu R, Tai-Seale M, Lafata JE, and Zheng K. Automatic speech recognition performance for digital scribes: a performance comparison between general-purpose and specialized models tuned for patient-clinician conversations. AMIA Annu Symp Proc 2022; 2022:1072-1080.

Document Type

Article

Publication Date

4-1-2023

Publication Title

AMIA Annual Symposium proceedings

Abstract

One promising solution to address physician data entry needs is through the development of so-called "digital scribes," or tools which aim to automate clinical documentation via automatic speech recognition (ASR) of patient-clinician conversations. Evaluation of specialized ASR models in this domain, useful for understanding feasibility and development opportunities, has been difficult because most models have been under development. Following the commercial release of such models, we report an independent evaluation of four models, two general-purpose, and two for medical conversation with a corpus of 36 primary care conversations. We identify word error rates (WER) of 8.8%-10.5% and word-level diarization error rates (WDER) ranging from 1.8%-13.9%, which are generally lower than previous reports. The findings indicate that, while there is room for improvement, the performance of these specialized models, at least under ideal recording conditions, may be amenable to the development of downstream applications which rely on ASR of patient-clinician conversations.

Medical Subject Headings

Humans; Speech Recognition Software; Speech Perception; Communication; Speech; Documentation

PubMed ID

37128439

Volume

2022

First Page

1072

Last Page

1080

Find It @ Sladen

COinS

Research Administration Articles

Automatic speech recognition performance for digital scribes: a performance comparison between general-purpose and specialized models tuned for patient-clinician conversations

Recommended Citation

Document Type

Publication Date

Publication Title

Abstract

Medical Subject Headings

PubMed ID

Volume

First Page

Last Page

Browse

Author Corner

Research Administration Articles

Automatic speech recognition performance for digital scribes: a performance comparison between general-purpose and specialized models tuned for patient-clinician conversations

Authors

Recommended Citation

Document Type

Publication Date

Publication Title

Abstract

Medical Subject Headings

PubMed ID

Volume

First Page

Last Page

Share

Browse

Author Corner