<  Back to the Polytechnique Montréal portal

Towards a reliable french speech recognition tool for an automated diagnosis of learning disabilities

Jihene Rezgui, Félix Jobin, Younes Kechout, Chritine Turgeon and Foutse Khomh

Paper (2024)

An external link is available for this item
Show abstract
Hide abstract

Abstract

Dyslexia, characterized by severe challenges in reading and spelling acquisition, presents a substantial barrier to proficient literacy, resulting in significantly reduced reading speed (2 to 3 times slower) and diminished text comprehension. With a prevalence ranging from 5G to 10 % in the population, early intervention by speech and language pathologists (SLPs) can mitigate dyslexia's effects, but the diagnosis bottleneck impedes timely support. To address this, we propose leveraging machine learning tools to expedite the diagnosis process, focusing on automating phonetic transcription, a critical step in dyslexia assessment. We investigated the practicality of two model configurations utilizing Google's speech-to-text API with children speech in evaluation scenarios and compared their results against transcriptions crafted by experts. The first configuration focuses on Google API's speech-to-text while the second integrates Phonemizer, a text-to-phonemes tool based on a dictionary. Results analysis indicate that our Google-Phonemizer model yields reading accuracies comparable to those computed from human-made transcriptions, offering promise for clinical application. These findings underscore the potential of AI-driven solutions to enhance dyslexia diagnosis efficiency, paving the way for improved accessibility to vital SLP services.

Uncontrolled Keywords

Department: Department of Computer Engineering and Software Engineering
Funders: FRQ-Inno
ISBN: 9798350385328
PolyPublie URL: https://publications.polymtl.ca/58796/
Conference Title: 2024 International Conference on Smart Applications, Communications and Networking (SmartNets 2024)
Conference Location: Harrisonburg, VA, USA
Conference Date(s): 2024-05-28 - 2024-05-30
Publisher: Institute of Electrical and Electronics Engineers
DOI: 10.1109/smartnets61466.2024.10577676
Official URL: https://doi.org/10.1109/smartnets61466.2024.105776...
Date Deposited: 21 Aug 2024 00:09
Last Modified: 25 Sep 2024 16:51
Cite in APA 7: Rezgui, J., Jobin, F., Kechout, Y., Turgeon, C., & Khomh, F. (2024, May). Towards a reliable french speech recognition tool for an automated diagnosis of learning disabilities [Paper]. 2024 International Conference on Smart Applications, Communications and Networking (SmartNets 2024), Harrisonburg, VA, USA (6 pages). https://doi.org/10.1109/smartnets61466.2024.10577676

Statistics

Dimensions

Repository Staff Only

View Item View Item