Personalized ASR system for accurate recognition of dysarthric speech

Background

Dysarthria is a neurological speech disorder that affects the clarity and intelligibility of spoken language, often resulting from conditions such as stroke, ALS, Parkinson’s disease, or cerebral palsy. Individuals with dysarthria experience slurred or slowed speech, making verbal communication difficult and impacting quality of life.

While augmentative and alternative communication (AAC) devices can assist, they often require fine motor control and offer much slower communication rates compared to natural speech. As digital voice interfaces become more central to everyday interactions, there is a growing need for speech recognition systems that can accurately interpret dysarthric speech and bridge this accessibility gap.

Technology overview

This technology introduces a personalized automatic speech recognition (ASR) system specifically designed for individuals with severe dysarthria. The core model is built using a uniquely large dataset comprising over 50 hours of speech from a single speaker with dysarthria, totaling more than 40,000 words and 187,000 phonemes.

The system employs hidden Markov models with extended state durations to capture the distinct acoustic characteristics of dysarthric speech and can be further enhanced using Gaussian Mixture Models, Deep Neural Networks, or Long Short-Term Memory architectures.

It is implemented in C++ using the Kaldi ASR toolkit and is deployable as a standalone application, web service, or embedded component across platforms. The system can be personalized with minimal user data, achieving over 85 percent word recognition accuracy, even in cases of low intelligibility.

Benefits

Accurately recognizes speech from users with severe dysarthria
Personalizes to individual users with minimal additional data
Supports open-vocabulary recognition beyond fixed phrase sets
Operates across platforms as a standalone or embedded system
Significantly outperforms commercial ASR systems in accuracy

Applications

Augmentative and alternative communication (AAC)
Voice-controlled assistive technologies
Speech accessibility for neurological conditions
Rehabilitation and speech therapy tools
Integration into commercial ASR platforms

Opportunity

Addresses a major unmet need in speech accessibility for millions with neurological disorders
Provides scalable, personalized, and accurate speech recognition
Compatible with existing assistive and commercial voice technologies
Available for exclusive licensing

Intellectual property

U.S. Provisional serial no. 63/658,764 filed on 06/11/2024

PCT application serial no. PCT/US2025/032777 06/06/2025. Published as https://patents.google.com/patent/WO2025259567A1/en?oq=WO+2025%2f259567+A1

Direct Link:

https://canberra-ip.technologypublisher.com/tech/Personalized_ASR_system_for_ accurate_recognition_of_dysarthric_speech

Bookmark this page

Download as PDF

For Information, Contact:

Gazell Call

Senior Intellectual Property Specialist

University of Texas at Austin

gazell.call@austin.utexas.edu