Skip to main content

Technologies

You are here

Speaker Independent Speech Inversion System

Print Save as PDF

Overview

Background
Speech inversion is the process of mapping acoustic signals into articulatory parameters. This is a well-known ill-posed problem and the addition of individual speaker differences make it even harder to solve. Normalizing the speaker differences is essential to effectively using multi-speaker articulatory data for training a speaker independent speech inversion system. Currently, there is an essential need to develop a speech inversion system that is speaker independent and can accurately estimate articulatory features for any speaker. Such a system would make a real impact in improving speech accent conversion, speech therapy, language learning, automatic speech recognition, and detection of mental health issues from speech.

Innovative Technology
Researchers at the University of Maryland have developed a robust and speaker independent speech inversion system. This system is able to perform acoustic-to-articulatory inversion to estimate the 6 Tract Variables, namely – Lip Aperture (LA), Lip Protrusion (LP), Tongue Body Constriction Location (TBCL), Tongue Body Constriction Degree (TBCD), Tongue Tip Constriction Location (TTCL), and Tongue Tip Constriction Degree (TTCD). The software is trained on multi-speaker data in a speaker independent manner to robustly estimate the Tract Variables from speech audio.

Speech Inversion is available for Academic/Non-profit use licensing. Please contact [email protected] for more info.

Applications

·Speech Conversion
·Speech Therapy
·Language Learning
·Speech Recognition
·Mental Health Diagnosis

Advantages

·Perform acoustic-to articulatory inversion
·Minimizes speaker variability

Contact Info

UM Ventures
0134 Lee Building
7809 Regents Drive
College Park, MD 20742
Email: [email protected]
Phone: (301) 405-3947 | Fax: (301) 314-9502