Never miss an update from Georgetown University
Create your free account to connect with Georgetown University and thousands of other innovative organizations and professionals worldwide
- High-quality speech synthesis across multiple languages and accents with a single voice model
- Personalized speech synthesis capabilities, which enable users to utilize their own voices
- Requires less training data and allows for voice transformation
OVERVIEW
Researchers at Georgetown University have developed an innovative system to synthesize speech in any voice, in any language, and in any accent. The Text-to-Speech(TTS) system trained on numerous speakers and various languages using neural networks could use any voice to generate speech in any of the languages using any accent. Developed to address the growing demand for natural and versatile speech synthesis, this system is a significant advancement in the field of both artificial intelligence and linguistic processing.
The technical process involves three distinct steps: training Cross-Language TTS, enrolling a new speaker, and synthesizing speech. Step (1) is directed by system builders such as language engineers, while steps (2) and (3) are directed by end-users. Users have the freedom to utilize any of the built-in voices trained and skip step (2).
The application of high quality speech synthesis by the TTS is wide and associated with the proliferation of speech output in computer applications. A practical example of applying the TTS is in language learning applications. By integrating this technology, the platform can dynamically generate speech in various languages and accents, allowing learners to engage with conversations equivalent to those with native speakers. Whether practicing French with a Parisian accent or conversing in Spanish with a Latin American dialect, users can benefit from immersive conversational experiences and realistic interactions that enhance language acquisition and cultural understanding.
BACKGROUND
TTS is in the field of speech output, an unlimited market with high business value. TTS would give any technology giants a competitive edge in the competition for every market in every language.
Traditionally, speech synthesis has relied on parametric synthesis, which analyzes speech parameters dynamically or concatenative synthesis, which involves piecing together pre-recorded segments of speech. While concatenative synthesis has been favored for its higher quality to produce speech indistinguishable from human speech, it necessitates a significant investment in recording speech from a single speaker and restricts the synthesized voice to the domain of the recorded speech, limiting its versatility, especially across languages. In contrast, parametric synthesis offers advantages in terms of requiring less speech for training, and enabling transformation for speech in languages other than the recorded one.
While existing systems excel in single speech synthesis, the challenge lies in extending this capability to multiple languages with consistent voice quality. Finding multilingual voice talent is impractical. TTS used for personalized speech synthesis, allows users to utilize their own voices across various applications.
Benefit
Market Application
Publications
US Patent n.11,605,371
Our mission is to advance GU’s innovations through strategic alliances and new venture creation, to facilitate the translation of research breakthroughs into tangible solutions, and to cultivate a dynamic and inclusive environment for entrepreneurship. We advance this mission in support of the GU community and for the benefit of society.
Create your free account to connect with Georgetown University and thousands of other innovative organizations and professionals worldwide
Send a request for information
to Georgetown University
Technology Offers on Innoget are directly posted
and managed by its members as well as evaluation of requests for information. Innoget is the trusted open innovation and science network aimed at directly connect industry needs with professionals online.
Need help requesting additional information or have questions regarding this Technology Offer?
Contact Innoget support