LSTM Deep Neural Networks Postfiltering for Improving the Quality of Synthetic Voices

Colaboradores:: Ing. Marvin Coto Jiménez, PhD.
Autores:: Marvin Coto-Jiménez and John Goddard-Close
Revista:: N/A
Editor:: Springer, Cham
URL:: https://link.springer.com/chapter/10.1007/978-3-319-39393-3_28

Resumen:

Recent developments in speech synthesis have produced systems capable of providing intelligible speech, and researchers now strive to create models that more accurately mimic human voices. One such development is the incorporation of multiple linguistic styles in various languages and accents. HMM-based speech synthesis is of great interest to researchers, due to its ability to produce sophisticated features with a small footprint. Despite such progress, its quality has not yet reached the level of the current predominant unit-selection approaches, that select and concatenate recordings of real speech. Recent efforts have been made in the direction of improving HMM-based systems. In this paper, we present the application of long short-term memory deep neural networks as a postfiltering step in HMM-based speech synthesis. Our motivation stems from a desire to obtain spectral characteristics closer …

Anuncios
- julio 05, 2024
  
  Horarios del II-2024
- Ene. 26, 2024
  
  Calendario de graduaciones 2024
- abril 12, 2016
  
  RADIO201
- abril 12, 2016
  
  Bolsa de Empleo
- abril 12, 2016
  
  ¡Nuevo sistema de información!

LSTM Deep Neural Networks Postfiltering for Improving the Quality of Synthetic Voices

Resumen:

Anuncios

Horarios del II-2024

Calendario de graduaciones 2024

RADIO201

Bolsa de Empleo

¡Nuevo sistema de información!

Información de Contacto

Escuela de Ingeniería Eléctrica

Enlaces Útiles