Improving automatic speech recognition containing additive noise using deep denoising autoencoders of LSTM networks

Colaboradores:: Ing. Marvin Coto Jiménez, PhD.
Autores:: Marvin Coto-Jiménez and John Goddard-Close and Fabiola Martínez-Licona
Revista:: N/A
Editor:: Springer, Cham
URL:: https://link.springer.com/chapter/10.1007/978-3-319-43958-7_42

Resumen:

Automatic speech recognition systems (ASR) suffer from performance degradation under noisy conditions. Recent work, using deep neural networks to denoise spectral input features for robust ASR, have proved to be successful. In particular, Long Short-Term Memory (LSTM) autoencoders have outperformed other state of the art denoising systems when applied to the mfcc’s of a speech signal. In this paper we also consider denoising LSTM autoencoders (DLSTMA), but instead use three different DLSTMAs and apply each to the mfcc’s, fundamental frequency, and energy features, respectively. Results are given using several kinds of additive noise at different intensity levels, and show how this collection of DLSTMA’s improves the performance of the ASR in comparison with the LSTM autoencoder.

Improving automatic speech recognition containing additive noise using deep denoising autoencoders of LSTM networks

Resumen:

Anuncios

Fecha límite de recepción de solicitudes de matrícula por suficiencia y tutoría para I-2024

Horarios del I-2024

Calendario de graduaciones 2024

RADIO201

Bolsa de Empleo

Información de Contacto

Escuela de Ingeniería Eléctrica

Enlaces Útiles