Speech Signal Processing Based on Deep Learning in Complex Acoustic Environments

1 Edición - 4 de septiembre de 2024
Última edición
Autor: Xiao-Lei Zhang
Idioma: Inglés

Speech Signal Processing Based on Deep Learning in Complex Acoustic Environments provides a detailed discussion of deep learning-based robust speech processing and its applic… Leer más

Descripción

Speech Signal Processing Based on Deep Learning in Complex Acoustic Environments provides a detailed discussion of deep learning-based robust speech processing and its applications. The book begins by looking at the basics of deep learning and common deep network models, followed by front-end algorithms for deep learning-based speech denoising, speech detection, single-channel speech enhancement multi-channel speech enhancement, multi-speaker speech separation, and the applications of deep learning-based speech denoising in speaker verification and speech recognition.

Puntos claves

Provides a comprehensive introduction to the development of deep learning-based robust speech processing
Covers speech detection, speech enhancement, dereverberation, multi-speaker speech separation, robust speaker verification, and robust speech recognition
Focuses on a historical overview and then covers methods that demonstrate outstanding performance in practical applications

De interès para

Senior undergraduate students, graduate students, and professionals with a solid foundation in speech signal processing and machine learning who are engaged in intelligent speech processing

Índice

1. Introduction

2. Fundamentals of Deep Learning

3. Voice Activity Detection

4. Single-Channel Speech Enhancement

5. Multi-Channel Speech Enhancement

6. Multi-Speaker Speech Separation

7. Speaker Recognition

8. Speech Recognition

Detalles del producto

Edición: 1
Última edición
Publicado: 9 de septiembre de 2024
Idioma: Inglés

Sobre el autor

Xiao-Lei Zhang

Xiao-Lei Zhang received his Ph.D. degree with honors from Tsinghua University, China, in 2012. He was a postdoctoral researcher with the Department of Electronic Engineering at Tsinghua University from 2012 to 2014. He was a visiting scholar at The Ohio State University, USA, from 2013 to 2014 and a postdoctoral researcher with the Department of Computer Science and Engineering, The Ohio State University, from 2014 to 2016. Since 2016 he has been a full professor at the Northwestern Polytechnical University, Xi'an, China.

His research interests are the topics in speech processing, machine learning, statistical signal processing, and artificial intelligence.

Afiliaciones y experiencia

Northwestern Polytechnical University, Xi'an, China

Ver libro en ScienceDirect

Lee Speech Signal Processing Based on Deep Learning in Complex Acoustic Environments en ScienceDirect

Descubrar libros e ebooks

Áreas temáticas

Títulos en inglés