Speech_enhancement Knowpia

Speech enhancement aims to improve speech quality by using various algorithms. The objective of enhancement is improvement in intelligibility and/or overall perceptual quality of degraded speech signal using audio signal processing techniques.

Enhancing of speech degraded by noise, or noise reduction, is the most important field of speech enhancement, and used for many applications such as mobile phones, VoIP, teleconferencing systems, speech recognition, speaker diarization, and hearing aids.^[1]^[2]

Algorithms edit

The algorithms of speech enhancement for noise reduction can be categorized into three fundamental classes: filtering techniques, spectral restoration, and model-based methods.^[3]

Filtering Techniques

Spectral Subtraction Method
Wiener Filtering
Signal subspace approach (SSA)

Spectral Restoration

Minimum Mean-Square-Error Short-Time, Spectral Amplitude Estimator (MMSE-STSA)

Speech-Model-Based

References edit

^ J. Benesty, S. Makino, J. Chen (ed). Speech Enhancement. pp.1-8. Springer, 2005. ISBN 978-3-540-24039-6.
^ Sahidullah, Md; Patino, Jose; Cornell, Samuele; Yin, Ruiking; Sivasankaran, Sunit; Bredin, Herve; Korshunov, Pavel; Brutti, Alessio; Serizel, Romain; Vincent, Emmanuel; Evans, Nicholas; Marcel, Sebastien; Squartini, Stefano; Barras, Claude (2019-11-06). "The Speed Submission to DIHARD II: Contributions & Lessons Learned". arXiv:1911.02388 [eess.AS].
^ J. Benesty, M. M. Sondhi, Y. Huang (ed). Springer Handbook of Speech Processing. pp.843-869. Springer, 2007. ISBN 978-3-540-49125-5.

J. Benesty, M. M. Sondhi, Y. Huang (ed). Springer Handbook of Speech Processing. Springer, 2007. ISBN 978-3-540-49125-5.
J. Benesty, S. Makino, J. Chen (ed). Speech Enhancement. Springer, 2005. ISBN 978-3-540-24039-6.
P. C. Loizou. Speech Enhancement: Theory and Practice. CRC Press, 2013. ISBN 978-1-466-50421-9.

[1] J. Benesty, S. Makino, J. Chen (ed). Speech Enhancement. pp.1-8. Springer, 2005. ISBN 978-3-540-24039-6.

[2] Sahidullah, Md; Patino, Jose; Cornell, Samuele; Yin, Ruiking; Sivasankaran, Sunit; Bredin, Herve; Korshunov, Pavel; Brutti, Alessio; Serizel, Romain; Vincent, Emmanuel; Evans, Nicholas; Marcel, Sebastien; Squartini, Stefano; Barras, Claude (2019-11-06). "The Speed Submission to DIHARD II: Contributions & Lessons Learned". arXiv:1911.02388 [eess.AS].

[3] J. Benesty, M. M. Sondhi, Y. Huang (ed). Springer Handbook of Speech Processing. pp.843-869. Springer, 2007. ISBN 978-3-540-49125-5.

[1]

[2]

[3]

Summary

Algorithms edit

See also edit

References edit