Voice Activity Detection for Speech Enhancement Applications

Authors

  • E. Verteletskaya
  • K. Sakhnov

DOI:

https://doi.org/10.14311/1251

Keywords:

voice activity detection, periodicity measurement, voiced/unvoiced classification, speech analysis

Abstract

This paper describes a study of noise-robust voice activity detection (VAD) utilizing the periodicity of the signal, full band signal energy and high band to low band signal energy ratio. Conventional VADs are sensitive to a variably noisy environment especially with low SNR, and also result in cutting off unvoiced regions of speech as well as random oscillating of output VAD decisions. To overcome these problems, the proposed algorithm first identifies voiced regions of speech and then differentiates unvoiced regions from silence or background noise using the energy ratio and total signal energy. The performance of the proposed VAD algorithm is tested on real speech signals. Comparisons confirm that the proposed VAD algorithm outperforms the conventional VAD algorithms, especially in the presence of background noise.

Downloads

Download data is not yet available.

Author Biographies

  • E. Verteletskaya
  • K. Sakhnov

Downloads

Published

2010-01-04

Issue

Section

Articles

How to Cite

Verteletskaya, E., & Sakhnov, K. (2010). Voice Activity Detection for Speech Enhancement Applications. Acta Polytechnica, 50(4). https://doi.org/10.14311/1251