The aim of this work is to design and create a robust speech activity detector that is able to detect speech in different languages, in a noise environment and with music on background. I decided to solve this problem by using a neural network as a classification model that assigns one of the four possible classes - silence, speech, music, or noise to the input of audio recording. The resulting tool is able to detect the speech in at least 12 languages. Speech with musical background up to 88 % accuracy and system success on noisy data reaches from 84 % (5 dB SNR) to 88 % (20 dB SNR). This tool can be used for speech activity detection in various research areas of speech processing. The main contribution is the elimination of music, which w...
Voice activity detection (VAD) aims at identifying presence of speech in a noisy signal. For this pu...
Feed-forward multi-layer perceptrons (MLP) and recurrent neural networks (RNN) fed with different se...
This paper proposes a voice activity detection (VAD) method based on time and spectral domain featur...
This thesis follows the trend of last decades in using neural networks in order to detect speech in ...
Cílem této práce je navrhnout a vytvořit robustní detektor řečové aktivity, který je schopen detekov...
This thesis describes techniques for voice activity detection in audio recordings. It is necessary t...
###EgeUn###This paper proposes a voice activity detection (VAD) method based on time and spectral do...
This paper proposes a voice activity detection algorithm to be used in the presence of breathing noi...
The paper investigates the problem of voice activity detection from a noisy sound signal. An extreme...
This paper presents and compares two algorithms based on artificial neural networks (ANNs) for sound...
This paper describes a study of noise-robust voice activity detection (VAD) utilizing the periodicit...
Voice activity detection (VAD) is a fundamental task in various speech-related applications, such as...
In this paper, we present a speech activity detection (SAD) tech-nique for speaker verification in n...
This research work aims at designing both text-dependent and text-independent speaker recognition sy...
In this paper, we present a database with speech in different types of background noises. The speech...
Voice activity detection (VAD) aims at identifying presence of speech in a noisy signal. For this pu...
Feed-forward multi-layer perceptrons (MLP) and recurrent neural networks (RNN) fed with different se...
This paper proposes a voice activity detection (VAD) method based on time and spectral domain featur...
This thesis follows the trend of last decades in using neural networks in order to detect speech in ...
Cílem této práce je navrhnout a vytvořit robustní detektor řečové aktivity, který je schopen detekov...
This thesis describes techniques for voice activity detection in audio recordings. It is necessary t...
###EgeUn###This paper proposes a voice activity detection (VAD) method based on time and spectral do...
This paper proposes a voice activity detection algorithm to be used in the presence of breathing noi...
The paper investigates the problem of voice activity detection from a noisy sound signal. An extreme...
This paper presents and compares two algorithms based on artificial neural networks (ANNs) for sound...
This paper describes a study of noise-robust voice activity detection (VAD) utilizing the periodicit...
Voice activity detection (VAD) is a fundamental task in various speech-related applications, such as...
In this paper, we present a speech activity detection (SAD) tech-nique for speaker verification in n...
This research work aims at designing both text-dependent and text-independent speaker recognition sy...
In this paper, we present a database with speech in different types of background noises. The speech...
Voice activity detection (VAD) aims at identifying presence of speech in a noisy signal. For this pu...
Feed-forward multi-layer perceptrons (MLP) and recurrent neural networks (RNN) fed with different se...
This paper proposes a voice activity detection (VAD) method based on time and spectral domain featur...