DDS (Device-Degraded Speech) dataset provides aligned parallel recordings of high-quality speech (recorded in professional studios) and a large number of versions of low-quality speech, producing approximately 2,000 hours speech data. DDS is built on top of two datasets: DAPS and VCTK. We play clean speech recordings (4 hours from DAPS and 8 hours from VCTK) and re-record waveforms in nine environments (two offices, two conference rooms, three studios, one living room, one waiting room) on three different devices (one MEMS and two condenser microphones), producing 27 different recording conditions. Moreover, each version of condition consists of multiple recordings recorded at 6 different microphone positions to simulate various signal-to...
There are many types of degradation which can occur in Voice over IP calls. Degradations which occur...
The oral diadokochinesia (DDK) task is an established tool for assessing speech motor control that h...
This dataset consists of 25,921 recorded Vietnamese speeches (with their transcripts and the labelle...
DDS (Device-Degraded Speech) dataset provides aligned parallel recordings of high-quality speech (re...
DDS (Device-Degraded Speech) dataset provides aligned parallel recordings of high-quality speech (re...
DR-VCTK is device recorded version of VCTK dataset on common consumer devices (laptop, tablet and sm...
The DAPS (Device and Produced Speech) dataset is a collection of aligned versions of professionally ...
This is a modified version of a subset of the Device and Produced Speech (DAPS) dataset. The origina...
This is FOSD-based (extracted from approximately 30-hour of FPT Open Speech Data, released publicly ...
This is FOSD-based (extracted from approximately 30-hour of FPT Open Speech Data, released publicly ...
This is a modified version of the speech audio contained within the Ryerson Audio-Visual Database of...
Datasets used in the paper: "Creating speech zones with self-distributing acoustic swarms" This dep...
Datasets used in the paper: "Creating speech zones with self-distributing acoustic swarms" This dep...
Purpose: Oral diadochokinesis is a useful task in assessment of speech motor function in the context...
Datasets used in the paper: "Creating speech zones with self-distributing acoustic swarms" This dep...
There are many types of degradation which can occur in Voice over IP calls. Degradations which occur...
The oral diadokochinesia (DDK) task is an established tool for assessing speech motor control that h...
This dataset consists of 25,921 recorded Vietnamese speeches (with their transcripts and the labelle...
DDS (Device-Degraded Speech) dataset provides aligned parallel recordings of high-quality speech (re...
DDS (Device-Degraded Speech) dataset provides aligned parallel recordings of high-quality speech (re...
DR-VCTK is device recorded version of VCTK dataset on common consumer devices (laptop, tablet and sm...
The DAPS (Device and Produced Speech) dataset is a collection of aligned versions of professionally ...
This is a modified version of a subset of the Device and Produced Speech (DAPS) dataset. The origina...
This is FOSD-based (extracted from approximately 30-hour of FPT Open Speech Data, released publicly ...
This is FOSD-based (extracted from approximately 30-hour of FPT Open Speech Data, released publicly ...
This is a modified version of the speech audio contained within the Ryerson Audio-Visual Database of...
Datasets used in the paper: "Creating speech zones with self-distributing acoustic swarms" This dep...
Datasets used in the paper: "Creating speech zones with self-distributing acoustic swarms" This dep...
Purpose: Oral diadochokinesis is a useful task in assessment of speech motor function in the context...
Datasets used in the paper: "Creating speech zones with self-distributing acoustic swarms" This dep...
There are many types of degradation which can occur in Voice over IP calls. Degradations which occur...
The oral diadokochinesia (DDK) task is an established tool for assessing speech motor control that h...
This dataset consists of 25,921 recorded Vietnamese speeches (with their transcripts and the labelle...