This is FOSD-based (extracted from approximately 30-hour of FPT Open Speech Data, released publicly in 2018 by FPT Corporation, under FPT Public License) Male Speech Dataset which is useful for creating text-to-speech model. It comprises of 9474 audio files totalling more than 10.5 recording hours. All files are in *.wav format (16 kHz sampling rate, 32-bit float, mono). This dataset is useful for various TTS-related applications.THE DATA OR SOFTWARE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WH...
Code and data to reproduce figure files in "Design and Evaluation of Personal Audio Systems bas...
This dataset includes 24,000 5-seconds-long polyphonic stereo soundscapes composed of sounds taken f...
The processed audio files included in the Free Open-Access Misophonia Stimuli (FOAMS) project to cur...
This is FOSD-based (extracted from approximately 30-hour of FPT Open Speech Data, released publicly ...
This dataset consists of 25,921 recorded Vietnamese speeches (with their transcripts and the labelle...
This is the 1st FPT Open Speech Data (FOSD) and Tacotron-2 -based Text-to-Speech Model Dataset for V...
FSD-FS is a publicly-available database of human labelled sound events for few-shot learning. It spa...
The DAPS (Device and Produced Speech) dataset is a collection of aligned versions of professionally ...
Dataset accompanying the paper "Objective speech outcomes after surgical treatment for oral cancer: ...
DDS (Device-Degraded Speech) dataset provides aligned parallel recordings of high-quality speech (re...
DDS (Device-Degraded Speech) dataset provides aligned parallel recordings of high-quality speech (re...
DDS (Device-Degraded Speech) dataset provides aligned parallel recordings of high-quality speech (re...
USPDATRO ========== Underrepresented Speech Dataset from Open Data: Case Study on the Romanian Lang...
This is a speech dataset for Fongbe language spoken mostly in Benin. The folder contains the follow...
The dataset consists of 7,014 files delivered as 32kHz, mono audio files in .wav format and divided ...
Code and data to reproduce figure files in "Design and Evaluation of Personal Audio Systems bas...
This dataset includes 24,000 5-seconds-long polyphonic stereo soundscapes composed of sounds taken f...
The processed audio files included in the Free Open-Access Misophonia Stimuli (FOAMS) project to cur...
This is FOSD-based (extracted from approximately 30-hour of FPT Open Speech Data, released publicly ...
This dataset consists of 25,921 recorded Vietnamese speeches (with their transcripts and the labelle...
This is the 1st FPT Open Speech Data (FOSD) and Tacotron-2 -based Text-to-Speech Model Dataset for V...
FSD-FS is a publicly-available database of human labelled sound events for few-shot learning. It spa...
The DAPS (Device and Produced Speech) dataset is a collection of aligned versions of professionally ...
Dataset accompanying the paper "Objective speech outcomes after surgical treatment for oral cancer: ...
DDS (Device-Degraded Speech) dataset provides aligned parallel recordings of high-quality speech (re...
DDS (Device-Degraded Speech) dataset provides aligned parallel recordings of high-quality speech (re...
DDS (Device-Degraded Speech) dataset provides aligned parallel recordings of high-quality speech (re...
USPDATRO ========== Underrepresented Speech Dataset from Open Data: Case Study on the Romanian Lang...
This is a speech dataset for Fongbe language spoken mostly in Benin. The folder contains the follow...
The dataset consists of 7,014 files delivered as 32kHz, mono audio files in .wav format and divided ...
Code and data to reproduce figure files in "Design and Evaluation of Personal Audio Systems bas...
This dataset includes 24,000 5-seconds-long polyphonic stereo soundscapes composed of sounds taken f...
The processed audio files included in the Free Open-Access Misophonia Stimuli (FOAMS) project to cur...