ITALIC: An Italian Intent Classification Dataset ITALIC is a dataset of Italian audio recordings and contains annotation for utterance transcripts and associated intents. The ITALIC dataset was created through a custom web platform, utilizing both native and non-native Italian speakers as participants. The participants were required to record themselves while reading a randomly sampled short text from the MASSIVE dataset. ITALIC dataset containing 16,521 audio recordings collected by 70 different volunteers. The dataset is composed of: recordings: a folder containing the audio recordings in .wav format. It contains all the recordings composing the data collection. [CONFIG_NAME]_[SPLIT_NAME].json: the files containing metadata used for...
This paper describes the development of a multilingual and multigenre manually annotated speech data...
This document describes the results of an activity that was conducted at ITC-Irst for the design of ...
This paper describes IDEA a database of Italian dysarthric speech produced by 45 speakers affected b...
The AXIOM Voice Dataset has the main purpose of gathering audio recordings from Italian natural lang...
This repository contains Emozionalmente: an extensive Italian speech emotional corpus. The dataset c...
The LaMIT database consists in recordings of 100 Italian sentences. The sentences in the database we...
vocadito is a dataset of 40 short excerpts of solo, monophonic singing. The excerpts are sung in 7 d...
Europeana Sounds was a project focused on accessing digital audio files. The current dataset aims to...
In recent years, non-native speech has been a topic of continuous research interest in theoretical l...
DEMoS (Database of Elicited Mood in Speech), is a corpus of induced emotional speech in Italian. DEM...
Linguistic Miner is a project carried out at ILC whose objective is the development of an integrated...
The 3 datasets derived from the Italian (ItWiki-100), French (FrWiki-100) and English (EnWiki-100) W...
International audienceWith the emergence of neural end-to-end approaches for spoken language underst...
Modelling the process that a listener actuates in deriving the words intended by a speaker requires ...
In this paper we introduce the main features of the KIParla corpus, a new resource for the study of ...
This paper describes the development of a multilingual and multigenre manually annotated speech data...
This document describes the results of an activity that was conducted at ITC-Irst for the design of ...
This paper describes IDEA a database of Italian dysarthric speech produced by 45 speakers affected b...
The AXIOM Voice Dataset has the main purpose of gathering audio recordings from Italian natural lang...
This repository contains Emozionalmente: an extensive Italian speech emotional corpus. The dataset c...
The LaMIT database consists in recordings of 100 Italian sentences. The sentences in the database we...
vocadito is a dataset of 40 short excerpts of solo, monophonic singing. The excerpts are sung in 7 d...
Europeana Sounds was a project focused on accessing digital audio files. The current dataset aims to...
In recent years, non-native speech has been a topic of continuous research interest in theoretical l...
DEMoS (Database of Elicited Mood in Speech), is a corpus of induced emotional speech in Italian. DEM...
Linguistic Miner is a project carried out at ILC whose objective is the development of an integrated...
The 3 datasets derived from the Italian (ItWiki-100), French (FrWiki-100) and English (EnWiki-100) W...
International audienceWith the emergence of neural end-to-end approaches for spoken language underst...
Modelling the process that a listener actuates in deriving the words intended by a speaker requires ...
In this paper we introduce the main features of the KIParla corpus, a new resource for the study of ...
This paper describes the development of a multilingual and multigenre manually annotated speech data...
This document describes the results of an activity that was conducted at ITC-Irst for the design of ...
This paper describes IDEA a database of Italian dysarthric speech produced by 45 speakers affected b...