Data used in the Interspeech 2022 paper "BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model" This is a corpus of literary texts that have been annotated with prominence and boundary features using the Wavelet Prosody Toolkit (https://github.com/asuni/wavelet_prosody_toolkit). Each text is read by three separate speakers. A subcorpus of contrastively focused pronouns is also provided. Train and Test sets for prominence prediction task. -Lines starting with '' identify the utterances. Here you will find book/speaker/chapter/chapter-utterance# information. -The columns of the remaining lines: word / quantized CWT prominence features / quantized CWT boundary features / Ra...
Previous processing studies have shown that constituents that are prosodically marked as focus lead ...
While End-2-End Text-to-Speech (TTS) has made significant progresses over the past few years, these ...
We present a new methodological approach which combines both naturally-occurring speech harvested on...
International audienceSeveral recent studies have tested the use of transformer language model repre...
International audienceSeveral recent studies have tested the use of transformer language model repre...
Several recent studies have tested the use of transformer language model representations to infer pr...
Slides for presentation at workshop New Tools and Methods for Very-Large-Scale Phonetics Research, U...
Abstract This paper examines the role that linguistic and cognitive prominence play in the resolutio...
Focus is central to our control of information flow in dialogue. Spoken language understanding syste...
Focus is central to our control of information flow in dialogue. Spoken language understanding syst...
Intonational prominence, or accent, is a fundamental prosodic feature that is said to contribute to ...
According to “Centering Theory”, an entity that links to the prior discourse could receive a...
In English, certain words are perceptually more salient than other neighboring words. The perceptual...
Wagner P, Origlia A, Avesani C, et al. Disentagling and Connecting Different Perspectives on Prosodi...
Wagner P, Origlia A, Avesani C, et al. Disentagling and Connecting Different Perspectives on Prosodi...
Previous processing studies have shown that constituents that are prosodically marked as focus lead ...
While End-2-End Text-to-Speech (TTS) has made significant progresses over the past few years, these ...
We present a new methodological approach which combines both naturally-occurring speech harvested on...
International audienceSeveral recent studies have tested the use of transformer language model repre...
International audienceSeveral recent studies have tested the use of transformer language model repre...
Several recent studies have tested the use of transformer language model representations to infer pr...
Slides for presentation at workshop New Tools and Methods for Very-Large-Scale Phonetics Research, U...
Abstract This paper examines the role that linguistic and cognitive prominence play in the resolutio...
Focus is central to our control of information flow in dialogue. Spoken language understanding syste...
Focus is central to our control of information flow in dialogue. Spoken language understanding syst...
Intonational prominence, or accent, is a fundamental prosodic feature that is said to contribute to ...
According to “Centering Theory”, an entity that links to the prior discourse could receive a...
In English, certain words are perceptually more salient than other neighboring words. The perceptual...
Wagner P, Origlia A, Avesani C, et al. Disentagling and Connecting Different Perspectives on Prosodi...
Wagner P, Origlia A, Avesani C, et al. Disentagling and Connecting Different Perspectives on Prosodi...
Previous processing studies have shown that constituents that are prosodically marked as focus lead ...
While End-2-End Text-to-Speech (TTS) has made significant progresses over the past few years, these ...
We present a new methodological approach which combines both naturally-occurring speech harvested on...