Text-to-Speech (TTS) normalization is an essential component of natural language processing (NLP) that plays a crucial role in the production of natural-sounding synthesized speech. However, there are limitations to the TTS normalization procedure. Lengthy input sequences and variations in spoken language can present difficulties. The motivation behind this research is to address the challenges associated with TTS normalization by evaluating and comparing the performance of various models. The aim is to determine their effectiveness in handling language variations. The models include LSTM-GRU, Transformer, GCN-Transformer, GCNN-Transformer, Reformer, and a BERT language model that has been pre-trained. The research evaluates the performance...
The paper directly compares two versions of a medical speech translation system, one with a grammar ...
We evaluate three simple, normalization-centric changes to improve Transformer training. First, we s...
There are about 7,000 languages spoken today in the world. However, most natural language processing...
Text-to-speech synthesis (TTS) can be split into two steps: the preprocessor, which takes input text...
Text normalization methods have been commonly applied to historical language or user-generated conte...
Attention mechanism is one of the most successful techniques in deep learning based Natural Languag...
State-of-the-art text-to-speech (TTS) systems have utilized pretrained language models (PLMs) to enh...
Training language model made from conversational speech is difficult due to large variation of the w...
Collecting sufficient language model training data for good speech recognition performance in a new ...
We present BLESS, a comprehensive performance benchmark of the most recent state-of-the-art Large La...
With the emergence of Social media and its growing popularity, there has been substantial growth in ...
Spoken language translation (SLT) exists within one of the most challenging intersections of speech ...
In this article, three adaptation methods are compared based on how well they change the speaking st...
One of the main characteristics of social media data is the use of non-standard language. Since NLP ...
As a result of advancement in deep learning and neural network technology, end-to-end models have be...
The paper directly compares two versions of a medical speech translation system, one with a grammar ...
We evaluate three simple, normalization-centric changes to improve Transformer training. First, we s...
There are about 7,000 languages spoken today in the world. However, most natural language processing...
Text-to-speech synthesis (TTS) can be split into two steps: the preprocessor, which takes input text...
Text normalization methods have been commonly applied to historical language or user-generated conte...
Attention mechanism is one of the most successful techniques in deep learning based Natural Languag...
State-of-the-art text-to-speech (TTS) systems have utilized pretrained language models (PLMs) to enh...
Training language model made from conversational speech is difficult due to large variation of the w...
Collecting sufficient language model training data for good speech recognition performance in a new ...
We present BLESS, a comprehensive performance benchmark of the most recent state-of-the-art Large La...
With the emergence of Social media and its growing popularity, there has been substantial growth in ...
Spoken language translation (SLT) exists within one of the most challenging intersections of speech ...
In this article, three adaptation methods are compared based on how well they change the speaking st...
One of the main characteristics of social media data is the use of non-standard language. Since NLP ...
As a result of advancement in deep learning and neural network technology, end-to-end models have be...
The paper directly compares two versions of a medical speech translation system, one with a grammar ...
We evaluate three simple, normalization-centric changes to improve Transformer training. First, we s...
There are about 7,000 languages spoken today in the world. However, most natural language processing...