Data augmentation is a valuable tool for the design of deep learning systems to overcome data limitations and stabilize the training process. Especially in the medical domain, where the collection of large-scale data sets is challenging and expensive due to limited access to patient data, relevant environments, as well as strict regulations, community-curated large-scale public datasets, pretrained models, and advanced data augmentation methods are the main factors for developing reliable systems to improve patient care. However, for the development of medical acoustic sensing systems, an emerging field of research, the community lacks large-scale publicly available data sets and pretrained models. To address the problem of limited data, we...
Statistical speech reconstruction for larynx-related dysphonia has achieved good performance using G...
Audio-visual emotion recognition is the research of identifying human emotional states by combining ...
In audio processing applications, the generation of expressive sounds based on high-level representa...
Data augmentation is a valuable tool for the design of deep learning systems to overcome data limita...
In this work, we propose a novel data augmentation method for clinical audio datasets based on a con...
The Generation power of Generative Adversarial Neural Networks (GANs) has shown great promise to lea...
Data augmentation has proven to be effective in training neural networks. Recently, a method called ...
One of the frontier issues that severely hamper the development of automatic snore sound classificat...
The era of machine learning has opened up groundbreaking realities and opportunities in the field of...
Computational recognition of human emotion using Deep Learning techniques requires learning from lar...
Generative Adversarial Networks (GANs) are a revolutionary innovation in machine learning that enabl...
Speech enhancement improves recorded voice utterances to eliminate noise that might be impeding thei...
Objectives: Development of accurate auditory attention decoding (AAD) algorithms, capable of identif...
Identifying musical instruments in a polyphonic music recording is a difficult yet crucial problem i...
Lossy audio codecs compress (and decompress) digital audio streams by removing information that tend...
Statistical speech reconstruction for larynx-related dysphonia has achieved good performance using G...
Audio-visual emotion recognition is the research of identifying human emotional states by combining ...
In audio processing applications, the generation of expressive sounds based on high-level representa...
Data augmentation is a valuable tool for the design of deep learning systems to overcome data limita...
In this work, we propose a novel data augmentation method for clinical audio datasets based on a con...
The Generation power of Generative Adversarial Neural Networks (GANs) has shown great promise to lea...
Data augmentation has proven to be effective in training neural networks. Recently, a method called ...
One of the frontier issues that severely hamper the development of automatic snore sound classificat...
The era of machine learning has opened up groundbreaking realities and opportunities in the field of...
Computational recognition of human emotion using Deep Learning techniques requires learning from lar...
Generative Adversarial Networks (GANs) are a revolutionary innovation in machine learning that enabl...
Speech enhancement improves recorded voice utterances to eliminate noise that might be impeding thei...
Objectives: Development of accurate auditory attention decoding (AAD) algorithms, capable of identif...
Identifying musical instruments in a polyphonic music recording is a difficult yet crucial problem i...
Lossy audio codecs compress (and decompress) digital audio streams by removing information that tend...
Statistical speech reconstruction for larynx-related dysphonia has achieved good performance using G...
Audio-visual emotion recognition is the research of identifying human emotional states by combining ...
In audio processing applications, the generation of expressive sounds based on high-level representa...