Convolutional neural networks (CNN) and Transformer have wildly succeeded in multimedia applications. However, more effort needs to be made to harmonize these two architectures effectively to satisfy speech enhancement. This paper aims to unify these two architectures and presents a Parallel Conformer for speech enhancement. In particular, the CNN and the self-attention (SA) in the Transformer are fully exploited for local format patterns and global structure representations. Based on the small receptive field size of CNN and the high computational complexity of SA, we specially designed a multi-branch dilated convolution (MBDC) and a self-channel-time-frequency attention (Self-CTFA) module. MBDC contains three convolutional layers with dif...
Abstract: In recent years, Convolutional Neural Network (CNN) has been widely applied in speech/imag...
Convolution-augmented transformers (Conformers) are recently proposed in various speech-domain appli...
Speech dereverberation is an important stage in many speech technology applications. Recent work in ...
Convolutional neural networks (CNN) and Transformer have wildly succeeded in multimedia applications...
Acquiring speech signal in real-world environment is always accompanied by various ambient noises, w...
Deep learning algorithm are increasingly used for speech enhancement (SE). In supervised methods, gl...
Optimization of modern ASR architectures is among the highest priority tasks since it saves many com...
The Transformer architecture model, based on self-attention and multi-head attention, has achieved r...
Speech enhancement (SE) is a critical aspect of various speech-processing applications. Recent resea...
Deep neural network solutions have emerged as a new and powerful paradigm for speech enhancement (SE...
In this paper, we present Multi-scale Feature Aggregation Conformer (MFA-Conformer), an easy-to-impl...
Convolutional encoder-decoder (CED) has emerged as a powerful architecture, particularly in speech e...
3D speech enhancement can effectively improve the auditory experience and plays a crucial role in au...
Conformer has proven to be effective in many speech processing tasks. It combines the benefits of ex...
Self-attention-based networks have obtained impressive performance in parallel training and global c...
Abstract: In recent years, Convolutional Neural Network (CNN) has been widely applied in speech/imag...
Convolution-augmented transformers (Conformers) are recently proposed in various speech-domain appli...
Speech dereverberation is an important stage in many speech technology applications. Recent work in ...
Convolutional neural networks (CNN) and Transformer have wildly succeeded in multimedia applications...
Acquiring speech signal in real-world environment is always accompanied by various ambient noises, w...
Deep learning algorithm are increasingly used for speech enhancement (SE). In supervised methods, gl...
Optimization of modern ASR architectures is among the highest priority tasks since it saves many com...
The Transformer architecture model, based on self-attention and multi-head attention, has achieved r...
Speech enhancement (SE) is a critical aspect of various speech-processing applications. Recent resea...
Deep neural network solutions have emerged as a new and powerful paradigm for speech enhancement (SE...
In this paper, we present Multi-scale Feature Aggregation Conformer (MFA-Conformer), an easy-to-impl...
Convolutional encoder-decoder (CED) has emerged as a powerful architecture, particularly in speech e...
3D speech enhancement can effectively improve the auditory experience and plays a crucial role in au...
Conformer has proven to be effective in many speech processing tasks. It combines the benefits of ex...
Self-attention-based networks have obtained impressive performance in parallel training and global c...
Abstract: In recent years, Convolutional Neural Network (CNN) has been widely applied in speech/imag...
Convolution-augmented transformers (Conformers) are recently proposed in various speech-domain appli...
Speech dereverberation is an important stage in many speech technology applications. Recent work in ...