For training a few-shot keyword spotting (FS-KWS) model, a large labeled dataset containing massive target keywords has known to be essential to generalize to arbitrary target keywords with only a few enrollment samples. To alleviate the expensive data collection with labeling, in this paper, we propose a novel FS-KWS system trained only on synthetic data. The proposed system is based on metric learning enabling target keywords to be detected using distance metrics. Exploiting the speech synthesis model that generates speech with pseudo phonemes instead of texts, we easily obtain a large collection of multi-view samples with the same semantics. These samples are sufficient for training, considering metric learning does not intrinsically nec...
This paper introduces a generalized few-shot segmentation framework with a straightforward training ...
Few-shot bioacoustic event detection is a task that detects the occurrence time of a novel sound giv...
Thesis (Master's)--University of Washington, 2021As more electronic devices have an on-device Keywor...
Recognizing a particular command or a keyword, keyword spotting has been widely used in many voice i...
User-defined keyword spotting is a task to detect new spoken terms defined by users. This can be vie...
In recent years, the development of accurate deep keyword spotting (KWS) models has resulted in KWS ...
This study presents a novel zero-shot user-defined keyword spotting model that utilizes the audio-ph...
Within the audio research community and the industry, keyword spotting (KWS) and audio tagging (AT) ...
Ons B., Gemmeke J.F., Van hamme H., ''NMF-based keyword learning from scarce data'', Automatic speec...
Building a small memory footprint keyword spotting model is important as it typically runs on mobil...
Voice trigger detection is an important task, which enables activating a voice assistant when a targ...
In this paper, we introduce a massively multilingual speech corpora with fine-grained phonemic trans...
In the context of keyword spotting (KWS), the replacement of handcrafted speech features by learnabl...
Voice assistants like Siri, Google Assistant, Alexa etc. are used widely across the globe for home a...
Few-shot Learning (FSL) is aimed to make predictions based on a limited number of samples. Structure...
This paper introduces a generalized few-shot segmentation framework with a straightforward training ...
Few-shot bioacoustic event detection is a task that detects the occurrence time of a novel sound giv...
Thesis (Master's)--University of Washington, 2021As more electronic devices have an on-device Keywor...
Recognizing a particular command or a keyword, keyword spotting has been widely used in many voice i...
User-defined keyword spotting is a task to detect new spoken terms defined by users. This can be vie...
In recent years, the development of accurate deep keyword spotting (KWS) models has resulted in KWS ...
This study presents a novel zero-shot user-defined keyword spotting model that utilizes the audio-ph...
Within the audio research community and the industry, keyword spotting (KWS) and audio tagging (AT) ...
Ons B., Gemmeke J.F., Van hamme H., ''NMF-based keyword learning from scarce data'', Automatic speec...
Building a small memory footprint keyword spotting model is important as it typically runs on mobil...
Voice trigger detection is an important task, which enables activating a voice assistant when a targ...
In this paper, we introduce a massively multilingual speech corpora with fine-grained phonemic trans...
In the context of keyword spotting (KWS), the replacement of handcrafted speech features by learnabl...
Voice assistants like Siri, Google Assistant, Alexa etc. are used widely across the globe for home a...
Few-shot Learning (FSL) is aimed to make predictions based on a limited number of samples. Structure...
This paper introduces a generalized few-shot segmentation framework with a straightforward training ...
Few-shot bioacoustic event detection is a task that detects the occurrence time of a novel sound giv...
Thesis (Master's)--University of Washington, 2021As more electronic devices have an on-device Keywor...