International audienceMuch audio content today is rendered as a static stereo mix: fundamentally a fixed single entity. Object-based audio envisages the delivery of sound content using a collection of individual sound ‘objects’ controlled by accompanying metadata. This offers potential for audio to be delivered in a dynamic manner providing enhanced audio for consumers. One example of such treatment is the concept of applying varying levels of data compression to sound objects thereby reducing the volume of data to be transmitted in limited bandwidth situations. This application motivates the ability to accurately classify objects in terms of their ‘hierarchy’. That is, whether or not an object is a foreground sound, which should be reprodu...
Labeling audio material to train classifiers comes with a large amount of human labor. In this pape...
In this work, we provide a broad comparative analysis of strategies for pre-training audio understan...
This paper presents a series of experiments to determine a categorization framework for broadcast au...
Much audio content today is rendered as a static stereo mix: fundamentally a fixed single entity. Ob...
With the advent of new audio delivery technologies comes opportunities and challenges for content cr...
The objective of the thesis is to develop techniques that optimize the performances of sound event d...
Advancements in broadcast technology are granting new opportunities to improve listening experiences...
1. This paper presents an active learning framework for the classification of one-minute audio-recor...
Coping with scarcity of labeled data is a common problem in sound classification tasks. Approaches f...
Audio classification tasks like speech recognition and acoustic scene analysis require substantial l...
This paper proposes a novel active learning method to save annotation effort when preparing material...
This paper proposes an active learning method to control a labeling process for efficient annotation...
There is growing recognition of the importance of data-centric methods for building machine learning...
Object-based audio presents the opportunity to optimise audio reproduction for different listening s...
<p>There are 16,930 sound instances in our database with durations ranging 242 from 1 to 10 seconds,...
Labeling audio material to train classifiers comes with a large amount of human labor. In this pape...
In this work, we provide a broad comparative analysis of strategies for pre-training audio understan...
This paper presents a series of experiments to determine a categorization framework for broadcast au...
Much audio content today is rendered as a static stereo mix: fundamentally a fixed single entity. Ob...
With the advent of new audio delivery technologies comes opportunities and challenges for content cr...
The objective of the thesis is to develop techniques that optimize the performances of sound event d...
Advancements in broadcast technology are granting new opportunities to improve listening experiences...
1. This paper presents an active learning framework for the classification of one-minute audio-recor...
Coping with scarcity of labeled data is a common problem in sound classification tasks. Approaches f...
Audio classification tasks like speech recognition and acoustic scene analysis require substantial l...
This paper proposes a novel active learning method to save annotation effort when preparing material...
This paper proposes an active learning method to control a labeling process for efficient annotation...
There is growing recognition of the importance of data-centric methods for building machine learning...
Object-based audio presents the opportunity to optimise audio reproduction for different listening s...
<p>There are 16,930 sound instances in our database with durations ranging 242 from 1 to 10 seconds,...
Labeling audio material to train classifiers comes with a large amount of human labor. In this pape...
In this work, we provide a broad comparative analysis of strategies for pre-training audio understan...
This paper presents a series of experiments to determine a categorization framework for broadcast au...