Audio content sharing on online platforms has become increasingly popular. This necessitates development of techniques to better organize and retrieve this data. In this thesis we look to improve audio retrieval through content and metadata categorization in the context of Freesound. For content, we focus on organization through morphological description. In particular, we propose a taxonomy and thresholding-based classification approach for loudness profiles. The approach can be generalized to structure information about the temporal evolution of other sound attributes. To this end, we also discuss our preliminary findings from extension of this methodology to pitch profiles. On the other hand, metadata systematization has been approached ...
The rapid adoption of Internet and web technologies has created an opportunity for making music coll...
A fundamental and general representation of audio and music which integrates multi-modal data source...
A fundamental and general representation of audio and music which integrates multi-modal data source...
Comunicació presentada al 2016 IEEE International Symposium on Multimedia, celebrat els dies 11 a 13...
Comunicació presentada al 2016 IEEE International Symposium on Multimedia, celebrat els dies 11 a 13...
This paper presents an in–depth study of the social tagging mechanisms used in Freesound.org, an onl...
A typical content-based audio management system deals with three aspects namely audio segmentation a...
Comunicació presentada a la 6th Sound and Music Computing Conference, celebrada els dies 23 a 25 de ...
Comunicació presentada a la 6th Sound and Music Computing Conference, celebrada els dies 23 a 25 de ...
A new algorithm for content-based audio information retrieval is introduced in this work. Assuming t...
We propose a method for automatic fine-scale audio description that draws inspiration from ontologic...
Abstract—We introduce a modified version of the acoustic topic model, which assumes an audio signal ...
New ways of producing information, knowledge, and culture through social, rather than proprietary re...
This paper presents an overview of audio indexing, which has emerged very recently as a research top...
We investigate the use of perceptually-modelled descriptors of timbre for browsing large collections...
The rapid adoption of Internet and web technologies has created an opportunity for making music coll...
A fundamental and general representation of audio and music which integrates multi-modal data source...
A fundamental and general representation of audio and music which integrates multi-modal data source...
Comunicació presentada al 2016 IEEE International Symposium on Multimedia, celebrat els dies 11 a 13...
Comunicació presentada al 2016 IEEE International Symposium on Multimedia, celebrat els dies 11 a 13...
This paper presents an in–depth study of the social tagging mechanisms used in Freesound.org, an onl...
A typical content-based audio management system deals with three aspects namely audio segmentation a...
Comunicació presentada a la 6th Sound and Music Computing Conference, celebrada els dies 23 a 25 de ...
Comunicació presentada a la 6th Sound and Music Computing Conference, celebrada els dies 23 a 25 de ...
A new algorithm for content-based audio information retrieval is introduced in this work. Assuming t...
We propose a method for automatic fine-scale audio description that draws inspiration from ontologic...
Abstract—We introduce a modified version of the acoustic topic model, which assumes an audio signal ...
New ways of producing information, knowledge, and culture through social, rather than proprietary re...
This paper presents an overview of audio indexing, which has emerged very recently as a research top...
We investigate the use of perceptually-modelled descriptors of timbre for browsing large collections...
The rapid adoption of Internet and web technologies has created an opportunity for making music coll...
A fundamental and general representation of audio and music which integrates multi-modal data source...
A fundamental and general representation of audio and music which integrates multi-modal data source...