Speaker tracking is the process of following who says something in an audio stream. In the case the audio stream is a recording of broadcast news, speaker identity can be an important meta-data for building digital libraries. moreover, the segmentation and classification of the audio stream in terms of acoustic contents, bandwidth and speaker gender allow to filter out portions of the signal which do not contain speech and to improve transcription accuracy through the use of condition-dependent acoustic models and adaptation techniques. In this paper, the problem of automatic speaker tracking in a corpus of Italian broadcast news is investigated. A 81.9% frame classification accuracy is achieved on a 1h:15m test set, in terms of 37 named s...
Given a piece of audio recording, the task of speaker diarization can be summarized as answering the...
International audienceIn this paper, we consider the issue of speaker identification within audio re...
It is often important to be able to automatically label `who spoke when' during some audio data...
This paper presents some recent improvements in automatic transcription of Italian broadcast news ob...
This work presents first results in segmenting and classifying an Italian audio broadcast news corpu...
This work reports on preliminary activity at ITC-irst on the problem of acoustic segmentation, class...
In this paper, we present a first approach to build an automatic system for broadcast news speaker-b...
This paper presents the first achievements in the development of a broadcast news transcription syst...
One rapidly expanding application area for state-of-the-art speech recognition technology is the au...
A system for speaker tracking in broadcast-news audio data is presented and the impacts of the main ...
The identity of persons in audiovisual documents represents very important semantic information for ...
La motivation de cette thèse est de développer des méthodologies et des algorithmes qui utilisent l'...
This paper addresses the problem of real time speaker change detection and speaker tracking in broad...
This paper investigates the issue of automatic segmentation of speech recordings for broadcast news ...
Abstract. This paper addresses the problem of real-time speaker segmentation and speaker tracking in...
Given a piece of audio recording, the task of speaker diarization can be summarized as answering the...
International audienceIn this paper, we consider the issue of speaker identification within audio re...
It is often important to be able to automatically label `who spoke when' during some audio data...
This paper presents some recent improvements in automatic transcription of Italian broadcast news ob...
This work presents first results in segmenting and classifying an Italian audio broadcast news corpu...
This work reports on preliminary activity at ITC-irst on the problem of acoustic segmentation, class...
In this paper, we present a first approach to build an automatic system for broadcast news speaker-b...
This paper presents the first achievements in the development of a broadcast news transcription syst...
One rapidly expanding application area for state-of-the-art speech recognition technology is the au...
A system for speaker tracking in broadcast-news audio data is presented and the impacts of the main ...
The identity of persons in audiovisual documents represents very important semantic information for ...
La motivation de cette thèse est de développer des méthodologies et des algorithmes qui utilisent l'...
This paper addresses the problem of real time speaker change detection and speaker tracking in broad...
This paper investigates the issue of automatic segmentation of speech recordings for broadcast news ...
Abstract. This paper addresses the problem of real-time speaker segmentation and speaker tracking in...
Given a piece of audio recording, the task of speaker diarization can be summarized as answering the...
International audienceIn this paper, we consider the issue of speaker identification within audio re...
It is often important to be able to automatically label `who spoke when' during some audio data...