This paper reports on two experiments with a probabilistic part-of-speech tagger, trained on a tagged corpus of written Swedish, being used to tag a corpus of (transcribed) spoken Swedish. The results indicate that with very little adaptations an accuracy rate of 85 % can be achieved, with an accuracy rate for known words of 90%. In addition, two dierent treatments of pauses were explored but with no signi cant gain in accuracy under either condition.
In this paper we present a part-of-speech tagging system based on a structural language model learnt...
International audienceThe aim of our paper is to study the interest of part of speech (POS) tagging ...
The aim of the thesis is to propose a tagging system for a learner corpus of spoken English which wo...
This paper describes an evaluation of five data-driven part-of-speech (PoS) taggers for spoken Norwe...
Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Jo...
State-of-the-art statistical part-of-speech taggers mainly use information on tag bi- or trigrams, d...
In this paper a data-driven method for Part-of-Speech tagging not using any n-grams of tags is prese...
The Talko corpus of Swedish spoken in Finland is a new research tool consisting of audio files li...
In this paper we present some experiments on the use of a probabilistic model to tag English text, i...
This paper reports the ongoing work of producing a state of the art part of speech tagger for unedit...
Part-Of-Speech (POS) tagging is the process of marking-up the words in a text with their correspondi...
The field of Part of Speech (POS) tagging has made slow but steady progress during the last decade, ...
Statistical n-gram taggers like that of [Church 1988] or [Foster 1991] assign a part-of-speech label...
This work presents Stagger, a new open-source part of speech tagger for Swedish based on the Average...
Part of Speech (POS) tagging is an essential part of text processing applications. A POS tagger assi...
In this paper we present a part-of-speech tagging system based on a structural language model learnt...
International audienceThe aim of our paper is to study the interest of part of speech (POS) tagging ...
The aim of the thesis is to propose a tagging system for a learner corpus of spoken English which wo...
This paper describes an evaluation of five data-driven part-of-speech (PoS) taggers for spoken Norwe...
Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Jo...
State-of-the-art statistical part-of-speech taggers mainly use information on tag bi- or trigrams, d...
In this paper a data-driven method for Part-of-Speech tagging not using any n-grams of tags is prese...
The Talko corpus of Swedish spoken in Finland is a new research tool consisting of audio files li...
In this paper we present some experiments on the use of a probabilistic model to tag English text, i...
This paper reports the ongoing work of producing a state of the art part of speech tagger for unedit...
Part-Of-Speech (POS) tagging is the process of marking-up the words in a text with their correspondi...
The field of Part of Speech (POS) tagging has made slow but steady progress during the last decade, ...
Statistical n-gram taggers like that of [Church 1988] or [Foster 1991] assign a part-of-speech label...
This work presents Stagger, a new open-source part of speech tagger for Swedish based on the Average...
Part of Speech (POS) tagging is an essential part of text processing applications. A POS tagger assi...
In this paper we present a part-of-speech tagging system based on a structural language model learnt...
International audienceThe aim of our paper is to study the interest of part of speech (POS) tagging ...
The aim of the thesis is to propose a tagging system for a learner corpus of spoken English which wo...