In this thesis we look at how we can develop automated analysis tools for Norwegian text. We look at 3 different tasks: Part-of-Speech (PoS) tagging, Named-Entity Chunking (NEC), and Named-Entity Recognition (NER). For our work on PoS tagging, we extend the work done on the OBT+Stat tagger by training a new model to allow it to also do disambiguation of Nynorsk. We work with Googles SyntaxNet and train it for PoS tagging of Bokmål and Nynorsk, showing state of the art results at the time of the research. We train a Support Vector Machine for NEC of Bokmål. The task of extracting names from text. Next, we develop a NER model using deep learning and provide a NER sequence tagger for Bokmål and Nynorsk. The Nynorsk tagger is the first NER mode...
In some languages, Named Entity Recognition (NER) is severely hindered by complex linguistic structu...
Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Jo...
This paper describes how a preexisting Constraint Grammar based parser for Danish (DanGram, Bick 200...
Named-Entity Chunking is part of the Named-Entity Recognition (NER) process and is the task of ident...
This paper presents NorNE, a manually annotated corpus of named entities which extends the annotatio...
We use Google’s open source neural network framework, SyntaxNet, to train a fully automatic part-of-...
We use Google’s open source neural network framework, SyntaxNet, to train a fully automatic part-of-...
This paper describes an evaluation of five data-driven part-of-speech (PoS) taggers for spoken Norwe...
Named entity recognition is a complex but rewarding task with a number of obvious applications- sema...
We use Google’s open source neural network framework, SyntaxNet, to train a fully automatic part-of-...
This thesis presents a systematic, empirical investigation of how an existing PoS tag set can be mod...
We analyze neural network architectures that yield state of the art results on named entity recognit...
1. Corpora and annotation tools Named entity recognition is a complex but rewarding task with a numb...
This paper describes the classifier part of a named entity recogniser for Norwegian which uses memor...
In some languages, Named Entity Recognition (NER) is severely hindered by complex linguistic structu...
In some languages, Named Entity Recognition (NER) is severely hindered by complex linguistic structu...
Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Jo...
This paper describes how a preexisting Constraint Grammar based parser for Danish (DanGram, Bick 200...
Named-Entity Chunking is part of the Named-Entity Recognition (NER) process and is the task of ident...
This paper presents NorNE, a manually annotated corpus of named entities which extends the annotatio...
We use Google’s open source neural network framework, SyntaxNet, to train a fully automatic part-of-...
We use Google’s open source neural network framework, SyntaxNet, to train a fully automatic part-of-...
This paper describes an evaluation of five data-driven part-of-speech (PoS) taggers for spoken Norwe...
Named entity recognition is a complex but rewarding task with a number of obvious applications- sema...
We use Google’s open source neural network framework, SyntaxNet, to train a fully automatic part-of-...
This thesis presents a systematic, empirical investigation of how an existing PoS tag set can be mod...
We analyze neural network architectures that yield state of the art results on named entity recognit...
1. Corpora and annotation tools Named entity recognition is a complex but rewarding task with a numb...
This paper describes the classifier part of a named entity recogniser for Norwegian which uses memor...
In some languages, Named Entity Recognition (NER) is severely hindered by complex linguistic structu...
In some languages, Named Entity Recognition (NER) is severely hindered by complex linguistic structu...
Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Jo...
This paper describes how a preexisting Constraint Grammar based parser for Danish (DanGram, Bick 200...