We examine the role of character patterns in three tasks: morphological analysis, lemmatization and copy. We use a modified version of the standard sequence-to-sequence model, where the encoder is a pattern matching network. Each pattern scores all possible N character long subwords (substrings) on the source side, and the highest scoring subword’s score is used to initialize the decoder as well as the input to the attention mechanism. This method allows learning which subwords of the input are important for generating the output. By training the models on the same source but different target, we can compare what subwords are important for different tasks and how they relate to each other. We define a similarity metric, a generalized form o...
The problem of morphological ambiguity is central to many natural language processing tasks. In part...
The persistent efforts to make valuable annotated corpora in more diverse, morphologically rich lang...
�� 2018 The Authors. Published by Association for Computational Linguistics. This is an open access ...
Morphology is the study of how words are composed of smaller units of meaning (morphemes). It allow...
This paper describes our submission to SIGMORPHON 2019 Task 2: Morphological analysis and lemmatizat...
This paper describes our submission to SIGMORPHON 2019 Task 2: Morphological analysis and lemmatizat...
A core issue that hampers development and use of language technology for underresourced and morpholo...
This paper presents the submissions by the University of Zurich to the SIGMORPHON 2017 shared task o...
International audienceAnalogical proportions are statements expressed in the form "A is to B as C is...
Morphological segmentation, which aims to break words into meaning-bearing morphemes, is an importan...
We present a neural transition-based model that uses a simple set of edit actions (copy, delete, ins...
This thesis addresses some of the challenges of translating morphologically rich languages (MRLs). W...
In this paper, we present a novel lemmatization method based on a sequence-to-sequence neural networ...
This paper describes the Stockholm University/University of Groningen (SU-RUG) system for the SIGMOR...
Neural architectures are prominent in the construction of language models (LMs). However, word-leve...
The problem of morphological ambiguity is central to many natural language processing tasks. In part...
The persistent efforts to make valuable annotated corpora in more diverse, morphologically rich lang...
�� 2018 The Authors. Published by Association for Computational Linguistics. This is an open access ...
Morphology is the study of how words are composed of smaller units of meaning (morphemes). It allow...
This paper describes our submission to SIGMORPHON 2019 Task 2: Morphological analysis and lemmatizat...
This paper describes our submission to SIGMORPHON 2019 Task 2: Morphological analysis and lemmatizat...
A core issue that hampers development and use of language technology for underresourced and morpholo...
This paper presents the submissions by the University of Zurich to the SIGMORPHON 2017 shared task o...
International audienceAnalogical proportions are statements expressed in the form "A is to B as C is...
Morphological segmentation, which aims to break words into meaning-bearing morphemes, is an importan...
We present a neural transition-based model that uses a simple set of edit actions (copy, delete, ins...
This thesis addresses some of the challenges of translating morphologically rich languages (MRLs). W...
In this paper, we present a novel lemmatization method based on a sequence-to-sequence neural networ...
This paper describes the Stockholm University/University of Groningen (SU-RUG) system for the SIGMOR...
Neural architectures are prominent in the construction of language models (LMs). However, word-leve...
The problem of morphological ambiguity is central to many natural language processing tasks. In part...
The persistent efforts to make valuable annotated corpora in more diverse, morphologically rich lang...
�� 2018 The Authors. Published by Association for Computational Linguistics. This is an open access ...