Learning Structural Dependencies of Words in the Zipfian Tail

Deoskar, T.
Mylonakis, M.
Sima'an, K.
Bunt, H.
Nivre, J.
Çetinoğlu, Ö.

Publication date

January 2011

Publisher

Association for Computational Linguistics

Abstract

Using semi-supervised EM, we learn finegrained but sparse lexical parameters of a generative parsing model (a PCFG) initially estimated over the Penn Treebank. Our lexical parameters employ supertags, which encode complex structural information at the pre-terminal level, and are particularly sparse in labeled data - our goal is to learn these for words that are unseen or rare in the labeled data. In order to guide estimation from unlabeled data, we incorporate both structural and lexical priors from the labeled data. We get a large error reduction in parsing ambiguous structures associated with unseen verbs, the most important case of learning lexico-structural dependencies. We also obtain a statistically significant improvement in labeled ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Learning Structural Dependencies of Words in the Zipfian Tail

Abstract

Extracted data

Learning Structural Dependencies of Words in the Zipfian Tail

Abstract

Extracted data

Related items

Related items