O papel da informação negativa na aprendizagem de vetores palavra densos

Salle, Alexandre Tadeu

Publication date

January 2021

Abstract

By statistical analysis of the text in a given language, it is possible to represent each word in the vocabulary of the language as an m-dimensional word vector (also known as a word embedding) such that this vector captures semantic and syntactic information. Word embeddings derived from unannotated corpora can be divided into (1) counting methods which perform factorization of the word-context cooccurrence matrix and (2) predictive methods where neural networks are trained to predict word distributions given a context, generally outperforming counting methods. In this thesis, we hypothesize that the performance gap is due to how counting methods account for – or completely ig nore – negative information: word-context pairs where observing...

Extracted data

We use cookies to provide a better user experience.

Data Protection

O papel da informação negativa na aprendizagem de vetores palavra densos

Abstract

Extracted data

O papel da informação negativa na aprendizagem de vetores palavra densos

Abstract

Extracted data

Related items

Related items