Incrementally Learning the Hierarchical Softmax Function for Neural Language Models

Peng, Hao
Li, Jianxin
Song, Yangqiu
Liu, Yaopeng

Publication date

February 2017

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Abstract

Neural network language models (NNLMs) have attracted a lot of attention recently. In this paper, we present a training method that can incrementally train the hierarchical softmax function for NNMLs. We split the cost function to model old and update corpora separately, and factorize the objective function for the hierarchical softmax. Then we provide a new stochastic gradient based method to update all the word vectors and parameters, by comparing the old tree generated based on the old corpus and the new tree generated based on the combined (old and update) corpus. Theoretical analysis shows that the mean square error of the parameter vectors can be bounded by a function of the number of changed words related to the parameter node. Exper...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Incrementally Learning the Hierarchical Softmax Function for Neural Language Models

Abstract

Extracted data

Incrementally Learning the Hierarchical Softmax Function for Neural Language Models

Abstract

Extracted data

Related items

Related items