Distilling Word Embeddings: An Encoding Approach

Mou, Lili
Jia, Ran
Xu, Yan
Li, Ge
Zhang, Lu
Jin, Zhi

Open link

Publication date

January 2016

DOI

10.1145/2983323.2983888

Publisher

25th ACM International Conference on Information and Knowledge Management (CIKM)

Abstract

Distilling knowledge from a well-trained cumbersome network to a small one has recently become a new research topic, as lightweight neural networks with high performance are particularly in need in various resource-restricted systems. This paper addresses the problem of distilling word embeddings for NLP tasks. We propose an encoding approach to distill task-specific knowledge from a set of high-dimensional embeddings, so that we can reduce model complexity by a large margin as well as retain high accuracy, achieving a good compromise between efficiency and performance. Experiments reveal the phenomenon that distilling knowledge from cumbersome embeddings is better than directly training neural networks with small embeddings.CPCI-S(ISTP)dou...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Distilling Word Embeddings: An Encoding Approach

Abstract

Extracted data

Distilling Word Embeddings: An Encoding Approach

Abstract

Extracted data

Related items

Related items