CUE Vectors: Modular Training of Language Models Conditioned on Diverse Contextual Signals

Novotney, Scott
Mukherjee, Sreeparna
Ahmed, Zeeshan
Stolcke, Andreas

Open PDF

Open link

Publication date

March 2022

DOI

10.18653/v1/2022.findings-acl.265

Publisher

Association for Computational Linguistics (ACL)

Language

English

Abstract

We propose a framework to modularize the training of neural language models that use diverse forms of sentence-external context (including metadata) by eliminating the need to jointly train sentence-external and within-sentence encoders. Our approach, contextual universal embeddings (CUE), trains LMs on one set of context, such as date and author, and adapts to novel metadata types, such as article title, or previous sentence. The model consists of a pretrained neural sentence LM, a BERT-based context encoder, and a masked transformer decoder that estimates LM probabilities using sentence-internal and sentence-external information. When context or metadata are unavailable, our model learns to combine contextual and sentence-internal informa...

Extracted data

We use cookies to provide a better user experience.

Data Protection

CUE Vectors: Modular Training of Language Models Conditioned on Diverse Contextual Signals

Abstract

Extracted data

CUE Vectors: Modular Training of Language Models Conditioned on Diverse Contextual Signals

Abstract

Extracted data

Related items

Related items