Investigating Data Generation Using Masking in Language Models

Language Models and World Knowledge: Injecting structured information using masked language modeling and adapters

Wold, Sondre

January 2022

Combining structured information with language models is a standing problem in NLP. Building on prev...

Pre-training model based on the transfer learning in natural language processing

Tang, Jiayi

January 2019

Transfer learning is to apply knowledge or patterns learned in a particular field or task to differe...

Exploring Unsupervised Pretraining Objectives for Machine Translation

Baziotis
Titov
Birch
Haddow

July 2021

Unsupervised cross-lingual pretraining has achieved strong results in neural machine translation (NM...

Investigating Masking-based Data Generation in Language Models

Ma, Ed S.

June 2023

The current era of natural language processing (NLP) has been defined by the prominence of pre-train...

Linguistically informed masking: pretrained and downstream task BERT models for the patent domain

Anonymous

December 2020

This repository contains BERT-based models which were trained as part of the experiments described i...

AMOM: Adaptive Masking over Masking for Conditional Masked Language Model

Xiao, Yisheng
Xu, Ruiyang
Wu, Lijun
Li, Juntao
Qin, Tao
Liu, Tie-Yan
Zhang, Min

June 2023

Transformer-based autoregressive (AR) methods have achieved appealing performance for varied sequenc...

Frustratingly simple pretraining alternatives to masked language modeling

Yamaguchi, A.
Chrysostomou, G.
Margatina, K.
Aletras, N.

Masked language modeling (MLM), a self-supervised pretraining objective, is widely used in natural l...

Unmasking the Mask – Evaluating Social Biases in Masked Language Models

Kaneko, Masahiro
Bollegala, Danushka

June 2022

Masked Language Models (MLMs) have shown superior performances in numerous downstream Natural Langua...

Should You Mask 15% in Masked Language Modeling?

Wettig, Alexander
Gao, Tianyu
Zhong, Zexuan
Chen, Danqi

May 2022

Masked language models conventionally use a masking rate of 15% due to the belief that more masking ...

Towards a Deeper Understanding of Neural Language Generation

He, Tianxing

June 2022

In recent years, the field of language modelling has witnessed exciting developments. Especially, th...

Using Selective Masking as a Bridge between Pre-training and Fine-tuning

Lad, Tanish
Maheshwari, Himanshu
Kottukkal, Shreyas
Mamidi, Radhika

November 2022

Pre-training a language model and then fine-tuning it for downstream tasks has demonstrated state-of...

A New Feature Engineering-Driven Pre-Trained Language Model for Enabling Semantically Enriched Natural Language Processing Tasks

Ameri, Kimia

January 2022

Natural language processing (NLP) techniques had significantly improved by introducing pre-trained l...

Could KeyWord Masking Strategy Improve Language Model?

Borovikova, Mariya
Ferré, Arnaud
Bossy, Robert
Roche, Mathieu
Nédellec, Claire

June 2023

International audienceThis paper presents an enhanced approach for adapting a Language Model (LM) to...

Masked Vision and Language Modeling for Multi-modal Representation Learning

Kwon, Gukyeong
Cai, Zhaowei
Ravichandran, Avinash
Bas, Erhan
Bhotika, Rahul
Soatto, Stefano

August 2022

In this paper, we study how to use masked signal modeling in vision and language (V+L) representatio...

Data augmentation in natural language processing: a novel text generation approach for long and short text classifiers

Bayer, Markus
Kaufhold, Marc-André
Buchhold, Björn
Keller, Marcel
Dallmeyer, Jörg
Reuter, Christian

January 2022

In many cases of machine learning, research suggests that the development of training data might hav...

Language Models and World Knowledge: Injecting structured information using masked language modeling and adapters

Wold, Sondre

January 2022

Combining structured information with language models is a standing problem in NLP. Building on prev...

Pre-training model based on the transfer learning in natural language processing

Tang, Jiayi

January 2019

Transfer learning is to apply knowledge or patterns learned in a particular field or task to differe...

Exploring Unsupervised Pretraining Objectives for Machine Translation

Baziotis
Titov
Birch
Haddow

July 2021

Unsupervised cross-lingual pretraining has achieved strong results in neural machine translation (NM...

Investigating Masking-based Data Generation in Language Models

Ma, Ed S.

June 2023

The current era of natural language processing (NLP) has been defined by the prominence of pre-train...

Linguistically informed masking: pretrained and downstream task BERT models for the patent domain

Anonymous

December 2020

This repository contains BERT-based models which were trained as part of the experiments described i...

AMOM: Adaptive Masking over Masking for Conditional Masked Language Model

Xiao, Yisheng
Xu, Ruiyang
Wu, Lijun
Li, Juntao
Qin, Tao
Liu, Tie-Yan
Zhang, Min

June 2023

Transformer-based autoregressive (AR) methods have achieved appealing performance for varied sequenc...

Frustratingly simple pretraining alternatives to masked language modeling

Yamaguchi, A.
Chrysostomou, G.
Margatina, K.
Aletras, N.

Masked language modeling (MLM), a self-supervised pretraining objective, is widely used in natural l...

Unmasking the Mask – Evaluating Social Biases in Masked Language Models

Kaneko, Masahiro
Bollegala, Danushka

June 2022

Masked Language Models (MLMs) have shown superior performances in numerous downstream Natural Langua...

Should You Mask 15% in Masked Language Modeling?

Wettig, Alexander
Gao, Tianyu
Zhong, Zexuan
Chen, Danqi

May 2022

Masked language models conventionally use a masking rate of 15% due to the belief that more masking ...

Towards a Deeper Understanding of Neural Language Generation

He, Tianxing

June 2022

In recent years, the field of language modelling has witnessed exciting developments. Especially, th...

Using Selective Masking as a Bridge between Pre-training and Fine-tuning

Lad, Tanish
Maheshwari, Himanshu
Kottukkal, Shreyas
Mamidi, Radhika

November 2022

Pre-training a language model and then fine-tuning it for downstream tasks has demonstrated state-of...

A New Feature Engineering-Driven Pre-Trained Language Model for Enabling Semantically Enriched Natural Language Processing Tasks

Ameri, Kimia

January 2022

Natural language processing (NLP) techniques had significantly improved by introducing pre-trained l...

Could KeyWord Masking Strategy Improve Language Model?

Borovikova, Mariya
Ferré, Arnaud
Bossy, Robert
Roche, Mathieu
Nédellec, Claire

June 2023

International audienceThis paper presents an enhanced approach for adapting a Language Model (LM) to...

Masked Vision and Language Modeling for Multi-modal Representation Learning

Kwon, Gukyeong
Cai, Zhaowei
Ravichandran, Avinash
Bas, Erhan
Bhotika, Rahul
Soatto, Stefano

August 2022

In this paper, we study how to use masked signal modeling in vision and language (V+L) representatio...

Data augmentation in natural language processing: a novel text generation approach for long and short text classifiers

Bayer, Markus
Kaufhold, Marc-André
Buchhold, Björn
Keller, Marcel
Dallmeyer, Jörg
Reuter, Christian

January 2022

In many cases of machine learning, research suggests that the development of training data might hav...

Language Models and World Knowledge: Injecting structured information using masked language modeling and adapters

Wold, Sondre

January 2022

Combining structured information with language models is a standing problem in NLP. Building on prev...

Pre-training model based on the transfer learning in natural language processing

Tang, Jiayi

January 2019

Transfer learning is to apply knowledge or patterns learned in a particular field or task to differe...

Exploring Unsupervised Pretraining Objectives for Machine Translation

Baziotis
Titov
Birch
Haddow

July 2021

Unsupervised cross-lingual pretraining has achieved strong results in neural machine translation (NM...

Investigating Data Generation Using Masking in Language Models

Abstract

Extracted data

Investigating Data Generation Using Masking in Language Models

Abstract

Extracted data

Related items

Related items