Effectively scaling large Transformer models is a main driver of recent advances in natural language processing. Dynamic neural networks, as an emerging research direction, are capable of scaling up neural networks with sub-linear increases in computation and time by dynamically adjusting their computational path based on the input. Dynamic neural networks could be a promising solution to the growing parameter numbers of pretrained language models, allowing both model pretraining with trillions of parameters and faster inference on mobile devices. In this survey, we summarize progress of three types of dynamic neural networks in NLP: skimming, mixture of experts, and early exit. We also highlight current challenges in dynamic neural network...
The rapid increase of information available on the Internet, much of it textual, calls for computati...
In recent years, the field of language modelling has witnessed exciting developments. Especially, th...
rba @ bellcore.com Recent developments in neural algorithms provide a new approach to natural langua...
Thesis (Ph.D.)--University of Washington, 2020In the past decade deep learning has revolutionized ma...
This report documents the program and the outcomes of Dagstuhl Seminar 22232 “Efficient and Equitabl...
Natural language processing (NLP) is a rapidly growing field with a wide range of applications, such...
Recent advances in deep neural language models combined with the capacity of large scale datasets ha...
The deep learning approach to machine learning emphasizes high-capacity, scalable models that learn ...
The field of Natural Language Processing (NLP) has been undergoing a revolution in recent years. Lar...
In 2017, Vaswani et al. proposed a new neural network architecture named Transformer. That modern ar...
Thesis (Ph.D.)--University of Washington, 2022Natural language processing (NLP) is having a paradigm...
One approach used to develop computer systems for natu- identifying phrases, e.g.<the boy> is ...
Recently, the development of pre-trained language models has brought natural language processing (NL...
A major challenge in modern neural networks is the utilization of previous knowledge for new tasks i...
The ability to accurately represent sentences is central to language understanding. We describe a co...
The rapid increase of information available on the Internet, much of it textual, calls for computati...
In recent years, the field of language modelling has witnessed exciting developments. Especially, th...
rba @ bellcore.com Recent developments in neural algorithms provide a new approach to natural langua...
Thesis (Ph.D.)--University of Washington, 2020In the past decade deep learning has revolutionized ma...
This report documents the program and the outcomes of Dagstuhl Seminar 22232 “Efficient and Equitabl...
Natural language processing (NLP) is a rapidly growing field with a wide range of applications, such...
Recent advances in deep neural language models combined with the capacity of large scale datasets ha...
The deep learning approach to machine learning emphasizes high-capacity, scalable models that learn ...
The field of Natural Language Processing (NLP) has been undergoing a revolution in recent years. Lar...
In 2017, Vaswani et al. proposed a new neural network architecture named Transformer. That modern ar...
Thesis (Ph.D.)--University of Washington, 2022Natural language processing (NLP) is having a paradigm...
One approach used to develop computer systems for natu- identifying phrases, e.g.<the boy> is ...
Recently, the development of pre-trained language models has brought natural language processing (NL...
A major challenge in modern neural networks is the utilization of previous knowledge for new tasks i...
The ability to accurately represent sentences is central to language understanding. We describe a co...
The rapid increase of information available on the Internet, much of it textual, calls for computati...
In recent years, the field of language modelling has witnessed exciting developments. Especially, th...
rba @ bellcore.com Recent developments in neural algorithms provide a new approach to natural langua...