The field of Natural Language Processing has experienced a dramatic leap in capabilities with the recent introduction of huge Language Models. Despite this success, natural language problems that involve several compounded steps are still practically unlearnable, even by the largest LMs. This complies with experimental failures for end-to-end learning of composite problems that were demonstrated in a variety of domains. An effective mitigation is to introduce intermediate supervision for solving sub-tasks of the compounded problem. Recently, several works have demonstrated high gains by taking a straightforward approach for incorporating intermediate supervision in compounded natural language problems: the sequence-to-sequence LM is fed wit...
Machine Learning and Inference methods have become ubiquitous in our attempt to induce more abstract...
State-of-the-art NLP systems are generally based on the assumption that the underlying models are pr...
Neural sequence models have been applied with great success to a variety of tasks in natural languag...
Language acquisition in both natural and artificial language learning settings crucially depends on ...
Language acquisition in both natural and artificial language learning settings crucially depends on ...
Recent work on large language models relies on the intuition that most natural language processing t...
The human ability to understand the world in terms of reusable ``building blocks\u27\u27 allows us t...
Pre-trained large language models have shown successful progress in many language understanding benc...
Deep Neural Networks (DNNs) are powerful models that have achieved excel-lent performance on difficu...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
In Natural Language Processing (NLP), it is important to detect the relationship between two sequenc...
Many of the Natural Language Processing tasks that we would like to model with machine learning tech...
One of the most powerful ideas in natural language processing is the distributional hypothesis which...
We use reinforcement learning to learn tree-structured neural networks for computing representations...
Structured tasks, which often involve many interdependent decisions for each example, are the backbo...
Machine Learning and Inference methods have become ubiquitous in our attempt to induce more abstract...
State-of-the-art NLP systems are generally based on the assumption that the underlying models are pr...
Neural sequence models have been applied with great success to a variety of tasks in natural languag...
Language acquisition in both natural and artificial language learning settings crucially depends on ...
Language acquisition in both natural and artificial language learning settings crucially depends on ...
Recent work on large language models relies on the intuition that most natural language processing t...
The human ability to understand the world in terms of reusable ``building blocks\u27\u27 allows us t...
Pre-trained large language models have shown successful progress in many language understanding benc...
Deep Neural Networks (DNNs) are powerful models that have achieved excel-lent performance on difficu...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
In Natural Language Processing (NLP), it is important to detect the relationship between two sequenc...
Many of the Natural Language Processing tasks that we would like to model with machine learning tech...
One of the most powerful ideas in natural language processing is the distributional hypothesis which...
We use reinforcement learning to learn tree-structured neural networks for computing representations...
Structured tasks, which often involve many interdependent decisions for each example, are the backbo...
Machine Learning and Inference methods have become ubiquitous in our attempt to induce more abstract...
State-of-the-art NLP systems are generally based on the assumption that the underlying models are pr...
Neural sequence models have been applied with great success to a variety of tasks in natural languag...