To augment language models with the ability to reason, researchers usually prompt or finetune them to produce chain of thought reasoning steps before producing the final answer. However, although people use natural language to reason effectively, it may be that LMs could reason more effectively with some intermediate computation that is not in natural language. In this work, we explore an alternative reasoning approach: instead of explicitly producing the chain of thought reasoning steps, we use the language model's internal hidden states to perform implicit reasoning. The implicit reasoning steps are distilled from a teacher model trained on explicit chain-of-thought reasoning, and instead of doing reasoning "horizontally" by producing int...
Generating step-by-step "chain-of-thought" rationales improves language model performance on complex...
Inductive reasoning is a core problem-solving capacity: humans can identify underlying principles fr...
Large language models (LLMs) have achieved remarkable advancements in the field of natural language ...
Large language models (LLMs) have shown remarkable reasoning capabilities given chain-of-thought pro...
We explore how generating a chain of thought -- a series of intermediate reasoning steps -- signific...
Logical reasoning remains a pivotal component within the realm of artificial intelligence. The recen...
Large language models (LMs) beyond a certain scale, demonstrate the emergent capability of generatin...
Large language models (LLMs) can perform complex reasoning by generating intermediate reasoning step...
Chain-of-Thought (CoT) is a technique that guides Large Language Models (LLMs) to decompose complex ...
The knowledge-augmented deep learning paradigm refers to a paradigm in which domain knowledge is ide...
Chain-of-Thought(CoT) prompting and its variants explore equipping large language models (LLMs) with...
Implicit knowledge, such as common sense, is key to fluid human conversations. Current neural respon...
Recently, there has been significant progress in teaching language models to perform step-by-step re...
Reasoning is a distinctive human capacity, enabling us to address complex problems by breaking them ...
Abstract Large language models (LLMs) such as GPT-4 have recently demonstrated impressive results ac...
Generating step-by-step "chain-of-thought" rationales improves language model performance on complex...
Inductive reasoning is a core problem-solving capacity: humans can identify underlying principles fr...
Large language models (LLMs) have achieved remarkable advancements in the field of natural language ...
Large language models (LLMs) have shown remarkable reasoning capabilities given chain-of-thought pro...
We explore how generating a chain of thought -- a series of intermediate reasoning steps -- signific...
Logical reasoning remains a pivotal component within the realm of artificial intelligence. The recen...
Large language models (LMs) beyond a certain scale, demonstrate the emergent capability of generatin...
Large language models (LLMs) can perform complex reasoning by generating intermediate reasoning step...
Chain-of-Thought (CoT) is a technique that guides Large Language Models (LLMs) to decompose complex ...
The knowledge-augmented deep learning paradigm refers to a paradigm in which domain knowledge is ide...
Chain-of-Thought(CoT) prompting and its variants explore equipping large language models (LLMs) with...
Implicit knowledge, such as common sense, is key to fluid human conversations. Current neural respon...
Recently, there has been significant progress in teaching language models to perform step-by-step re...
Reasoning is a distinctive human capacity, enabling us to address complex problems by breaking them ...
Abstract Large language models (LLMs) such as GPT-4 have recently demonstrated impressive results ac...
Generating step-by-step "chain-of-thought" rationales improves language model performance on complex...
Inductive reasoning is a core problem-solving capacity: humans can identify underlying principles fr...
Large language models (LLMs) have achieved remarkable advancements in the field of natural language ...