The development of highly fluent large language models (LLMs) has prompted increased interest in assessing their reasoning and problem-solving capabilities. We investigate whether several LLMs can solve a classic type of deductive reasoning problem from the cognitive science literature. The tested LLMs have limited abilities to solve these problems in their conventional form. We performed follow up experiments to investigate if changes to the presentation format and content improve model performance. We do find performance differences between conditions; however, they do not improve overall performance. Moreover, we find that performance interacts with presentation format and content in unexpected ways that differ from human performance. Ov...
Large language models (LLMs), such as GPT-3.5 and GPT-4, have greatly advanced the performance of ar...
Large language models (LLMs) have exploded in popularity in the past few years and have achieved und...
Large language models (LMs) beyond a certain scale, demonstrate the emergent capability of generatin...
Logical reasoning consistently plays a fundamental and significant role in the domains of knowledge ...
Large Language Models (LLMs) have not only exhibited exceptional performance across various tasks, b...
In the present study, we investigate and compare reasoning in large language models (LLM) and humans...
Large language models have exhibited emergent abilities, demonstrating exceptional performance acros...
We present an empirical evaluation of various outputs generated by nine of the most widely-available...
The impressive recent performance of large language models has led many to wonder to what extent the...
Abstract reasoning is a key ability for an intelligent system. Large language models achieve above-c...
Large language models (LLMs) have achieved remarkable advancements in the field of natural language ...
The research described investigates why subjects frequently give logically wrong answers to problems...
Human language offers a powerful window into our thoughts -- we tell stories, give explanations, and...
Recent developments in large language models (LLMs) have shown promise in enhancing the capabilities...
Natural Language Inference (NLI) is considered a representative task to test natural language unders...
Large language models (LLMs), such as GPT-3.5 and GPT-4, have greatly advanced the performance of ar...
Large language models (LLMs) have exploded in popularity in the past few years and have achieved und...
Large language models (LMs) beyond a certain scale, demonstrate the emergent capability of generatin...
Logical reasoning consistently plays a fundamental and significant role in the domains of knowledge ...
Large Language Models (LLMs) have not only exhibited exceptional performance across various tasks, b...
In the present study, we investigate and compare reasoning in large language models (LLM) and humans...
Large language models have exhibited emergent abilities, demonstrating exceptional performance acros...
We present an empirical evaluation of various outputs generated by nine of the most widely-available...
The impressive recent performance of large language models has led many to wonder to what extent the...
Abstract reasoning is a key ability for an intelligent system. Large language models achieve above-c...
Large language models (LLMs) have achieved remarkable advancements in the field of natural language ...
The research described investigates why subjects frequently give logically wrong answers to problems...
Human language offers a powerful window into our thoughts -- we tell stories, give explanations, and...
Recent developments in large language models (LLMs) have shown promise in enhancing the capabilities...
Natural Language Inference (NLI) is considered a representative task to test natural language unders...
Large language models (LLMs), such as GPT-3.5 and GPT-4, have greatly advanced the performance of ar...
Large language models (LLMs) have exploded in popularity in the past few years and have achieved und...
Large language models (LMs) beyond a certain scale, demonstrate the emergent capability of generatin...