Prompts have been the center of progress in advancing language models' zero-shot and few-shot performance. However, recent work finds that models can perform surprisingly well when given intentionally irrelevant or misleading prompts. Such results may be interpreted as evidence that model behavior is not "human like". In this study, we challenge a central assumption in such work: that humans would perform badly when given pathological instructions. We find that humans are able to reliably ignore irrelevant instructions and thus, like models, perform well on the underlying task despite an apparent lack of signal regarding the task they are being asked to do. However, when given deliberately misleading instructions, humans follow the instruct...
Distinct model-free and model-based learning processes are thought to drive both typical and dysfunc...
Test suites assess natural language processing models' performance on specific functionalities: case...
Recently, Instruction fine-tuning has risen to prominence as a potential method for enhancing the ze...
Large language models (LLMs) have exploded in popularity in the past few years and have achieved und...
Abstract reasoning is a key ability for an intelligent system. Large language models achieve above-c...
Despite widespread use of LLMs as conversational agents, evaluations of performance fail to capture ...
In the present study, we investigate and compare reasoning in large language models (LLM) and humans...
Distinct model-free and model-based learning processes are thought to drive both typical and dysfunc...
Language Models (LMs) can perform new tasks by adapting to a few in-context examples. For humans, ex...
Large language models generate complex, open-ended outputs: instead of outputting a class label they...
Large language models have exhibited emergent abilities, demonstrating exceptional performance acros...
Comunicació presentada a la Conference on Empirical Methods in Natural Language Processing (EMNLP 20...
Language models (LMs) can be directed to perform target tasks by using labeled examples or natural l...
Language models (LMs) have been used in cognitive modeling as well as engineering studies -- they co...
Inferring reward functions from human behavior is at the center of value alignment - aligning AI obj...
Distinct model-free and model-based learning processes are thought to drive both typical and dysfunc...
Test suites assess natural language processing models' performance on specific functionalities: case...
Recently, Instruction fine-tuning has risen to prominence as a potential method for enhancing the ze...
Large language models (LLMs) have exploded in popularity in the past few years and have achieved und...
Abstract reasoning is a key ability for an intelligent system. Large language models achieve above-c...
Despite widespread use of LLMs as conversational agents, evaluations of performance fail to capture ...
In the present study, we investigate and compare reasoning in large language models (LLM) and humans...
Distinct model-free and model-based learning processes are thought to drive both typical and dysfunc...
Language Models (LMs) can perform new tasks by adapting to a few in-context examples. For humans, ex...
Large language models generate complex, open-ended outputs: instead of outputting a class label they...
Large language models have exhibited emergent abilities, demonstrating exceptional performance acros...
Comunicació presentada a la Conference on Empirical Methods in Natural Language Processing (EMNLP 20...
Language models (LMs) can be directed to perform target tasks by using labeled examples or natural l...
Language models (LMs) have been used in cognitive modeling as well as engineering studies -- they co...
Inferring reward functions from human behavior is at the center of value alignment - aligning AI obj...
Distinct model-free and model-based learning processes are thought to drive both typical and dysfunc...
Test suites assess natural language processing models' performance on specific functionalities: case...
Recently, Instruction fine-tuning has risen to prominence as a potential method for enhancing the ze...