Recent developments in large language models (LLMs) have shown promise in enhancing the capabilities of natural language processing (NLP). Despite these successes, there remains a dearth of research dedicated to the NLP problem-solving abilities of LLMs. To fill the gap in this area, we present a unique benchmarking dataset, NLPBench, comprising 378 college-level NLP questions spanning various NLP topics sourced from Yale University's prior final exams. NLPBench includes questions with context, in which multiple sub-questions share the same public information, and diverse question types, including multiple choice, short answer, and math. Our evaluation, centered on LLMs such as GPT-3.5/4, PaLM-2, and LLAMA-2, incorporates advanced prompting...
Computational argumentation has become an essential tool in various fields, including artificial int...
Realizing the recent advances in Natural Language Processing (NLP) to the legal sector poses challen...
Large Language Models (LLMs) have not only exhibited exceptional performance across various tasks, b...
Large Language Models (LLMs) have achieved significant success across various natural language proce...
Pretrained large language models (LLMs) are widely used in many sub-fields of natural language proce...
Large language models (LLMs) have garnered significant attention, but the definition of "large" lack...
Logical reasoning consistently plays a fundamental and significant role in the domains of knowledge ...
Large language models (LLMs) have significantly advanced the field of natural language processing, w...
Large language models (LLMs) have significantly advanced the field of natural language processing, w...
Large language models (LLMs) are gaining increasing popularity in both academia and industry, owing ...
As the performance of large language models rapidly improves, benchmarks are getting larger and more...
The development of large language models (LLMs) such as ChatGPT has brought a lot of attention recen...
Recent strides in Large Language Models (LLMs) have saturated many NLP benchmarks (even professional...
Recently, large language models (LLMs), including notable models such as GPT-4 and burgeoning commun...
While recent advancements in large language models (LLMs) bring us closer to achieving artificial ge...
Computational argumentation has become an essential tool in various fields, including artificial int...
Realizing the recent advances in Natural Language Processing (NLP) to the legal sector poses challen...
Large Language Models (LLMs) have not only exhibited exceptional performance across various tasks, b...
Large Language Models (LLMs) have achieved significant success across various natural language proce...
Pretrained large language models (LLMs) are widely used in many sub-fields of natural language proce...
Large language models (LLMs) have garnered significant attention, but the definition of "large" lack...
Logical reasoning consistently plays a fundamental and significant role in the domains of knowledge ...
Large language models (LLMs) have significantly advanced the field of natural language processing, w...
Large language models (LLMs) have significantly advanced the field of natural language processing, w...
Large language models (LLMs) are gaining increasing popularity in both academia and industry, owing ...
As the performance of large language models rapidly improves, benchmarks are getting larger and more...
The development of large language models (LLMs) such as ChatGPT has brought a lot of attention recen...
Recent strides in Large Language Models (LLMs) have saturated many NLP benchmarks (even professional...
Recently, large language models (LLMs), including notable models such as GPT-4 and burgeoning commun...
While recent advancements in large language models (LLMs) bring us closer to achieving artificial ge...
Computational argumentation has become an essential tool in various fields, including artificial int...
Realizing the recent advances in Natural Language Processing (NLP) to the legal sector poses challen...
Large Language Models (LLMs) have not only exhibited exceptional performance across various tasks, b...