Reasoning is a cognitive process of using evidence to reach a sound conclusion. The reasoning capability is essential for large language models (LLMs) to serve as the brain of the artificial general intelligence agent. Recent studies reveal that fine-tuning LLMs on data with the chain of thought (COT) reasoning process can significantly enhance their reasoning capabilities. However, we find that the fine-tuned LLMs suffer from an \textit{Assessment Misalignment} problem, i.e., they frequently assign higher scores to subpar COTs, leading to potential limitations in their reasoning abilities. To address this problem, we introduce an \textit{Alignment Fine-Tuning (AFT)} paradigm, which involves three steps: 1) fine-tuning LLMs with COT trainin...
The interactive nature of Large Language Models (LLMs) theoretically allows models to refine and imp...
Pretrained large language models (LLMs) are strong in-context learners that are able to perform few-...
Many applications of large language models (LLMs), ranging from chatbots to creative writing, requir...
Recent years have witnessed remarkable progress made in large language models (LLMs). Such advanceme...
In this paper, we uncover a systematic bias in the evaluation paradigm of adopting large language mo...
Large language models (LMs) beyond a certain scale, demonstrate the emergent capability of generatin...
Large language models (LLMs), typically designed as a function of next-word prediction, have excelle...
Generative foundation models are susceptible to implicit biases that can arise from extensive unsupe...
Logical reasoning consistently plays a fundamental and significant role in the domains of knowledge ...
The growing awareness of safety concerns in large language models (LLMs) has sparked considerable in...
The development of highly fluent large language models (LLMs) has prompted increased interest in ass...
Large language models (LLMs), such as GPT-3.5 and GPT-4, have greatly advanced the performance of ar...
Language models (LMs) with less than 100B parameters are known to perform poorly on chain-of-thought...
Large Multimodal Models (LMM) are built across modalities and the misalignment between two modalitie...
Large Language Models (LLMs) are central to a multitude of applications but struggle with significan...
The interactive nature of Large Language Models (LLMs) theoretically allows models to refine and imp...
Pretrained large language models (LLMs) are strong in-context learners that are able to perform few-...
Many applications of large language models (LLMs), ranging from chatbots to creative writing, requir...
Recent years have witnessed remarkable progress made in large language models (LLMs). Such advanceme...
In this paper, we uncover a systematic bias in the evaluation paradigm of adopting large language mo...
Large language models (LMs) beyond a certain scale, demonstrate the emergent capability of generatin...
Large language models (LLMs), typically designed as a function of next-word prediction, have excelle...
Generative foundation models are susceptible to implicit biases that can arise from extensive unsupe...
Logical reasoning consistently plays a fundamental and significant role in the domains of knowledge ...
The growing awareness of safety concerns in large language models (LLMs) has sparked considerable in...
The development of highly fluent large language models (LLMs) has prompted increased interest in ass...
Large language models (LLMs), such as GPT-3.5 and GPT-4, have greatly advanced the performance of ar...
Language models (LMs) with less than 100B parameters are known to perform poorly on chain-of-thought...
Large Multimodal Models (LMM) are built across modalities and the misalignment between two modalitie...
Large Language Models (LLMs) are central to a multitude of applications but struggle with significan...
The interactive nature of Large Language Models (LLMs) theoretically allows models to refine and imp...
Pretrained large language models (LLMs) are strong in-context learners that are able to perform few-...
Many applications of large language models (LLMs), ranging from chatbots to creative writing, requir...