A key technology for the development of large language models (LLMs) involves instruction tuning that helps align the models' responses with human expectations to realize impressive learning abilities. Two major approaches for instruction tuning characterize supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF), which are currently applied to produce the best commercial LLMs (e.g., ChatGPT). To improve the accessibility of LLMs for research and development efforts, various instruction-tuned open-source LLMs have also been introduced recently, e.g., Alpaca, Vicuna, to name a few. However, existing open-source LLMs have only been instruction-tuned for English and a few popular languages, thus hindering their impac...
Widely used language models (LMs) are typically built by scaling up a two-stage training pipeline: a...
Sparse Mixture-of-Experts (MoE) is a neural architecture design that can be utilized to add learnabl...
Recently, various studies have leveraged Large Language Models (LLMs) to help decision-making and pl...
Large language models (LLMs), such as GPT-4, PaLM, and LLaMa, have been shown to achieve remarkable ...
In this work, we evaluate 10 open-source instructed LLMs on four representative code comprehension a...
Open-sourced large language models (LLMs) have demonstrated remarkable efficacy in various tasks wit...
Instruction tuning has remarkably advanced large language models (LLMs) in understanding and respond...
Large Language Models (LLMs), trained predominantly on extensive English data, often exhibit limitat...
Recent advancements in Large Language Models (LLMs) have expanded the horizons of natural language u...
Recent benchmarks for Large Language Models (LLMs) have mostly focused on application-driven tasks s...
Large Language Models (LLMs) present strong general capabilities, and a current compelling challenge...
Reinforcement learning (RL) has emerged as a powerful paradigm for fine-tuning Large Language Models...
Large Language Models (LLMs) have been a significant landmark of Artificial Intelligence (AI) advanc...
Reinforcement learning from human feedback (RLHF) is effective at aligning large language models (LL...
Pretrained large language models (LLMs) are strong in-context learners that are able to perform few-...
Widely used language models (LMs) are typically built by scaling up a two-stage training pipeline: a...
Sparse Mixture-of-Experts (MoE) is a neural architecture design that can be utilized to add learnabl...
Recently, various studies have leveraged Large Language Models (LLMs) to help decision-making and pl...
Large language models (LLMs), such as GPT-4, PaLM, and LLaMa, have been shown to achieve remarkable ...
In this work, we evaluate 10 open-source instructed LLMs on four representative code comprehension a...
Open-sourced large language models (LLMs) have demonstrated remarkable efficacy in various tasks wit...
Instruction tuning has remarkably advanced large language models (LLMs) in understanding and respond...
Large Language Models (LLMs), trained predominantly on extensive English data, often exhibit limitat...
Recent advancements in Large Language Models (LLMs) have expanded the horizons of natural language u...
Recent benchmarks for Large Language Models (LLMs) have mostly focused on application-driven tasks s...
Large Language Models (LLMs) present strong general capabilities, and a current compelling challenge...
Reinforcement learning (RL) has emerged as a powerful paradigm for fine-tuning Large Language Models...
Large Language Models (LLMs) have been a significant landmark of Artificial Intelligence (AI) advanc...
Reinforcement learning from human feedback (RLHF) is effective at aligning large language models (LL...
Pretrained large language models (LLMs) are strong in-context learners that are able to perform few-...
Widely used language models (LMs) are typically built by scaling up a two-stage training pipeline: a...
Sparse Mixture-of-Experts (MoE) is a neural architecture design that can be utilized to add learnabl...
Recently, various studies have leveraged Large Language Models (LLMs) to help decision-making and pl...