Ask
Search
My library
ORKG

ORKG Ask

Our mission is to organize scholarly knowledge and make it accessible to humans and machines.

About

About ORKG Ask
Report issue
Statistics
Contact

Legal

Terms of use
Data protection (sheet)
Imprint
Accessibility
License

Technical

We
open source
System status
Changelog
Frontend: v1.41.0
Backend: v1.13.4

We use cookies to provide a better user experience.

Data Protection

Petals: Collaborative Inference and Fine-tuning of Large Models

Borzunov, Alexander
Baranchuk, Dmitry
Dettmers, Tim
Ryabinin, Max
Belkada, Younes
Chumachenko, Artem
Samygin, Pavel
Raffel, Colin

Publication date

September 2022

Language

English

Abstract

Many NLP tasks benefit from using large language models (LLMs) that often have more than 100 billion parameters. With the release of BLOOM-176B and OPT-175B, everyone can download pretrained models of this scale. Still, using these models requires high-end hardware unavailable to many researchers. In some cases, LLMs can be used more affordably via RAM offloading or hosted APIs. However, these techniques have innate limitations: offloading is too slow for interactive inference, while APIs are not flexible enough for research. In this work, we propose Petals $-$ a system for inference and fine-tuning of large models collaboratively by joining the resources of multiple parties trusted to process client's data. We demonstrate that this strateg...

Extracted data

Related items

AutoMix: Automatically Mixing Language Models

Madaan, Aman
Aggarwal, Pranjal
Anand, Ankit
Potharaju, Srividya Pranavi
Mishra, Swaroop
Zhou, Pei
Gupta, Aditya
Rajagopal, Dheeraj
Kappaganthu, Karthik
Yang, Yiming
Upadhyay, Shyam
Mausam
Faruqui, Manaal

November 2023

Large language models (LLMs) are now available in various sizes and configurations from cloud API pr...

Selecting Informative Contexts Improves Language Model Finetuning

Antonello, Richard
Beckage, Nicole
Turek, Javier
Huth, Alexander

May 2022

Language model fine-tuning is essential for modern natural language processing, but is computational...

Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets

Gupta, Vatsal
Pandya, Pranshu
Kataria, Tushar
Gupta, Vivek
Roth, Dan

November 2023

Language models, given their black-box nature, often exhibit sensitivity to input perturbations, lea...

GLaM: Efficient Scaling of Language Models with Mixture-of-Experts

Du, Nan
Huang, Yanping
Dai, Andrew M.
Tong, Simon
Lepikhin, Dmitry
Xu, Yuanzhong
Krikun, Maxim
Zhou, Yanqi
Yu, Adams Wei
Firat, Orhan
Zoph, Barret
Fedus, Liam
Bosma, Maarten
Zhou, Zongwei
Wang, Tao
Wang, Yu Emma
Webster, Kellie
Pellat, Marie
Robinson, Kevin
Meier-Hellstern, Kathleen
Duke, Toju
Dixon, Lucas
Zhang, Kun
Le, Quoc V
Wu, Yonghui
Chen, Zhifeng
Cui, Claire

August 2022

Scaling language models with more data, compute and parameters has driven significant progress in na...

nuQmm: Quantized MatMul for Efficient Inference of Large-Scale Generative Language Models

Park, Gunho
Park, Baeseong
Lee, Sungjae
Kim, Minsub
Kim, Byeongwook
Kwon, Se Jung
Lee, Youngjoo
Lee, Dongsoo

November 2022

The recent advance of self-supervised learning associated with the Transformer architecture enables ...

Boosting Inference Efficiency: Unleashing the Power of Parameter-Shared Pre-trained Language Models

Chen, Weize
Xu, Xiaoyue
Han, Xu
Lin, Yankai
Xie, Ruobing
Liu, Zhiyuan
Sun, Maosong
Zhou, Jie

October 2023

Parameter-shared pre-trained language models (PLMs) have emerged as a successful approach in resourc...

PLMM: Personal Large Models on Mobile Devices

Gong, Yuanhao

September 2023

Inspired by Federated Learning, in this paper, we propose personal large models that are distilled f...

LightLDA: Big Topic Models on Modest Compute Clusters

Jinhui Yuan (5364287)
Fei Gao (5364293)
Quirong Ho (5364299)
Jinliang Wei (5363933)
Xun Zheng (5363927)
Eric P Xing (27712)
Tie-Yan Liu (5364290)
Wei-Ying Ma (5364296)

December 2014

<p>When building large-scale machine learning (ML) programs, such as big topic models or deep neural...

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

November 2022

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstr...

An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing

Peng, Le
Luo, Gaoxiang
zhou, sicheng
chen, jiandong
Zhang, Rui
Xu, Ziyue
Sun, Ju

November 2023

Language models (LMs) such as BERT and GPT have revolutionized natural language processing (NLP). Ho...

It’s Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

Schick, Timo
Schütze, Hinrich
Toutanova, Kristina

June 2021

When scaled to hundreds of billions of parameters, pretrained language models such as GPT-3 (Brown e...

Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes

Hsieh, Cheng-Yu
Li, Chun-Liang
Yeh, Chih-Kuan
Nakhost, Hootan
Fujii, Yasuhisa
Ratner, Alexander
Krishna, Ranjay
Lee, Chen-Yu
Pfister, Tomas

July 2023

Deploying large language models (LLMs) is challenging because they are memory inefficient and comput...

Proto-lm: A Prototypical Network-Based Framework for Built-in Interpretability in Large Language Models

Xie, Sean
Vosoughi, Soroush
Hassanpour, Saeed

November 2023

Large Language Models (LLMs) have significantly advanced the field of Natural Language Processing (N...

Improving the Reusability of Pre-trained Language Models in Real-world Applications

Ghanbarzadeh, Somayeh
Palangi, Hamid
Huang, Yan
Moreno, Radames Cruz
Khanpour, Hamed

August 2023

The reusability of state-of-the-art Pre-trained Language Models (PLMs) is often limited by their gen...

Sparse Distillation: Speeding Up Text Classification by Using Bigger Student Models

Ye, Qinyuan
Khabsa, Madian
Lewis, Mike
Wang, Sinong
Ren, Xiang
Jaech, Aaron

July 2022

Distilling state-of-the-art transformer models into lightweight student models is an effective way t...

AutoMix: Automatically Mixing Language Models

Madaan, Aman
Aggarwal, Pranjal
Anand, Ankit
Potharaju, Srividya Pranavi
Mishra, Swaroop
Zhou, Pei
Gupta, Aditya
Rajagopal, Dheeraj
Kappaganthu, Karthik
Yang, Yiming
Upadhyay, Shyam
Mausam
Faruqui, Manaal

November 2023

Large language models (LLMs) are now available in various sizes and configurations from cloud API pr...

Selecting Informative Contexts Improves Language Model Finetuning

Antonello, Richard
Beckage, Nicole
Turek, Javier
Huth, Alexander

May 2022

Language model fine-tuning is essential for modern natural language processing, but is computational...

Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets

Gupta, Vatsal
Pandya, Pranshu
Kataria, Tushar
Gupta, Vivek
Roth, Dan

November 2023

Language models, given their black-box nature, often exhibit sensitivity to input perturbations, lea...

GLaM: Efficient Scaling of Language Models with Mixture-of-Experts

Du, Nan
Huang, Yanping
Dai, Andrew M.
Tong, Simon
Lepikhin, Dmitry
Xu, Yuanzhong
Krikun, Maxim
Zhou, Yanqi
Yu, Adams Wei
Firat, Orhan
Zoph, Barret
Fedus, Liam
Bosma, Maarten
Zhou, Zongwei
Wang, Tao
Wang, Yu Emma
Webster, Kellie
Pellat, Marie
Robinson, Kevin
Meier-Hellstern, Kathleen
Duke, Toju
Dixon, Lucas
Zhang, Kun
Le, Quoc V
Wu, Yonghui
Chen, Zhifeng
Cui, Claire

August 2022

Scaling language models with more data, compute and parameters has driven significant progress in na...

nuQmm: Quantized MatMul for Efficient Inference of Large-Scale Generative Language Models

Park, Gunho
Park, Baeseong
Lee, Sungjae
Kim, Minsub
Kim, Byeongwook
Kwon, Se Jung
Lee, Youngjoo
Lee, Dongsoo

November 2022

The recent advance of self-supervised learning associated with the Transformer architecture enables ...

Boosting Inference Efficiency: Unleashing the Power of Parameter-Shared Pre-trained Language Models

Chen, Weize
Xu, Xiaoyue
Han, Xu
Lin, Yankai
Xie, Ruobing
Liu, Zhiyuan
Sun, Maosong
Zhou, Jie

October 2023

Parameter-shared pre-trained language models (PLMs) have emerged as a successful approach in resourc...

PLMM: Personal Large Models on Mobile Devices

Gong, Yuanhao

September 2023

Inspired by Federated Learning, in this paper, we propose personal large models that are distilled f...

LightLDA: Big Topic Models on Modest Compute Clusters

Jinhui Yuan (5364287)
Fei Gao (5364293)
Quirong Ho (5364299)
Jinliang Wei (5363933)
Xun Zheng (5363927)
Eric P Xing (27712)
Tie-Yan Liu (5364290)
Wei-Ying Ma (5364296)

December 2014

<p>When building large-scale machine learning (ML) programs, such as big topic models or deep neural...

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

November 2022

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstr...

An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing

Peng, Le
Luo, Gaoxiang
zhou, sicheng
chen, jiandong
Zhang, Rui
Xu, Ziyue
Sun, Ju

November 2023

Language models (LMs) such as BERT and GPT have revolutionized natural language processing (NLP). Ho...

It’s Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

Schick, Timo
Schütze, Hinrich
Toutanova, Kristina

June 2021

When scaled to hundreds of billions of parameters, pretrained language models such as GPT-3 (Brown e...

Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes

Hsieh, Cheng-Yu
Li, Chun-Liang
Yeh, Chih-Kuan
Nakhost, Hootan
Fujii, Yasuhisa
Ratner, Alexander
Krishna, Ranjay
Lee, Chen-Yu
Pfister, Tomas

July 2023

Deploying large language models (LLMs) is challenging because they are memory inefficient and comput...

Proto-lm: A Prototypical Network-Based Framework for Built-in Interpretability in Large Language Models

Xie, Sean
Vosoughi, Soroush
Hassanpour, Saeed

November 2023

Large Language Models (LLMs) have significantly advanced the field of Natural Language Processing (N...

Improving the Reusability of Pre-trained Language Models in Real-world Applications

Ghanbarzadeh, Somayeh
Palangi, Hamid
Huang, Yan
Moreno, Radames Cruz
Khanpour, Hamed

August 2023

The reusability of state-of-the-art Pre-trained Language Models (PLMs) is often limited by their gen...

Sparse Distillation: Speeding Up Text Classification by Using Bigger Student Models

Ye, Qinyuan
Khabsa, Madian
Lewis, Mike
Wang, Sinong
Ren, Xiang
Jaech, Aaron

July 2022

Distilling state-of-the-art transformer models into lightweight student models is an effective way t...

AutoMix: Automatically Mixing Language Models

Madaan, Aman
Aggarwal, Pranjal
Anand, Ankit
Potharaju, Srividya Pranavi
Mishra, Swaroop
Zhou, Pei
Gupta, Aditya
Rajagopal, Dheeraj
Kappaganthu, Karthik
Yang, Yiming
Upadhyay, Shyam
Mausam
Faruqui, Manaal

November 2023

Large language models (LLMs) are now available in various sizes and configurations from cloud API pr...

Selecting Informative Contexts Improves Language Model Finetuning

Antonello, Richard
Beckage, Nicole
Turek, Javier
Huth, Alexander

May 2022

Language model fine-tuning is essential for modern natural language processing, but is computational...

Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets

Gupta, Vatsal
Pandya, Pranshu
Kataria, Tushar
Gupta, Vivek
Roth, Dan

November 2023

Language models, given their black-box nature, often exhibit sensitivity to input perturbations, lea...