The BigScience Workshop was a value-driven initiative that spanned one and half years of interdisciplinary research and culminated in the creation of ROOTS, a 1.6TB multilingual dataset that was used to train BLOOM, one of the largest multilingual language models to date. In addition to the technical outcomes and artifacts, the workshop fostered multidisciplinary collaborations around large models, datasets, and their analysis. This in turn led to a wide range of research publications spanning topics from ethics to law, data governance, modeling choices and distributed training. This paper focuses on the collaborative research aspects of BigScience and takes a step back to look at the challenges of large-scale participatory research, with r...
This paper discusses some of the issues that arise when small scientific projects make the transitio...
This paper discusses some of the issues that arise when small scientific projects make the transitio...
\u2018Big data is here to stay.\u2019 This key statement has a double value: is an assumption as wel...
International audienceAs language models grow ever larger, the need for large-scale high-quality tex...
The use of language models in Web applications and other areas of computing and business have grown ...
The use of language models in Web applications and other areas of computing and business have grown ...
The emergence of Large Language Models (LLMs) has brought both excitement and concerns to social com...
The rise of Big Data in the social realm poses significant questions at the intersection of science,...
Language models demonstrate both quantitative improvement and new qualitative capabilities with incr...
Rapid advances in the capabilities of Large Language Models (LLM) as a basis for Artificial Intellig...
This paper argues that analyses of the ways in which Big Data has been enacted in other academic dis...
In this paper, we reflect on the disciplinary contours of contemporary sociology, and social science...
Thanks to rapid progress in artificial intelligence, we have entered an era when technology and phil...
Sample description: Our target audience consisted of researchers working in the fields of science, t...
Large language models (LLMs)—machine learning algorithms that can recognize, summarize, translate,...
This paper discusses some of the issues that arise when small scientific projects make the transitio...
This paper discusses some of the issues that arise when small scientific projects make the transitio...
\u2018Big data is here to stay.\u2019 This key statement has a double value: is an assumption as wel...
International audienceAs language models grow ever larger, the need for large-scale high-quality tex...
The use of language models in Web applications and other areas of computing and business have grown ...
The use of language models in Web applications and other areas of computing and business have grown ...
The emergence of Large Language Models (LLMs) has brought both excitement and concerns to social com...
The rise of Big Data in the social realm poses significant questions at the intersection of science,...
Language models demonstrate both quantitative improvement and new qualitative capabilities with incr...
Rapid advances in the capabilities of Large Language Models (LLM) as a basis for Artificial Intellig...
This paper argues that analyses of the ways in which Big Data has been enacted in other academic dis...
In this paper, we reflect on the disciplinary contours of contemporary sociology, and social science...
Thanks to rapid progress in artificial intelligence, we have entered an era when technology and phil...
Sample description: Our target audience consisted of researchers working in the fields of science, t...
Large language models (LLMs)—machine learning algorithms that can recognize, summarize, translate,...
This paper discusses some of the issues that arise when small scientific projects make the transitio...
This paper discusses some of the issues that arise when small scientific projects make the transitio...
\u2018Big data is here to stay.\u2019 This key statement has a double value: is an assumption as wel...