We introduce a large-scale dataset of the complete texts of free/open source software (FOSS) license variants. To assemble it we have collected from the Software Heritage archive—the largest publicly available archive of FOSS source code with accompanying development history—all versions of files whose names are commonly used to convey licensing terms to software users and developers. The dataset consists of 6.5 million unique license files that can be used to conduct empirical studies on open source licensing, training of automated license classifiers, natural language processing (NLP) analyses of legal texts, as well as historical and phylogenetic studies on FOSS licensing. Additional metadata about shipped license files are also provided...
Abstract. Software licensing is a complex issue in free and open source software (FOSS), specially w...
Workshop on open licenses: Data licencing and policiesInternational audienceThe main goal of this pr...
We aim to determine the features of four popular FOSS scanning tools, FOSSology,FOSSA, FOSSID(SCAS),...
We introduce a large-scale dataset of the complete texts of free/open source software (FOSS) license...
This dataset consists of a number of GitHub repositories that cover the following programming langua...
This is the Debsources Dataset: source code and related metadata spanning two decades of Free and Op...
Free and open source software systems (FOSS) are distribu-ted and made available to users under diff...
The race to train language models on vast, diverse, and inconsistently documented datasets has raise...
Open source software nowadays has become an important trend for software technology innovation and s...
Part 2: Open Source in Business ModelingInternational audienceFree and Open Source Software (FOSS) i...
FOSS (Free and Open Source System) is repeatedly modied and reused by other FOSS or proprietary soft...
Free and open source software (FOSS) is distributed and made available to users under different soft...
Organizations across the globe are creating and distributing products that include open source softw...
Software Heritage is the largest existing public archive of software source code and accompanying de...
We present a dataset of open source software developed mainly by enterprises rather than volunteers....
Abstract. Software licensing is a complex issue in free and open source software (FOSS), specially w...
Workshop on open licenses: Data licencing and policiesInternational audienceThe main goal of this pr...
We aim to determine the features of four popular FOSS scanning tools, FOSSology,FOSSA, FOSSID(SCAS),...
We introduce a large-scale dataset of the complete texts of free/open source software (FOSS) license...
This dataset consists of a number of GitHub repositories that cover the following programming langua...
This is the Debsources Dataset: source code and related metadata spanning two decades of Free and Op...
Free and open source software systems (FOSS) are distribu-ted and made available to users under diff...
The race to train language models on vast, diverse, and inconsistently documented datasets has raise...
Open source software nowadays has become an important trend for software technology innovation and s...
Part 2: Open Source in Business ModelingInternational audienceFree and Open Source Software (FOSS) i...
FOSS (Free and Open Source System) is repeatedly modied and reused by other FOSS or proprietary soft...
Free and open source software (FOSS) is distributed and made available to users under different soft...
Organizations across the globe are creating and distributing products that include open source softw...
Software Heritage is the largest existing public archive of software source code and accompanying de...
We present a dataset of open source software developed mainly by enterprises rather than volunteers....
Abstract. Software licensing is a complex issue in free and open source software (FOSS), specially w...
Workshop on open licenses: Data licencing and policiesInternational audienceThe main goal of this pr...
We aim to determine the features of four popular FOSS scanning tools, FOSSology,FOSSA, FOSSID(SCAS),...