This dataset contains Python scripts applying repository mining and social network analysis (SNA) techniques for investigating the transparency and workload distribution of open source hardware (OSH) product development projects hosted on GitHub. Starting from a list of projects and the reference of their corresponding repositories, the scripts extract file versioning metadata from the GitHub API and compute GraphML graphs depicting the full history of commit information for each project. Three types of graphs are computed: commit graphs, file co-edition graphs and file change graphs. They then apply SNA indicators (size, centrality and clustering index) to characterize the topology of file co-edition graphs. Finally, they apply a k-means c...
Software projects under version control grow with each commit, accumulating up to hundreds of thousa...
GitHub has become the central online platform for much of open source, hosting most open source code...
GitHub is the most popular repository for open source code. It has more than 3.5 million users, as t...
Data from software repositories have become an important foundation for the empirical study of softw...
Data from software repositories have become an important foundation for the empirical study of softw...
This dataset contains the scripts and dataset used in the study reported at Unveiling the Technical ...
We present a dataset of open source software developed mainly by enterprises rather than volunteers....
International audienceMulti–repository software projects are becoming more and more popular, thanks ...
Research software is vital for academia, yet reliable figures are rare. In an attempt to better unde...
This dataset accompanies the submission "Generating representative, live network traffic out of mill...
This dataset contains the scripts and dataset used in the study reported at Mining the Technical Rol...
Web 2.0 technologies have not only raised microblogs, but also social software development and colla...
Software repositories contain historical and valuable information about the overall development of s...
Nowadays, the increasing need of software products in the market is changing the way of developing s...
We present a dataset of open source software developed mainly by enterprises rather than volunteers....
Software projects under version control grow with each commit, accumulating up to hundreds of thousa...
GitHub has become the central online platform for much of open source, hosting most open source code...
GitHub is the most popular repository for open source code. It has more than 3.5 million users, as t...
Data from software repositories have become an important foundation for the empirical study of softw...
Data from software repositories have become an important foundation for the empirical study of softw...
This dataset contains the scripts and dataset used in the study reported at Unveiling the Technical ...
We present a dataset of open source software developed mainly by enterprises rather than volunteers....
International audienceMulti–repository software projects are becoming more and more popular, thanks ...
Research software is vital for academia, yet reliable figures are rare. In an attempt to better unde...
This dataset accompanies the submission "Generating representative, live network traffic out of mill...
This dataset contains the scripts and dataset used in the study reported at Mining the Technical Rol...
Web 2.0 technologies have not only raised microblogs, but also social software development and colla...
Software repositories contain historical and valuable information about the overall development of s...
Nowadays, the increasing need of software products in the market is changing the way of developing s...
We present a dataset of open source software developed mainly by enterprises rather than volunteers....
Software projects under version control grow with each commit, accumulating up to hundreds of thousa...
GitHub has become the central online platform for much of open source, hosting most open source code...
GitHub is the most popular repository for open source code. It has more than 3.5 million users, as t...