This dataset contains the scripts and dataset used in the study reported at Unveiling the Technical Roles of GitHub Users paper. The files are described in more detailed below: processed_ground_truth.csv: A CSV file with the information of the developers considered in the study. Due to privacy issues, we already preprocessed the dataset to remove identification clues. Please contact the authors in case you need the original one. script.ipynb: A jupyter notebook with the scripts used in our study. Pipfile: The list of dependencies used to execute our script. This way you can replicate our study in an easier way (i.e., using virtual environments).Authors dataset are currently anonymous due to DBR policy
With over 10 million git repositories, GitHub is becoming one of the most important source of softwa...
This replication package contains datasets and scripts related to the paper: "*The Transparency of P...
Using GitHub APIs, we construct an unbiased dataset of over 10 million GitHub users. The data was co...
This dataset contains the scripts and dataset used in the study reported at Unveiling the Technical ...
This dataset contains the scripts and dataset used in the study reported at Mining the Technical Rol...
This dataset includes the data used for the thematic analysis of the paper in the title. all_proj...
Supplementary website containing result plots, pseudonymized input data, resulting pseudonymized cla...
This dataset collected from Stack Overflow (SO) and GitHub Discussions was used to conduct an empiri...
This dataset contains the SQL tables of the training and test datasets used in our experimentation. ...
Research software is vital for academia, yet reliable figures are rare. In an attempt to better unde...
This dataset collected from Stack Overflow (SO) and GitHub was used to conduct an empirical study on...
This replication package contains all the material required to replicate the analyses we made for ou...
This dataset contains the SQL tables of the training and test datasets used in our experimentation. ...
This dataset comprises of the raw data that we used for analyzing the automotive software landscape ...
This dataset contains Python scripts applying repository mining and social network analysis (SNA) te...
With over 10 million git repositories, GitHub is becoming one of the most important source of softwa...
This replication package contains datasets and scripts related to the paper: "*The Transparency of P...
Using GitHub APIs, we construct an unbiased dataset of over 10 million GitHub users. The data was co...
This dataset contains the scripts and dataset used in the study reported at Unveiling the Technical ...
This dataset contains the scripts and dataset used in the study reported at Mining the Technical Rol...
This dataset includes the data used for the thematic analysis of the paper in the title. all_proj...
Supplementary website containing result plots, pseudonymized input data, resulting pseudonymized cla...
This dataset collected from Stack Overflow (SO) and GitHub Discussions was used to conduct an empiri...
This dataset contains the SQL tables of the training and test datasets used in our experimentation. ...
Research software is vital for academia, yet reliable figures are rare. In an attempt to better unde...
This dataset collected from Stack Overflow (SO) and GitHub was used to conduct an empirical study on...
This replication package contains all the material required to replicate the analyses we made for ou...
This dataset contains the SQL tables of the training and test datasets used in our experimentation. ...
This dataset comprises of the raw data that we used for analyzing the automotive software landscape ...
This dataset contains Python scripts applying repository mining and social network analysis (SNA) te...
With over 10 million git repositories, GitHub is becoming one of the most important source of softwa...
This replication package contains datasets and scripts related to the paper: "*The Transparency of P...
Using GitHub APIs, we construct an unbiased dataset of over 10 million GitHub users. The data was co...