Data is a cornerstone of empirical software engineering (ESE) research and practice. Data underpin numerous process and project management activities, including the estimation of development effort and the prediction of the likely location and severity of defects in code. Serious questions have been raised, however, over the quality of the data used in ESE. Data quality problems caused by noise, outliers, and incompleteness have been noted as being especially prevalent. Other quality issues, although also potentially important, have received less attention. In this study, we assess the quality of 13 datasets that have been used extensively in research on software effort estimation. The quality issues considered in this article draw on...
The dataset contains the following files: (1) readme.txt, it is a text file including details about ...
Abstract. Data collection and analysis are key artifacts in any software engi-neering experiment. Ho...
Considering the complex nature of software projects, we have to use historical data and past experie...
Data is a cornerstone of empirical software engineering (ESE) research and practice. Data underpin n...
Since its inception as a recognized sub-discipline, empirical software engineering (ESE) has been pl...
Context: We revisit our review of data quality within the context of empirical software engineering ...
Context: The utility of prediction models in empirical software engineering (ESE) is heavily reliant...
OBJECTIVE - to assess the extent and types of techniques used to manage quality within software engi...
The quality of data is important in research working with data sets because poor data quality may le...
Reliable empirical models such as those used in software effort estimation or defect prediction are ...
The quality of data is important in research working with data sets because poor data quality may le...
Software is playing a crucial role in modern societies. The demand for software quality is increasin...
Agile software development has interested researchers for the last decade. Agile software developmen...
Abstract In the last decade, modern data analytics technologies have enabled the creation of softwa...
Software data sets derived from actual software products and their development processes are widely ...
The dataset contains the following files: (1) readme.txt, it is a text file including details about ...
Abstract. Data collection and analysis are key artifacts in any software engi-neering experiment. Ho...
Considering the complex nature of software projects, we have to use historical data and past experie...
Data is a cornerstone of empirical software engineering (ESE) research and practice. Data underpin n...
Since its inception as a recognized sub-discipline, empirical software engineering (ESE) has been pl...
Context: We revisit our review of data quality within the context of empirical software engineering ...
Context: The utility of prediction models in empirical software engineering (ESE) is heavily reliant...
OBJECTIVE - to assess the extent and types of techniques used to manage quality within software engi...
The quality of data is important in research working with data sets because poor data quality may le...
Reliable empirical models such as those used in software effort estimation or defect prediction are ...
The quality of data is important in research working with data sets because poor data quality may le...
Software is playing a crucial role in modern societies. The demand for software quality is increasin...
Agile software development has interested researchers for the last decade. Agile software developmen...
Abstract In the last decade, modern data analytics technologies have enabled the creation of softwa...
Software data sets derived from actual software products and their development processes are widely ...
The dataset contains the following files: (1) readme.txt, it is a text file including details about ...
Abstract. Data collection and analysis are key artifacts in any software engi-neering experiment. Ho...
Considering the complex nature of software projects, we have to use historical data and past experie...