Reliable empirical models such as those used in software effort estimation or defect prediction are inherently dependent on the data from which they are built. As demands for process and product improvement continue to grow, the quality of the data used in measurement and prediction systems warrants increasingly close scrutiny. In this paper we propose a taxonomy of data quality challenges in empirical software engineering, based on an extensive review of prior research. We consider current assessment techniques for each quality issue and proposed mechanisms to address these issues, where available. Our taxonomy classifies data quality issues into three broad areas: first, characteristics of data that mean they are not fit for modeling; sec...
Abstract—In the last decade, a large number of software repositories have been created for different...
Data quality has emerged as an important and challenging topic in recent years. This article address...
Although contemporary research relies to a large extent on data, data quality in Information Systems...
Reliable empirical models such as those used in software effort estimation or defect prediction are ...
Context: The utility of prediction models in empirical software engineering (ESE) is heavily reliant...
Abstract. Data collection and analysis are key artifacts in any software engi-neering experiment. Ho...
Since its inception as a recognized sub-discipline, empirical software engineering (ESE) has been pl...
The quality of data is important in research working with data sets because poor data quality may le...
OBJECTIVE - to assess the extent and types of techniques used to manage quality within software engi...
The quality of data is important in research working with data sets because poor data quality may le...
Context: We revisit our review of data quality within the context of empirical software engineering ...
Software is playing a crucial role in modern societies. The demand for software quality is increasin...
Data is a cornerstone of empirical software engineering (ESE) research and practice. Data underpin n...
Software quality is explicit property which determines what sort of standards software ought to have...
Context. Quality assurance plays a vital role in the software engineering development process. It ca...
Abstract—In the last decade, a large number of software repositories have been created for different...
Data quality has emerged as an important and challenging topic in recent years. This article address...
Although contemporary research relies to a large extent on data, data quality in Information Systems...
Reliable empirical models such as those used in software effort estimation or defect prediction are ...
Context: The utility of prediction models in empirical software engineering (ESE) is heavily reliant...
Abstract. Data collection and analysis are key artifacts in any software engi-neering experiment. Ho...
Since its inception as a recognized sub-discipline, empirical software engineering (ESE) has been pl...
The quality of data is important in research working with data sets because poor data quality may le...
OBJECTIVE - to assess the extent and types of techniques used to manage quality within software engi...
The quality of data is important in research working with data sets because poor data quality may le...
Context: We revisit our review of data quality within the context of empirical software engineering ...
Software is playing a crucial role in modern societies. The demand for software quality is increasin...
Data is a cornerstone of empirical software engineering (ESE) research and practice. Data underpin n...
Software quality is explicit property which determines what sort of standards software ought to have...
Context. Quality assurance plays a vital role in the software engineering development process. It ca...
Abstract—In the last decade, a large number of software repositories have been created for different...
Data quality has emerged as an important and challenging topic in recent years. This article address...
Although contemporary research relies to a large extent on data, data quality in Information Systems...