Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/17355
Full metadata record
DC FieldValueLanguage
dc.contributor.authorLiebchen, G-
dc.contributor.authorShepperd, M-
dc.date.accessioned2019-01-15T14:25:08Z-
dc.date.available2016-09-09-
dc.date.available2019-01-15T14:25:08Z-
dc.date.issued2016-
dc.identifier.citationACM International Conference Proceeding Series, 2016en_US
dc.identifier.isbn9781450347723-
dc.identifier.issnhttp://dx.doi.org/10.1145/2972958.2972967-
dc.identifier.urihttp://bura.brunel.ac.uk/handle/2438/17355-
dc.description.abstractContext: We revisit our review of data quality within the context of empirical software engineering eight years on from our PROMISE 2008 article. Objective: To assess the extent and types of techniques used to manage quality within data sets. We consider this a particularly interesting question in the context of initiatives to promote sharing and secondary analysis of data sets. Method: We update the 2008 mapping study through four subsequently published reviews and a snowballing exercise. Results: The original study located only 23 articles explicitly considering data quality. This picture has changed substantially as our updated review now finds 283 articles, however, our estimate is that this still represents perhaps 1% of the total empirical software engineering literature. Conclusions: It appears the community is now taking the issue of data quality more seriously and there is more work exploring techniques to automatically detect (and sometimes repair) noise problems. However, there is still little systematic work to evaluate the various data sets that are widely used for secondary analysis; addressing this would be of considerable benefit. It should also be a priority to work collaboratively with practitioners to add new, higher quality data to the existing corpora.en_US
dc.language.isoenen_US
dc.publisherACMen_US
dc.subjectEmpirical software engineeringen_US
dc.subjectData qualityen_US
dc.subjectMapping studyen_US
dc.titleData sets and data quality in software engineering: Eight years onen_US
dc.typeConference Paperen_US
dc.identifier.doihttp://dx.doi.org/10.1145/2972958.2972967-
dc.relation.isPartOfACM International Conference Proceeding Series-
pubs.publication-statusPublished-
Appears in Collections:Dept of Computer Science Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdf175.17 kBAdobe PDFView/Open


Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.