Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/29248
Full metadata record
DC FieldValueLanguage
dc.contributor.authorYao, J-
dc.contributor.authorShepperd, M-
dc.date.accessioned2024-06-22T19:08:39Z-
dc.date.available2021-06-17-
dc.date.available2024-06-22T19:08:39Z-
dc.date.issued2021-06-17-
dc.identifierORCiD: Martin Shepperd https://orcid.org/0000-0003-1874-6145-
dc.identifier106664-
dc.identifier.citationYao, J. and . (2021) 'The impact of using biased performance metrics on software defect prediction research', Information and Software Technology, 139, 106664, pp. 1 - 14. doi: 10.1016/j.infsof.2021.106664.en_US
dc.identifier.issn0950-5849-
dc.identifier.urihttps://bura.brunel.ac.uk/handle/2438/29248-
dc.descriptionThe article archived on this institutional repository is the accepted manuscript [v4], made available at arXiv:2103.10201v4 [cs.SE], https://arxiv.org/abs/2103.10201v4 under a CC BY-NC-SA license on Tue, 22 Jun 2021 09:50:19 UTC (3,233 KB). Comments: Accepted by the journal Information & Software Technology. It is a greatly extended version of "Assessing Software Defection Prediction Performance: Why Using the Matthews Correlation Coefficient Matters" presented at EASE 2020.-
dc.description.abstractContext: Software engineering researchers have undertaken many experiments investigating the potential of software defect prediction algorithms. Unfortunately some widely used performance metrics are known to be problematic, most notably F1, but nevertheless F1 is widely used. Objective: To investigate the potential impact of using F1 on the validity of this large body of research. Method: We undertook a systematic review to locate relevant experiments and then extract all pairwise comparisons of defect prediction performance using F1 and the unbiased Matthews correlation coefficient (MCC). Results: We found a total of 38 primary studies. These contain 12,471 pairs of results. Of these comparisons, 21.95% changed direction when the MCC metric is used instead of the biased F1 metric. Unfortunately, we also found evidence suggesting that F1 remains widely used in software defect prediction research. Conclusion: We reiterate the concerns of statisticians that the F1 is a problematic metric outside of an information retrieval context, since we are concerned about both classes (defect-prone and not defect-prone units). This inappropriate usage has led to a substantial number (more than one fifth) of erroneous (in terms of direction) results. Therefore we urge researchers to (i) use an unbiased metric and (ii) publish detailed results including confusion matrices such that alternative analyses become possible.en_US
dc.description.sponsorshipJingxiu Yao wishes to acknowledge the support of the China Scholarship Council .en_US
dc.format.extent1 - 14-
dc.format.mediumPrint-Electronic-
dc.languageEnglish-
dc.language.isoen_USen_US
dc.publisherElsevieren_US
dc.relation.urihttps://arxiv.org/abs/2103.10201v4-
dc.rightsCopyright © 2021 The Author(s). This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (https://creativecommons.org/licenses/by-nc-sa/4.0/).-
dc.rights.urihttps://creativecommons.org/licenses/by-nc-sa/4.0/-
dc.subjectsoftware engineeringen_US
dc.subjectmachine learningen_US
dc.subjectsoftware defect predictionen_US
dc.subjectcomputational experimenten_US
dc.subjectclassification metricsen_US
dc.titleThe impact of using biased performance metrics on software defect prediction researchen_US
dc.typeArticleen_US
dc.identifier.doihttps://doi.org/10.1016/j.infsof.2021.106664-
dc.relation.isPartOfInformation and Software Technology-
pubs.publication-statusPublished-
pubs.volume139-
dc.identifier.eissn1873-6025-
dc.rights.licensehttps://creativecommons.org/licenses/by-nc-sa/4.0/legalcode.en-
dc.rights.holderThe Author(s)-
Appears in Collections:Dept of Computer Science Research Papers

Files in This Item:
File Description SizeFormat 
Preprint.pdf2.88 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons