DeepHistoClass: A novel strategy for confident classification of immunohistochemistry images using Deep Learning

Ghoshal, B; Hikmet, F; Pineau, C; Tucker, A; Lindskog, C

Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/23161

Title:	DeepHistoClass: A novel strategy for confident classification of immunohistochemistry images using Deep Learning
Authors:	Ghoshal, B Hikmet, F Pineau, C Tucker, A Lindskog, C
Keywords:	Testis;Immunohistochemistry;Artificial intelligence;Machine learning;Histology
Issue Date:	21-Aug-2021
Publisher:	Elsevier
Citation:	Ghoshal B, et al. (2021) 'DeepHistoClass: A novel strategy for confident classification of immunohistochemistry images using Deep Learning', Molecular & Cellular Proteomics, 20, 100140, pp. 1 - 21. doi: 10.1016/j.mcpro.2021.100140.
Abstract:	A multitude of efforts worldwide aim to create a single cell reference map of the human body, for fundamental understanding of human health, molecular medicine and targeted treatment. Antibody-based proteomics using immunohistochemistry (IHC) has proven to be an excellent technology for integration with large-scale single cell transcriptomics datasets. The golden standard for evaluation of IHC staining patterns is manual annotation, which is expensive and may lead to subjective errors. Artificial intelligence holds much promise for efficient and accurate pattern recognition, but confidence in prediction needs to be addressed. Here, the aim was to present a reliable and comprehensive framework for automated annotation of IHC images. We developed a multi-label classification of 7,848 complex IHC images of human testis corresponding to 2,794 unique proteins, generated as part of the Human Protein Atlas (HPA) project. Manual annotation data for eight different cell types was generated as a basis for training and testing a proposed Hybrid Bayesian Neural Network. By combining the deep learning model with a novel uncertainty metric; DeepHistoClass (DHC) confidence score; the average diagnostic performance improved from 86.9% to 96.3%. This metric not only reveals which images are reliably classified by the model, but can also be utilized for identification of manual annotation errors. The proposed streamlined workflow can be developed further for other tissue types in health and disease, and has important implications for digital pathology initiatives or large-scale protein mapping efforts such as the HPA project.
Description:	Data Availability: JPEG files of all 7848 images of the HPA dataset used in the present investigation, as well as the manually annotated protein expression in eight different cell types are available on v20.proteinatlas.org. Manual errors identified as part of this study have been corrected, which means that some of the presented protein expression data on the HPA will differ from the input data used for model training. All images from the independent dataset from another laboratory have been uploaded to the BioStudies repository (https://www.ebi.ac.uk/biostudies) under the accession S-BSST554. All codes are available in GitHub (https://github.com/birajaghoshal/DeepHistoClass). Supplemental data are available online at: https://www.sciencedirect.com/science/article/pii/S1535947621001122#appsec1 .
URI:	https://bura.brunel.ac.uk/handle/2438/23161
DOI:	https://doi.org/10.1016/j.mcpro.2021.100140
ISSN:	1535-9476
Other Identifiers:	ORCiD: Biraja Ghoshal https://orcid.org/0000-0001-5456-2197 ORCiD: Allan Tucker https://orcid.org/0000-0001-5105-3506 ORCiD: Cecilia Lindskog https://orcid.org/0000-0001-5611-1015 Article number: 100140
Appears in Collections:	Dept of Computer Science Research Papers

Files in This Item:

File	Description	Size	Format
FullText.pdf	Copyright © 2021 The Authors. Published by Elsevier Inc on behalf of American Society for Biochemistry and Molecular Biology. This is an open access article under the CC BY license (https://creativecommons.org/ licenses/by/4.0/).	4.64 MB	Adobe PDF	View/Open

Show full item record

This item is licensed under a Creative Commons License