Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/33310
Title: Empirical validation of race-neutral normative brain morphometry models across ethnoracially diverse populations
Authors: Ge, R
Yu, Y
New, F
Haas, SS
Sanford, N
Yu, K
Allen, P
Arslan, S
Avram, M
Borgwardt, S
Crossley, NA
de la Fuente-Sandoval, C
Fukunaga, M
Gao, J-H
Gonzalez-Valderrama, A
Hashimoto, R
Iasevoli, F
Keeser, D
Kubat, K
Kumari, V
Matsumoto, J
Mehta, UM
Nemoto, K
Pontillo, G
Raabe, FJ
Reyes-Madrigal, F
Roy, N
Şahin-Çevik, D
Sahin-Ilikoglu, T
Toulopoulou, T
Wagner, E
Yang, G
Zurita, M
Thompson, PM
Frangou, S
Keywords: normative models;human;neuroimaging;brain morphometry
Issue Date: 12-May-2026
Publisher: National Academy of Sciences
Citation: Ge, R. et al. (2026) 'Empirical validation of race-neutral normative brain morphometry models across ethnoracially diverse populations', Proceedings of the National Academy of Sciences, 123 (20), e2521055123, pp. 1-8. doi: 10.1073/pnas.2521055123.
Abstract: Normative models of brain morphometry quantify individual deviations from typical anatomical patterns and hold promise for enhancing clinical decision-making. However, their clinical utility depends critically on demonstrating generalizability across diverse ethnoracial populations. We previously developed sex-specific, race-neutral normative models for cortical thickness, surface area, and subcortical volumes using brain scans from a large international sample of healthy individuals, as part of the CentileBrain Project, a global initiative to provide open-access, neuroimaging reference models. The primary aim of the present study was to empirically evaluate the generalizability and accuracy of these pretrained models across multiple ethnoracial groups. To this end, we tested model performance in independent samples of healthy individuals from Africa, Asia, Europe, and the Americas, with ethnoracial classification defined either by self-identification or genetic ancestry (N = 4,862). We further compared performance against normative models developed exclusively from a single-population Chinese cohort. Across all groups, as well as in the pooled sample, the pretrained CentileBrain models demonstrated consistently high accuracy, with relative mean absolute error values below 10% for subcortical volume and surface area and below 5% for cortical thickness. Model performance was highly concordant across self-identified and ancestry-defined groups. In a separate analysis, the CentileBrain models performed comparably to a population-specific model when applied to an independent ancestry-matched sample. These findings provide empirical support for the generalizability of race-neutral normative models developed on large and diverse samples and underscore their potential utility for individualized neuroimaging assessment across ethnoracially diverse populations.
Description: Data, Materials, and Software Availability: The pretrained CentileBrain Models are freely available at https://centilebrain.org/ while the deviation Z-scores of all samples used here can be access through https://doi.org/10.6084/m9.figshare.31100953. The original imaging data can be accessed through a number of repositories with a range of licensing conditions. Specifically, the ABCD dataset can be accessed through the US National Data Archive (https://nda.nih.gov/); the CHCP dataset can be accessed through the Science Data Bank website (https://doi.org/10.11922/sciencedb.01374); the Psy-ShareD dataset can be accessed by request at https://psyshared.com/; the SALD and SLIM datasets can be accessed through the International Data-sharing Initiative (https://fcon_1000.projects.nitrc.org/indi/retro/sald.html and https://fcon_1000.projects.nitrc.org/indi/retro/southwestuni_qiu_index.html); the UKB dataset can be access through the UK Biobank data-access protocol (https://www.ukbiobank.ac.uk/enable-your-research/apply-for-access). Data from BRAID-TWINS cohort (39), the Cognitive Genetics Collaborative Research Organization (25) and the Clinical Deep Phenotyping Working Group (https://www.psych.mpg.de/2948741/cdp-working-group) and South American datasets (38) can be made available through the cited original sources.
URI: https://bura.brunel.ac.uk/handle/2438/33310
DOI: https://doi.org/10.1073/pnas.2521055123
ISSN: 0027-8424
Other Identifiers: ORCiD: Nicole Sanford https://orcid.org/0000-0002-4915-2537
ORCiD: Seda Arslan https://orcid.org/0000-0001-6094-1417
ORCiD: Stefan Borgwardt https://orcid.org/0000-0002-5792-3987
ORCiD: Nicolas A. Crossley https://orcid.org/0000-0002-3060-656X
ORCiD: Camilo de la Fuente-Sandoval https://orcid.org/0000-0003-0773-1642
ORCiD: Masaki Fukunaga https://orcid.org/0000-0003-1010-2644
ORCiD: Jia-Hong Gao https://orcid.org/0000-0002-9311-0297
ORCiD: Kader Kubat https://orcid.org/0009-0007-5908-368X
ORCiD: Veena Kumari https://orcid.org/0000-0002-9635-5505
ORCiD: Junya Matsumoto https://orcid.org/0000-0003-4228-3208
ORCiD: Urvakhsh M. Mehta https://orcid.org/0000-0002-2252-9189
ORCiD: Kiyotaka Nemoto https://orcid.org/0000-0001-8623-9829
ORCiD: Francisco Reyes-Madrigal https://orcid.org/0000-0003-0772-4119
ORCiD: Neelabja Roy https://orcid.org/0000-0001-7016-9256
ORCiD: Didenur Şahin-Çevik https://orcid.org/0000-0001-9377-3560
ORCiD: Tuba Sahin-Ilikoglu https://orcid.org/0000-0003-2920-2151
ORCiD: Guoyuan Yang https://orcid.org/0000-0002-7864-3714
ORCiD: Mariana Zurita https://orcid.org/0000-0002-4847-311X
ORCiD: Sophia Frangou https://orcid.org/0000-0002-3210-6470
Appears in Collections:Department of Life Sciences Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright © 2026 the Author(s). Published by PNAS. This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND).2.23 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons