Please use this identifier to cite or link to this item:
http://bura.brunel.ac.uk/handle/2438/33310| Title: | Empirical validation of race-neutral normative brain morphometry models across ethnoracially diverse populations |
| Authors: | Ge, R Yu, Y New, F Haas, SS Sanford, N Yu, K Allen, P Arslan, S Avram, M Borgwardt, S Crossley, NA de la Fuente-Sandoval, C Fukunaga, M Gao, J-H Gonzalez-Valderrama, A Hashimoto, R Iasevoli, F Keeser, D Kubat, K Kumari, V Matsumoto, J Mehta, UM Nemoto, K Pontillo, G Raabe, FJ Reyes-Madrigal, F Roy, N Şahin-Çevik, D Sahin-Ilikoglu, T Toulopoulou, T Wagner, E Yang, G Zurita, M Thompson, PM Frangou, S |
| Keywords: | normative models;human;neuroimaging;brain morphometry |
| Issue Date: | 12-May-2026 |
| Publisher: | National Academy of Sciences |
| Citation: | Ge, R. et al. (2026) 'Empirical validation of race-neutral normative brain morphometry models across ethnoracially diverse populations', Proceedings of the National Academy of Sciences, 123 (20), e2521055123, pp. 1-8. doi: 10.1073/pnas.2521055123. |
| Abstract: | Normative models of brain morphometry quantify individual deviations from typical anatomical patterns and hold promise for enhancing clinical decision-making. However, their clinical utility depends critically on demonstrating generalizability across diverse ethnoracial populations. We previously developed sex-specific, race-neutral normative models for cortical thickness, surface area, and subcortical volumes using brain scans from a large international sample of healthy individuals, as part of the CentileBrain Project, a global initiative to provide open-access, neuroimaging reference models. The primary aim of the present study was to empirically evaluate the generalizability and accuracy of these pretrained models across multiple ethnoracial groups. To this end, we tested model performance in independent samples of healthy individuals from Africa, Asia, Europe, and the Americas, with ethnoracial classification defined either by self-identification or genetic ancestry (N = 4,862). We further compared performance against normative models developed exclusively from a single-population Chinese cohort. Across all groups, as well as in the pooled sample, the pretrained CentileBrain models demonstrated consistently high accuracy, with relative mean absolute error values below 10% for subcortical volume and surface area and below 5% for cortical thickness. Model performance was highly concordant across self-identified and ancestry-defined groups. In a separate analysis, the CentileBrain models performed comparably to a population-specific model when applied to an independent ancestry-matched sample. These findings provide empirical support for the generalizability of race-neutral normative models developed on large and diverse samples and underscore their potential utility for individualized neuroimaging assessment across ethnoracially diverse populations. |
| Description: | Data, Materials, and Software Availability: The pretrained CentileBrain Models are freely available at https://centilebrain.org/ while the deviation Z-scores of all samples used here can be access through https://doi.org/10.6084/m9.figshare.31100953. The original imaging data can be accessed through a number of repositories with a range of licensing conditions. Specifically, the ABCD dataset can be accessed through the US National Data Archive (https://nda.nih.gov/); the CHCP dataset can be accessed through the Science Data Bank website (https://doi.org/10.11922/sciencedb.01374); the Psy-ShareD dataset can be accessed by request at https://psyshared.com/; the SALD and SLIM datasets can be accessed through the International Data-sharing Initiative (https://fcon_1000.projects.nitrc.org/indi/retro/sald.html and https://fcon_1000.projects.nitrc.org/indi/retro/southwestuni_qiu_index.html); the UKB dataset can be access through the UK Biobank data-access protocol (https://www.ukbiobank.ac.uk/enable-your-research/apply-for-access). Data from BRAID-TWINS cohort (39), the Cognitive Genetics Collaborative Research Organization (25) and the Clinical Deep Phenotyping Working Group (https://www.psych.mpg.de/2948741/cdp-working-group) and South American datasets (38) can be made available through the cited original sources. |
| URI: | https://bura.brunel.ac.uk/handle/2438/33310 |
| DOI: | https://doi.org/10.1073/pnas.2521055123 |
| ISSN: | 0027-8424 |
| Other Identifiers: | ORCiD: Nicole Sanford https://orcid.org/0000-0002-4915-2537 ORCiD: Seda Arslan https://orcid.org/0000-0001-6094-1417 ORCiD: Stefan Borgwardt https://orcid.org/0000-0002-5792-3987 ORCiD: Nicolas A. Crossley https://orcid.org/0000-0002-3060-656X ORCiD: Camilo de la Fuente-Sandoval https://orcid.org/0000-0003-0773-1642 ORCiD: Masaki Fukunaga https://orcid.org/0000-0003-1010-2644 ORCiD: Jia-Hong Gao https://orcid.org/0000-0002-9311-0297 ORCiD: Kader Kubat https://orcid.org/0009-0007-5908-368X ORCiD: Veena Kumari https://orcid.org/0000-0002-9635-5505 ORCiD: Junya Matsumoto https://orcid.org/0000-0003-4228-3208 ORCiD: Urvakhsh M. Mehta https://orcid.org/0000-0002-2252-9189 ORCiD: Kiyotaka Nemoto https://orcid.org/0000-0001-8623-9829 ORCiD: Francisco Reyes-Madrigal https://orcid.org/0000-0003-0772-4119 ORCiD: Neelabja Roy https://orcid.org/0000-0001-7016-9256 ORCiD: Didenur Şahin-Çevik https://orcid.org/0000-0001-9377-3560 ORCiD: Tuba Sahin-Ilikoglu https://orcid.org/0000-0003-2920-2151 ORCiD: Guoyuan Yang https://orcid.org/0000-0002-7864-3714 ORCiD: Mariana Zurita https://orcid.org/0000-0002-4847-311X ORCiD: Sophia Frangou https://orcid.org/0000-0002-3210-6470 |
| Appears in Collections: | Department of Life Sciences Research Papers |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| FullText.pdf | Copyright © 2026 the Author(s). Published by PNAS. This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND). | 2.23 MB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License