Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/31752
Full metadata record
DC FieldValueLanguage
dc.contributor.authorHossein Nezhad, F-
dc.contributor.authorOues, N-
dc.contributor.authorMassimiliano, M-
dc.contributor.authorPandini, A-
dc.date.accessioned2025-08-17T12:47:05Z-
dc.date.available2025-08-17T12:47:05Z-
dc.date.issued2025-07-31-
dc.identifierORCiD: Ferdoos Hossein Nezhad https://orcid.org/0009-0007-9892-7662-
dc.identifierORCiD: Namir Oues https://orcid.org/0009-0003-2001-1065-
dc.identifierORCiD: Massimiliano Meli https://orcid.org/0000-0003-3304-6104-
dc.identifierORCiD: Alessandro Pandini https://orcid.org/0000-0002-4158-233X-
dc.identifierArticle number: btaf420-
dc.identifier.citationHossein Nezhad, et al. (2025) 'MDGraphEmb: A Toolkit for Graph Embedding and Classification of Protein Conformational Ensembles', Bioinformatics , 0 (ahead of print), btaf420, pp. 1 - 10. doi: 10.1093/bioinformatics/btaf420.en_US
dc.identifier.issn1367-4803-
dc.identifier.urihttps://bura.brunel.ac.uk/handle/2438/31752-
dc.descriptionAccepted manuscripts are PDF versions of the author’s final manuscript, as accepted for publication by the journal but prior to copyediting or typesetting. They can be cited using the author(s), article title, journal title, year of online publication, and DOI. They will be replaced by the final typeset articles, which may therefore contain changes. The DOI will remain the same throughout.en_US
dc.descriptionData availability: Relevant data underpinning this publication can be accessed from Brunel University London’s data repository under CC BY licence: https://doi.org/10.17633/rd.brunel.c.7664645 .-
dc.descriptionSupplementary data is available online at: https://academic.oup.com/bioinformatics/advance-article/doi/10.1093/bioinformatics/btaf420/8220315#supplementary-data .-
dc.description.abstractMotivation: Molecular Dynamics (MD) simulations are essential for investigating protein dynamics and function. Although significant advances have been made in integrating simulation techniques and machine learning, there are still challenges in selecting the most suitable data representation for learning. Graph embedding is a powerful computational method that automatically learns low-dimensional representations of nodes in a graph while preserving graph topology and node properties, thereby bridging graph structures and machine learning methods. Graph embeddings hold great potential for efficiently representing MD simulation data and studying protein dynamics. Results: We present MDGraphEmb, a Python library built on MDAnalysis, specifically designed to convert protein MD simulation trajectories into graph-based representations and corresponding graph embeddings. This transformation enables the compression of high-dimensional, noisy trajectories from protein simulations into tabular formats suitable for machine learning. MDGraphEmb provides a framework that supports a range of graph embedding techniques and machine learning models, enabling the creation of workflows to analyse protein dynamics and identify important protein conformations. Graph embedding effectively captures and compresses structural information from protein MD simulation data, making it applicable to diverse downstream machine-learning classification tasks. We present an application for encoding and detecting important protein conformations from molecular dynamics simulations to classify functional states, using adenylate kinase (ADK) as the main case study. To assess the generalisability of the approach, two additional systems, Plantaricin E (PlnE) and HIV-1 protease are included as supplementary validation examples. A performance comparison of different graph embedding methods combined with machine learning models is also provided. Availability: MDGraphEMB GitHub Repository: https://github.com/FerdoosHN/MDGraphEMB .en_US
dc.description.sponsorshipN.O. is supported by a scholarship from Brunel University London EPSRC DTP [EP/T518116/1]. This project made use of time on HPC granted via the UK High-End Computing Consortium for Biomolecular Simulation, HECBioSim, supported by EPSRC [EP/X035603/1]. Collaborative work between F.HN., M.M., and A.P. was supported by Royal Society International Exchanges 2024 Cost Share (Italy only) [IEC\R2\242053]. This work was supported by the CINECA award under the ISCRA initiative (project HP10BKFH8P), which provided access to high-performance computing resources and technical support.en_US
dc.format.extent1 - 10-
dc.format.mediumPrint-Electronic-
dc.language.isoenen_US
dc.publisherOxford University Pressen_US
dc.relation.urihttps://github.com/FerdoosHN/MDGraphEMB-
dc.rightsCreative Commons Attribution 4.0 International-
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/-
dc.subjectprotein conformationen_US
dc.subjectgraph representation learningen_US
dc.subjectgraph embeddingen_US
dc.subjectmachine learningen_US
dc.titleMDGraphEmb: A Toolkit for Graph Embedding and Classification of Protein Conformational Ensemblesen_US
dc.typeArticleen_US
dc.date.dateAccepted2025-07-27-
dc.identifier.doihttps://doi.org/10.1093/bioinformatics/btaf420-
dc.relation.isPartOfBioinformatics-
pubs.publication-statusPublished online-
pubs.volume0-
dc.identifier.eissn1367-4811-
dc.rights.licensehttps://creativecommons.org/licenses/by/4.0/legalcode.en-
dcterms.dateAccepted2025-07-27-
dc.rights.holderThe Author(s)-
Appears in Collections:Dept of Computer Science Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright © The Author(s) 2025. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.3.04 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons