Please use this identifier to cite or link to this item:
http://bura.brunel.ac.uk/handle/2438/20109
Title: | Correlation between Multivariate Datasets, from Inter-Graph Distance computed using Graphical Models Learnt With Uncertainties |
Authors: | Wang, K Chakrabarty, D |
Keywords: | soft random geometric graphs;probabilistic metric spaces;inter-graph distance;Hellinger distance;metropolis by block update;human disease-symptom network |
Issue Date: | 6-Nov-2018 |
Publisher: | Cornell University |
Citation: | Wang, K. and Chakrabarty, D. (2018) 'Correlation between Multivariate Datasets, from Inter-Graph Distance computed using Graphical Models Learnt With Uncertainties', arXiv [preprint], arXiv:1710.11292v3 [stat.ME], pp. 1 - 43. doi: 10.48550/arXiv.1710.11292. |
Abstract: | We present a method for simultaneous Bayesian learning of the correlation matrix and graphical model of a multivariate dataset, along with uncertainties in each, to subsequently compute distance between the learnt graphical models of a pair of datasets, using a new metric that approximates an uncertainty-normalised Hellinger distance between the posterior probabilities of the graphical models given the respective dataset; correlation between the pair of datasets is then computed as a corresponding affinity measure. We achieve a closed-form likelihood of the between-columns correlation matrix by marginalising over the between-row matrices. This between-columns correlation is updated first, given the data, and the graph is then updated, given the partial correlation matrix that is computed given the updated correlation, allowing for learning of the 95% Highest Probability Density credible regions of the correlation matrix and graphical model of the data. Difference made to the learnt graphical model, by acknowledgement of measurement noise, is demonstrated on a small simulated dataset, while the large human disease-symptom network--with > 8,000 nodes--is learnt using real data. Data on vino-chemical attributes of Portuguese red and white wine samples are employed to learn with-uncertainty graphical model of each dataset, and subsequently, the distance between these learnt graphical models. |
URI: | https://bura.brunel.ac.uk/handle/2438/20109 |
DOI: | https://doi.org/10.48550/arXiv.1710.11292 |
Other Identifiers: | ORCiD:Dalia Chakrabarty https://orcid.org/0000-0003-1246-4235 arXiv:1710.11292v3 [stat.ME] |
Appears in Collections: | Dept of Mathematics Research Papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
FullText.pdf | Copyright © 2018 The Author(s). The URI https://arxiv.org/licenses/nonexclusive-distrib/1.0/ is used to record the fact that the submitter granted the following license to arXiv.org on submission of an article: I grant arXiv.org a perpetual, non-exclusive license to distribute this article. I certify that I have the right to grant this license. I understand that submissions cannot be completely removed once accepted. I understand that arXiv.org reserves the right to reclassify or reject any submission. | 1.19 MB | Adobe PDF | View/Open |
Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.