Please use this identifier to cite or link to this item:
|Title:||Modelling air pollution, climate and health data using Bayesian Networks: a case study of the English regions|
|Citation:||Earth and Space Science, 2018|
|Abstract:||The link between pollution and health is commonly explored by trying to identify the dom inant cause of pollution and its most significant effect on health outcomes. The use of mul tivariate features to describe exposure is less explored because investigating a large domain of scenarios is theoretically (i.e. interpretation of results) and technically (i.e. computational effort) challenging. In this work we explore the use of Bayesian Networks with a multivari20 ate approach to identify the probabilistic dependence structure of the environment-health nexus, consisting of environmental factors (topography, climate), exposure levels (concentra tion of outdoor air pollutant) and health outcomes (mortality rates), with regard to a data-rich study area with a large spatial scale: the English regions (United Kingdom), incorporating environment types that are different in character from urban to rural. We implemented a re25 producible workflow in the the R programming language to collate environment-health data and analyze almost 50 millions of observations making use of a graphical model (Bayesian Network) and Big Data technologies. Results show that for pollution and weather variables the model tests well in sample, but also has good predictive power when tested out of sample.|
|Appears in Collections:||Dept of Computer Science Research Papers|
Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.