Please use this identifier to cite or link to this item:
Title: Modelling air pollution, climate and health data using Bayesian Networks: a case study of the English regions
Authors: Vitolo, C
Scutari, M
Ghalaieny, M
Tucker, A
Russell, A
Keywords: air pollution;modeling;Bayesian Networks;climate;health
Issue Date: 2018
Citation: Earth and Space Science, 2018, 5(4): 76 - 88
Abstract: ©2018. The Authors. The link between pollution and health is commonly explored by trying to identify the dominant cause of pollution and its most significant effect on health outcomes. The use of multivariate features to describe exposure is less explored because investigating a large domain of scenarios is theoretically (i.e. interpretation of results) and technically (i.e. computational effort) challenging. In this work we explore the use of Bayesian Networks with a multivariate approach to identify the probabilistic dependence structure of the environment-health nexus, consisting of environmental factors (topography, climate), exposure levels (concentration of outdoor air pollutant) and health outcomes (mortality rates), with regard to a data-rich study area with a large spatial scale: the English regions (United Kingdom), incorporating environment types that are different in character from urban to rural. We implemented a reproducible workflow in the the R programming language to collate environment-health data and analyze almost 50 millions of observations making use of a graphical model (Bayesian Network) and Big Data technologies. Results show that for pollution and weather variables the model tests well in sample, but also has good predictive power when tested out of sample.
Appears in Collections:Dept of Computer Science Research Papers

Files in This Item:
File Description SizeFormat 
Fulltext.pdf610.7 kBAdobe PDFView/Open

Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.