Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/10905
Full metadata record
DC FieldValueLanguage
dc.contributor.authorKirkup, D-
dc.contributor.authorTucker, A-
dc.coverage.spatialBrussels-
dc.coverage.spatialBrussels-
dc.date.accessioned2015-05-26T11:56:34Z-
dc.date.available2014-11-01-
dc.date.available2015-05-26T11:56:34Z-
dc.date.issued2014-
dc.identifier.citationAdvances in Intelligent Data Analysis XIII, Lecture Notes in Computer Science, 8819: 309-320, ( 2014)en_US
dc.identifier.issn0302-9743-
dc.identifier.urihttp://link.springer.com/chapter/10.1007%2F978-3-319-12571-8_27-
dc.identifier.urihttp://bura.brunel.ac.uk/handle/2438/10905-
dc.description.abstractIn this paper we explore the combination of text-mining, un-supervised and supervised learning to extract predictive models from a corpus of digitised historical floras. These documents deal with the nomenclature, geographical distribution, ecology and comparative morphology of the species of a region. Here we exploit the fact that portions of text in the floras are marked up as different types of trait and habitat. We infer models from these different texts that can predict different habitat-types based upon the traits of plant species. We also integrate plant taxonomy data in order to assist in the validation of our models. We have shown that by clustering text describing the habitat of different floras we can identify a number of important and distinct habitats that are associated with particular families of species along with statistical significance scores. We have also shown that by using these discovered habitat-types as labels for supervised learning we can predict them based upon a subset of traits, identified using wrapper feature selection.en_US
dc.language.isoenen_US
dc.publisherSpringer International Publishingen_US
dc.sourceSymposium on Intelligent Data Analysis-
dc.sourceSymposium on Intelligent Data Analysis-
dc.subjectText-miningen_US
dc.subjectDigitised historical florasen_US
dc.subjectPlant taxonomy dataen_US
dc.subjectHabitat-typesen_US
dc.subjectSupervised learningen_US
dc.titleExtracting predictive models from marked-p free-text documents at the Royal Botanic Gardens, Kew, Londonen_US
dc.typeConference Paperen_US
dc.identifier.doihttp://dx.doi.org/10.1007/978-3-319-12571-8_27-
dc.relation.isPartOfLecture Notes in Computer Science-
pubs.finish-date2014-11-01-
pubs.finish-date2014-11-01-
pubs.start-date2014-10-30-
pubs.start-date2014-10-30-
pubs.organisational-data/Brunel-
pubs.organisational-data/Brunel/Brunel Staff by College/Department/Division-
pubs.organisational-data/Brunel/Brunel Staff by College/Department/Division/College of Engineering, Design and Physical Sciences-
pubs.organisational-data/Brunel/Brunel Staff by College/Department/Division/College of Engineering, Design and Physical Sciences/Dept of Computer Science-
pubs.organisational-data/Brunel/Brunel Staff by College/Department/Division/College of Engineering, Design and Physical Sciences/Dept of Computer Science/Computer Science-
pubs.organisational-data/Brunel/Brunel Staff by Institute/Theme-
pubs.organisational-data/Brunel/Brunel Staff by Institute/Theme/Institute of Environmental, Health and Societies-
pubs.organisational-data/Brunel/Brunel Staff by Institute/Theme/Institute of Environmental, Health and Societies/Synthetic Biology-
pubs.organisational-data/Brunel/Specialist Centres-
pubs.organisational-data/Brunel/Specialist Centres/IfE-
Appears in Collections:Dept of Life Sciences Research Papers

Files in This Item:
File Description SizeFormat 
Fulltext.pdf394.23 kBAdobe PDFView/Open


Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.