Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/23941
Full metadata record
DC FieldValueLanguage
dc.contributor.authorWalker, BE-
dc.contributor.authorTucker, A-
dc.contributor.authorNicolson, N-
dc.date.accessioned2022-01-13T11:26:13Z-
dc.date.available2022-01-13T11:26:13Z-
dc.date.issued2022-01-13-
dc.identifier.citationWalker, B.E., Tucker, A. and Nicolson, N. (2022) 'Harnessing Large-Scale Herbarium Image Datasets Through Representation Learning', Frontiers in Plant Science, 12, 806407, pp. 1-12. doi: 10.3389/fpls.2021.806407.en_US
dc.identifier.urihttps://bura.brunel.ac.uk/handle/2438/23941-
dc.description.abstractCopyright © 2022 Walker, Tucker and Nicolson. The mobilization of large-scale datasets of specimen images and metadata through herbarium digitization provide a rich environment for the application and development of machine learning techniques. However, limited access to computational resources and uneven progress in digitization, especially for small herbaria, still present barriers to the wide adoption of these new technologies. Using deep learning to extract representations of herbarium specimens useful for a wide variety of applications, so-called “representation learning,” could help remove these barriers. Despite its recent popularity for camera trap and natural world images, representation learning is not yet as popular for herbarium specimen images. We investigated the potential of representation learning with specimen images by building three neural networks using a publicly available dataset of over 2 million specimen images spanning multiple continents and institutions. We compared the extracted representations and tested their performance in application tasks relevant to research carried out with herbarium specimens. We found a triplet network, a type of neural network that learns distances between images, produced representations that transferred the best across all applications investigated. Our results demonstrate that it is possible to learn representations of specimen images useful in different applications, and we identify some further steps that we believe are necessary for representation learning to harness the rich information held in the worlds’ herbaria.en_US
dc.description.sponsorshipResearch/Scientific Computing teams at the James Hutton Institute and NIAB for providing computational resources and technical support for the “UK’s Crop Diversity Bioinformatics HPC” (BBSRC grant BB/S019669/1).en_US
dc.format.mediumElectronic-
dc.language.isoen_USen_US
dc.publisherFrontiers Media SAen_US
dc.rightsCopyright © 2022 Walker, Tucker and Nicolson. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.-
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/-
dc.subjectdeep learningen_US
dc.subjectdigitized herbarium specimensen_US
dc.subjectnatural history collectionsen_US
dc.subjectmachine learningen_US
dc.subjectcomputer visionen_US
dc.titleHarnessing Large-Scale Herbarium Image Datasets Through Representation Learningen_US
dc.typeArticleen_US
dc.identifier.doihttps://doi.org/10.3389/fpls.2021.806407-
dc.relation.isPartOfFrontiers in Plant Science-
pubs.publication-statusPublished online-
pubs.volume12-
dc.identifier.eissn1664-462X-
Appears in Collections:Dept of Computer Science Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdf10.08 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons