Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/29266
Full metadata record
DC FieldValueLanguage
dc.contributor.authorAyad, LAK-
dc.contributor.authorLoukides, G-
dc.contributor.authorPissis, SP-
dc.contributor.authorVerbeek, H-
dc.coverage.spatialPuerto Varas, Chile-
dc.date.accessioned2024-06-24T07:11:02Z-
dc.date.available2024-06-24T07:11:02Z-
dc.date.issued2024-03-06-
dc.identifierORCiD: Lorraine A. K. Ayad https://orcid.org/0000-0003-0846-2616-
dc.identifier.citationAyad, L.A.K. et al. (2024) 'Sparse Suffix and LCP Array: Simple, Direct, Small, and Fast', in: Soto, J.A. and Wiese, A. (eds.) LATIN 2024: Theoretical Informatics. LATIN 2024. (Lecture Notes in Computer Science Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 14578). Cham, Switzerland, Springer, pp. 162 - 177. doi: 10.1007/978-3-031-55598-5_11.en_US
dc.identifier.isbn978-3-031-55597-8 (pbk)-
dc.identifier.isbn978-3-031-55598-5 (ebk)-
dc.identifier.issn0302-9743-
dc.identifier.urihttps://bura.brunel.ac.uk/handle/2438/29266-
dc.descriptionA preprint version of this article is available at arXiv:2310.09023v1 [cs.DS] (https://arxiv.org/abs/2310.09023) under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/). It has not been certified by peer review.-
dc.description.abstractSparse suffix sorting is the problem of sorting b=o(n) suffixes of a string of length n. Efficient sparse suffix sorting algorithms have existed for more than a decade. Despite the multitude of works and their justified claims for applications in text indexing, the existing algorithms have not been employed by practitioners. Arguably this is because there are no simple, direct, and efficient algorithms for sparse suffix array construction. We provide two new algorithms for constructing the sparse suffix and LCP arrays that are simultaneously simple, direct, small, and fast. In particular, our algorithms are: simple in the sense that they can be implemented using only basic data structures; direct in the sense that the output arrays are not a byproduct of constructing the sparse suffix tree or an LCE data structure; fast in the sense that they run in O(nlogb) time, in the worst case, or in O(n) time, when the total number of suffixes with an LCP value greater than 2⌊lognb⌋+1−1 is in O(b/logb), matching the time of the optimal yet much more complicated algorithms [Gawrychowski and Kociumaka, SODA 2017; Birenzwige et al., SODA 2020]; and small in the sense that they can be implemented using only 8b+o(b) machine words. Our algorithms are simplified, yet non-trivial, space-efficient adaptations of the Monte Carlo algorithm by I et al. for constructing the sparse suffix tree in O(nlogb) time [STACS 2014]. We also provide proof-of-concept experiments to justify our claims on simplicity and efficiency.en_US
dc.description.sponsorshipSPP and HV are supported by the PANGAIA project (GA 872539). SPP is supported by the ALPACA project (GA 956229). HV is supported by a Constance van Eeden Fellowship.en_US
dc.format.extent162 - 177-
dc.format.mediumPrint-Electronic-
dc.languageEnglish-
dc.language.isoen_USen_US
dc.publisherSpringer Natureen_US
dc.relation.urihttps://arxiv.org/abs/2310.09023-
dc.rightsCopyright © 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG. This version of the article has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: https://doi.org/10.1007/978-3-031-55598-5_11 (see: https://www.springernature.com/gp/open-research/policies/book-policies).-
dc.rights.urihttps://www.springernature.com/gp/open-research/policies/book-policies-
dc.source16th Latin American Symposium-
dc.source16th Latin American Symposium-
dc.subjectsuffix arrayen_US
dc.subjectLCP arrayen_US
dc.subjectsuffix sortingen_US
dc.subjectsparse suffix sortingen_US
dc.subjectdata structures and algorithms (cs.DS)en_US
dc.titleSparse Suffix and LCP Array: Simple, Direct, Small, and Fasten_US
dc.typeConference Paperen_US
dc.date.dateAccepted2023-12-20-
dc.identifier.doihttps://doi.org/10.1007/978-3-031-55598-5_11-
dc.relation.isPartOfLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)-
pubs.finish-date2024-03-22-
pubs.finish-date2024-03-22-
pubs.publication-statusPublished-
pubs.start-date2024-03-18-
pubs.start-date2024-03-18-
pubs.volume14578 LNCS-
dc.identifier.eissn1611-3349-
dc.rights.holderThe Author(s), under exclusive license to Springer Nature Switzerland AG-
Appears in Collections:Dept of Computer Science Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright © 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG. This version of the article has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: https://doi.org/10.1007/978-3-031-55598-5_11 (see: https://www.springernature.com/gp/open-research/policies/book-policies).439.6 kBAdobe PDFView/Open


Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.