Please use this identifier to cite or link to this item:
http://bura.brunel.ac.uk/handle/2438/27046
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Ayad, LAK | - |
dc.contributor.author | Chikhi, R | - |
dc.contributor.author | Pissis, SP | - |
dc.date.accessioned | 2023-08-24T10:41:50Z | - |
dc.date.available | 2023-08-24T10:41:50Z | - |
dc.date.issued | 2023-08-12 | - |
dc.identifier | ORCID iD: Lorraine A.K. Ayad https://orcid.org/0000-0003-0846-2616. | - |
dc.identifier | vbad108 | - |
dc.identifier.citation | Ayad, L.A.K., Chikhi, R. and Pissis, S.P. (2023) 'Seedability: optimizing alignment parameters for sensitive sequence comparison', Bioinformatics Advances, 3 (1), vbad108, pp. 1 - 9. doi: 10.1093/bioadv/vbad108. | en_US |
dc.identifier.uri | https://bura.brunel.ac.uk/handle/2438/27046 | - |
dc.description | Data availability: The data underlying this article are available either in https://github.com/lorrainea/Seedability or in the ensembl database at https://www.ensembl.org, and can be accessed using the gene names ENSPTRG00000044036 and ENSG00000174236 or in the NCBI database at https://www.ncbi.nlm.nih.gov and can be found using the reference sequence NC_000001.11. | en_US |
dc.description.abstract | Motivation: Most sequence alignment techniques make use of exact k-mer hits, called seeds, as anchors to optimize alignment speed. A large number of bioinformatics tools employing seed-based alignment techniques, such as Minimap2, use a single value of k per sequencing technology, without a strong guarantee that this is the best possible value. Given the ubiquity of sequence alignment, identifying values of k that lead to more sensitive alignments is thus an important task. To aid this, we present Seedability, a seed-based alignment framework designed for estimating an optimal seed k-mer length (as well as a minimal number of shared seeds) based on a given alignment identity threshold. In particular, we were motivated to make Minimap2 more sensitive in the pairwise alignment of short sequences. Results: The experimental results herein show improved alignments of short and divergent sequences when using the parameter values determined by Seedability in comparison to the default values of Minimap2. We also show several cases of pairs of real divergent sequences, where the default parameter values of Minimap2 yield no output alignments, but the values output by Seedability produce plausible alignments. Availability and implementation: https://github.com/lorrainea/Seedability (distributed under GPL v3.0). | en_US |
dc.description.sponsorship | R.C. was supported by ANR Full-RNA, SeqDigger, Inception, and PRAIRIE grants (ANR-22-CE45-0007, ANR-19-CE45-0008, PIA/ANR16-CONV-0005, ANR-19-P3IA-0001). This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreements No. 872539 (PANGAIA) and 956229 (ALPACA). | en_US |
dc.format.medium | Electronic | - |
dc.language | English | - |
dc.language.iso | en_US | en_US |
dc.publisher | Oxford University Press (OUP) | en_US |
dc.rights | Copyright © The Author(s) 2023. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. | - |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | - |
dc.title | Seedability: optimizing alignment parameters for sensitive sequence comparison | en_US |
dc.type | Article | en_US |
dc.identifier.doi | https://doi.org/10.1093/bioadv/vbad108 | - |
dc.relation.isPartOf | Bioinformatics Advances | - |
pubs.issue | 1 | - |
pubs.publication-status | Published | - |
pubs.volume | 3 | - |
dc.identifier.eissn | 2635-0041 | - |
dc.rights.holder | The Author(s) | - |
Appears in Collections: | Dept of Computer Science Research Papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
FullText.pdf | Copyright © The Author(s) 2023. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. | 876.9 kB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License