Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/32677
Full metadata record
DC FieldValueLanguage
dc.contributor.authorAyad, LAK-
dc.contributor.authorPissis, SP-
dc.date.accessioned2026-01-19T14:48:17Z-
dc.date.available2026-01-19T14:48:17Z-
dc.date.issued2017-01-14-
dc.identifierORCiD: Lorraine A. K. Ayad https://orcid.org/0000-0003-0846-2616-
dc.identifierORCiD: Solon P. Pissis https://orcid.org/0000-0002-1445-1932-
dc.identifier.citationAyad, L.A.K. and Pissis, S.P. (2017) 'MARS: Improving multiple circular sequence alignment using refined sequences', BMC Genomics, 18 (1), 86, pp. 1 - 10. doi: 10.1186/s12864-016-3477-5.en_US
dc.identifier.urihttps://bura.brunel.ac.uk/handle/2438/32677-
dc.descriptionAvailability of data and materials: The datasets generated during and/or analysed during the current study are available in the GitHub repository. Project name: MARS Project home page: https://github.com/lorrainea/mars Operating system: GNU/Linux Programming language: C++ Other requirements: N/A License: GNU GPLen_US
dc.description.abstractBackground: A fundamental assumption of all widely-used multiple sequence alignment techniques is that the left- and right-most positions of the input sequences are relevant to the alignment. However, the position where a sequence starts or ends can be totally arbitrary due to a number of reasons: arbitrariness in the linearisation (sequencing) of a circular molecular structure; or inconsistencies introduced into sequence databases due to different linearisation standards. These scenarios are relevant, for instance, in the process of multiple sequence alignment of mitochondrial DNA, viroid, viral or other genomes, which have a circular molecular structure. A solution for these inconsistencies would be to identify a suitable rotation (cyclic shift) for each sequence; these refined sequences may in turn lead to improved multiple sequence alignments using the preferred multiple sequence alignment program. Results: We present MARS, a new heuristic method for improving Multiple circular sequence Alignment using Refined Sequences. MARS was implemented in the C++ programming language as a program to compute the rotations (cyclic shifts) required to best align a set of input sequences. Experimental results, using real and synthetic data, show that MARS improves the alignments, with respect to standard genetic measures and the inferred maximum-likelihood-based phylogenies, and outperforms state-of-the-art methods both in terms of accuracy and efficiency. Our results show, among others, that the average pairwise distance in the multiple sequence alignment of a dataset of widely-studied mitochondrial DNA sequences is reduced by around 5% when MARS is applied before a multiple sequence alignment is performed. Conclusions: Analysing multiple sequences simultaneously is fundamental in biological research and multiple sequence alignment has been found to be a popular method for this task. Conventional alignment techniques cannot be used effectively when the position where sequences start is arbitrary. We present here a method, which can be used in conjunction with any multiple sequence alignment program, to address this problem effectively and efficiently.en_US
dc.description.sponsorshipLAKA is supported by an EPSRC grant (Doctoral Training Grant #EP/M50788X/1).en_US
dc.format.extent1 - 10-
dc.format.mediumElectronic-
dc.language.isoenen_US
dc.publisherBioMed Central (part of Springer Nature)en_US
dc.relation.urihttps://github.com/lorrainea/mars-
dc.rightsCreative Commons Attribution 4.0 International-
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/-
dc.subjectmultiple circular sequence alignmenten_US
dc.subjectcircular sequencesen_US
dc.subjectq-gramsen_US
dc.subjectprogressive alignmenten_US
dc.titleMARS: Improving multiple circular sequence alignment using refined sequencesen_US
dc.typeArticleen_US
dc.date.dateAccepted2016-12-26-
dc.identifier.doihttps://doi.org/10.1186/s12864-016-3477-5-
dc.relation.isPartOfBMC Genomics-
pubs.issue1-
pubs.publication-statusPublished-
pubs.volume18-
dc.identifier.eissn1471-2164-
dc.rights.licensehttps://creativecommons.org/licenses/by/4.0/legalcode.en-
dcterms.dateAccepted2016-12-26-
dc.rights.holderThe Author(s)-
dc.contributor.orcidAyad, Lorraine A. K. [0000-0003-0846-2616]-
dc.contributor.orcidPissis, Solon P. [0000-0002-1445-1932]-
Appears in Collections:Dept of Computer Science Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright © The Author(s). 2017. Rights and permissions: Open Access. This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (https://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.446.8 kBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons