A Knowledge-based Multiple Sequence Alignment Algorithm

dc.contributor.authorKen Nguyen
dc.contributor.authorYi Pan
dc.date.accessioned2024-05-21T13:41:52Z
dc.date.available2024-05-21T13:41:52Z
dc.description.abstractA common and cost effective mechanism to identify the functionalities, structures, or relationships between species is multiple sequence alignment, in which DNA/RNA/protein sequences are arranged and aligned so that similarities between sequences are clustered together. Correctly identifying and aligning these sequence biological similarities help from unwinding the mystery of species evolution to drug design. We present our knowledge-based multiple sequence alignment (KB-MSA) technique that utilizes existing knowledge databases such as SWISSPROT, GENBANK or HOMSTRAD to provide a more realistic and reliable sequence alignment. We also provide a modified version of this algorithm (CB-MSA) that utilizes the sequence consistency information when sequence knowledge databases are not available. Our benchmark tests on BAliBASE, PREFAB, HOMSTRAD and SABMARK references show accuracy improvements up to 10% on twilight datasets against many leading alignment tools such as ISPALIGN, PADT, CLUSTALW, MAFFT, PROBCONS and T-COFFEE
dc.identifier.urihttps://hdl.handle.net/20.500.12951/1316
dc.titleA Knowledge-based Multiple Sequence Alignment Algorithm
dc.typeJournal Article, Academic Journal
dcterms.bibliographicCitationIEEE Transactions on Computational Biology and Bioinformatics 10(4), 884-896, (August 2013)
Files
Collections