multialign2ref performs multialignment to reference sequences
This function performs multialignment of set of Amino or nucleotide sequences not aligned (Seqs2align) to a set of reference sequences already aligned (refSeqs). In order to achieve this target function multialign of the BioInformatics toolbox is called several times with all the default options. Note that if the alignment of a sequence inside Seqs2align changes the reference sequences we call again function multialign using the 'Name',Value, GapOpen',100000 in order to allow the possibility of creating gaps or adding additional characters in the reference sequence. If the new call to multialign: 1) creates gap(s) in the reference sequences (refSeqs) we delete the characters corresponding to the gaps in the aligned sequence and store this information inside boolean variable usedGap of output table SeqsMultiAligned.
2) creates additional characters at the end of the reference sequences (refSeqs) we delete the additional characters and store this information inside boolean vector usedDeletion of output table SeqsMultiAligned.
As a result of this process the number of sites (characters) in the reference sequence is kept fixed and all the aligned sequences will have the same number of characters of the reference sequences. We also store the information about the sequences which could not be aligned inside second output argument WrngAlignment. So far with the hundreds of millions of sequences we have aligned this case never took place.
Example of multialign2ref without optional arguments.Seqsaligned
=multialign2ref(refSeqs
,
Seqs2align
)
Example without using the parallel computing toolbox.Seqsaligned
=multialign2ref(refSeqs
,
Seqs2align
,
Name, Value
)
[
Example of two output arguments.Seqsaligned
,
WrngAlignment
]
=multialign2ref(___)
![]() |
function-alpha.html |
function-cate.html |
![]() |
![]() |
Functions |
![]() |
• The developers of the toolbox • The forward search group • Terms of Use • Acknowledgments