ProbCons
In bioinformatics and proteomics, ProbCons is an open source software for probabilistic consistency-based multiple alignment of amino acid sequences. It is one of the most efficient protein multiple sequence alignment programs, since it has repeatedly demonstrated a statistically significant advantage in accuracy over similar tools, including Clustal and MAFFT.
Algorithm
The following describes the basic outline of the ProbCons algorithm.Step 1: Reliability of an alignment edge
For every pair of sequences compute the probability that letters and are paired in an alignment that is generated by the model.Step 2: Maximum expected accuracy
The accuracy of an alignment with respect to another alignment is defined as the number of common aligned pairs divided by the length of the shorter sequence.Calculate expected accuracy of each sequence:
This yields a maximum expected accuracy alignment:
Step 3: Probabilistic Consistency Transformation
All pairs of sequences x,y from the set of all sequences are now re-estimated using all intermediate sequences z:This step can be iterated.