SOSUI
SOSUI is a free online tool that predicts a part of the secondary structure of proteins from a given amino acid sequence. The main objective is to determine whether the protein in question is a soluble or a transmembrane protein.
History
SOSUI's algorithm was developed in 1996 at Tokyo University. The name means as much as "hydrophobic", an allusion to its molecular "clients".How SOSUI works
First of all, SOSUI looks for α helices that are relatively easy to predict, taking into account the known helical potentials of the given amino acid sequence. The much more difficult task is to differentiate between the α helices in soluble proteins and the ones in transmembrane proteins, the α helix being a very common secondary structure pattern in proteins.SOSUI uses 4 characteristics of the AAS in its prediction:
- "hydropathy index"
- weighted presence of amphiphilic amino acids and their localization: "amphiphilicity index"
- the AA's charge
- the length of the AAS
SOSUI then looks for accumulations of amphiphilic AAs at the ends of α helices, which seems to be typical for transmembrane α helices.
The AA's charge is also taken into consideration; the length is important because biological lipid membranes have a certain thickness determining the length of membrane-spanning proteins.
According to a study published by SOSUI's developers it successfully differentiated 99% of a chosen group of proteins with known structure. However, another study that had several prediction tools perform on the AAS's of 122 known proteins claimed that SOSUI was correct about the number of α helices in only about 60% of the cases. But even if the number of transmembrane domains is not always exact, the differentiation between soluble and transmembrane proteins often works, as it is only necessary to find out if a protein has such a domain at all. Of course, membrane proteins which don't have transmembrane α helices or which are fixed with a covalent bond cannot be found by SOSUI.