Sequence Alignment of the Insulin Protein (with Clustalw) a
Species Sequence Alignment
of Insulin
Human MALWMRLLPLLALLALWGPDPAAAFVNQHLCGSHLVEALYLVCGERGFFYTPKTRREAED 60
Chimpanzee MALWMRLLPLLVLLALWGPDPASAFVNQHLCGSHLVEALYLVCGERGFFYTPKTRREAED 60
Mouse MALLVHFLPLLALLALWEPKPTQAFVKQHLCGPHLVEALYLVCGERGFFYTPKSRREVED 60
Rabbit MASLAALLPLLALLVLCRLDPAQAFVNQHLCGSHLVEALYLVCGERGFFYTPKSRREVEE 60
Frog MALWMQCLPLVLVLFFSTPN-TEALVNQHLCGSHLVEALYLVCGDRGFFYYPKVKRDMEQ 59
Zebrafish MAVWLQAGALLVLLVVSSVS-TNPGTPQHLCGSHLVDALYLVCGPTGFFYNPK--RDVEP 57
** .*: :* . . : . . *****.***:******* **** ** *: *
Human LQVG---QVELGGGPGAGSLQPLALEGSLQKRGIVEQCCTSICSLYQLENYCN 110
Chimpanzee LQVG---QVELGGGPGAGSLQPLALEGSLQKRGIVEQCCTSICSLYQLENYCN 110
Mouse PQVE---QLELGGSPGD--LQTLALEVARQKRGIVDQCCTSICSLYQLENYCN 108
Rabbit LQVG---QAELGGGPGAGGLQPSALELALQKRGIVEQCCTSICSLYQLENYCN 110
Frog ALVSGPQDNELDG------MQLQPQEYQKMKRGIVEQCCHSTCSLFQLESYCN 106
Zebrafish -LLG-FLPPKSAQETEVADFAFKDHAELIRKRGIVEQCCHKPCSIFELQNYCN 108
: : : *****:*** . **:::*:.***
Accession numbers of the insulin sequences used in the protein sequence alignment: NP_000198 - Human; NP_001008996 - Chimpanzee; NP_032412
- Mouse; NP_001075804 - Rabbit; NP_001079351 - Frog; NP_571131 - Zebrafish
Predicted General Properties of the Insulin Protein (Homo sapiens) - Primary Sequence Analysis (with ProtParam)b
Number of amino acids: 110
Molecular weight: 11980.9
Theoretical pI: 5.22
Total number of negatively charged residues (Asp + Glu): 10
Total number of positively charged residues (Arg + Lys): 7
Atomic composition: Carbon C 535 Hydrogen H 841 Nitrogen N 143 Oxygen O 153 Sulfur S 8
Formula: C535H841N143O153S8
Total number of atoms: 1680
Extinction coefficients: Extinction coefficients are in units of M-1 cm-1, at 280 nm measured in water. Ext. coefficient 17335 Abs 0.1% (=1 g/l) 1.447, assuming ALL Cys residues appear as half cystines Ext. coefficient 16960 Abs 0.1% (=1 g/l) 1.416, assuming NO Cys residues appear as half cystines
Estimated half-life: The N-terminal of the sequence considered is M (Met). The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro). >20 hours (yeast, in vivo). >10 hours (Escherichia coli, in vivo).
Instability index: The instability index (II) is computed to be 40.33 This classifies the protein as unstable.
Aliphatic index: 102.91
Grand average of hydropathicity (GRAVY): 0.193
Bioinformatics References
a Higgins D., Thompson J., Gibson T. Thompson J. D., Higgins D. G., Gibson T. J.(1994). CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting,position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673-4680.
b Gasteiger E., Hoogland C., Gattiker A., Duvaud S., Wilkins M.R., Appel R.D., Bairoch A.; Protein Identification and Analysis Tools on the ExPASy Server;
(In) John M. Walker (ed): The Proteomics Protocols Handbook, Humana Press (2005). pp. 571-607 Full text - Copyright Humana Press.