Sequence Alignment of the Calcitonin Protein (with Clustalw)a
Species Sequence Alignment of Calcitonin
human MGFQKFSPFLALSILVLLQAGSLHAAPFRSALESSPADPATLSEDEARLLLAALVQDYVQ 60
mouse-calca MGFLKFSPFLVVSILLLYQACSLQAVPLRSILESSP-GMATLSEEEVR-LLAALVQDYMQ 58
mouse-cgrp MGFLKFSPFLVVSILLLYQACSLQAVPLRSILESSP-GMATLSEEEVR-LLAALVQDYMQ 58
dog MGLWKSSPFLAFSILVLCQAGGLQAAPFRSALEGLP-DPTALSEKEGRLLLAALVKAYVQ 59
cow MGFGKSSPFLAFSILVLCQAGSLQATPLRSALETLP-DPGALSEKEGRLLLAALVKAYVQ 59
chicken MVMLKISSFLAVYALVVCQMDSFQAAPVRPGLESIT-DRVTLSDYEARRLLNALVKEFIQ 59
fugu MVMLKISAFLLAYALVICQMYSSHAAPARPGLESMS-DRVTLTDYEARRLLNAIVKEFVQ 59
* : * *.** *:: * . :*.* *. ** . . :*:: * * ** *:*: ::*
human MKASELEQEQERE---GSSLDSPRSKRCGNLSTCMLGTYTQDFNK-----FHTFPQTAIG 112
mouse-calca MKARELEQEEEQE-AEGSSLDSPRSKRCGNLSTCMLGTYTQDLNK-----FHTFPQTSIG 112
mouse-cgrp MKARELEQEEEQE-AEGSSVTAQ--KRSCNTATCVTHRLAGLLSRSGGVVKDNFVPTNVG 115
dog RK-NELEQEQEQE-TEGSSLDSSRAKRCSNLSTCVLGTYSKDLNN-----FHTFSGIGFG 112
cow RKTNELEQEEEQEETEDSSLDGSRAKRCSNLSTCVLSAYWKDLNN-----YHRFSGMGFG 114
chicken MTAEELEQ-----ASEGNSLDRPISKRCASLSTCVLGKLSQELHK-----LQTYPRTDVG 109
fugu MTAEELEQQ----ATEGNSMDRPLTKRCSNLSTCVLGKLSQELHK-----LQTFPRTNVG 110
. **** ..*: **. . :**: : . . : .*
human VGAPGKKRDMSSDLERDHRPHVSMPQNAN 141
mouse-calca VEAPGKKRDVAKDLETNHQSHFGN----- 136
mouse-cgrp SEAFGRRR---RDLQA------------- 128
dog AETPGKKRDIASGLERGR----------- 130
cow PETPGKKRDVANSLERDHSFHFGVPQDAN 143
chicken AGTPGKKRNVLNDLDHERYANYGETLGNN 138
fugu AGTPGKKR---SAAESDSYASYGETFGRI 136
: *::* :
Accession numbers of the calcitonin sequences used in the protein sequence alignment: human - CAA25103; mouse-calca - NP_031613; mouse-cgrp - NP_001029126; dog - NP_001003266; cow - BAG72140; chicken - ABY65359; fugu - CAC81278
Predicted General Properties of the Calcitonin Protein (Homo sapiens) - Primary Sequence Analysis (with ProtParam)b
Number of amino acids: 141
Molecular weight: 15467.4
Theoretical pI: 5.77
Total number of negatively charged residues (Asp + Glu): 17
Total number of positively charged residues (Arg + Lys): 14
Atomic composition: Carbon C 672 ; Hydrogen H 1066 ; Nitrogen N 192 ; Oxygen O 213; Sulfur S 7
Formula: C672H1066N192O213S7
Total number of atoms: 2150
Extinction coefficients: This protein does not contain any Trp residues. Experience shows that this could result in more than 10% error in the computed extinction coefficient. Extinction coefficients are in units of M-1 cm-1, at 280 nm measured in water. Ext. coefficient 3105 Abs 0.1% (=1 g/l) 0.201, assuming ALL Cys residues appear as half cystines Ext. coefficient 2980 Abs 0.1% (=1 g/l) 0.193, assuming NO Cys residues appear as half cystines
Estimated half-life: The N-terminal of the sequence considered is M (Met). The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro). >20 hours (yeast, in vivo). >10 hours (Escherichia coli, in vivo).
Instability index: The instability index (II) is computed to be 71.34 This classifies the protein as unstable.
Aliphatic index: 72.77
Grand average of hydropathicity (GRAVY): -0.429
Bioinformatics References
a Higgins D., Thompson J., Gibson T. Thompson J. D., Higgins D. G., Gibson T. J.(1994). CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting,position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673-4680.
b Gasteiger E., Hoogland C., Gattiker A., Duvaud S., Wilkins M.R., Appel R.D., Bairoch A.; Protein Identification and Analysis Tools on the ExPASy Server;
(In) John M. Walker (ed): The Proteomics Protocols Handbook, Humana Press (2005). pp. 571-607 Full text - Copyright Humana Press.