Sequence Alignment of the Osteocalcin Protein (with Clustalw)a
Species and Isoform Sequence Alignment of Osteocalcin
Mouse1.1 MRTIFLLTLLTLAALCLSD---LTDAKPSGPESDK--AFMSKQEGNKVVNRLRRYL---- 51
Mouse1.2 MRTIFLLTLLTLAALCLSD---LTDAKPSGPESDK--AFMSKQEGNKVVNRLRRYL---- 51
Mouse2 MRTLSLLTLLALAALCLSD---LTDAKPSGPESDK--AFMSKQEGNKVVNRLRRYL---- 51
Rat MRTLSLLTLLALTAFCLSD---LAGAKPSDSESDK--AFMSKQEGSKVVNRLRRYLNNGL 55
Cow MRTPMLLALLALATLCLAG---RADAKPGDAESGKGAAFVSKQEGSEVVKRLRRYLDHWL 57
Sheep MRTPML-ALLGLATLCLAG---QADAKPGDAESGKGTAFVSKQEGSEVVKRLRRYLDPGL 56
Human MRALTLLALLALAALCIAG---QAGAKPSGAESSKGAAFVSKQEGSEVVKRPRRYLYQWL 57
Chicken MKAAALLLLAALLTFSL----CRSAPDGSDARSAK--AFISHRQRAEMVRRQKRHYAQDS 54
Frog MKLAIVLLLLGLAVLCLGGKDSQHSASAGDSRSSE--AFISRQDSANFARRHKRSYRYNV 58
*: : * * .:.: .. ....* : **:*::: :...* :*
Mouse1.1 GASVPSPDPLEPTREQCELNPACDELSDQYGLKTAYKRIYGITI 95
Mouse1.2 G------------------------------------------- 52
Mouse2 GASVPSPDPLEPTREQCELNPACDELSDQYGLKTAYKRIYGITI 95
Rat GAPAPYPDPLEPHREVCELNPNCDELADHIGFQDAYKRIYGTTV 99
Cow GAPAPYPDPLEPKREVCELNPDCDELADHIGFQEAYRRFYGPV- 100
Sheep GAPAPYPDPLEPRREVCELNPDCDELADHIGFQEAYRRFYGPV- 99
Human GAPVPYPDPLEPRREVCELNPDCDELADHIGFQEAYRRFYGPV- 100
Chicken GVAGAPPNPLEAQREVCELSPDCDELADQIGFQEAYRRFYGPV- 97
Frog ARGAAVTSPLESQREVCELNPDCDELADHIGFQEAYRRFYGPV- 101
.
Accession numbers of the osteocalcin protein sequences used in the protein sequence alignment: NP_954642 osteocalcin [Human]; NP_031567 - bone gamma
carboxyglutamate protein 1 isoform 1 [Mouse1.1]; NP_001033028 bone gamma
carboxyglutamate protein 1 isoform 2 [Mouse1.2];NP_001027469 bone gamma-
carboxyglutamate protein 2 [Mouse2]; NP_038200 osteocalcin [Rat];
NP_776674 osteocalcin [Cow]; NP_001035098 bone gamma-carboxyglutamate
(gla) protein [Sheep]; NP_990718 osteocalcin [Chicken]; NP_001006689 bone
gamma-carboxyglutamate (gla) protein (osteocalcin) [Frog]
Predicted General Properties of the Osteocalcin Protein (Homo sapiens) - Primary Sequence Analysis (with ProtParam)b
Number of amino acids: 100
Molecular weight: 10962.5
Theoretical pI: 6.56
Total number of negatively charged residues (Asp + Glu): 12
Total number of positively charged residues (Arg + Lys): 12
Atomic composition: Carbon C 491 Hydrogen H 770 Nitrogen N 136 Oxygen O 141 Sulfur S 4
ormula: C491H770N136O141S4
Total number of atoms: 1542
Extinction coefficients: Extinction coefficients are in units of M-1 cm-1, at 280 nm measured in water. Ext. coefficient 13075 Abs 0.1% (=1 g/l) 1.193, assuming ALL Cys residues appear as half cystines Ext. coefficient 12950 Abs 0.1% (=1 g/l) 1.181, assuming NO Cys residues appear as half cystines
Estimated half-life: The N-terminal of the sequence considered is M (Met). The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro). >20 hours (yeast, in vivo). >10 hours (Escherichia coli, in vivo).
Instability index: The instability index (II) is computed to be 54.38 This classifies the protein as unstable.
Aliphatic index: 86.00
Grand average of hydropathicity (GRAVY): -0.212
Bioinformatics References
a Higgins D., Thompson J., Gibson T. Thompson J. D., Higgins D. G., Gibson T. J.(1994). CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting,position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673-4680.
b Gasteiger E., Hoogland C., Gattiker A., Duvaud S., Wilkins M.R., Appel R.D., Bairoch A.; Protein Identification and Analysis Tools on the ExPASy Server;
(In) John M. Walker (ed): The Proteomics Protocols Handbook, Humana Press (2005). pp. 571-607 Full text - Copyright Humana Press.