Sequence Alignment of the p53 Tumor Suppressor Protein (with Clustalw)a
Species Sequence Alignment
of p53
Mouse MTAMEESQSDISLELPLSQETFSGLWKLLPPEDILP-----SPHCMDDLLLP-QDVEEFF 54
Rat ---MEDSQSDMSIELPLSQETFSCLWKLLPPDDILPTTATGSPNSMEDLFLP-QDVAELL 56
Dog ---MEESQSELNIDPPLSQETFSELWNLLPENNVLSS--ELC-PAVDELLLP-ESVVNWL 53
Cat ---MQEPPLELTIEPPLSQETFSELWNLLPENNVLSS--ELS-SAMNELPLS-EDVANWL 53
Sheep ---MEESQAELGVEPPLSQETFSDLWNLLPENNLLSS--ELS-APVDDLLPYSEDVVTWL 54
Cow ---MEESQAELGVEPPLSQETFSDLWNLLPENNLLSS--ELS-APVDDLLPYSEDVVTWL 54
Pig ---MEESQSELGVEPPLSQETFSDLWKLLPENNLLSS--ELSLAAVNDLLLS--PVTNWL 53
Human ---MEEPQSDPSVEPPLSQETFSDLWKLLPENNVLSP--LPS-QAMDDLMLSPDDIEQWF 54
Zebrafish ---MAQND---------SQE-FAELW----EKNLIIQ--PPGGGSCWDIINDEEYLPGSF 41
* : *** *: ** .::: :: : :
Mouse EGPS---EALRVSGAPAAQDPVTETPGPVAPAPATPWPLSSFVPSQKTYQGNYGFHLGFL 111
Rat EGPE---EALQVS-APAAQEPGTEAPAPVAPASATPWPLSSSVPSQKTYQGNYGFHLGFL 112
Dog DEDS--DDAPRMPATS-----APTAPGPAP-----SWPLSSSVPSPKTYPGTYGFRLGFL 101
Cat DEAP--DDASGMSAVP-----APAAPAPATPAPAISWPLSSFVPSQKTYPGAYGFHLGFL 106
Sheep DECP--NEAPQMPEP----------PAQAALAPATSWPLSSFVPSQKTYPGNYGFRLGFL 102
Cow DECP--NEAPQMPEP----------PAQAALAPATSWPLSSFVPSQKTYPGNYGFRLGFL 102
Pig DENP--DDASRVPAPP-----AATAPAPAAPAPATSWPLSSFVPSQKTYPGSYDFRLGFL 106
Human TEDPGPDEAPRMPEAAPPVAPAPAAPTPAAPAPAPSWPLSSSVPSQKTYQGSYGFRLGFL 114
Zebrafish DPNFFENVLEEQPQPS-------------------TLPPTSTVPETSDYPGDHGFRLRFP 82
. . * :* **. . * * :.*:* *
Mouse QSGTAKSVMCTYSPPLNKLFCQLVKTCPVQLWVSATPPAGSRVRAMAIYKKSQHMTEVVR 171
Rat QSGTAKSVMCTYSISLNKLFCQLAKTCPVQLWVTSTPPPGTRVRAMAIYKKSQHMTEVVR 172
Dog HSGTAKSVTWTYSPLLNKLFCQLAKTCPVQLWVSSPPPPNTCVRAMAIYKKSEFVTEVVR 161
Cat QSGTAKSVTCTYSPPLNKLFCQLAKTCPVQLWVRSPPPPGTCVRAMAIYKKSEFMTEVVR 166
Sheep HSGTAKSVTCTYSPSLNKLFCQLAKTCPVQLWVDSPPPPGTRVRAMAIYKKLEHMTEVVR 162
Cow HSGTAKSVTCTYSPSLNKLFCQLAKTCPVQLWVDSPPPPGTRVRAMAIYKKLEHMTEVVR 162
Pig HSGTAKSVTCTYSPALNKLFCQLAKTCPVQLWVSSPPPPGTRVRAMAIYKKSEYMTEVVR 166
Human HSGTAKSVTCTYSPALNKMFCQLAKTCPVQLWVDSTPPPGTRVRAMAIYKQSQHMTEVVR 174
Zebrafish QSGTAKSVTCTYSPDLNKLFCQLAKTCPVQMVVDVAPPQGSVVRATAIYKKSEHVAEVVR 142
:******* *** ***:****.******: * .** .: *** ****: :.::****
Mouse RCPHHERCSD-GDGLAPPQHLIRVEGNLYPEYLEDRQTFRHSVVVPYEPPEAGSEYTTIH 230
Rat RWPHHERCSD-GDGLAPPQHLIRVEGNPYAEYLDDRQTFRHSVVVPYEPPEVGSDYTTIH 231
Dog RCPHHERCSDSSDGLAPPQHLIRVEGNLRAKYLDDRNTFRHSVVVPYEPPEVGSDYTTIH 221
Cat RCPHHERCPDSSDGLAPPQHLIRVEGNLHAKYLDDRNTFRHSVVVPYEPPEVGSDCTTIH 226
Sheep RSPHHERSSDYSDGLAPPQHLIRVEGNLRAEYFDDRNTFRHSVVVPYESPEIESECTTIH 222
Cow RSPHHERSSDYSDGLAPPQHLIRVEGNLRAEYFDDRNTFRHSVVVPYESPEIESECTTIH 222
Pig RCPHHERSSDYSDGLAPPQHLIRVEGNLRAEYLDDRNTFRHSVVVPYEPPEVGSDCTTIH 226
Human RCPHHERCSD-SDGLAPPQHLIRVEGNLRVEYLDDRNTFRHSVVVPYEPPEVGSDCTTIH 233
Zebrafish RCPHHERTPD-GDNLAPAGHLIRVEGNQRANYREDNITLRHSVFVPYEAPQLGAEWTTVL 201
* ***** .* .*.***. ******** :* :*. *:****.****.*: :: **:
Mouse YKYMCNSSCMGGMNRRPILTIITLEDSSGNLLGRDSFEVRVCACPGRDRRTEEENFRKKE 290
Rat YKYMCNSSCMGGMNRRPILTIITLEDSSGNLLGRDSFEVRVCACPGRDRRTEEENFRKKE 291
Dog YNYMCNSSCMGGMNRRPILTIITLEDSSGNVLGRNSFEVRVCACPGRDRRTEEENFHKKG 281
Cat YNFMCNSSCMGGMNRRPIITIITLEDSNGKLLGRNSFEVRVCACPGRDRRTEEENFRKKG 286
Sheep YNFMCNSSCMGGMNRRPILTIITLEDSRGNLLGRSSFEVRVCACPGRDRRTEEENFRKKG 282
Cow YNFMCNSSCMGGMNRRPILTIITLEDSRGNLLGRSSFEVRVCACPGRDRRTEEENFRKKG 282
Pig YNFMCNSSCMGGMNRRPILTIITLEDASGNLLGRNSFEVRVCACPGRDRRTEEENFLKKG 286
Human YNYMCNSSCMGGMNRRPILTIITLEDSSGNLLGRNSFEVRVCACPGRDRRTEEENLRKKG 293
Zebrafish LNYMCNSSCMGGMNRRPILTIITLETQEGQLLGRRSFEVRVCACPGRDRKTEESNFKKDQ 261
::***************:****** *::*** **************:***.*: *.
Mouse VLCPELPPGS-AKRALPTCTSAS---PPQKKKP----LDGEYFTLKIRGRKRFEMFRELN 342
Rat EHCPELPPGS-AKRALPTSTSSS---PQQKKKP----LDGEYFTLKIRGRERFEMFRELN 343
Dog EPCPEPPPGS-TKRALPPSTSSS---PPQKKKP----LDGEYFTLQIRGRERYEMFRNLN 333
Cat EPCPEPPPGS-TKRALPPSTSST---PPQKKKP----LDGEYFTLQIRGRERFEMFRELN 338
Sheep QSCPEPPPGS-TKRALPSSTSSS---PQQKKKP----LDGEYFTLQIRGRKRFEMFRELN 334
Cow QSCPEPPPGS-TKRALPSSTSSS---PQQKKKP----LDGEYFTLQIRGRKRFEMFRELN 334
Pig QSCPEPPPGS-TKRALPTSTSSS---PVQKKKP----LDGEYFTLQIRGRERFEMFRELN 338
Human EPHHELPPGS-TKRALPNNTSSS---PQPKKKP----LDGEYFTLQIRGRERFEMFRELN 345
Zebrafish ETKTMAKTTTGTKRSLVKESSSATLRPEGSKKAKGSSSDEEIFTLQVRGRERYEILKKLN 321
. : :**:* :*:: * .**. * * ***::***:*:*::::**
Mouse EALELKDAHATEESGDSRAHSSYLKTKKGQSTS---RHKKTMVKKVG-PDSD 390
Rat EALELKDARAAEESGDSRAHSSYPKTKKGQSTS---RHKKPMIKKVG-PDSD 391
Dog EALELKDAQSGKEPGGSRAHSSHLKAKKGQSTS---RHKKLMFKREG-LDSD 381
Cat EALELKDAQSGKEPGGSRAHSSHLKAKKGQSTS---RHKKPMLKREG-LDSD 386
Sheep EALELMDAQAGREPGESRAHSSHLKSKKGPSPS---CHKKPMLKREG-PDSD 382
Cow EALELMDAQAGREPGESRAHSSHLKSKKGPSPS---CHKKPMLKREG-PDSD 382
Pig DALELKDAQTARESGENRAHSSHLKSKKGQSPS---RHKKPMFKREG-PDSD 386
Human EALELKDAQAGKEPGGSRAHSSHLKSKKGQSTS---RHKKLMFKTEG-PDSD 393
Zebrafish DSLELSDVVPASDAEKYRQKFMTKNKKENRESSEPKQGKKLMVKDEGRSDSD 373
::*** *. . :. * : : *:. ..* ** *.* * ***
Accession numbers of the p53 protein sequences used in the protein sequence alignment: Human - NP_000537; Mouse - NP_035770; Rat - NP_112251; Pig - NP_999310; Dog - NP_001003210; Sheep - NP_001009403; Cow - NP_776626; Cat - NP_001009294; Zebrafish - NP_571402
Predicted General Properties of the p53 Tumor Suppressor Protein (Homo sapiens) - Primary Sequence Analysis (with ProtParam)b
Number of amino acids: 393
Molecular weight: 43653.1
Theoretical pI: 6.33
Total number of negatively charged residues (Asp + Glu): 50
Total number of positively charged residues (Arg + Lys): 46
Atomic composition: Carbon C 1898 Hydrogen H 2980 Nitrogen N 548 Oxygen O 592 Sulfur S 22
Formula: C1898H2980N548O592S22
Total number of atoms: 6040
Extinction coefficients: Extinction coefficients are in units of M-1 cm-1, at 280 nm measured in water. Ext. coefficient 36035 Abs 0.1% (=1 g/l) 0.825, assuming ALL Cys residues appear as half cystines Ext. coefficient 35410 Abs 0.1% (=1 g/l) 0.811, assuming NO Cys residues appear as half cystines
Estimated half-life: The N-terminal of the sequence considered is M (Met). The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro). >20 hours (yeast, in vivo). >10 hours (Escherichia coli, in vivo).
Instability index: The instability index (II) is computed to be 73.59 This classifies the protein as unstable.
Aliphatic index: 59.08
Grand average of hydropathicity (GRAVY): -0.756
Bioinformatics References
a Higgins D., Thompson J., Gibson T. Thompson J. D., Higgins D. G., Gibson T. J.(1994). CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting,position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673-4680.
b Gasteiger E., Hoogland C., Gattiker A., Duvaud S., Wilkins M.R., Appel R.D., Bairoch A.; Protein Identification and Analysis Tools on the ExPASy Server;
(In) John M. Walker (ed): The Proteomics Protocols Handbook, Humana Press (2005). pp. 571-607 Full text - Copyright Humana Press.