Macromolecule Insights - Protein Bioinformatics Analysis

Privacy Policy

 


The p53 Protein

Bookmark and Share

Sequence Alignment of the p53 Tumor Suppressor Protein (with Clustalw)a

Species Sequence Alignment of p53

Mouse      MTAMEESQSDISLELPLSQETFSGLWKLLPPEDILP-----SPHCMDDLLLP-QDVEEFF 54
Rat        ---MEDSQSDMSIELPLSQETFSCLWKLLPPDDILPTTATGSPNSMEDLFLP-QDVAELL 56
Dog        ---MEESQSELNIDPPLSQETFSELWNLLPENNVLSS--ELC-PAVDELLLP-ESVVNWL 53
Cat        ---MQEPPLELTIEPPLSQETFSELWNLLPENNVLSS--ELS-SAMNELPLS-EDVANWL 53
Sheep      ---MEESQAELGVEPPLSQETFSDLWNLLPENNLLSS--ELS-APVDDLLPYSEDVVTWL 54
Cow        ---MEESQAELGVEPPLSQETFSDLWNLLPENNLLSS--ELS-APVDDLLPYSEDVVTWL 54
Pig        ---MEESQSELGVEPPLSQETFSDLWKLLPENNLLSS--ELSLAAVNDLLLS--PVTNWL 53
Human      ---MEEPQSDPSVEPPLSQETFSDLWKLLPENNVLSP--LPS-QAMDDLMLSPDDIEQWF 54
Zebrafish  ---MAQND---------SQE-FAELW----EKNLIIQ--PPGGGSCWDIINDEEYLPGSF 41
              * :           *** *: **     .:::            ::      :   :

Mouse      EGPS---EALRVSGAPAAQDPVTETPGPVAPAPATPWPLSSFVPSQKTYQGNYGFHLGFL 111
Rat        EGPE---EALQVS-APAAQEPGTEAPAPVAPASATPWPLSSSVPSQKTYQGNYGFHLGFL 112
Dog        DEDS--DDAPRMPATS-----APTAPGPAP-----SWPLSSSVPSPKTYPGTYGFRLGFL 101
Cat        DEAP--DDASGMSAVP-----APAAPAPATPAPAISWPLSSFVPSQKTYPGAYGFHLGFL 106
Sheep      DECP--NEAPQMPEP----------PAQAALAPATSWPLSSFVPSQKTYPGNYGFRLGFL 102
Cow        DECP--NEAPQMPEP----------PAQAALAPATSWPLSSFVPSQKTYPGNYGFRLGFL 102
Pig        DENP--DDASRVPAPP-----AATAPAPAAPAPATSWPLSSFVPSQKTYPGSYDFRLGFL 106
Human      TEDPGPDEAPRMPEAAPPVAPAPAAPTPAAPAPAPSWPLSSSVPSQKTYQGSYGFRLGFL 114
Zebrafish  DPNFFENVLEEQPQPS-------------------TLPPTSTVPETSDYPGDHGFRLRFP 82
                       .                      . * :* **. . * * :.*:* * 

Mouse      QSGTAKSVMCTYSPPLNKLFCQLVKTCPVQLWVSATPPAGSRVRAMAIYKKSQHMTEVVR 171
Rat        QSGTAKSVMCTYSISLNKLFCQLAKTCPVQLWVTSTPPPGTRVRAMAIYKKSQHMTEVVR 172
Dog        HSGTAKSVTWTYSPLLNKLFCQLAKTCPVQLWVSSPPPPNTCVRAMAIYKKSEFVTEVVR 161
Cat        QSGTAKSVTCTYSPPLNKLFCQLAKTCPVQLWVRSPPPPGTCVRAMAIYKKSEFMTEVVR 166
Sheep      HSGTAKSVTCTYSPSLNKLFCQLAKTCPVQLWVDSPPPPGTRVRAMAIYKKLEHMTEVVR 162
Cow        HSGTAKSVTCTYSPSLNKLFCQLAKTCPVQLWVDSPPPPGTRVRAMAIYKKLEHMTEVVR 162
Pig        HSGTAKSVTCTYSPALNKLFCQLAKTCPVQLWVSSPPPPGTRVRAMAIYKKSEYMTEVVR 166
Human      HSGTAKSVTCTYSPALNKMFCQLAKTCPVQLWVDSTPPPGTRVRAMAIYKQSQHMTEVVR 174
Zebrafish  QSGTAKSVTCTYSPDLNKLFCQLAKTCPVQMVVDVAPPQGSVVRATAIYKKSEHVAEVVR 142
           :*******  ***  ***:****.******: *  .** .: *** ****: :.::****

Mouse      RCPHHERCSD-GDGLAPPQHLIRVEGNLYPEYLEDRQTFRHSVVVPYEPPEAGSEYTTIH 230
Rat        RWPHHERCSD-GDGLAPPQHLIRVEGNPYAEYLDDRQTFRHSVVVPYEPPEVGSDYTTIH 231
Dog        RCPHHERCSDSSDGLAPPQHLIRVEGNLRAKYLDDRNTFRHSVVVPYEPPEVGSDYTTIH 221
Cat        RCPHHERCPDSSDGLAPPQHLIRVEGNLHAKYLDDRNTFRHSVVVPYEPPEVGSDCTTIH 226
Sheep      RSPHHERSSDYSDGLAPPQHLIRVEGNLRAEYFDDRNTFRHSVVVPYESPEIESECTTIH 222
Cow        RSPHHERSSDYSDGLAPPQHLIRVEGNLRAEYFDDRNTFRHSVVVPYESPEIESECTTIH 222
Pig        RCPHHERSSDYSDGLAPPQHLIRVEGNLRAEYLDDRNTFRHSVVVPYEPPEVGSDCTTIH 226
Human      RCPHHERCSD-SDGLAPPQHLIRVEGNLRVEYLDDRNTFRHSVVVPYEPPEVGSDCTTIH 233
Zebrafish  RCPHHERTPD-GDNLAPAGHLIRVEGNQRANYREDNITLRHSVFVPYEAPQLGAEWTTVL 201
           * ***** .* .*.***. ********   :* :*. *:****.****.*:  :: **: 

Mouse      YKYMCNSSCMGGMNRRPILTIITLEDSSGNLLGRDSFEVRVCACPGRDRRTEEENFRKKE 290
Rat        YKYMCNSSCMGGMNRRPILTIITLEDSSGNLLGRDSFEVRVCACPGRDRRTEEENFRKKE 291
Dog        YNYMCNSSCMGGMNRRPILTIITLEDSSGNVLGRNSFEVRVCACPGRDRRTEEENFHKKG 281
Cat        YNFMCNSSCMGGMNRRPIITIITLEDSNGKLLGRNSFEVRVCACPGRDRRTEEENFRKKG 286
Sheep      YNFMCNSSCMGGMNRRPILTIITLEDSRGNLLGRSSFEVRVCACPGRDRRTEEENFRKKG 282
Cow        YNFMCNSSCMGGMNRRPILTIITLEDSRGNLLGRSSFEVRVCACPGRDRRTEEENFRKKG 282
Pig        YNFMCNSSCMGGMNRRPILTIITLEDASGNLLGRNSFEVRVCACPGRDRRTEEENFLKKG 286
Human      YNYMCNSSCMGGMNRRPILTIITLEDSSGNLLGRNSFEVRVCACPGRDRRTEEENLRKKG 293
Zebrafish  LNYMCNSSCMGGMNRRPILTIITLETQEGQLLGRRSFEVRVCACPGRDRKTEESNFKKDQ 261
            ::***************:******   *::*** **************:***.*: *. 

Mouse      VLCPELPPGS-AKRALPTCTSAS---PPQKKKP----LDGEYFTLKIRGRKRFEMFRELN 342
Rat        EHCPELPPGS-AKRALPTSTSSS---PQQKKKP----LDGEYFTLKIRGRERFEMFRELN 343
Dog        EPCPEPPPGS-TKRALPPSTSSS---PPQKKKP----LDGEYFTLQIRGRERYEMFRNLN 333
Cat        EPCPEPPPGS-TKRALPPSTSST---PPQKKKP----LDGEYFTLQIRGRERFEMFRELN 338
Sheep      QSCPEPPPGS-TKRALPSSTSSS---PQQKKKP----LDGEYFTLQIRGRKRFEMFRELN 334
Cow        QSCPEPPPGS-TKRALPSSTSSS---PQQKKKP----LDGEYFTLQIRGRKRFEMFRELN 334
Pig        QSCPEPPPGS-TKRALPTSTSSS---PVQKKKP----LDGEYFTLQIRGRERFEMFRELN 338
Human      EPHHELPPGS-TKRALPNNTSSS---PQPKKKP----LDGEYFTLQIRGRERFEMFRELN 345
Zebrafish  ETKTMAKTTTGTKRSLVKESSSATLRPEGSKKAKGSSSDEEIFTLQVRGRERYEILKKLN 321
                  . : :**:*   :*::   *  .**.     * * ***::***:*:*::::**

Mouse      EALELKDAHATEESGDSRAHSSYLKTKKGQSTS---RHKKTMVKKVG-PDSD 390
Rat        EALELKDARAAEESGDSRAHSSYPKTKKGQSTS---RHKKPMIKKVG-PDSD 391
Dog        EALELKDAQSGKEPGGSRAHSSHLKAKKGQSTS---RHKKLMFKREG-LDSD 381
Cat        EALELKDAQSGKEPGGSRAHSSHLKAKKGQSTS---RHKKPMLKREG-LDSD 386
Sheep      EALELMDAQAGREPGESRAHSSHLKSKKGPSPS---CHKKPMLKREG-PDSD 382
Cow        EALELMDAQAGREPGESRAHSSHLKSKKGPSPS---CHKKPMLKREG-PDSD 382
Pig        DALELKDAQTARESGENRAHSSHLKSKKGQSPS---RHKKPMFKREG-PDSD 386
Human      EALELKDAQAGKEPGGSRAHSSHLKSKKGQSTS---RHKKLMFKTEG-PDSD 393
Zebrafish  DSLELSDVVPASDAEKYRQKFMTKNKKENRESSEPKQGKKLMVKDEGRSDSD 373
           ::*** *. .  :.   * :    : *:. ..*     ** *.*  *  ***

Accession numbers of the p53 protein sequences used in the protein sequence alignment:  Human - NP_000537; Mouse - NP_035770;  Rat - NP_112251; Pig - NP_999310; Dog - NP_001003210; Sheep - NP_001009403; Cow - NP_776626; Cat - NP_001009294; Zebrafish - NP_571402

Predicted General Properties of the p53 Tumor Suppressor Protein  (Homo sapiens) - Primary Sequence Analysis (with ProtParam)b

Number of amino acids: 393

Molecular weight: 43653.1

Theoretical pI: 6.33

Total number of negatively charged residues (Asp + Glu): 50

Total number of positively charged residues (Arg + Lys): 46

Atomic composition: Carbon C 1898 Hydrogen H 2980 Nitrogen N 548 Oxygen O 592 Sulfur S 22

Formula: C1898H2980N548O592S22

Total number of atoms: 6040

Extinction coefficients: Extinction coefficients are in units of M-1 cm-1, at 280 nm measured in water. Ext. coefficient 36035 Abs 0.1% (=1 g/l) 0.825, assuming ALL Cys residues appear as half cystines Ext. coefficient 35410 Abs 0.1% (=1 g/l) 0.811, assuming NO Cys residues appear as half cystines

Estimated half-life: The N-terminal of the sequence considered is M (Met). The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro). >20 hours (yeast, in vivo). >10 hours (Escherichia coli, in vivo).

Instability index: The instability index (II) is computed to be 73.59 This classifies the protein as unstable.

Aliphatic index: 59.08

Grand average of hydropathicity (GRAVY): -0.756

Bioinformatics References

a Higgins D., Thompson J., Gibson T. Thompson J. D., Higgins D. G., Gibson T. J.(1994). CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting,position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673-4680.

b Gasteiger E., Hoogland C., Gattiker A., Duvaud S., Wilkins M.R., Appel R.D., Bairoch A.; Protein Identification and Analysis Tools on the ExPASy Server;
(In) John M. Walker (ed): The Proteomics Protocols Handbook, Humana Press (2005). pp. 571-607  Full text - Copyright Humana Press.

Bookmark and Share