Macromolecule Insights - Protein Bioinformatics Analysis

Privacy Policy

 


The Rhodopsin Protein

Bookmark and Share

Sequence Alignment of the Rhodopsin Protein (with Clustalw)a

Species Sequence Alignment of Rhodopsin

Human     MNGTEGPNFYVPFSNATGVVRSPFEYPQYYLAEPWQFSMLAAYMFLLIVLGFPINFLTLY 60
Rabbit    MNGTEGPDFYIPMSNQTGVVRSPFEYPQYYLAEPWQFSMLAAYMFLLIVLGFPINFLTLY 60
Dog       MNGTEGPNFYVPFSNKTGVVRSPFEYPQYYLAEPWQFSMLAAYMFLLIVLGFPINFLTLY 60
Cat       MNGTEGPNFYVPFSNKTGVVRSPFEYPQYYLAEPWQFSMLAAYMFLLIVLGFPINFLTLY 60
Mouse     MNGTEGPNFYVPFSNVTGVVRSPFEQPQYYLAEPWQFSMLAAYMFLLIVLGFPINFLTLY 60
Rat       MNGTEGPNFYVPFSNITGVVRSPFEQPQYYLAEPWQFSMLAAYMFLLIVLGFPINFLTLY 60
Pig       MNGTEGPNFYVPFSNKTGVVRSPFEYPQYYLAEPWQFSMLAAYMFMLIVLGFPINFLTLY 60
Cow       MNGTEGPNFYVPFSNKTGVVRSPFEAPQYYLAEPWQFSMLAAYMFLLIMLGFPINFLTLY 60
Zebrafish MNGTEGPAFYVPMSNATGVVRSPYEYPQYYLVAPWAYGFVAAYMFFLIITGFPVNFLTLY 60
Frog      MNGTEGPNFYIPMSNKTGVVRSPFDYPQYYLAEPWQYSALAAYMFLLILLGLPINFMTLF 60
Chicken   MNGTEGINFYVPMSNKTGVVRSPFEYPQYYLAEPWKYRLVCCYIFFLISTGLPINLLTLL 60
          ******  **:*:** *******:: *****. ** :  :..*:*:**  *:*:*::** 

Human     VTVQHKKLRTPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTGCNLEGFFATLG 120
Rabbit    VTVQHKKLRTPLNYILLNLAVADLFMVLGGFTTTLYTSLHGYFVFGPTGCNVEGFFATLG 120
Dog       VTVQHKKLRTPLNYILLNLAVADLFMVFGGFTTTLYTSLHGYFVFGPTGCNVEGFFATLG 120
Cat       VTVQHKKLRTPLNYILLNLAVADLFMVFGGFTTTLYTSLHGYFVFGPTGCNLEGFFATLG 120
Mouse     VTVQHKKLRTPLNYILLNLAVADLFMVFGGFTTTLYTSLHGYFVFGPTGCNLEGFFATLG 120
Rat       VTVQHKKLRTPLNYILLNLAVADLFMVFGGFTTTLYTSLHGYFVFGPTGCNLEGFFATLG 120
Pig       VTVQHKKLRTPLNYILLNLAVADLFMVFGGFTTTLYTSLHGYFVFGPTGCNLEGFFATLG 120
Cow       VTVQHKKLRTPLNYILLNLAVADLFMVFGGFTTTLYTSLHGYFVFGPTGCNLEGFFATLG 120
Zebrafish VTIEHKKLRTPLNYILLNLAIADLFMVFGGFTTTMYTSLHGYFVFGRLGCNLEGFFATLG 120
Frog      VTIQHKKLRTPLNYILLNLVFANHFMVLCGFTVTMYTSMHGYFIFGQTGCYIEGFFATLG 120
Chicken   VTFKHKKLRQPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFVFGPVGCAVEGFFATLG 120
          **.:***** ******:**..*: **.  *** *:**: :***:**  ** :********

Human     GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLAGWS---- 176
Rabbit    GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWIMALACAAPPLVGWS---- 176
Dog       GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLAGWSSLLS 180
Cat       GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLVGWS---- 176
Mouse     GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVVFTWIMALACAAPPLVGWS---- 176
Rat       GEIGLWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLVGWS---- 176
Pig       GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGLALTWVMALACAAPPLVGWS---- 176
Cow       GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLVGWS---- 176
Zebrafish GEMGLKSLVVLAIERWMVVCKPVSNFRFGENHAIMGVAFTWVMACSCAVPPLVGWS---- 176
Frog      GEVALWSLVVLAVERYMVVCKPMANFRFGENHAIMGVAFTWIMALSCAAPPLFGWS---- 176
Chicken   GQVALWSLVVLAIERYIVVCKPMGNFRFSATHAMMGIAFTWVMAFSCAAPPLFGWS---- 176
          *::.* ******:**::*****:.****. .**:**:.:**:** :**.*** ***    

Human     ------RYIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTV 230
Rabbit    ------RYIPEGMQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPLIIIFFCYGQLVFTV 230
Dog       HSPLVLRYIPEGMQCSCGIDYYTLKPEINNESFVIYMFVVHFAIPMIVIFFCYGQLVFTV 240
Cat       ------RYIPEGMQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIVIFFCYGQLVFTV 230
Mouse     ------RYIPEGMQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIVIFFCYGQLVFTV 230
Rat       ------RYIPEGMQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIVIFFCYGQLVFTV 230
Pig       ------RYIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFSIPLVIIFFCYGQLVFTV 230
Cow       ------RYIPEGMQCSCGIDYYTPHEETNNESFVIYMFVVHFIIPLIVIFFCYGQLVFTV 230
Zebrafish ------RYIPEGMQCSCGVDYYTRTPGVNNESFVIYMFIVHFFIPLIVIFFCYGRLVCTV 230
Frog      ------RYIPEGMQCSCGVDYYTLKPEVNNESFVIYMFVVHFTIPLIVIFFCYGRLLCTV 230
Chicken   ------RYMPEGMQCSCGPDYYTHNPDYHNESYVLYMFVIHFIIPVVVIFFSYGRLICKV 230
                **:***:***** ****     :***:*:***::** **:::***.**:*: .*

Human     KEAAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTI 290
Rabbit    KEAAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTI 290
Dog       KEAAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSDFGPIFMTL 300
Cat       KEAAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTL 290
Mouse     KEAAAQQQESATTQKAEKEVTRMVIIMVIFFLICWLPYASVAFYIFTHQGSNFGPIFMTL 290
Rat       KEAAAQQQESATTQKAEKEVTRMVIIMVIFFLICWLPYASVAMYIFTHQGSNFGPIFMTL 290
Pig       KEAAAQQQESATTQKAEKEVTRMVIIMVVAFLICWLPYASVAFYIFTHQGSDFGPIFMTI 290
Cow       KEAAAQQQESATTQKAEKEVTRMVIIMVIAFLICWLPYAGVAFYIFTHQGSDFGPIFMTI 290
Zebrafish KEAARQQQESETTQRAEREVTRMVIIMVIAFLICWLPYAGVAWYIFTHQGSEFGPVFMTL 290
Frog      KEAAAQQQESATTQKAEKEVTRMVVIMVVFFLICWVPYAYVAFYIFTHQGSDFGPVFMTV 290
Chicken   REAAAQQQESATTQKAEKEVTRMVILMVLGFMLAWTPYAVVAFWIFTNKGADFTATLMAV 290
          :*** ***** ***:**:******::**: *::.* *** ** :***::*::* . :*::

Human     PAFFAKSAAIYNPVIYIMMNKQFRNCMLTTICCGKNPLGDDE--ASATVSKTETSQVAPA 348
Rabbit    PAFFAKSSSIYNPVIYIMMNKQFRNCMLTTICCGKNPLGDDE--ASATASKTETSQVAPA 348
Dog       PAFFAKSSSIYNPVIYIMMNKQFRNCMITTLCCGKNPLGDDE--ASASASKTETSQVAPA 358
Cat       PAFFAKSSSIYNPVIYIMMNKQFRNCMLTTLCCGKNPLGDDE--ASTTGSKTETSQVAPA 348
Mouse     PAFFAKSSSIYNPVIYIMLNKQFRNCMLTTLCCGKNPLGDDD--ASATASKTETSQVAPA 348
Rat       PAFFAKTASIYNPIIYIMMNKQFRNCMLTTLCCGKNPLGDDE--ASATASKTETSQVAPA 348
Pig       PAFFAKSASIYNPVIYIMMNKQFRNCMLTTLCCGKNPLGDDE--ASTTTSKTETSQVAPA 348
Cow       PAFFAKTSAVYNPVIYIMMNKQFRNCMVTTLCCGKNPLGDDE--ASTTVSKTETSQVAPA 348
Zebrafish PAFFAKTSAVYNPCIYICMNKQFRHCMITTLCCGKNPFEEEEG-ASTTASKTEASSVSSS 349
Frog      PAFFAKSSAIYNPVIYIVLNKQFRNCLITTLCCGKNPFGDEDG-SSAATSKTEASSVSSS 349
Chicken   PAFFSKSSSLYNPIIYVLMNKQFRNCMITTICCGKNPFGDEDVSSTVSQSKTEVSSVSSS 350
          ****:*::::*** **: :*****:*::**:******: :::  ::.: ****.*.*:.:

Human     -----
Rabbit    -----
Dog       -----
Cat       -----
Mouse     -----
Rat       -----
Pig       -----
Cow       -----
Zebrafish SVSPA 354
Frog      QVSPA 354
Chicken   QVSPA 355

Accession numbers of the rhodopsin protein sequences used in the protein sequence alignment:  human - NP_000530; cow - NP_001014890; mouse -  NP_663358; rat - NP_254276; dog - NP_001008277; cat - NP_001009242; pig - NP_999386; rabbit - NP_001075818; zebrafish - NP_571159; frog - NP_001080517

Predicted General Properties of the Rhodopsin Protein  (Homo sapiens) - Primary Sequence Analysis (with ProtParam)b

Number of amino acids: 348

Molecular weight: 38892.6

Theoretical pI: 6.21

Total number of negatively charged residues (Asp + Glu): 20

Total number of positively charged residues (Arg + Lys): 18

Atomic composition: Carbon C 1814 Hydrogen H 2725 Nitrogen N 423 Oxygen O 477 Sulfur S 25

Formula: C1814H2725N423O477S25

Total number of atoms: 5464

Extinction coefficients: Extinction coefficients are in units of M-1 cm-1, at 280 nm measured in water. Ext. coefficient 56435 Abs 0.1% (=1 g/l) 1.451, assuming ALL Cys residues appear as half cystines Ext. coefficient 55810 Abs 0.1% (=1 g/l) 1.435, assuming NO Cys residues appear as half cystines

Estimated half-life: The N-terminal of the sequence considered is M (Met). The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro). >20 hours (yeast, in vivo). >10 hours (Escherichia coli, in vivo).

Instability index: The instability index (II) is computed to be 34.05 This classifies the protein as stable.

Aliphatic index: 93.59

Grand average of hydropathicity (GRAVY): 0.519

Bioinformatics References

a Higgins D., Thompson J., Gibson T. Thompson J. D., Higgins D. G., Gibson T. J.(1994). CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting,position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673-4680.

b Gasteiger E., Hoogland C., Gattiker A., Duvaud S., Wilkins M.R., Appel R.D., Bairoch A.; Protein Identification and Analysis Tools on the ExPASy Server;
(In) John M. Walker (ed): The Proteomics Protocols Handbook, Humana Press (2005). pp. 571-607  Full text - Copyright Humana Press.

Bookmark and Share