Macromolecule Insights - Protein Bioinformatics Analysis

Privacy Policy

 


The Lactase Protein

Bookmark and Share

Sequence Alignment of the Lactase Protein (with Clustalw)a

Species Sequence Alignment of Lactase

human   MELSWHVVFIA--LLSFSCWGSDWESDRNFISTAGPLTNDLLHNL---SGLLGDQSSNFV 55
mouse   MELPWTALFLSTFLLGLSCQGNDWESDRNFISAAGPLTNDLVHNL---NHPLGKQGSDLV 57
rat     ------------------------------------------------------------
chicken --MELVCKTVVFFLLVSTSLGIDGELAQDFVAIAGPLPTTLAHRLHLQSHILPKEALDPT 58
                                                                      

human   AGDKDMYVCHQP-LPTFLPEYFSSLHASQITHYKVFLSWAQLLPAGSTQNPDEKTVQCYR 114
mouse   AGDTDIYVCQQP-LPSFLPQYFSSLRASQVTHYKVLLSWAQLLPKGSSKNPDQEAVQCYR 116
rat     ------------------------------------------------------------
chicken VSEAHDDFCQQDTVISQLPQYFSHLREIGVTHYKVFLPWARILPDGDAKKPDEAQVRCYQ 118
                                                                      

human   RLLKALKTARLQPMVILHHQTLPASTLR--RTEAFADLFADYATFAFHSFGDLVGIWFTF 172
mouse   QLLQSLKDAQLQPMVVLFHQMPPTSTIQ--RDGGFADLFADYATLAFQSFGDLVEIWFTF 174
rat     ------------------------------------------------------------
chicken ELLKMLVAADLRPVIVLHHKGVPDTVAVGRKASSFADLFVDYAEFSFYVFGGLADMWLTF 178
                                                                    

human   SDLEEVIKELPHQESRASQLQTLSDAHRKAYEIYHESYAFQGGKLSVVLRAEDIPELLLE 232
mouse   SDLEKVIMGLPHQHLKASGLQTLSDAHRKAFDVYHRKYSSQGGKLSVVLKAEDLPKLLPD 234
rat     --------------------------------------MPTG-------------EPLTF 9
chicken SDLPELLESLPYSDSQVR-VQALAAAHERAYSVYHKKYSFQGGKLSIALGHDDIPSVKTK 237
                                                 *             .    

human   PPISA---LAQDTVDFLSLDLSYECQNEASLRQKLSKLQTIEPKVKVFIFNLKLPDCPST 289
mouse   PSSAA---LVKGSVDFLSLDLSYDCQSISTLPQKLSELQNLEPKVKVFVYTLKPQDCPST 291
rat     TTESS---HLRGSVDFLSLDLSYECQSVATLPQKLSELQNLEPKVKVFIYTLKLEDCPAT 66
chicken PPISCSFHLEHESVDFLSLSLQYWCGNETDFYLKMSEFQNVWKDTEVLLFSLKFLDCGSL 297
        .. :.     : :******.*.* * . : :  *:*::*.:  ..:*:::.**  ** : 

human   MKNPASLLFSLFEAINKDQVLTIGFDINEFLSCSSSSKKSMSCSLTGS-LALQPDQQQDH 348
mouse   GMSPASLLFRLLEAINKDQVQTVGFDVSAFLSCTSSSEESPSCSLTYSSLALQSEQQHET 351
rat     GTSPSSLLISLLEAINKDQIQTVGFDVNAFLSCTSNSEESPSCSLTDS-LALQTEQQQET 125
chicken EENPFIPVAAIVAAINKEKARTIGYDVNEFLDFSSRKLRTSEAGLTGF--SLSSYTLQEN 355
          .*   :  :. ****::  *:*:*:. **. :* . .: ...**    :*..   :: 

human   ETTDSSPASAYQRVWEAFANQSRAERDAFLQDTFPEGFLWGASTGAFNVEGGWAEGGRGV 408
mouse   VVS-SFPGSAYQRVWAALANQSREERDAFLQDVFPEGFLWGVSTGAFNVEGGWAEGGRGP 410
rat     AVP-SSPGSAYQRVWAAFANQSREERDAFLQDVFPEGFLWGISTGAFNVEGGWAEGGRGP 184
chicken LSAAIAPRSSYQTVWEMFANQSELERDTFLQDVFPSGFLWGTSTGAFNIEGAWAEDGKGE 415
          .   * *:** **  :****. ***:****.**.***** ******:**.***.*:* 

human   SIWDPRRPLNTTEGQATLEVASDSYHKVASDVALLCGLRAQVYKFSISWSRIFPMGHGSS 468
mouse   SIWDHYSNLNAAESQATAKVASDSYHKPVSDVALLRGLRADVYKFSISWSRIFPFGQRTS 470
rat     SIWDHYGNLNAAEGQATAKVASDSYHKPASDVALLRGIRAQVYKFSISWSRLFPTGQKST 244
chicken SIWDQFGHEGHVYMNQTTDVACDSYHKTSYDVYLLRGLHPQLYKFSISWPRIFPAGTNET 475
        ****     . .  : * .**.*****   ** ** *::.::*******.*:** *   :

human   PSLPGVAYYNKLIDRLQDAGIEPMATLFHWDLPQALQDHGGWQNESVVDAFLDYAAFCFS 528
mouse   PNLQGVAYYNKLIDSLLDSQVEPMATLFHWDLPQALQEQGGWQNESVVDAFLDYAAFCFS 530
rat     PNRQGVAYYNKLIDRLLDSHIEPMATLFHWDLPQALQEQGGWQNESVVEAFLDYAAFCFS 304
chicken IGLKGVDYYNQLIDRLLEANIEPMVTLFHWDLPQALQVLGGWQNDSIIDAFANYADFCFT 535
         .  ** ***:*** * :: :***.************  *****:*:::** :** ***:

human   TFGDRVKLWVTFHEPWVMSYAGYGTGQHPPGISDPGVASFKVAHLVLKAHARTWHHYNSH 588
mouse   TFGDRVKLWVTFHEPWVISYAGYGTGQHAPAISDPGVASFKVAHLILKAHARTWHHYNYH 590
rat     TFGDRVKLWVTFHEPWVMSYAGYGTGQHAPAISDPGMASFKVAHLILKAHARTWHLYDLH 364
chicken TFGDRVKFWVTFHEPWVISYAGYGTGEHPPGITDPGIASYKVAHTILKAHAKVWHLYNDR 595
        *******:*********:********:*.*.*:***:**:**** :*****:.** *: :

human   HRPQQQGHVGIVLNSDWAEPLSPERPEDLRASERFLHFMLGWFAHPVFVDGDYPATLRTQ 648
mouse   HRQKQQGRVGIVLNSDWAEPLDGKSPQDLAAAERYLHFMLGWFAHPIFIDGDYPAALRAQ 650
rat     HRLQQQGRVGIVLNSDWAEPLDRKSPQDLAAAERFLHFMLGWFAHPIFVDGDYPTTLRAQ 424
chicken YRSQQQGRVGLVLNSDWAEPQTPANSEDVKASERYLQFMLGWFAHPIFVNGDYPDILKAQ 655
        :* :***:**:*********     .:*: *:**:*:*********:*::****  *::*

human   IQQMNRQCSHPVAQLPEFTEAEKQLLKGSADFLGLSHYTSRLISNAPQNTCIPSYDTIGG 708
mouse   IQHTNQQCGRPLAQLPEFTEAEKRLLKGSADFLGLSHYTSRLISKAGQQTCIPSYDNIGG 710
rat     IQHINQQCGHPLAQLPEFTEAEKRLLKGSADFLGLSHYTSRLISKAGRQTCTSSYDNIGG 484
chicken IQEVNQQCSTTVAQLPVFTEEEKTWVKGTADFFGLSHYTSHLVTAVTNGTCTPGYESIGN 715
        **. *:**. .:**** *** **  :**:***:*******:*:: . . ** ..*:.**.

human   FSQHVNHVWPQTSSSWIRVVPWGIRRLLQFVSLEYTRGKVPIYLAGNGMPIGESENLFDD 768
mouse   FSQHVDPKWPQTASPWIRVVPWGIRRLLRFASLEYTKGKLPIFLAGNGMPIGEGSDLFDD 770
rat     FSQHVDPEWPQTASPWIRVVPWGIRRLLRFASMEYTKGKLPIFLAGNGMPVGEEADLFDD 544
chicken FSLHVDPSWPKTASSSIHVVPWGLRRLLKFVSQEYTGTKIPIYIAGNGMPTEAVGDLIND 775
        ** **:  **:*:*. *:*****:****:*.* ***  *:**::******     :*::*

human   SLRVDYFNQYINEVLKAIKEDSVDVRSYIARSLIDGFEGPSGYSQRFGLHHVNFSDSSKS 828
mouse   SMRVNYLNLYINEVLKAVKEDSVDVRSYIARSLIDGYEGPLGYSQRFGLYHVNFNDSSRP 830
rat     SVRVNYFNLYINEVLKAVKEDLVDVRSYIVRSLIDGYEGPLGFSQRFGLYHVNFNDSSRP 604
chicken TLRVDYFRRYINEALKAIKLDAVDVQSYIARSLIDGFEGPGGYSLKFGLHHVNFEDSNRP 835
        ::**:*:. ****.***:* * ***:***.******:*** *:* :***:****.**.:.

human   RTPRKSAYFFTSIIEKNGFLTKGAKRLLPPNTVNLPSKVR-AFTFPSEVPSKAKVVWEKF 887
mouse   RTPRKSAYFFTSIIEKNGFPAKKVKRNPLPLKADFSSRARAAFSFPSEVPSKAKVVWEKF 890
rat     RTPRKSAYFFTSIIEKNGFSAKKVKRNPLPVRADFTSRARVTDSLPSEVPSKAKIVWEKF 664
chicken RTPKASAYFYSSVIENNGFPSKVSDRSSTSVVFGLPTPSKLPSLPASEVPSKSKVVWQKF 895
        ***: ****::*:**:*** :*  .*   .   .:.:  : .   .******:*:**:**

human   SSQPKFERDLFYHGTFRDDFLWGVSSSAYQIEGAWDADGKGPSIWDNFTHTPGSNVKDNA 947
mouse   SRQPKFERDLFYHGTFRDDFLWGVSSSAYQIEGGWDADGKGPSIWDNFTHTPGNGVKDNA 950
rat     SKQPRFERDLFYHGTFRDDFLWGVSSSAYQIEGGWNADGKGPSIWDNFTHTPGNGVKDNA 724
chicken SSQTDFERDMYVYGTFPKDFTWGVSSSAYQIEGGWDADGKGPSVWDNFTHVPGN-IKNND 954
        * *. ****:: :*** .** ************.*:*******:******.**. :*:* 

human   TGDIACDSYHQLDADLNMLRALKVKAYRFSISWSRIFPTGRNSSINSHGVDYYNRLINGL 1007
mouse   TGDIACDSYHQLDADINILRTLKVKSYRFSISWPRIFPTGRNSSINKQGVDYYNKLIDRL 1010
rat     TGDVACDSYHQLDADLNILRTLKVKSYRFSISWSRIFPTGRNSTINKQGVDYYNRLIDSL 784
chicken TGDIACNSYNKVEEDIYLLRALGVKNYRFSLSWPRIFPNGRNNSINSHGVDYYNRLIDGL 1014
        ***:**:**:::: *: :**:* ** ****:**.****.***.:**.:******:**: *

human   VASNIFPMVTLFHWDLPQALQDIGGWENPALIDLFDSYADFCFQTFGDRVKFWMTFNEPM 1067
mouse   LESNIFPMVTLFHWDLPQALQDIGGWENPSLIELFDSYADFCFKTFGDRVKFWMTFNEPW 1070
rat     VDNNIFPMVTLFHWDLPQALQDIGGWENPSLIELFDSYADFCFKTFGDRVKFWMTFNEPW 844
chicken VANNITPIVTLYHWDLPQALQDIGGWENSELIELFDSFADFCFQTFGDRVKFWLTFNEPQ 1074
        : .** *:***:****************. **:****:*****:*********:***** 

human   YLAWLGYGSGEFPPGVKDPGWAPYRIAHAVIKAHARVYHTYDEKYRQEQKGVISLSLSTH 1127
mouse   CSAVLGYSSGIFPPNVQDPGSLSYKVSHVIIKAHARVYHTYDEKYRQEQKGVISLSLNTH 1130
rat     CHVVLGYSSGIFPPSVQEPGWLPYKVSHIVIKAHARVYHTYDEKYRSEQKGVISLSLNTH 904
chicken VIAWVSYGTGEFPPNVNNPGSAPYEVAHTLLKAHARVYHTYDDKYRASQGGVISLCLNID 1134
          . :.*.:* ***.*::**  .*.::* ::***********:*** .* *****.*. .

human   WAEPKSPGVPRDVEAADRMLQFSLGWFAHPIFRNGDYPDTMKWKVGNRSELQHLATSRLP 1187
mouse   WVEPKDPGVQRDVEAADRMLQFNLGWFAHPIFKNGDYPDVMKWNVGNRSELQHLASSRLP 1190
rat     WAEPKDPGLQRDVEAADRMLQFTMGWFAHPIFKNGDYPDVMKWTVGNRSELQHLASSRLP 964
chicken WIEPKTPSNPRDLEAADRYMQFLVGWFAHPVFKNGDYPEVMKWTVGNRSELQNLPSSRLP 1194
        * *** *.  **:***** :** :******:*:*****:.***.********:*.:****

human   SFTEEEKRFIRATADVFCLNTYYSRIVQHKTPRLNPPSYEDDQEMAE-EEDPSWPSTAMN 1246
mouse   SFTEEEKNYIRGTADVFCINTYTSVFAQHVTPRLNPPSYDNDMELKASDMNSSALISMMH 1250
rat     TFTEEEKNYVRGTADVFCINTYTSVFVQHSTPRLNPPSYDDDMELKLIEMNSS--TGVMH 1022
chicken VFTAEEREYIRGTADVFCLNTYTAKLVTHATTRLNPFSYEYDQEIST-DVDSSWPTSALA 1253
         ** **:.::*.******:*** : :. * *.**** **: * *:   : :.*     : 

human   --RAAPWGTRRLLNWIKEEYGDIPIYITENGVGLTN-PNTEDTDRIFYHKTYINEALKAY 1303
mouse   --QDVPWGMRRLLNWIKEEYGNIPIYITENGQGLTN-PTLDDTERIFYHKTYINEALKAY 1307
rat     --PDVPWGTRRLLNWIKEEYGNIPIYITENGQGLEN-PTLDDTERIFYHKTYINEALKAY 1079
chicken GHRAVAWGLRRLLNWVKEEYGNPPMYIIENGVGIKTKSDVDDHTRILYYKTYIDEALKAY 1313
            ..** ******:*****: *:** *** *: . .  :*  **:*:****:******

human   RLDGIDLRGYVAWSLMDNFEWLNGYTVKFGLYHVDFNNTNRPRTARASARYYTEVITNNG 1363
mouse   RLDGVDLRGYSAWALMDNFEWLHGYTMRFGLYHVDFDHVNRPRTARASARYYTEVITNNG 1367
rat     KLDGVDLRGYSAWTLMDDFEWLLGYTMRFGLYHVDFNHVSRPRTARASARYYAEVIANNG 1139
chicken KLDGVNLRGYNAWSFMDFFEWLNGYEPRFGLHEVDFNDPNRPRTPRRSAVYYAEIIRNNG 1373
        :***::**** **::** **** **  :***:.***:. .****.* ** **:*:* ***

human   MPLAREDEFLYGRFPEGFIWSAASAAYQIEGAWRADGKGLSIWDTFSHTPLRVENDAIGD 1423
mouse   MPLAKEDEFLYGEFPKGFIWSAASASYQVEGAWRADGKGLSIWDTFSHTPLKIGNDDNGD 1427
rat     MPLAREDEFLYGEFPKGFIWSAASASYQVEGAWRADGKGLSIWDTFSHTPLRIGNDDNGD 1199
chicken IPLPEENKFLYGEFPKNFCWSVATAAYQIEGAWRADGKGLSIWDKYTHTPLKISNDDNGD 1433
        :**..*::****.**:.* **.*:*:**:***************.::****:: **  **

human   VACDSYHKIAEDLVTLQNLGVSHYRFSISWSRILPDGTTRYINEAGLNYYVRLIDTLLAA 1483
mouse   VACDSYHKIAEDVVALQNLGVSHYRFSISWPRILPDGTTKFINEAGLNYYVRFIDALLAA 1487
rat     VACDSYHKIAEDVVALQNLGVSHYRFSIAWSRILPDGTTKFINEAGLSYYVRFIDALLAA 1259
chicken VACDSYHKIEEDVEMLKRLKVSHYRFSISWSRVLPDGTTRYINEMGLNYYERLIDALLAA 1493
        ********* **:  *:.* ********:*.*:******::*** **.** *:**:****

human   SIQPQVTIYHWDLPQTLQDVGGWENETIVQRFKEYADVLFQRLGDKVKFWITLNEPFVIA 1543
mouse   GITPQVTMYHWDLPQALQDVGGWENETVVQRFKDYADVLFRRLGDKVKFWITLNEPFVIA 1547
rat     GITPQVTIYHWDLPQALQDVGGWENETIVQRFKEYADVLFQRLGDRVKFWITLNEPFVIA 1319
chicken NITPQVTLYHWDLPQALQDIGGWENDTIVQRFKEYAELLFQRLGDKVKFWITLNEPYNTA 1553
        .* ****:*******:***:*****:*:*****:**::**:****:**********:  *

human   YQGYGYGTAAPGVSNRPGTAPYIVGHNLIKAHAEAWHLYNDVYRASQGGVISITISSDWA 1603
mouse   AHGYGSGVSAPGISFRPGTAPYTAGHNLIKAHAEAWHLYNSTYRNSQGGVISITISSDWA 1607
rat     AQGYGTGVSAPGISFRPGTAPYIAGHNLIKAHAEAWHLYNDVYRARQGGTISITISSDWA 1379
chicken YLGYGFGTAAPGISVRPGRAPYVVGHNLIKAHAEAWHLYNETYRAKQGGLISITINSDWA 1613
          *** *.:***:* *** *** .****************..**  *** *****.****

human   EPRDPSNQEDVEAARRYVQFMGGWFAHPIFKNGDYNEVMKTRIRDRSLAAGLNKSRLPEF 1663
mouse   EPRDPSNQEDVEAARRYVQFMGGWFAHPIFKNGDYPEVMKTRIRDRSLAAGLNKSRLPEF 1667
rat     EPRDPTNQGDVEAARRYVQFMGGWFAHPIFKNGDYPEVMKTRIRDRSLAAGLNKSRLPEF 1439
chicken EPRNPHKQEDFDAARQYLQFLIGWFAHPIFKNGDYNEVMKTRIRERSLAQGLSSSRLPEF 1673
        ***:* :* *.:***:*:**: ************* ********:**** **..******

human   TESEKRRINGTYDFFGFNHYTTVLAYNLNYATAISSFDADRGVASIADRSWPDSGSFWLK 1723
mouse   TESEKKKIQGTFDFFGFNHYTTVLAYNLNYAAAVSSFDADRGVASITDRSWPDSGSFWLK 1727
rat     TESEKSRIKGTFDFFGFNHYTTVLAYNLDYPAAFSSFDADRGVASIADSSWPVSGSFWLK 1499
chicken TESEKQRIKGTYDYFGLNHYTTVLAYKYEYSTGILSYDADRGVASVTDRSWLNSGSFWLK 1733
        ***** :*:**:*:**:*********: :*.:.. *:********::* **  *******

human   MTPFGFRRILNWLKEEYNDPPIYVTENGVSQREETDLNDTARIYYLRTYINEALKAVQ-D 1782
mouse   MTPFGFRRILNWLKEEYNNPLIYVTENGVSRRGDPELNDTDRIYYLRSYINEALKAVR-D 1786
rat     VTPFGFRRILNWLKEEYNNPPIYVTENGVSRRGEPELNDTDRIYYLRSYINEALKAVQ-D 1558
chicken VTPFGFRKLLQWIKEEYNNPPIYVTENGVSERGAIDFNDTWRIHYYQNYINEALKAVVLD 1793
        :******::*:*:*****:* *********.*   ::*** **:* :.*********  *

human   KVDLRGYTVWSAMDNFEWATGFSERFGLHFVNYSDPSLPRIPKASAKFYASVVRCNGFPD 1842
mouse   KVDLRGYTVWSIMDNFEWATGFAERFGVHFVNRSDPSLPRIPKASAKVYASIVRCNGFPD 1846
rat     KVDLRGYTVWSIMDNFEWATGFAERFGVHFVNRSDPSLPRIPKASAKFYATIVRCNGFPD 1618
chicken GVDLRGYTAWTLMDNFEWAVGYDERFGFYHVNYTDPTLPRLPKASARYYSQIISCNGFPD 1853
         *******.*: *******.*: ****.:.** :**:***:*****: *: :: ******

human   PATGPHACLHQP-DAGPTISPVRQEEVQFLGLMLGTTEAQTALYVLFSLVLLGVCGLAFL 1901
mouse   PAQGPHPCLQQPEDAGPTASPVKS-EVPFLGLMLGTAEAQTALYVLFALVLLGVCSVAFL 1905
rat     PAQGPHPCLQQPEDAAPTASPVQS-EVPFLGLMLGIAEAQTALYVLFALLLLGACSLAFL 1677
chicken PATGPHPCLETEPDVIPDTTPGLADSVRFLGLDLTSQRAEIALYVLFALSAIGALGLALI 1913
        ** ***.**.   *. *  :*    .* **** *   .*: ******:*  :*. .:*::

human   SYKYCKRSKQGKTQRSQQELSPVSSF 1927
mouse   LYKYCKRSKQGTTQPGHHGLSQISSF 1931
rat     TYKYCRRSKQGNAQPSQHQLSPISSF 1703
chicken SYKYGTISKRYHKQ-SSMELSKM--- 1935
         ***   **:   * .   ** :   

Accession numbers of the lactase sequences used in the protein sequence alignment:  human - EAX11622; mouse - NP_001074547; rat - EDM09876; chicken - NP_001104816

Predicted General Properties of the Lactase Protein  (Homo sapiens) - Primary Sequence Analysis (with ProtParam)b

Number of amino acids: 1927

Molecular weight: 218572.6

Theoretical pI: 5.90

Total number of negatively charged residues (Asp + Glu): 218

Total number of positively charged residues (Arg + Lys): 181

Atomic composition: Carbon C 9907 Hydrogen H 14857 Nitrogen N 2651 Oxygen O 2877 Sulfur S 45

Formula: C9907H14857N2651O2877S45

Total number of atoms: 30337

Extinction coefficients: Extinction coefficients are in units of M-1 cm-1, at 280 nm measured in water. Ext. coefficient 422255 Abs 0.1% (=1 g/l) 1.932, assuming ALL Cys residues appear as half cystines Ext. coefficient 421130 Abs 0.1% (=1 g/l) 1.927, assuming NO Cys residues appear as half cystines

Estimated half-life: The N-terminal of the sequence considered is M (Met). The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro). >20 hours (yeast, in vivo). >10 hours (Escherichia coli, in vivo).

Instability index: The instability index (II) is computed to be 41.09 This classifies the protein as unstable.

Aliphatic index: 76.25

Grand average of hydropathicity (GRAVY): -0.362

Bioinformatics References

a Higgins D., Thompson J., Gibson T. Thompson J. D., Higgins D. G., Gibson T. J.(1994). CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting,position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673-4680.

b Gasteiger E., Hoogland C., Gattiker A., Duvaud S., Wilkins M.R., Appel R.D., Bairoch A.; Protein Identification and Analysis Tools on the ExPASy Server;
(In) John M. Walker (ed): The Proteomics Protocols Handbook, Humana Press (2005). pp. 571-607  Full text - Copyright Humana Press.

Bookmark and Share