Sequence Alignment of the Lactase Protein (with Clustalw)a
Species Sequence Alignment
of Lactase
human MELSWHVVFIA--LLSFSCWGSDWESDRNFISTAGPLTNDLLHNL---SGLLGDQSSNFV 55
mouse MELPWTALFLSTFLLGLSCQGNDWESDRNFISAAGPLTNDLVHNL---NHPLGKQGSDLV 57
rat ------------------------------------------------------------
chicken --MELVCKTVVFFLLVSTSLGIDGELAQDFVAIAGPLPTTLAHRLHLQSHILPKEALDPT 58
human AGDKDMYVCHQP-LPTFLPEYFSSLHASQITHYKVFLSWAQLLPAGSTQNPDEKTVQCYR 114
mouse AGDTDIYVCQQP-LPSFLPQYFSSLRASQVTHYKVLLSWAQLLPKGSSKNPDQEAVQCYR 116
rat ------------------------------------------------------------
chicken VSEAHDDFCQQDTVISQLPQYFSHLREIGVTHYKVFLPWARILPDGDAKKPDEAQVRCYQ 118
human RLLKALKTARLQPMVILHHQTLPASTLR--RTEAFADLFADYATFAFHSFGDLVGIWFTF 172
mouse QLLQSLKDAQLQPMVVLFHQMPPTSTIQ--RDGGFADLFADYATLAFQSFGDLVEIWFTF 174
rat ------------------------------------------------------------
chicken ELLKMLVAADLRPVIVLHHKGVPDTVAVGRKASSFADLFVDYAEFSFYVFGGLADMWLTF 178
human SDLEEVIKELPHQESRASQLQTLSDAHRKAYEIYHESYAFQGGKLSVVLRAEDIPELLLE 232
mouse SDLEKVIMGLPHQHLKASGLQTLSDAHRKAFDVYHRKYSSQGGKLSVVLKAEDLPKLLPD 234
rat --------------------------------------MPTG-------------EPLTF 9
chicken SDLPELLESLPYSDSQVR-VQALAAAHERAYSVYHKKYSFQGGKLSIALGHDDIPSVKTK 237
* .
human PPISA---LAQDTVDFLSLDLSYECQNEASLRQKLSKLQTIEPKVKVFIFNLKLPDCPST 289
mouse PSSAA---LVKGSVDFLSLDLSYDCQSISTLPQKLSELQNLEPKVKVFVYTLKPQDCPST 291
rat TTESS---HLRGSVDFLSLDLSYECQSVATLPQKLSELQNLEPKVKVFIYTLKLEDCPAT 66
chicken PPISCSFHLEHESVDFLSLSLQYWCGNETDFYLKMSEFQNVWKDTEVLLFSLKFLDCGSL 297
.. :. : :******.*.* * . : : *:*::*.: ..:*:::.** ** :
human MKNPASLLFSLFEAINKDQVLTIGFDINEFLSCSSSSKKSMSCSLTGS-LALQPDQQQDH 348
mouse GMSPASLLFRLLEAINKDQVQTVGFDVSAFLSCTSSSEESPSCSLTYSSLALQSEQQHET 351
rat GTSPSSLLISLLEAINKDQIQTVGFDVNAFLSCTSNSEESPSCSLTDS-LALQTEQQQET 125
chicken EENPFIPVAAIVAAINKEKARTIGYDVNEFLDFSSRKLRTSEAGLTGF--SLSSYTLQEN 355
.* : :. ****:: *:*:*:. **. :* . .: ...** :*.. ::
human ETTDSSPASAYQRVWEAFANQSRAERDAFLQDTFPEGFLWGASTGAFNVEGGWAEGGRGV 408
mouse VVS-SFPGSAYQRVWAALANQSREERDAFLQDVFPEGFLWGVSTGAFNVEGGWAEGGRGP 410
rat AVP-SSPGSAYQRVWAAFANQSREERDAFLQDVFPEGFLWGISTGAFNVEGGWAEGGRGP 184
chicken LSAAIAPRSSYQTVWEMFANQSELERDTFLQDVFPSGFLWGTSTGAFNIEGAWAEDGKGE 415
. * *:** ** :****. ***:****.**.***** ******:**.***.*:*
human SIWDPRRPLNTTEGQATLEVASDSYHKVASDVALLCGLRAQVYKFSISWSRIFPMGHGSS 468
mouse SIWDHYSNLNAAESQATAKVASDSYHKPVSDVALLRGLRADVYKFSISWSRIFPFGQRTS 470
rat SIWDHYGNLNAAEGQATAKVASDSYHKPASDVALLRGIRAQVYKFSISWSRLFPTGQKST 244
chicken SIWDQFGHEGHVYMNQTTDVACDSYHKTSYDVYLLRGLHPQLYKFSISWPRIFPAGTNET 475
**** . . : * .**.***** ** ** *::.::*******.*:** * :
human PSLPGVAYYNKLIDRLQDAGIEPMATLFHWDLPQALQDHGGWQNESVVDAFLDYAAFCFS 528
mouse PNLQGVAYYNKLIDSLLDSQVEPMATLFHWDLPQALQEQGGWQNESVVDAFLDYAAFCFS 530
rat PNRQGVAYYNKLIDRLLDSHIEPMATLFHWDLPQALQEQGGWQNESVVEAFLDYAAFCFS 304
chicken IGLKGVDYYNQLIDRLLEANIEPMVTLFHWDLPQALQVLGGWQNDSIIDAFANYADFCFT 535
. ** ***:*** * :: :***.************ *****:*:::** :** ***:
human TFGDRVKLWVTFHEPWVMSYAGYGTGQHPPGISDPGVASFKVAHLVLKAHARTWHHYNSH 588
mouse TFGDRVKLWVTFHEPWVISYAGYGTGQHAPAISDPGVASFKVAHLILKAHARTWHHYNYH 590
rat TFGDRVKLWVTFHEPWVMSYAGYGTGQHAPAISDPGMASFKVAHLILKAHARTWHLYDLH 364
chicken TFGDRVKFWVTFHEPWVISYAGYGTGEHPPGITDPGIASYKVAHTILKAHAKVWHLYNDR 595
*******:*********:********:*.*.*:***:**:**** :*****:.** *: :
human HRPQQQGHVGIVLNSDWAEPLSPERPEDLRASERFLHFMLGWFAHPVFVDGDYPATLRTQ 648
mouse HRQKQQGRVGIVLNSDWAEPLDGKSPQDLAAAERYLHFMLGWFAHPIFIDGDYPAALRAQ 650
rat HRLQQQGRVGIVLNSDWAEPLDRKSPQDLAAAERFLHFMLGWFAHPIFVDGDYPTTLRAQ 424
chicken YRSQQQGRVGLVLNSDWAEPQTPANSEDVKASERYLQFMLGWFAHPIFVNGDYPDILKAQ 655
:* :***:**:********* .:*: *:**:*:*********:*::**** *::*
human IQQMNRQCSHPVAQLPEFTEAEKQLLKGSADFLGLSHYTSRLISNAPQNTCIPSYDTIGG 708
mouse IQHTNQQCGRPLAQLPEFTEAEKRLLKGSADFLGLSHYTSRLISKAGQQTCIPSYDNIGG 710
rat IQHINQQCGHPLAQLPEFTEAEKRLLKGSADFLGLSHYTSRLISKAGRQTCTSSYDNIGG 484
chicken IQEVNQQCSTTVAQLPVFTEEEKTWVKGTADFFGLSHYTSHLVTAVTNGTCTPGYESIGN 715
**. *:**. .:**** *** ** :**:***:*******:*:: . . ** ..*:.**.
human FSQHVNHVWPQTSSSWIRVVPWGIRRLLQFVSLEYTRGKVPIYLAGNGMPIGESENLFDD 768
mouse FSQHVDPKWPQTASPWIRVVPWGIRRLLRFASLEYTKGKLPIFLAGNGMPIGEGSDLFDD 770
rat FSQHVDPEWPQTASPWIRVVPWGIRRLLRFASMEYTKGKLPIFLAGNGMPVGEEADLFDD 544
chicken FSLHVDPSWPKTASSSIHVVPWGLRRLLKFVSQEYTGTKIPIYIAGNGMPTEAVGDLIND 775
** **: **:*:*. *:*****:****:*.* *** *:**::****** :*::*
human SLRVDYFNQYINEVLKAIKEDSVDVRSYIARSLIDGFEGPSGYSQRFGLHHVNFSDSSKS 828
mouse SMRVNYLNLYINEVLKAVKEDSVDVRSYIARSLIDGYEGPLGYSQRFGLYHVNFNDSSRP 830
rat SVRVNYFNLYINEVLKAVKEDLVDVRSYIVRSLIDGYEGPLGFSQRFGLYHVNFNDSSRP 604
chicken TLRVDYFRRYINEALKAIKLDAVDVQSYIARSLIDGFEGPGGYSLKFGLHHVNFEDSNRP 835
::**:*:. ****.***:* * ***:***.******:*** *:* :***:****.**.:.
human RTPRKSAYFFTSIIEKNGFLTKGAKRLLPPNTVNLPSKVR-AFTFPSEVPSKAKVVWEKF 887
mouse RTPRKSAYFFTSIIEKNGFPAKKVKRNPLPLKADFSSRARAAFSFPSEVPSKAKVVWEKF 890
rat RTPRKSAYFFTSIIEKNGFSAKKVKRNPLPVRADFTSRARVTDSLPSEVPSKAKIVWEKF 664
chicken RTPKASAYFYSSVIENNGFPSKVSDRSSTSVVFGLPTPSKLPSLPASEVPSKSKVVWQKF 895
***: ****::*:**:*** :* .* . .:.: : . .******:*:**:**
human SSQPKFERDLFYHGTFRDDFLWGVSSSAYQIEGAWDADGKGPSIWDNFTHTPGSNVKDNA 947
mouse SRQPKFERDLFYHGTFRDDFLWGVSSSAYQIEGGWDADGKGPSIWDNFTHTPGNGVKDNA 950
rat SKQPRFERDLFYHGTFRDDFLWGVSSSAYQIEGGWNADGKGPSIWDNFTHTPGNGVKDNA 724
chicken SSQTDFERDMYVYGTFPKDFTWGVSSSAYQIEGGWDADGKGPSVWDNFTHVPGN-IKNND 954
* *. ****:: :*** .** ************.*:*******:******.**. :*:*
human TGDIACDSYHQLDADLNMLRALKVKAYRFSISWSRIFPTGRNSSINSHGVDYYNRLINGL 1007
mouse TGDIACDSYHQLDADINILRTLKVKSYRFSISWPRIFPTGRNSSINKQGVDYYNKLIDRL 1010
rat TGDVACDSYHQLDADLNILRTLKVKSYRFSISWSRIFPTGRNSTINKQGVDYYNRLIDSL 784
chicken TGDIACNSYNKVEEDIYLLRALGVKNYRFSLSWPRIFPNGRNNSINSHGVDYYNRLIDGL 1014
***:**:**:::: *: :**:* ** ****:**.****.***.:**.:******:**: *
human VASNIFPMVTLFHWDLPQALQDIGGWENPALIDLFDSYADFCFQTFGDRVKFWMTFNEPM 1067
mouse LESNIFPMVTLFHWDLPQALQDIGGWENPSLIELFDSYADFCFKTFGDRVKFWMTFNEPW 1070
rat VDNNIFPMVTLFHWDLPQALQDIGGWENPSLIELFDSYADFCFKTFGDRVKFWMTFNEPW 844
chicken VANNITPIVTLYHWDLPQALQDIGGWENSELIELFDSFADFCFQTFGDRVKFWLTFNEPQ 1074
: .** *:***:****************. **:****:*****:*********:*****
human YLAWLGYGSGEFPPGVKDPGWAPYRIAHAVIKAHARVYHTYDEKYRQEQKGVISLSLSTH 1127
mouse CSAVLGYSSGIFPPNVQDPGSLSYKVSHVIIKAHARVYHTYDEKYRQEQKGVISLSLNTH 1130
rat CHVVLGYSSGIFPPSVQEPGWLPYKVSHIVIKAHARVYHTYDEKYRSEQKGVISLSLNTH 904
chicken VIAWVSYGTGEFPPNVNNPGSAPYEVAHTLLKAHARVYHTYDDKYRASQGGVISLCLNID 1134
. :.*.:* ***.*::** .*.::* ::***********:*** .* *****.*. .
human WAEPKSPGVPRDVEAADRMLQFSLGWFAHPIFRNGDYPDTMKWKVGNRSELQHLATSRLP 1187
mouse WVEPKDPGVQRDVEAADRMLQFNLGWFAHPIFKNGDYPDVMKWNVGNRSELQHLASSRLP 1190
rat WAEPKDPGLQRDVEAADRMLQFTMGWFAHPIFKNGDYPDVMKWTVGNRSELQHLASSRLP 964
chicken WIEPKTPSNPRDLEAADRYMQFLVGWFAHPVFKNGDYPEVMKWTVGNRSELQNLPSSRLP 1194
* *** *. **:***** :** :******:*:*****:.***.********:*.:****
human SFTEEEKRFIRATADVFCLNTYYSRIVQHKTPRLNPPSYEDDQEMAE-EEDPSWPSTAMN 1246
mouse SFTEEEKNYIRGTADVFCINTYTSVFAQHVTPRLNPPSYDNDMELKASDMNSSALISMMH 1250
rat TFTEEEKNYVRGTADVFCINTYTSVFVQHSTPRLNPPSYDDDMELKLIEMNSS--TGVMH 1022
chicken VFTAEEREYIRGTADVFCLNTYTAKLVTHATTRLNPFSYEYDQEIST-DVDSSWPTSALA 1253
** **:.::*.******:*** : :. * *.**** **: * *: : :.* :
human --RAAPWGTRRLLNWIKEEYGDIPIYITENGVGLTN-PNTEDTDRIFYHKTYINEALKAY 1303
mouse --QDVPWGMRRLLNWIKEEYGNIPIYITENGQGLTN-PTLDDTERIFYHKTYINEALKAY 1307
rat --PDVPWGTRRLLNWIKEEYGNIPIYITENGQGLEN-PTLDDTERIFYHKTYINEALKAY 1079
chicken GHRAVAWGLRRLLNWVKEEYGNPPMYIIENGVGIKTKSDVDDHTRILYYKTYIDEALKAY 1313
..** ******:*****: *:** *** *: . . :* **:*:****:******
human RLDGIDLRGYVAWSLMDNFEWLNGYTVKFGLYHVDFNNTNRPRTARASARYYTEVITNNG 1363
mouse RLDGVDLRGYSAWALMDNFEWLHGYTMRFGLYHVDFDHVNRPRTARASARYYTEVITNNG 1367
rat KLDGVDLRGYSAWTLMDDFEWLLGYTMRFGLYHVDFNHVSRPRTARASARYYAEVIANNG 1139
chicken KLDGVNLRGYNAWSFMDFFEWLNGYEPRFGLHEVDFNDPNRPRTPRRSAVYYAEIIRNNG 1373
:***::**** **::** **** ** :***:.***:. .****.* ** **:*:* ***
human MPLAREDEFLYGRFPEGFIWSAASAAYQIEGAWRADGKGLSIWDTFSHTPLRVENDAIGD 1423
mouse MPLAKEDEFLYGEFPKGFIWSAASASYQVEGAWRADGKGLSIWDTFSHTPLKIGNDDNGD 1427
rat MPLAREDEFLYGEFPKGFIWSAASASYQVEGAWRADGKGLSIWDTFSHTPLRIGNDDNGD 1199
chicken IPLPEENKFLYGEFPKNFCWSVATAAYQIEGAWRADGKGLSIWDKYTHTPLKISNDDNGD 1433
:**..*::****.**:.* **.*:*:**:***************.::****:: ** **
human VACDSYHKIAEDLVTLQNLGVSHYRFSISWSRILPDGTTRYINEAGLNYYVRLIDTLLAA 1483
mouse VACDSYHKIAEDVVALQNLGVSHYRFSISWPRILPDGTTKFINEAGLNYYVRFIDALLAA 1487
rat VACDSYHKIAEDVVALQNLGVSHYRFSIAWSRILPDGTTKFINEAGLSYYVRFIDALLAA 1259
chicken VACDSYHKIEEDVEMLKRLKVSHYRFSISWSRVLPDGTTRYINEMGLNYYERLIDALLAA 1493
********* **: *:.* ********:*.*:******::*** **.** *:**:****
human SIQPQVTIYHWDLPQTLQDVGGWENETIVQRFKEYADVLFQRLGDKVKFWITLNEPFVIA 1543
mouse GITPQVTMYHWDLPQALQDVGGWENETVVQRFKDYADVLFRRLGDKVKFWITLNEPFVIA 1547
rat GITPQVTIYHWDLPQALQDVGGWENETIVQRFKEYADVLFQRLGDRVKFWITLNEPFVIA 1319
chicken NITPQVTLYHWDLPQALQDIGGWENDTIVQRFKEYAELLFQRLGDKVKFWITLNEPYNTA 1553
.* ****:*******:***:*****:*:*****:**::**:****:**********: *
human YQGYGYGTAAPGVSNRPGTAPYIVGHNLIKAHAEAWHLYNDVYRASQGGVISITISSDWA 1603
mouse AHGYGSGVSAPGISFRPGTAPYTAGHNLIKAHAEAWHLYNSTYRNSQGGVISITISSDWA 1607
rat AQGYGTGVSAPGISFRPGTAPYIAGHNLIKAHAEAWHLYNDVYRARQGGTISITISSDWA 1379
chicken YLGYGFGTAAPGISVRPGRAPYVVGHNLIKAHAEAWHLYNETYRAKQGGLISITINSDWA 1613
*** *.:***:* *** *** .****************..** *** *****.****
human EPRDPSNQEDVEAARRYVQFMGGWFAHPIFKNGDYNEVMKTRIRDRSLAAGLNKSRLPEF 1663
mouse EPRDPSNQEDVEAARRYVQFMGGWFAHPIFKNGDYPEVMKTRIRDRSLAAGLNKSRLPEF 1667
rat EPRDPTNQGDVEAARRYVQFMGGWFAHPIFKNGDYPEVMKTRIRDRSLAAGLNKSRLPEF 1439
chicken EPRNPHKQEDFDAARQYLQFLIGWFAHPIFKNGDYNEVMKTRIRERSLAQGLSSSRLPEF 1673
***:* :* *.:***:*:**: ************* ********:**** **..******
human TESEKRRINGTYDFFGFNHYTTVLAYNLNYATAISSFDADRGVASIADRSWPDSGSFWLK 1723
mouse TESEKKKIQGTFDFFGFNHYTTVLAYNLNYAAAVSSFDADRGVASITDRSWPDSGSFWLK 1727
rat TESEKSRIKGTFDFFGFNHYTTVLAYNLDYPAAFSSFDADRGVASIADSSWPVSGSFWLK 1499
chicken TESEKQRIKGTYDYFGLNHYTTVLAYKYEYSTGILSYDADRGVASVTDRSWLNSGSFWLK 1733
***** :*:**:*:**:*********: :*.:.. *:********::* ** *******
human MTPFGFRRILNWLKEEYNDPPIYVTENGVSQREETDLNDTARIYYLRTYINEALKAVQ-D 1782
mouse MTPFGFRRILNWLKEEYNNPLIYVTENGVSRRGDPELNDTDRIYYLRSYINEALKAVR-D 1786
rat VTPFGFRRILNWLKEEYNNPPIYVTENGVSRRGEPELNDTDRIYYLRSYINEALKAVQ-D 1558
chicken VTPFGFRKLLQWIKEEYNNPPIYVTENGVSERGAIDFNDTWRIHYYQNYINEALKAVVLD 1793
:******::*:*:*****:* *********.* ::*** **:* :.********* *
human KVDLRGYTVWSAMDNFEWATGFSERFGLHFVNYSDPSLPRIPKASAKFYASVVRCNGFPD 1842
mouse KVDLRGYTVWSIMDNFEWATGFAERFGVHFVNRSDPSLPRIPKASAKVYASIVRCNGFPD 1846
rat KVDLRGYTVWSIMDNFEWATGFAERFGVHFVNRSDPSLPRIPKASAKFYATIVRCNGFPD 1618
chicken GVDLRGYTAWTLMDNFEWAVGYDERFGFYHVNYTDPTLPRLPKASARYYSQIISCNGFPD 1853
*******.*: *******.*: ****.:.** :**:***:*****: *: :: ******
human PATGPHACLHQP-DAGPTISPVRQEEVQFLGLMLGTTEAQTALYVLFSLVLLGVCGLAFL 1901
mouse PAQGPHPCLQQPEDAGPTASPVKS-EVPFLGLMLGTAEAQTALYVLFALVLLGVCSVAFL 1905
rat PAQGPHPCLQQPEDAAPTASPVQS-EVPFLGLMLGIAEAQTALYVLFALLLLGACSLAFL 1677
chicken PATGPHPCLETEPDVIPDTTPGLADSVRFLGLDLTSQRAEIALYVLFALSAIGALGLALI 1913
** ***.**. *. * :* .* **** * .*: ******:* :*. .:*::
human SYKYCKRSKQGKTQRSQQELSPVSSF 1927
mouse LYKYCKRSKQGTTQPGHHGLSQISSF 1931
rat TYKYCRRSKQGNAQPSQHQLSPISSF 1703
chicken SYKYGTISKRYHKQ-SSMELSKM--- 1935
*** **: * . ** :
Accession numbers of the lactase sequences used in the protein sequence alignment: human - EAX11622; mouse - NP_001074547; rat - EDM09876; chicken - NP_001104816
Predicted General Properties of the Lactase Protein (Homo sapiens) - Primary Sequence Analysis (with ProtParam)b
Number of amino acids: 1927
Molecular weight: 218572.6
Theoretical pI: 5.90
Total number of negatively charged residues (Asp + Glu): 218
Total number of positively charged residues (Arg + Lys): 181
Atomic composition: Carbon C 9907 Hydrogen H 14857 Nitrogen N 2651 Oxygen O 2877 Sulfur S 45
Formula: C9907H14857N2651O2877S45
Total number of atoms: 30337
Extinction coefficients: Extinction coefficients are in units of M-1 cm-1, at 280 nm measured in water. Ext. coefficient 422255 Abs 0.1% (=1 g/l) 1.932, assuming ALL Cys residues appear as half cystines Ext. coefficient 421130 Abs 0.1% (=1 g/l) 1.927, assuming NO Cys residues appear as half cystines
Estimated half-life: The N-terminal of the sequence considered is M (Met). The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro). >20 hours (yeast, in vivo). >10 hours (Escherichia coli, in vivo).
Instability index: The instability index (II) is computed to be 41.09 This classifies the protein as unstable.
Aliphatic index: 76.25
Grand average of hydropathicity (GRAVY): -0.362
Bioinformatics References
a Higgins D., Thompson J., Gibson T. Thompson J. D., Higgins D. G., Gibson T. J.(1994). CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting,position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673-4680.
b Gasteiger E., Hoogland C., Gattiker A., Duvaud S., Wilkins M.R., Appel R.D., Bairoch A.; Protein Identification and Analysis Tools on the ExPASy Server;
(In) John M. Walker (ed): The Proteomics Protocols Handbook, Humana Press (2005). pp. 571-607 Full text - Copyright Humana Press.