Sequence Alignment of the Transferrin Protein (with Clustalw)a
Species Alignment
of Transferrin
human -------MRLAVGALLVCAVLGLCLAVP-DKTVRWCAVSEHEATKCQSFRDHMKSVIPSD 52
mouse -------MRLTVGALLACAALGLCLAVP-DKTVKWCAVSEHENTKCISFRDHMKTVLPPD 52
rat -------MRFAVGALLACAALGLCLAVP-DKTVKWCAVSEHENTKCISFRDHMKTVLPAD 52
cow -------MRPAVRALLACAVLGLCLADP-ERTVRWCTISTHEANKCASFRENVLRILES- 51
horse -------MRLAIRALLACAVLGLCLA---EQTVRWCTVSNHEVSKCASFRDSMKSIVPA- 49
bee MMLRCNIWTLAVNVLFVNSFLFVIAAQDSSGRIFTICVPEIYSKECDEMKKDSAVK---- 56
frog -------MDFSLRVALCLSMLALCLAIQKEKQVRWCVKSNSELKKCKDLVDTCK----NK 49
salmon -------------------CLATVYAAXVEGMVRWCVKSDKELQKCHDLAAKVA------ 35
cod ------------------------------------------------------------
human GPSVACVKKASYLDCIRAIAANEADAVTLDAGLVYDAYLA---PNNLKPVVAEFYGSKED 109
mouse GPRLACVKKTSYPDCIKAISASEADAMTLDGGWVYDAGLT---PNNLKPVAAEFYGSVEH 109
rat GPRLACVKKTSYQDCIKAISGGEADAITLDGGWVYDAGLT---PNNLKPVAAEFYGSLEH 109
cow GPFVSCVKKTSHMDCIKAISNNEADAVTLDGGLVYEAGLK---PNNLKPVVAEFHGTKDN 108
horse PPLVACVKRTSYLECIKAIADNEADAVTLDAGLVFEAGLS---PYNLKPVVAEFYGSKTE 106
bee GIPVSCISGRDRYECIEKVGKKEADVVAVDPEDMYLAVKDNKLASNAGYNVIEQVRTKEE 116
frog EIKLSCVEKSNTDECSLLFRKTMQMQFVWTGGDVYKGSLQ---PYNLKPIMAENYGSHTE 106
salmon --EFSCVKRDDSFECIKAIKREEADAITLDGGDIYTAGLH---NYNLQPIIAEDYG--ED 88
cod -----GIKEADATECILAIKAGEADAITLDGGEIYTAGQH---PYDLQPIISEKYG--SG 50
:. . :* . .. :: . : *
human PQTFYYAVAVVKKDSGFQMN---QLRGKKSCHTGLGRSAGWNIPIGLLYC--------DL 158
mouse PQTYYYAVAVVKKGTDFQLN---QLEGKKSCHTGLGRSAGWVIPIGLLFC--------KL 158
rat PQTHYLAVAVVKKGTDFQLN---QLQGKKSCHTGLGRSAGWIIPIGLLFC--------NL 158
cow PQTHYYAVAVVKKDTDFKLN---ELRGKKSCHTGLGRSAGWNIPMGKLYK--------EL 157
horse PQTHYYAVAVVKKNSNFQLN---QLQGKKSCHTGLGRSAGWNIPIGLLYW--------QL 155
bee PHAPYRYEAVAVIHKDLPINNVQGLRGLKSCHTGVGRNVGYKIPITKLTAMGVLNNLHDP 176
frog TDTCYYAVAVVKKSSKFTFD---ELKDKKSCHTGIGKTAGWNIIIGLLLERKLL----KW 159
salmon SDTCYYAVAVAKKGTKFGFL---NLHGKKSCHTGLGKSAGWNIPIGTLVTLDQI----QW 141
cod S-SCYYAVAVVKKDTGFSFK---QLRGKKSCHTGIGKTAGWNIPIGTLLTTGQL----VW 102
. : * **. . : : *.. ******:*:..*: * : *
human PEP-RKPLEKAVANFFSGSCAPCADGT----------DFPQLCQLCPG-----CGCSTLN 202
mouse SEP-RSPLEKAVSSFFSGSCVPCADPV----------AFPKLCQLCPG-----CGCSSTQ 202
rat PEP-RKPLEKAVASFFSGSCVPCADPV----------AFPQLCQLCPG-----CGCSPTQ 202
cow PDP-QESIQRAAANFFSASCVPCADQS----------SFPKLCQLCAGKGTDKCACSNHE 206
horse PEP-RESLQKAVSNFFAGSCVPCADRT----------AVPNLCQLCVGKGTDKCACSNHE 204
bee EYSARENELRALSSLFSKGCLVGTWSPDPAINRRLKETYSNMCALCEK----PEVCDYPD 232
frog AGPDSETWRNAVSKFFKASCVPGAKE-------------PKLSQLCAGIKEHKCSRSNNE 206
salmon AGIEDRPVESAVSDFFNASCVPGAN------------TGSELCQLCMG----DCSRSHNE 185
cod SGQEDLPVES-VSTFFSKSCVPGAGGL----------VGGKLCTLCPS----DCSKSATN 147
: :* .* : ::. ** . :
human QYFGYSGAFKCLKDGAGDVAFVKHSTIF----------ENLANKADRDQYELLCLDNTRK 252
mouse PFFGYVGAFKCLKDGGGDVAFVKHTTIF----------EVLPEKADRDQYELLCLDNTRK 252
rat PFFGYVGAFKCLRDGGGDVAFVKHTTIF----------EVLPQKADRDQYELLCLDNTRK 252
cow PYFGYSGAFKCLMEGAGDVAFVKHSTVF----------DNLPNPEDRKNYELLCGDNTRK 256
horse PYFGYSGAFKCLADGAGDVAFVKHSTVL----------ENLPQEADRDEYQLLCRDNTRK 254
bee IYSGYEGALRCLAHNGGEIAWTKVIYVKRFFGLPVGVTAAIPTSENPADYRYFCPDGSKV 292
frog PYYNYAGAFKCLQDDQGDVAFVKQSTVP----------EEFHK-----DYELLCPDNTRK 251
salmon PYYDYSGAFQCLKDGAGEVAFIKHLTVP----------AAEKAS-----YELLCKDNSRA 230
cod PYFGYAGAFKCLKDDAGDVAFINHLTVP----------ASEKAN-----YELLCLDGTRA 192
: .* **::** .. *::*: : : *. :* *.::
human PVDE-YKDCHLAQVPSHTVVARSMGGKEDLIWELLNQAQEHFGKDKSKEFQLFSSP--HG 309
mouse PVDQ-YEDCYLARIPSHAVVARKNNGKEDLIWEILKVAQEHFGKGKSKDFQLFSSP--LG 309
rat PVDQ-YEDCYLARIPSHAVVARNGDGKEDLIWEILKVAQEHFGKGKSKDFQLFGSP--LG 309
cow SVDD-YQECYLAMVPSHAVVARTVGGKEDVIWELLNHAQEHFGKDKPDNFQLFQSP--HG 313
horse SVDE-YKDCYLASIPSHAVVARSVDGKEDLIWGLLNQAQEHFGTEKSKDFHLFSSP--HG 311
bee PIDANTKPCTWAARPWQGYMTNNGVNNVEAVQKELTDLG---------KLGEEEKAD-WW 342
frog SIKE-YKNCNLAKVPAHAVLTRGRDDKSKDIIEFLQEAQ------KTQECKLFRLPG-MG 303
salmon PIDS-YKTCHLARVPAHAVVSRKDTKLANHIYSKLMALK---------DFNLFSSDGYAA 280
cod PIDS-YKTCNLARVPAHAVVSRVDPELAERIFTALTTVT---------GFSFFSSAGFGA 242
.:. : * * * : ::. . : *
human KDLLFKDSAHGFLKVPPRMDAKMYLGYEYVTAIRNLREGTCPEAPTDECKPVKWCALSHH 369
mouse KDLLFKDSAFGLLRVPPRMDYRLYLGHNYVTAIRNQQEGVCPEGSIDNS-PVKWCALSHL 368
rat KDLLFKDSAFGLLRVPPRMDYRLYLGHSYVTAIRNQREGVCPEGSIDSA-PVKWCALSHQ 368
cow KDLLFKDSADGFLKIPSKMDFELYLGYEYVTALQNLRESKPPDSSKDEC-MVKWCAIGHQ 372
horse KDLLFKDSALGFLRIPPAMDTWLYLGYEYVTAIRNLREDIRPEVPKDECKKVKWCAIGHH 371
bee KDIMLLNEKTLAVPAPPVLPENHLKNAKYLDVIE--------RNSGATDKIIRWCTWSEG 394
frog KGSNFQGQRSESYSPPIFYGQFSVPRSRLFQCIQALKEGVKEDDSAAQV-KVRWCTQSKA 362
salmon KNLMFKDSTQNLVQLPMTTDSFLYLGAEYMSTIRSLTT---AQATGATSRAIKWCAVGHN 337
cod ANLMFKDTTQSLVRLPDGSNSFLYLGAKYMASIQSLKK---ESDQPITP-AIKWCAVGHA 298
. : . * . :. ::**: ..
human ERLKCDEWSVNSVG-----KIECVSAETTEDCIAKIMNGEADAMSLDGGFVYIAG-KCGL 423
mouse ERTKCDEWSIISEG-----KIECESAETTEDCIEKIVNGEADAMTLDGGHAYIAG-QCGL 422
rat ERAKCDEWSVSSNG-----QIECESAESTEDCIDKIVNGEADAMSLDGGHAYIAG-QCGL 422
cow ERTKCDRWSGFSGG-----AIECETAENTEECIAKIMKGEADAMSLDGGYLYIAG-KCGL 426
horse EKVKCDEWSVNSGG-----NIECESAQSTEDCIAKIVKGEADAMSLDGGFIYIAG-KCGL 425
bee DLEKCKALTRAAYSRDVRPKYDCTLEKSQDDCLKAIKENNADLTVVSGGSVLRATKEYNT 454
frog EKTKCDDWTTISGG-----AIECTEASTAEECIVQILKGDADAVTLDGGYMYTAG-LCGL 416
salmon EKVKCDAWTINSFTDGD-SRIECQDAPTVDECIKKIMRKEADAIAVDGGEVFTAG-QCGL 395
cod EKKKCDSWSSFSVSDGV-KSVACQISLTVEGCFQRIMRQEADAMSVDGGQVYTAG-KCQL 356
: **. : : * . : *: * . :** :.** *
human VPVLAENYNKS----DNCEDTPEAG-YFAVAVVKKS-ASDLTWDNLKGKKSCHTAVGR-T 476
mouse VPVMAEYYESSNCAIPSQQGIFPKG-YYAVAVVKAS-DTSITWNNLKGKKSCHTGVDR-T 479
rat VPVMAENYDISSCTNP-QSDVFPKG-YYAVAVVKAS-DSSINWNNLKGKKSCHTGVDR-T 478
cow VPVLAENYKTE---GESCKNTPEKG-YLAVAVVKTS-DANINWNNLKDKKSCHTAVDR-T 480
horse VPVLAENYETRS--GSACVDTPEEG-YHAVAVVKSSSDPDLTWNSLKGKKSCHTGVDR-T 481
bee VPIIAESYGSGS------TNFNERP---AVAVVSKS-SSINKLEDLRNKKSCHSGYKDSF 504
frog VPVMGEYYDQDDLTPCQRSCSQAKGVYYAVAIVKKG--TQVSWSNLRGVKTCHTAVGR-T 473
salmon VPVMVEQYDEVR----CSAPG-EASSYFAVAVAKRA--SGLTWKTLQGRRSCHTGLGR-T 447
cod IPAMVEQYNQSL----CSSAGTPQATYFAVAVVKKG--SGVTWDNLRGKRSCHTGLGR-T 409
:* : * * ***:.. . . . . *:. ::**:.
human AGWNIPMGLLYN---KINHCRFDEFFSEGCAPGSKKDSSLCKLCMGSGL---------NL 524
mouse AGWNIPMGMLYN---RINHCKFDEFFSQGCAPGYEKNSTLCDLCIGP-----------LK 525
rat AGWNIPMGLLFS---RINHCKFDEFFSQGCAPGYKKNSTLCDLCIGP-----------AK 524
cow AGWNIPMGLLYS---KINNCKFDEFFSAGCAPGSPRNSSLCALCIGSEKGTG------KE 531
horse AGWNIPMGLLYS---EIKHCEFDKFFREGCAPGYRRNSTLCNLCIGSASGPG------RE 532
bee AGWTAPIYTLKRKGLIKSENEAADFFSGSCAPGAPLDSKLCQQCVGNLASNNDRIRQVTK 564
frog AGWNIPVGLITS---ETANCDFASYVGESCAPGSDVKSNLCALCIGDPEKLSE---REKK 527
salmon AGWNIPMGLIHR---RTMNCDFTTYFSKGCAPGFEVDSPFCAQCRGSGQSVGG---DRAK 501
cod AGWNIPMGLVHS---IHGSCDFGGFFPSGCAPGSEPSSTFCRQCAGSGSGVE----DGSK 462
***. *: : :. .**** .* :* * *
human CEPNNKEGYYGYTGAFRCLVE-KGDVAFVKHQTVPQNTGGKNPDPWAKNLNEKDYELLCL 583
mouse CAPNNKEEYNGYTGAFRCLVE-KGDVAFVKHQTVLDNTEGKNPAEWAKNLKQEDFELLCP 584
rat CAPNNREGYNGYTGAFQCLVE-KGDVAFVKHQTVLENTNGKNTAAWAKDLKQEDFQLLCP 583
cow CVPNSNERYYGYTGAFRCLVE-KGDVAFVKDQTVIQNTDGNNNEAWAKNLKKENFEVLCK 590
horse CEPNNHERYYGYTGAFRCLVE-KGDVAFVKHQTVEQNTDGRNPDDWAKDLKSENFKLLCP 591
bee CKATNEETYRGGKGALSCLLDGKGDVAFVPLTALSE-----------EGVQSKDLALICP 613
frog CSPSASEAYYGYSGAFRCLVE-KGQVGFAKHTTVFENTDGKNPAGWAKDLKSEDFELLCP 586
salmon CKASSEEQYYGYTGAFRCLVEDAGDVAFIKHTIVPEMTDGNGP-VWAQNLMSSDFELLCQ 560
cod CSASSVEKYYGYAGAFRCLVDGAGDVAFIKHTIVADNSDGQGP-AWATALKSSDYQLICP 521
* .. * * * **: **:: *:*.* : : : ..: ::*
human DGT-RKPVEEYANCHLARAPNHAVVTRKDKEACVHKILRQQQHLFGSNVTDCSGNFCLFR 642
mouse DGT-RKPVKDFASCHLAQAPNHVVVSRKEKAARVKAVLTSQETLFG--GSDCTGNFCLFK 641
rat DGT-KKPVTEFATCHLAQAPNHVVVSRKEKAARVSTVLTAQKDLFWKGDKDCTGNFCLFR 642
cow DGT-RKPVTDAENCHLARGPNHAVVSRKDKATCVEKILNKQQDDFGKSVTDCTSNFCLFQ 649
horse DGT-RKSVTEFKSCYLARAPNHAVVSRKEKAACVCQELHNQQASYGKNGSHCPDKFCLFQ 650
bee DGG-RAEINEWERCNLGLEPPRVILSSGAKSPTVLEELTHGTLAASTLYSKRPDLLHLFG 672
frog DGS-RAPVTDYKRCNLAEVPAHAVVTLPDKREQVAKIVVNQQSLYGRKGFQK-DIFQMFQ 644
salmon DGT-TQPVTKFRECHLAKVPAHAVITRPESRGEVVSILLEQQARFGSSGSDS--SFNMFK 617
cod GGVGRAEISDFASCNLAAVPSHAVVTRQDIRDDVVKMLLDQQRKFGIDGSDP--LFRIYE 579
.* : . * *. * :.::: * : . : ::
human SET--KDLLFRDDTVCLAKLHDRNTYEKYLGEEYVKAVGNLRKCSTS------------- 687
mouse STT--KDLLFRDDTKCFVKLPEGTTPEKYLGAEYMQSVGNMRKCSTS------------- 686
rat SST--KDLLFRDDTKCLTKLPEGTTYEEYLGAEYLQAVGNIRKCSTS------------- 687
cow SNS--KDLLFRDDTKCLASIA-KKTYDSYLGDDYVRAMTNLRQCSTS------------- 693
horse SAT--KDLLFRDDTQCLANLQPTTTYKTYLGEKYLTAVANLRQCSTS------------- 695
bee SWSNRPNLLFKDEAKDLVSVNKSWNKWN----DWQETQNNYGAA---------------- 712
frog STGG-KDLLFKDSTQCLLEIPSKTTMQEFLGDKYHTAVTSLNKCSTSNEASWLPAQFHSC 703
salmon TDFG-KNLLFKDSTKCLQEIPSGTKFQGFLGEEYMIAMQSLRECSNSTS----------- 665
cod SKDG-NNLLFKDSTKCLKEIPSLTTADAFLGTGYVNAIMSLRQCPETAS----------- 627
: :***:*.: : .: . : : . .
human ---SLLEACTFRRP---- 698
mouse ---RLLEACTFHKH---- 697
rat ---RLLEACTFHKS---- 698
cow ---KLLEACTFHKP---- 704
horse ---RLLEACTFHRV---- 706
bee ------------------
frog MKIYIMVDCPLSII---- 717
salmon ---DLEKACT-------- 672
cod ---ELEKTCISSSCSTAE 642
Accession numbers of the transferrin protein sequences used in the protein sequence alignment: human - NP_001054; mouse - NP_598738; rat - NP_001013128; cow - NP_803450; horse - NP_001075415; bee - NP_001011572; frog - NP_001079812; salmon - AAF03084; cod - AAB08440
Predicted General Properties of Transferrin (Homo sapiens) - Primary Sequence Analysis (with ProtParam)b
Number of amino acids: 698
Molecular weight: 77049.8
Theoretical pI: 6.81
Total number of negatively charged residues (Asp + Glu): 87
Total number of positively charged residues (Arg + Lys): 85
Atomic composition: Carbon C 3389 Hydrogen H 5282 Nitrogen N 934 Oxygen O 1021 Sulfur S 50
Formula: C3389H5282N934O1021S50
Total number of atoms: 10676
Extinction coefficients: Extinction coefficients are in units of M-1 cm-1, at 280 nm measured in water. Ext. coefficient 85240 Abs 0.1% (=1 g/l) 1.106, assuming ALL Cys residues appear as half cystines Ext. coefficient 82740 Abs 0.1% (=1 g/l) 1.074, assuming NO Cys residues appear as half cystines
Estimated half-life: The N-terminal of the sequence considered is M (Met). The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro). >20 hours (yeast, in vivo). >10 hours (Escherichia coli, in vivo).
Instability index: The instability index (II) is computed to be 34.94 This classifies the protein as stable.
Aliphatic index: 73.38
Grand average of hydropathicity (GRAVY): -0.337
Bioinformatics References
a Higgins D., Thompson J., Gibson T. Thompson J. D., Higgins D. G., Gibson T. J.(1994). CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting,position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673-4680.
b Gasteiger E., Hoogland C., Gattiker A., Duvaud S., Wilkins M.R., Appel R.D., Bairoch A.; Protein Identification and Analysis Tools on the ExPASy Server;
(In) John M. Walker (ed): The Proteomics Protocols Handbook, Humana Press (2005). pp. 571-607 Full text - Copyright Humana Press.