8.A.77.1.8 Papilin isoform X5 of 3072 aas. This large protein contains three domains that with filters, has an N-terminal domain that hits proteases of TC family 8.A.77 with a minimal e-value of e-51, a large domain in the C-terminal half of the protein that hits proteases of TC family 8.A.13 with a minimal e-value of e-20, and a small C-terminal domain that hits protein of TC family 8.A.23 with a minimal e-value of e-13. Numerous papilins come up in BLAST searches using 1.B.89.1.1 (an outer membrane porin from Actinobacteria) with low scores. between residues 1762 and 2518, there are about 12 repeats of ~60 aas. They are homologous to the amyloid A4 protein (TC# 1.C.50.1.1).
|
Accession Number: | XP_021194905 |
Protein Name: | XP_021194905.1 papilin isoform X5 [Helicoverpa armigera] |
Length: | 3074 |
Molecular Weight: | |
Species: | Helicoverpa armigera [29058] |
Number of TMSs: | 1 |
Substrate |
|
---|
1: MTAFNFRSLL LAAIVLSNCI TWTASRHHYA HNVHQHRTRH RRQGAGLYLP ASYIIPGQEG
61: RSENDGGWGE WGPVSPCSRT CGGGVASQKR ICLQIGQDGQ PQCQGGDTKY FSCQTQDCPA
121: GSGDFRAEQC AEFNDKEFRG FKYNWVPYTK APNPCELNCM PHGERFYFRH KLKVVDGTRC
181: NEDSFDVCVD GRCQQVGCDM MLGSNAREDK CRECRGNGAN CRTTAGIIDS QDLIKGYNDV
241: LLIPEGSTTI IIEEIDASNN YLALRAKNDT YYLNGHYHID FPRTIMIAGA LWTYERSQQG
301: FAAPDKLRCL GPTTEPLYLS LLLQDVNVGI RYEYSVPKDQ APPAHKQYNW VHEEFTPCSA
361: TCGGGFQTRN VTCRSREELE IVDDELCDAG LKPPTNQTCN TDPCPAEWNE GPWGNCSQRC
421: GSGGVRSREV TCQKIIANGI KSIVEDTECF ERLGPKPKLF ESCNEDAPCP TWFVGRWKPC
481: SELCDEGKQT RQVVCHQKKN GRVEVLSDEN CLEEKPEAEK SCLLRPCEGI DWVTSDWVGC
541: DNCLSKTRTR RVVCATYSKQ VINDSFCSYH TRPADQEECE KVPECDVQWY ATQWSKCSVS
601: CGEGVQTRKV FCGVLNDDAV VILEDEKCKD IPRYKDSKPC QVPKEDCPPE WFAGPWENCT
661: KECGGGEQSR RVMCFKGDQP FGNCTDYSVL EASQFCNTGP CNEDELLGVT EEVKDPNVYC
721: EDEEDYEEVG VDEDLSTTTN EMMSDSPSPS AFAEHESTLE GSGSTTPLED LTEEGSGEST
781: FSTGTDWTTT DSDYETGSGD ISTTDEPLSS SPKAAKESGS TTTTVAMPTE SIPRSSVVPS
841: SSTEATTETG ETDTTGTATD ETTSTGTDET GSTDTDSTGS TTEGETGSTT EYTGTTETDI
901: TTESSTTLED SSSTGTESTD STVTSETGST ESSVSADTKE TGSTIESSSP VSITSPTDTT
961: GSTSSGETET TESSGSTSSD VSTDETGSTE ATSTEETGST ESTTTDVTDT DFTDTSNTET
1021: TGTEDTESTV TEETSSTEST VTETTVTSET ESTVTSETES TETETTLTGS TDSESTSTDS
1081: TEESTTGVSV SGSTDSTTTP VAMSSKSGSS TTTTVAMPTE KSSSDASETT EGSTESSGST
1141: VESSTTERES TTESGSTVEE TGSTTVGGST LETESTVESS TASGSEETTE QSTTVSGVES
1201: STVSESGSTT EAAETSESGS TPVGSSTLSG SESTTEVAES SSDSSTETGE TTESSTTVGS
1261: ESTTEGGSTT EGEESTTETG ETSSTGATES TEESSTTESG STEESTTEGL STTEGSTIAG
1321: STEESGSTTE GLTTTEGPSI GSSPWDKVTV LVPHTARTCI PRPKKCKNSR YGCCPDGKTA
1381: AKGPFDAECK TIHNCKESPF KCCPDGVSPA QGPNFKGCPI EPCADTLFGC CQSDNKTAAQ
1441: GNDQEGCPPP PPACASSKFG CCADNETEAS GPEKEGCPET ETTTTGATTE TTTDTTESIE
1501: TSEGSTTAEG TSTETATEYA STTLDPCTGF QYGCCADNQT ESPGPDGQGC PCEATEYGCC
1561: SDGKTPAKGE KDAGCPGPCS TSQHGCCEDG QTPAHGPDFE GCCLLHAFGC CPDNRKPAEG
1621: PHLEGCGCQY TRHGCCPDNV TIAQGPGNEG CGCQYSQHGC CPDKHTIALG PNFEGCACHT
1681: YQFGCCPDGV TTAKGPEQQG CHCFESPFGC CGDEETHATG PEKAGCDCST SKYGCCPDGV
1741: TEATGSKFLG CTDAPENKQA SCSLPTDPGS CHNFTAMWYY DLAYGGCSRF WYGGCEGNGN
1801: RFATKEECED VCVQPAPKDA CKLPSVKGAC DAEYTRWHYN STMEQCVQFR YGGCLGNANN
1861: FDSRELCQKQ CEPTTVEGQC KLPIESGSCS GNYSRWGFNV DSGKCERFTW GGCEGNSNRF
1921: STEAACLLRC LIGAQPPQCS EPQEAGTCSD KQALWSFSVS ENRCMPFYYS GCGGNHNRFT
1981: SREACEQTCP SAYEIDKCTL PAETGECSNY RERWFYDTAI KRCRQFYYGG CGGNENNFNT
2041: EAECEGSCAE LQTTTTTTTA RPTQPQQQRP ENPEPAEYCL LEIDAGPCND TVTRYAYDSA
2101: LGRCVTFEYG GCGGNQNNFP DYEYCSLYCG ATQDICQLPM MTGPCEASLQ RFFYDPATDS
2161: CSQFTYGGCE GNDNRFETRE ACESRCRSRP APRPTPSPTT ITPPPAVDIP AECRSAQETC
2221: SHGGVVWYFD PTRSECVSHA NYENGSDCRY SNTYSSQEGC ERSCGAFKGV DVCKQRMDPG
2281: PCRAYMPKVY FDASTGQCRE FVYGGCLGGA NRFSSVDECS QVCKSEIEDV CSLPPEEGNC
2341: FSYIPQWYYD TLRDQCLQFV YTGCSGNDNR FETKSDCESR CKRPTLTTTT TVAPPQQTEE
2401: SECKTPSSLE PCGANVTMFY YDSERRQCLT GEIGNCGHPN TYRTEEECER RCGAFVGLDP
2461: CGSHLDPGPC RASIPKFYWD SITASCQEYS YGGCDGGPNR FSTVDECESV CKAFRPPVEC
2521: LQRADVGAAC GLPPGARYHY SAALADCVVF AYLGCGGNGN NFRSYQQCLD HCTPNLQPDV
2581: VDCSTYRMEC DALDCKFGTL HYEQDGCERC ICKEDPCLRA NCSASEHCEV YLKRDPGVQG
2641: VEYAAKCIAE ENEVFDCGDY VEHCSRLQCE YGVQRARLPN GCEQCSCVQV EVDCRPLQEE
2701: CDQLICNYGM DKIPGPDGCE RCKCKDYPCA SKSCAAGERC VVSQYWEAVN QEMKYSADCH
2761: QIVKPGACPV EQTTTNEVTC RRDCKDDADC RGVGKCCRRG CSSVCTEPVE QTSPHPLITT
2821: LAPDVPAAPQ ALPAPEPQVE AAEGGKATLR CLFHGNPPPK ITWKYGEVTI DGNAGRYRLQ
2881: SDGALEIVSL YRNDTGVYIC VADNGLGQAT QEIYLAVTVP VTANISTNPT TMSVGRDLYL
2941: ICNLDGFPEP EVYWTKDGYP LQSDGRISIT GSTIVSRLTV GRLTVSDSGL YACHAQNQYS
3001: SQTDTVQINV QQQVVVPAKC TDNPFFANCD LIVHSKFCKH KYYSKFCCKS CVEAGQLDPR
3061: EAELQADQPL RKK