|
Name |
Accession |
Description |
Interval |
E-value |
| APC_basic |
pfam05956 |
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ... |
1757-2089 |
6.66e-97 |
|
APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules. :
Pssm-ID: 428690 [Multi-domain] Cd Length: 336 Bit Score: 316.82 E-value: 6.66e-97
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1757 TMLRGRTVIYSAGpASRTQSKGISGPCTTPKKTgtsgttqpETVTKAPSPEQQRSRSLHRPGKISELAALRHPPRSATPP 1836
Cdd:pfam05956 1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1837 ARLAKTPSSSSSQTSPASQPLPRRSPLATPTGGPLPGPGGS------LVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1908
Cdd:pfam05956 72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRnklsplPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1909 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRMASARSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 1983
Cdd:pfam05956 152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1984 SSADSTASTSQAASPRRGRPALPAVFLCSSRCDELRVSPRQPLAA--QRSPQAKPGLAPRAPRRTSSESPSRLPVRASPG 2061
Cdd:pfam05956 229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
|
330 340
....*....|....*....|....*...
gi 148699623 2062 RPETVKRYASLPHISVSRRSDSAVSVPT 2089
Cdd:pfam05956 309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
|
|
| APC_rep |
pfam18797 |
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ... |
356-424 |
2.61e-34 |
|
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region. :
Pssm-ID: 465870 Cd Length: 74 Bit Score: 126.89 E-value: 2.61e-34
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 148699623 356 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTET-----PVPIEPQICQATCAVMKLSFDEEYRRAM 424
Cdd:pfam18797 1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGdsnpmPSPIEHQICPAICALMKLSFDEEHRHAM 74
|
|
| APC_N_CC |
pfam16689 |
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ... |
6-57 |
5.36e-26 |
|
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin. :
Pssm-ID: 435517 Cd Length: 52 Bit Score: 102.37 E-value: 5.36e-26
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 148699623 6 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 57
Cdd:pfam16689 1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
|
|
| Suppressor_APC |
pfam11414 |
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ... |
124-204 |
4.28e-18 |
|
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils. :
Pssm-ID: 463275 Cd Length: 82 Bit Score: 80.76 E-value: 4.28e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 124 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDT-FSMQMDLIRQQLEFEAQHIRSLMEERFG 202
Cdd:pfam11414 1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGTyFDYGSDAQQERLEFLLARIQEVNRCLGG 80
|
..
gi 148699623 203 TS 204
Cdd:pfam11414 81 LI 82
|
|
| Arm_APC_u3 super family |
cl25003 |
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ... |
690-936 |
5.33e-17 |
|
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. The actual alignment was detected with superfamily member pfam16629:
Pssm-ID: 435476 Cd Length: 293 Bit Score: 83.87 E-value: 5.33e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 690 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQSlPEAETTSKKplpplRHLDGLVQDYASDSG 769
Cdd:pfam16629 1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 770 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEK-----------------EAGGEAAVA 832
Cdd:pfam16629 74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKdrsldrergaglsnfhpATENSGNSS 147
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 833 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 908
Cdd:pfam16629 148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
|
250 260
....*....|....*....|....*...
gi 148699623 909 AAHTSLSNDSLNSGSTSDGYCTREHMTP 936
Cdd:pfam16629 228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
608-647 |
1.46e-06 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats. :
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 46.68 E-value: 1.46e-06
10 20 30 40
....*....|....*....|....*....|....*....|
gi 148699623 608 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 647
Cdd:pfam00514 2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
1505-1944 |
7.32e-05 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.40 E-value: 7.32e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1505 PPPRRASAIPRALKREKPAGRKETP---SRAAQPATLPVRAQPRLIVDETPPcysltssaSSLSEPEAPEQPANHARGPE 1581
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGD--------PRGPAPPSPLPPDTHAPDPP 2627
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1582 QgskqdSSPSPRAEEELLQRCISLAMPRRRTQVP-GSRRRKPRALRSDIRPTEITQKCQ--EEVAGSDPASDLDSVEWQA 1658
Cdd:PHA03247 2628 P-----PSPSPAANEPDPHPPPTVPPPERPRDDPaPGRVSRPRRARRLGRAAQASSPPQrpRRRAARPTVGSLTSLADPP 2702
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1659 IQEGAnsivtwlhQAAAKASLEASSESDSLLSLVSGVSAGSTLQPSKLRKGRKPAAEAGGAWRPEKRGTTSTKINGSPRL 1738
Cdd:PHA03247 2703 PPPPT--------PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA 2774
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1739 PNGPEKAKGTQKMMAGESTMLRGRTVIYSAGPASRTQSKGISGPCTTPKKTGTSGTTQPETVTKAPSPEQQRSRSLHRPG 1818
Cdd:PHA03247 2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1819 KISELAALRHPPRSATPPARLAktpsssssqtsPASQPLPRRSPLATPTGGPLPGPGGSLVPKSPARALLAKQHKTQKSP 1898
Cdd:PHA03247 2855 SVAPGGDVRRRPPSRSPAAKPA-----------APARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923
|
410 420 430 440
....*....|....*....|....*....|....*....|....*.
gi 148699623 1899 VRIPFMQRPARrvPPPLARPSPEPGSRGRAGAEGTPGARGSRLGLV 1944
Cdd:PHA03247 2924 PPPPQPQPPPP--PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL 2967
|
|
| PTZ00449 super family |
cl33186 |
104 kDa microneme/rhoptry antigen; Provisional |
1999-2196 |
3.40e-04 |
|
104 kDa microneme/rhoptry antigen; Provisional The actual alignment was detected with superfamily member PTZ00449:
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 45.84 E-value: 3.40e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1999 RRGRPALPAVFLC--SSRCDELRVSPRQPLAAQRSPQAKPGLAPRAPRrtSSESPSRLPVRASPGRPETV------KRYA 2070
Cdd:PTZ00449 608 RPKSPKLPELLDIpkSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIK--SPKPPKSPKPPFDPKFKEKFyddyldAAAK 685
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2071 SLPHISVSRRSDSAVSVPTTQANATRRGSDGEARPLPRVAP--PGTTWRRIKDEDVPHILRSTLPATALPLRVSSPEdSP 2148
Cdd:PTZ00449 686 SKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPrdEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHE-TP 764
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 148699623 2149 AGTPQRKTSDAVVQTEDVaTSKTNSSTSPSLESRDP----PQAPASGPVAPQ 2196
Cdd:PTZ00449 765 ADTPLPDILAEEFKEEDI-HAETGEPDEAMKRPDSPseheDKPPGDHPSLPK 815
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1370-1392 |
6.38e-04 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin. :
Pssm-ID: 461781 Cd Length: 24 Bit Score: 38.90 E-value: 6.38e-04
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
649-689 |
3.29e-03 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats. :
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 37.05 E-value: 3.29e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 148699623 649 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 689
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1247-1268 |
3.48e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin. :
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.59 E-value: 3.48e-03
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1135-1158 |
5.58e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin. :
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.20 E-value: 5.58e-03
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
1285-1622 |
6.20e-03 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.23 E-value: 6.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1285 PPACPERAVGGGGHRRRDEAASRLDGPAPAGSRARSATDKeleALRECLG-----AAMPARLRKVASALVPGRRSLPVPv 1359
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRR---AARPTVGsltslADPPPPPPTPEPAPHALVSATPLP- 2722
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1360 ymLVPAPARGDDSGTDSAEGTPVNFSSAASLSDETlQGPSRDKPAGPGDRQKPTGRAAPArQTRSHRPKAAGAGKSTEHT 1439
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPA-RPARPPTTAGPPAPAPPAAPAAGP-PRRLTRPAVASLSESRESL 2798
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1440 RGPcRNRAGLELPLSRPQSARSNRDSSCQTRTRGDGALQS--------LCLTTPTEEAVYCFYD-SDEEPPATAPPPRRA 1510
Cdd:PHA03247 2799 PSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTapppppgpPPPSLPLGGSVAPGGDvRRRPPSRSPAAKPAA 2877
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1511 SAIPRALKREKPAgrketPSRAAQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEAPEQPANHAR---GPEQGSKQD 1587
Cdd:PHA03247 2878 PARPPVRRLARPA-----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQpplAPTTDPAGA 2952
|
330 340 350
....*....|....*....|....*....|....*
gi 148699623 1588 SSPSPRAEEELLQRCISLAMPRRRTQVPGSRRRKP 1622
Cdd:PHA03247 2953 GEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSRE 2987
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| APC_basic |
pfam05956 |
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ... |
1757-2089 |
6.66e-97 |
|
APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.
Pssm-ID: 428690 [Multi-domain] Cd Length: 336 Bit Score: 316.82 E-value: 6.66e-97
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1757 TMLRGRTVIYSAGpASRTQSKGISGPCTTPKKTgtsgttqpETVTKAPSPEQQRSRSLHRPGKISELAALRHPPRSATPP 1836
Cdd:pfam05956 1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1837 ARLAKTPSSSSSQTSPASQPLPRRSPLATPTGGPLPGPGGS------LVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1908
Cdd:pfam05956 72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRnklsplPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1909 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRMASARSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 1983
Cdd:pfam05956 152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1984 SSADSTASTSQAASPRRGRPALPAVFLCSSRCDELRVSPRQPLAA--QRSPQAKPGLAPRAPRRTSSESPSRLPVRASPG 2061
Cdd:pfam05956 229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
|
330 340
....*....|....*....|....*...
gi 148699623 2062 RPETVKRYASLPHISVSRRSDSAVSVPT 2089
Cdd:pfam05956 309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
|
|
| APC_rep |
pfam18797 |
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ... |
356-424 |
2.61e-34 |
|
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.
Pssm-ID: 465870 Cd Length: 74 Bit Score: 126.89 E-value: 2.61e-34
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 148699623 356 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTET-----PVPIEPQICQATCAVMKLSFDEEYRRAM 424
Cdd:pfam18797 1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGdsnpmPSPIEHQICPAICALMKLSFDEEHRHAM 74
|
|
| APC_N_CC |
pfam16689 |
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ... |
6-57 |
5.36e-26 |
|
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.
Pssm-ID: 435517 Cd Length: 52 Bit Score: 102.37 E-value: 5.36e-26
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 148699623 6 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 57
Cdd:pfam16689 1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
|
|
| Suppressor_APC |
pfam11414 |
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ... |
124-204 |
4.28e-18 |
|
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.
Pssm-ID: 463275 Cd Length: 82 Bit Score: 80.76 E-value: 4.28e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 124 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDT-FSMQMDLIRQQLEFEAQHIRSLMEERFG 202
Cdd:pfam11414 1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGTyFDYGSDAQQERLEFLLARIQEVNRCLGG 80
|
..
gi 148699623 203 TS 204
Cdd:pfam11414 81 LI 82
|
|
| Arm_APC_u3 |
pfam16629 |
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ... |
690-936 |
5.33e-17 |
|
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435476 Cd Length: 293 Bit Score: 83.87 E-value: 5.33e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 690 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQSlPEAETTSKKplpplRHLDGLVQDYASDSG 769
Cdd:pfam16629 1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 770 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEK-----------------EAGGEAAVA 832
Cdd:pfam16629 74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKdrsldrergaglsnfhpATENSGNSS 147
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 833 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 908
Cdd:pfam16629 148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
|
250 260
....*....|....*....|....*...
gi 148699623 909 AAHTSLSNDSLNSGSTSDGYCTREHMTP 936
Cdd:pfam16629 228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
608-647 |
1.46e-06 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 46.68 E-value: 1.46e-06
10 20 30 40
....*....|....*....|....*....|....*....|
gi 148699623 608 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 647
Cdd:pfam00514 2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
607-647 |
2.24e-06 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 46.27 E-value: 2.24e-06
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 148699623 607 REDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 647
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
1769-1922 |
2.45e-05 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 49.69 E-value: 2.45e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1769 GPASRTQSKGISGPCT---TPKKTGTSGTTQPETVTKAPSPEQQrsrslHRPGKISELAALRHPPRSATPPARlaKTPSS 1845
Cdd:PTZ00449 523 APGDKEGEEGEHEDSKesdEPKEGGKPGETKEGEVGKKPGPAKE-----HKPSKIPTLSKKPEFPKDPKHPKD--PEEPK 595
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 148699623 1846 SSSQTSPASQPLPRRSPlatptggplPGPGGSLVPKSPARALLAKQHKTQKSPVRIPFMQRPARRVPPPLARP--SPEP 1922
Cdd:PTZ00449 596 KPKRPRSAQRPTRPKSP---------KLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPpkSPKP 665
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1505-1944 |
7.32e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.40 E-value: 7.32e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1505 PPPRRASAIPRALKREKPAGRKETP---SRAAQPATLPVRAQPRLIVDETPPcysltssaSSLSEPEAPEQPANHARGPE 1581
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGD--------PRGPAPPSPLPPDTHAPDPP 2627
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1582 QgskqdSSPSPRAEEELLQRCISLAMPRRRTQVP-GSRRRKPRALRSDIRPTEITQKCQ--EEVAGSDPASDLDSVEWQA 1658
Cdd:PHA03247 2628 P-----PSPSPAANEPDPHPPPTVPPPERPRDDPaPGRVSRPRRARRLGRAAQASSPPQrpRRRAARPTVGSLTSLADPP 2702
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1659 IQEGAnsivtwlhQAAAKASLEASSESDSLLSLVSGVSAGSTLQPSKLRKGRKPAAEAGGAWRPEKRGTTSTKINGSPRL 1738
Cdd:PHA03247 2703 PPPPT--------PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA 2774
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1739 PNGPEKAKGTQKMMAGESTMLRGRTVIYSAGPASRTQSKGISGPCTTPKKTGTSGTTQPETVTKAPSPEQQRSRSLHRPG 1818
Cdd:PHA03247 2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1819 KISELAALRHPPRSATPPARLAktpsssssqtsPASQPLPRRSPLATPTGGPLPGPGGSLVPKSPARALLAKQHKTQKSP 1898
Cdd:PHA03247 2855 SVAPGGDVRRRPPSRSPAAKPA-----------APARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923
|
410 420 430 440
....*....|....*....|....*....|....*....|....*.
gi 148699623 1899 VRIPFMQRPARrvPPPLARPSPEPGSRGRAGAEGTPGARGSRLGLV 1944
Cdd:PHA03247 2924 PPPPQPQPPPP--PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL 2967
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1591-1612 |
9.08e-05 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.
Pssm-ID: 461782 Cd Length: 22 Bit Score: 41.04 E-value: 9.08e-05
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
1999-2196 |
3.40e-04 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 45.84 E-value: 3.40e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1999 RRGRPALPAVFLC--SSRCDELRVSPRQPLAAQRSPQAKPGLAPRAPRrtSSESPSRLPVRASPGRPETV------KRYA 2070
Cdd:PTZ00449 608 RPKSPKLPELLDIpkSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIK--SPKPPKSPKPPFDPKFKEKFyddyldAAAK 685
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2071 SLPHISVSRRSDSAVSVPTTQANATRRGSDGEARPLPRVAP--PGTTWRRIKDEDVPHILRSTLPATALPLRVSSPEdSP 2148
Cdd:PTZ00449 686 SKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPrdEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHE-TP 764
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 148699623 2149 AGTPQRKTSDAVVQTEDVaTSKTNSSTSPSLESRDP----PQAPASGPVAPQ 2196
Cdd:PTZ00449 765 ADTPLPDILAEEFKEEDI-HAETGEPDEAMKRPDSPseheDKPPGDHPSLPK 815
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1370-1392 |
6.38e-04 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 38.90 E-value: 6.38e-04
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
649-689 |
3.29e-03 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 37.05 E-value: 3.29e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 148699623 649 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 689
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1247-1268 |
3.48e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.59 E-value: 3.48e-03
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1135-1158 |
5.58e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.20 E-value: 5.58e-03
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1285-1622 |
6.20e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.23 E-value: 6.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1285 PPACPERAVGGGGHRRRDEAASRLDGPAPAGSRARSATDKeleALRECLG-----AAMPARLRKVASALVPGRRSLPVPv 1359
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRR---AARPTVGsltslADPPPPPPTPEPAPHALVSATPLP- 2722
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1360 ymLVPAPARGDDSGTDSAEGTPVNFSSAASLSDETlQGPSRDKPAGPGDRQKPTGRAAPArQTRSHRPKAAGAGKSTEHT 1439
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPA-RPARPPTTAGPPAPAPPAAPAAGP-PRRLTRPAVASLSESRESL 2798
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1440 RGPcRNRAGLELPLSRPQSARSNRDSSCQTRTRGDGALQS--------LCLTTPTEEAVYCFYD-SDEEPPATAPPPRRA 1510
Cdd:PHA03247 2799 PSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTapppppgpPPPSLPLGGSVAPGGDvRRRPPSRSPAAKPAA 2877
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1511 SAIPRALKREKPAgrketPSRAAQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEAPEQPANHAR---GPEQGSKQD 1587
Cdd:PHA03247 2878 PARPPVRRLARPA-----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQpplAPTTDPAGA 2952
|
330 340 350
....*....|....*....|....*....|....*
gi 148699623 1588 SSPSPRAEEELLQRCISLAMPRRRTQVPGSRRRKP 1622
Cdd:PHA03247 2953 GEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSRE 2987
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
7-86 |
7.91e-03 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 41.04 E-value: 7.91e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 7 SYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQEARVL--VSSGQTEVLEQLKALQTDIS 84
Cdd:COG4372 32 QLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLqaAQAELAQAQEELESLQEEAE 111
|
..
gi 148699623 85 SL 86
Cdd:COG4372 112 EL 113
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| APC_basic |
pfam05956 |
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ... |
1757-2089 |
6.66e-97 |
|
APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.
Pssm-ID: 428690 [Multi-domain] Cd Length: 336 Bit Score: 316.82 E-value: 6.66e-97
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1757 TMLRGRTVIYSAGpASRTQSKGISGPCTTPKKTgtsgttqpETVTKAPSPEQQRSRSLHRPGKISELAALRHPPRSATPP 1836
Cdd:pfam05956 1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1837 ARLAKTPSSSSSQTSPASQPLPRRSPLATPTGGPLPGPGGS------LVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1908
Cdd:pfam05956 72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRnklsplPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1909 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRMASARSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 1983
Cdd:pfam05956 152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1984 SSADSTASTSQAASPRRGRPALPAVFLCSSRCDELRVSPRQPLAA--QRSPQAKPGLAPRAPRRTSSESPSRLPVRASPG 2061
Cdd:pfam05956 229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
|
330 340
....*....|....*....|....*...
gi 148699623 2062 RPETVKRYASLPHISVSRRSDSAVSVPT 2089
Cdd:pfam05956 309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
|
|
| APC_rep |
pfam18797 |
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ... |
356-424 |
2.61e-34 |
|
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.
Pssm-ID: 465870 Cd Length: 74 Bit Score: 126.89 E-value: 2.61e-34
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 148699623 356 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTET-----PVPIEPQICQATCAVMKLSFDEEYRRAM 424
Cdd:pfam18797 1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGdsnpmPSPIEHQICPAICALMKLSFDEEHRHAM 74
|
|
| APC_N_CC |
pfam16689 |
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ... |
6-57 |
5.36e-26 |
|
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.
Pssm-ID: 435517 Cd Length: 52 Bit Score: 102.37 E-value: 5.36e-26
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 148699623 6 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 57
Cdd:pfam16689 1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
|
|
| Suppressor_APC |
pfam11414 |
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ... |
124-204 |
4.28e-18 |
|
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.
Pssm-ID: 463275 Cd Length: 82 Bit Score: 80.76 E-value: 4.28e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 124 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDT-FSMQMDLIRQQLEFEAQHIRSLMEERFG 202
Cdd:pfam11414 1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGTyFDYGSDAQQERLEFLLARIQEVNRCLGG 80
|
..
gi 148699623 203 TS 204
Cdd:pfam11414 81 LI 82
|
|
| Arm_APC_u3 |
pfam16629 |
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ... |
690-936 |
5.33e-17 |
|
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435476 Cd Length: 293 Bit Score: 83.87 E-value: 5.33e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 690 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQSlPEAETTSKKplpplRHLDGLVQDYASDSG 769
Cdd:pfam16629 1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 770 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEK-----------------EAGGEAAVA 832
Cdd:pfam16629 74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKdrsldrergaglsnfhpATENSGNSS 147
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 833 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 908
Cdd:pfam16629 148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
|
250 260
....*....|....*....|....*...
gi 148699623 909 AAHTSLSNDSLNSGSTSDGYCTREHMTP 936
Cdd:pfam16629 228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
608-647 |
1.46e-06 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 46.68 E-value: 1.46e-06
10 20 30 40
....*....|....*....|....*....|....*....|
gi 148699623 608 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 647
Cdd:pfam00514 2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
607-647 |
2.24e-06 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 46.27 E-value: 2.24e-06
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 148699623 607 REDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 647
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
1769-1922 |
2.45e-05 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 49.69 E-value: 2.45e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1769 GPASRTQSKGISGPCT---TPKKTGTSGTTQPETVTKAPSPEQQrsrslHRPGKISELAALRHPPRSATPPARlaKTPSS 1845
Cdd:PTZ00449 523 APGDKEGEEGEHEDSKesdEPKEGGKPGETKEGEVGKKPGPAKE-----HKPSKIPTLSKKPEFPKDPKHPKD--PEEPK 595
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 148699623 1846 SSSQTSPASQPLPRRSPlatptggplPGPGGSLVPKSPARALLAKQHKTQKSPVRIPFMQRPARRVPPPLARP--SPEP 1922
Cdd:PTZ00449 596 KPKRPRSAQRPTRPKSP---------KLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPpkSPKP 665
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1792-2274 |
4.21e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.17 E-value: 4.21e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1792 SGTTQPETVTKAPSPEQQRSRSLHRPGkiselaalrhpPRSATPPARLAKTPSSSSSQTSPASQPL-PRRSPLATPTGGP 1870
Cdd:PHA03247 2548 AGDPPPPLPPAAPPAAPDRSVPPPRPA-----------PRPSEPAVTSRARRPDAPPQSARPRAPVdDRGDPRGPAPPSP 2616
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1871 LPGPGGSLVPKSPARALLAKQhKTQKSPVRIPFMQRPARRVPPPLARPSPEPGSRGRAGAEGTPGARGSRLGL-----VR 1945
Cdd:PHA03247 2617 LPPDTHAPDPPPPSPSPAANE-PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAArptvgSL 2695
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1946 MASARSSGSESSDRSGFRRQLTFIKESPGLLRRRRSELSSADSTASTSQAASP-------RRGRPALPAvflcssrcdel 2018
Cdd:PHA03247 2696 TSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPatpggpaRPARPPTTA----------- 2764
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2019 rvSPRQPLAAQRSPQAKPGLAPRAPRRTSSESPSRLPVRASPGRPETVKRYASLPHISVSRRSDSAVSVPTTQANATRRG 2098
Cdd:PHA03247 2765 --GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP 2842
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2099 SDGEARPLP---RVAPPGTTWRR-------IKDEDVPHILRSTLPATALPLRVSSPEDSPAGTPQRKTSDAVVQTEDVAT 2168
Cdd:PHA03247 2843 PGPPPPSLPlggSVAPGGDVRRRppsrspaAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQ 2922
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2169 SKTNSSTSPSLESRDPPQAPASGPVAPQGSDVDGPVLTKPPASAPFPHEgLSAVIAGFPTSRHGSPSRAARVPPFNYVPS 2248
Cdd:PHA03247 2923 PPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGR-VAVPRFRVPQPAPSREAPASSTPPLTGHSL 3001
|
490 500
....*....|....*....|....*.
gi 148699623 2249 PmAAATMASDSAVEKAPVSSPASLLE 2274
Cdd:PHA03247 3002 S-RVSSWASSLALHEETDPPPVSLKQ 3026
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1505-1944 |
7.32e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.40 E-value: 7.32e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1505 PPPRRASAIPRALKREKPAGRKETP---SRAAQPATLPVRAQPRLIVDETPPcysltssaSSLSEPEAPEQPANHARGPE 1581
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGD--------PRGPAPPSPLPPDTHAPDPP 2627
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1582 QgskqdSSPSPRAEEELLQRCISLAMPRRRTQVP-GSRRRKPRALRSDIRPTEITQKCQ--EEVAGSDPASDLDSVEWQA 1658
Cdd:PHA03247 2628 P-----PSPSPAANEPDPHPPPTVPPPERPRDDPaPGRVSRPRRARRLGRAAQASSPPQrpRRRAARPTVGSLTSLADPP 2702
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1659 IQEGAnsivtwlhQAAAKASLEASSESDSLLSLVSGVSAGSTLQPSKLRKGRKPAAEAGGAWRPEKRGTTSTKINGSPRL 1738
Cdd:PHA03247 2703 PPPPT--------PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA 2774
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1739 PNGPEKAKGTQKMMAGESTMLRGRTVIYSAGPASRTQSKGISGPCTTPKKTGTSGTTQPETVTKAPSPEQQRSRSLHRPG 1818
Cdd:PHA03247 2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1819 KISELAALRHPPRSATPPARLAktpsssssqtsPASQPLPRRSPLATPTGGPLPGPGGSLVPKSPARALLAKQHKTQKSP 1898
Cdd:PHA03247 2855 SVAPGGDVRRRPPSRSPAAKPA-----------APARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923
|
410 420 430 440
....*....|....*....|....*....|....*....|....*.
gi 148699623 1899 VRIPFMQRPARrvPPPLARPSPEPGSRGRAGAEGTPGARGSRLGLV 1944
Cdd:PHA03247 2924 PPPPQPQPPPP--PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL 2967
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1591-1612 |
9.08e-05 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.
Pssm-ID: 461782 Cd Length: 22 Bit Score: 41.04 E-value: 9.08e-05
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
1999-2196 |
3.40e-04 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 45.84 E-value: 3.40e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1999 RRGRPALPAVFLC--SSRCDELRVSPRQPLAAQRSPQAKPGLAPRAPRrtSSESPSRLPVRASPGRPETV------KRYA 2070
Cdd:PTZ00449 608 RPKSPKLPELLDIpkSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIK--SPKPPKSPKPPFDPKFKEKFyddyldAAAK 685
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2071 SLPHISVSRRSDSAVSVPTTQANATRRGSDGEARPLPRVAP--PGTTWRRIKDEDVPHILRSTLPATALPLRVSSPEdSP 2148
Cdd:PTZ00449 686 SKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPrdEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHE-TP 764
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 148699623 2149 AGTPQRKTSDAVVQTEDVaTSKTNSSTSPSLESRDP----PQAPASGPVAPQ 2196
Cdd:PTZ00449 765 ADTPLPDILAEEFKEEDI-HAETGEPDEAMKRPDSPseheDKPPGDHPSLPK 815
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1905-2239 |
3.66e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 45.93 E-value: 3.66e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1905 QRPARRVPPPLARPSPEPGSRGRAGAeGTPGARGSRLGLVRMASARSSGSESSDRSGFRRqltfiKESPGLLRRRRSELS 1984
Cdd:PHA03307 80 PANESRSTPTWSLSTLAPASPAREGS-PTPPGPSSPDPPPPTPPPASPPPSPAPDLSEML-----RPVGSPGPPPAASPP 153
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1985 SADSTASTSQAASPRRGRPALPAvflcsSRCDELRVSPRQPLAAQRSPQAKPGLAPRAPRRTSSESPSRLPVRASPGRPE 2064
Cdd:PHA03307 154 AAGASPAAVASDAASSRQAALPL-----SSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSA 228
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2065 TVKRYASlphisvsrRSDSAVSVPTTQANATRrgsdgEARPLPRvAPPGTTWRRIkDEDVPHILRSTL-----PATALPL 2139
Cdd:PHA03307 229 ADDAGAS--------SSDSSSSESSGCGWGPE-----NECPLPR-PAPITLPTRI-WEASGWNGPSSRpgpasSSSSPRE 293
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2140 RVSSPEDSPAGTPQRKTSDAVVQTEDVATSKTNSSTSPSLESRDPPQAPASGPVAPQGSDVDGPVLTKPP----ASAPFP 2215
Cdd:PHA03307 294 RSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSsprkRPRPSR 373
|
330 340
....*....|....*....|....
gi 148699623 2216 HEGLSAVIAGFPTSRHGSPSRAAR 2239
Cdd:PHA03307 374 APSSPAASAGRPTRRRARAAVAGR 397
|
|
| PHA03321 |
PHA03321 |
tegument protein VP11/12; Provisional |
2026-2256 |
5.57e-04 |
|
tegument protein VP11/12; Provisional
Pssm-ID: 223041 [Multi-domain] Cd Length: 694 Bit Score: 45.33 E-value: 5.57e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2026 LAAQRSPqakPGlaPRAPRRTSSESPsrlPVRASPGRPETVKRYASlphisvSRRSDSAVSVPTTQANATRRGSDGEAR- 2104
Cdd:PHA03321 424 LLSSRQP---PG--APAPRRDNDPPP---PPRARPGSTPACARRAR------AQRARDAGPEYVDPLGALRRLPAGAAPp 489
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2105 PLPRVAPPGTTWRRIKDEDVPHiLRSTLPATALPLRVSSPedSPAGTPQRKTSDAVVQTEDVATSKTNSSTSPSLESRDP 2184
Cdd:PHA03321 490 PEPAAAPSPATYYTRMGGGPPR-LPPRNRATETLRPDWGP--PAAAPPEQMEDPYLEPDDDRFDRRDGAAAAATSHPREA 566
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2185 PqAPASGPVAPQGSDVDGPV-------------LTKPPASAPFPHEGLS---------AVIAGFPTSRHGSPSRAARVPP 2242
Cdd:PHA03321 567 P-APDDDPIYEGVSDSEEPVyeeiptprvyqnpLPRPMEGAGEPPDLDAptspwveeeNPIYGWGDSPLFSPPPAARFPP 645
|
250
....*....|....
gi 148699623 2243 FNYVPSPMAAATMA 2256
Cdd:PHA03321 646 PDPALSPEPPALPA 659
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1370-1392 |
6.38e-04 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 38.90 E-value: 6.38e-04
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1972-2192 |
2.02e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.77 E-value: 2.02e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1972 SPGLLRRRRSELSSADSTASTSQAASPRRGRPALPAVFLCSSRCDELRVSP-RQPLAAQRSPQAKPGLAPRAP---RRTS 2047
Cdd:PHA03247 256 APPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPlALPAPPDPPPPAPAGDAEEEDdedGAME 335
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2048 SESP-----SRLPVRASPGRPETVKRYASLPHISVSRRSDSAVSVPTTQANATR-------RGSDGEARPLPRVAPPGTT 2115
Cdd:PHA03247 336 VVSPlprprQHYPLGFPKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRKRRSARhaatpfaRGPGGDDQTRPAAPVPASV 415
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 148699623 2116 wrriKDEDVPHILRSTLPATALPLRVSSP--EDSPAGTPQRKTSDAVVQTEDVATSKTNSSTSPSLESRDPPQAPASGP 2192
Cdd:PHA03247 416 ----PTPAPTPVPASAPPPPATPLPSAEPgsDDGPAPPPERQPPAPATEPAPDDPDDATRKALDALRERRPPEPPGADL 490
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
649-689 |
3.29e-03 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 37.05 E-value: 3.29e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 148699623 649 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 689
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1247-1268 |
3.48e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.59 E-value: 3.48e-03
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1135-1158 |
5.58e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.20 E-value: 5.58e-03
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1285-1622 |
6.20e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.23 E-value: 6.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1285 PPACPERAVGGGGHRRRDEAASRLDGPAPAGSRARSATDKeleALRECLG-----AAMPARLRKVASALVPGRRSLPVPv 1359
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRR---AARPTVGsltslADPPPPPPTPEPAPHALVSATPLP- 2722
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1360 ymLVPAPARGDDSGTDSAEGTPVNFSSAASLSDETlQGPSRDKPAGPGDRQKPTGRAAPArQTRSHRPKAAGAGKSTEHT 1439
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPA-RPARPPTTAGPPAPAPPAAPAAGP-PRRLTRPAVASLSESRESL 2798
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1440 RGPcRNRAGLELPLSRPQSARSNRDSSCQTRTRGDGALQS--------LCLTTPTEEAVYCFYD-SDEEPPATAPPPRRA 1510
Cdd:PHA03247 2799 PSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTapppppgpPPPSLPLGGSVAPGGDvRRRPPSRSPAAKPAA 2877
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1511 SAIPRALKREKPAgrketPSRAAQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEAPEQPANHAR---GPEQGSKQD 1587
Cdd:PHA03247 2878 PARPPVRRLARPA-----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQpplAPTTDPAGA 2952
|
330 340 350
....*....|....*....|....*....|....*
gi 148699623 1588 SSPSPRAEEELLQRCISLAMPRRRTQVPGSRRRKP 1622
Cdd:PHA03247 2953 GEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSRE 2987
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1856-2241 |
7.85e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.85 E-value: 7.85e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1856 PLPRRSPLATPTGGPLpgpggslvPKSPARALLAKQHKTQKSPVRI----------------------PFMQRPA---RR 1910
Cdd:PHA03247 2496 PDPGGGGPPDPDAPPA--------PSRLAPAILPDEPVGEPVHPRMltwirgleelasddagdpppplPPAAPPAapdRS 2567
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1911 VPPPLARPSP-EPGSRGRAGAEGTP--GARGSRLGLVRMASARSSGSESSDRSGFRRQLTFIKESPGLLRRRRSELSSAD 1987
Cdd:PHA03247 2568 VPPPRPAPRPsEPAVTSRARRPDAPpqSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVP 2647
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1988 STASTSQAASPRRGRPalpavflcssrcdelrvsPRQPLAAQRSPQAKPglAPRAPRRTSSEsPSRLPVRASPGRPETVK 2067
Cdd:PHA03247 2648 PPERPRDDPAPGRVSR------------------PRRARRLGRAAQASS--PPQRPRRRAAR-PTVGSLTSLADPPPPPP 2706
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2068 RYASLPHISVsrrsdSAVSVPTTQANATRRGSDGEARPLPRVAPPGTTwrrikdEDVPHILRSTLPATALPLRVSSPEDs 2147
Cdd:PHA03247 2707 TPEPAPHALV-----SATPLPPGPAAARQASPALPAAPAPPAVPAGPA------TPGGPARPARPPTTAGPPAPAPPAA- 2774
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2148 PAGTPQRKTSDAVVQTEDVATSKTNSSTSPS---------LESRDPPQAPASGPVAPQGSDVDGPVLTKPPASAPFPHEG 2218
Cdd:PHA03247 2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPAdppaavlapAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
|
410 420
....*....|....*....|...
gi 148699623 2219 lsAVIAGFPTSRHGSPSRAARVP 2241
Cdd:PHA03247 2855 --SVAPGGDVRRRPPSRSPAAKP 2875
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
7-86 |
7.91e-03 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 41.04 E-value: 7.91e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 7 SYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQEARVL--VSSGQTEVLEQLKALQTDIS 84
Cdd:COG4372 32 QLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLqaAQAELAQAQEELESLQEEAE 111
|
..
gi 148699623 85 SL 86
Cdd:COG4372 112 EL 113
|
|
|