|
Name |
Accession |
Description |
Interval |
E-value |
| Arm_APC_u3 |
pfam16629 |
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ... |
449-736 |
9.76e-166 |
|
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. :
Pssm-ID: 435476 Cd Length: 293 Bit Score: 512.21 E-value: 9.76e-166
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 449 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQSLYGDYVFDTNRHDDN-- 526
Cdd:pfam16629 1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 527 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLGNYHPATENPGTSSKR-GLQISTTAAQ 603
Cdd:pfam16629 81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 604 IAKVMEEVSAIHTSQEDRSSGSTTELHCVTDERNALRRSSAAHTHSNTYNFTKSENSNRTCSMPYAKLEYKRSSNDSLNS 683
Cdd:pfam16629 161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 1237937751 684 VSSSDGYGKRGQMKPSIESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 736
Cdd:pfam16629 241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
|
|
| APC_basic |
pfam05956 |
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ... |
1940-2285 |
5.72e-108 |
|
APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules. :
Pssm-ID: 428690 [Multi-domain] Cd Length: 336 Bit Score: 348.79 E-value: 5.72e-108
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1940 SISRGRTMIHIPGVRNSSSSTSPVSKKGPPLKTPASKSPSEGQTATTS-PRGAKPSVKSELSPVARQTSQIGGSSKAPSR 2018
Cdd:pfam05956 1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2019 SGSRDSTPSRPAQQPLSRPIQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQMSQQNLTKQ 2098
Cdd:pfam05956 81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2099 TGLSKNASSI------PRSESASKGLNqmnngNGANKKVELSRMSSTKSSGSESDRSerpVLVRQSTFIKEAPSPTLRRK 2172
Cdd:pfam05956 154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2173 LEESASfESLSPSSRPASPTRSQaqtpvlsPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEYNDGRPAKRHDIARSHSES 2252
Cdd:pfam05956 226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
|
330 340 350
....*....|....*....|....*....|...
gi 1237937751 2253 PSRLPInRSGTWKREHSKHSSSLPRVSTWRRTG 2285
Cdd:pfam05956 298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
|
|
| EB1_binding |
pfam05937 |
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ... |
2387-2560 |
2.07e-88 |
|
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins. :
Pssm-ID: 399141 Cd Length: 174 Bit Score: 285.74 E-value: 2.07e-88
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2387 RSGRSPTGNTPPVIDSVSEKANPNIKDSKDNQAKQNVGNGSVPMRTVGLENRLNSFIQVDAPDQKGTEIKPGQNNPVPVS 2466
Cdd:pfam05937 1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNVPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2467 ETNESSIVERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVNNNTKKRDSKTDSTESSGTQ 2546
Cdd:pfam05937 81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
|
170
....*....|....
gi 1237937751 2547 SPKRHSGSYLVTSV 2560
Cdd:pfam05937 161 SPKRHSGSYLVTSV 174
|
|
| APC_u5 |
pfam16630 |
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ... |
753-852 |
1.17e-56 |
|
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. :
Pssm-ID: 406923 Cd Length: 100 Bit Score: 191.66 E-value: 1.17e-56
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 753 LNSGRQSPSQNERWARPKHIIEDEIKQSEQRQSRNQSTTYPVYTESTDDKHLKFQPHFGQQECVSPYRSRGANGSETNRV 832
Cdd:pfam16630 1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
|
90 100
....*....|....*....|
gi 1237937751 833 GSNHGINQNVSQSLCQEDDY 852
Cdd:pfam16630 81 GSSHGINQKVSQSLCQVDDY 100
|
|
| APC_u14 |
pfam16635 |
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ... |
1463-1556 |
1.48e-46 |
|
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. :
Pssm-ID: 435479 Cd Length: 94 Bit Score: 162.70 E-value: 1.48e-46
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1463 IMDQVQQASASSSAPNKNQLDGKKKKPTSPVKPIPQNTEYRTRVRKNADSKNNLNAERVFSDNKDSKKQNLKNNSKVFND 1542
Cdd:pfam16635 1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
|
90
....*....|....
gi 1237937751 1543 KLPNNEDRVRGSFA 1556
Cdd:pfam16635 81 KLPNNEERTRGSFA 94
|
|
| APC_rep |
pfam18797 |
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ... |
110-183 |
3.39e-45 |
|
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region. :
Pssm-ID: 465870 Cd Length: 74 Bit Score: 158.09 E-value: 3.39e-45
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1237937751 110 SQPDDKRGRREIRVLHLLEQIRAYCETCWEWQEAHEPGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 183
Cdd:pfam18797 1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
|
|
| APC_u9 |
pfam16633 |
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ... |
1000-1085 |
2.43e-35 |
|
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. :
Pssm-ID: 435478 Cd Length: 89 Bit Score: 130.38 E-value: 2.43e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1000 AEDEI-GCNQTTQEADSANTLQIAEIKEKIGTRSAEDPVSEVPAVSQHPRTKSSRLQGSSLS-SESARHKAVEFSSGAKS 1077
Cdd:pfam16633 2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHIRTKPNRLQASNLSpSDSSRHKAVEFSSGAKS 81
|
....*...
gi 1237937751 1078 PSKSGAQT 1085
Cdd:pfam16633 82 PSKSGAQT 89
|
|
| APC_u15 |
pfam16636 |
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ... |
1590-1664 |
4.14e-31 |
|
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. :
Pssm-ID: 435480 Cd Length: 81 Bit Score: 118.04 E-value: 4.14e-31
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1237937751 1590 DLSREKAELRKAKENKESEAKVTSHTELTSNQQSANKTQAIAKQPINRGQPKPILQKQSTFPQSSKDIPDRGAAT 1664
Cdd:pfam16636 7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
|
|
| APC_u13 |
pfam16634 |
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ... |
1379-1432 |
1.61e-24 |
|
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. :
Pssm-ID: 406927 Cd Length: 54 Bit Score: 98.33 E-value: 1.61e-24
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 1237937751 1379 IESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQGGKTSSVTIPELDD 1432
Cdd:pfam16634 1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1354-1377 |
1.71e-07 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin. :
Pssm-ID: 461781 Cd Length: 24 Bit Score: 48.92 E-value: 1.71e-07
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
366-406 |
4.15e-07 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin. :
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 48.19 E-value: 4.15e-07
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1237937751 366 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 406
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1750-1769 |
2.97e-05 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin. :
Pssm-ID: 461782 Cd Length: 22 Bit Score: 42.58 E-value: 2.97e-05
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
229-270 |
4.73e-04 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin. :
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 39.72 E-value: 4.73e-04
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 1237937751 229 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 270
Cdd:smart00185 1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
408-448 |
6.30e-04 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats. :
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 39.36 E-value: 6.30e-04
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1237937751 408 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 448
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1433-1454 |
1.14e-03 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin. :
Pssm-ID: 461782 Cd Length: 22 Bit Score: 38.34 E-value: 1.14e-03
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1088-1110 |
3.92e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin. :
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.59 E-value: 3.92e-03
|
| PTZ00449 super family |
cl33186 |
104 kDa microneme/rhoptry antigen; Provisional |
1008-1418 |
6.00e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional The actual alignment was detected with superfamily member PTZ00449:
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 41.98 E-value: 6.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1008 QTTQEAD---SANTLQIAEIKEKIGTRSAEDPvsEVPAVSQHPRTKSSRLQGSSLSSESARHKAVEFSSGAKSPSKSGAQ 1084
Cdd:PTZ00449 480 QFTQEIKkliKKSKKKLAPIEEEDSDKHDEPP--EGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEV 557
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1085 TPKSPP--EHYVQETPLMFSRCT----SVSSLDSFESRSIASSVQ-SEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPP 1157
Cdd:PTZ00449 558 GKKPGPakEHKPSKIPTLSKKPEfpkdPKHPKDPEEPKKPKRPRSaQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1158 PPPQTAQTKR-EVPK----NKAPTAEKRESGPK------QAAVNAAVQRVQvlpdadtllhfaTESTPDGFSCSSSLSAL 1226
Cdd:PTZ00449 638 PPQRPSSPERpEGPKiiksPKPPKSPKPPFDPKfkekfyDDYLDAAAKSKE------------TKTTVVLDESFESILKE 705
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1227 SLDEPFIQKDVELRIMPPVQENDNG------NETESEQPKESNENQEKEAEKTIDSEKDLLDDSDDDDIEILEECIISAM 1300
Cdd:PTZ00449 706 TLPETPGTPFTTPRPLPPKLPRDEEfpfepiGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAE 785
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1301 PTKSSRKAKKPAQTASKLPPPVARKPSqlpvyklLPSQNRLQPQKHVSFTPGDDMPRVYCVE--GTPINFSTATSLSDL- 1377
Cdd:PTZ00449 786 TGEPDEAMKRPDSPSEHEDKPPGDHPS-------LPKKRHRLDGLALSTTDLESDAGRIAKDasGKIVKLKRSKSFDDLt 858
|
410 420 430 440
....*....|....*....|....*....|....*....|.
gi 1237937751 1378 TIESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQ 1418
Cdd:PTZ00449 859 TVEEAEEMGAEARKIVVDDDGTEADDEDTHPPEEKHKSEVR 899
|
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
57-109 |
7.30e-03 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin. :
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 36.25 E-value: 7.30e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1237937751 57 SQDSCISMRQSGCLPLLIQLLHgndkdsvllgnsRGSKEARARASAALHNIIH 109
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLK------------SEDEEVVKEAAWALSNLSS 41
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
974-991 |
7.34e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin. :
Pssm-ID: 461781 Cd Length: 24 Bit Score: 35.82 E-value: 7.34e-03
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Arm_APC_u3 |
pfam16629 |
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ... |
449-736 |
9.76e-166 |
|
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435476 Cd Length: 293 Bit Score: 512.21 E-value: 9.76e-166
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 449 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQSLYGDYVFDTNRHDDN-- 526
Cdd:pfam16629 1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 527 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLGNYHPATENPGTSSKR-GLQISTTAAQ 603
Cdd:pfam16629 81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 604 IAKVMEEVSAIHTSQEDRSSGSTTELHCVTDERNALRRSSAAHTHSNTYNFTKSENSNRTCSMPYAKLEYKRSSNDSLNS 683
Cdd:pfam16629 161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 1237937751 684 VSSSDGYGKRGQMKPSIESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 736
Cdd:pfam16629 241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
|
|
| APC_basic |
pfam05956 |
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ... |
1940-2285 |
5.72e-108 |
|
APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.
Pssm-ID: 428690 [Multi-domain] Cd Length: 336 Bit Score: 348.79 E-value: 5.72e-108
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1940 SISRGRTMIHIPGVRNSSSSTSPVSKKGPPLKTPASKSPSEGQTATTS-PRGAKPSVKSELSPVARQTSQIGGSSKAPSR 2018
Cdd:pfam05956 1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2019 SGSRDSTPSRPAQQPLSRPIQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQMSQQNLTKQ 2098
Cdd:pfam05956 81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2099 TGLSKNASSI------PRSESASKGLNqmnngNGANKKVELSRMSSTKSSGSESDRSerpVLVRQSTFIKEAPSPTLRRK 2172
Cdd:pfam05956 154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2173 LEESASfESLSPSSRPASPTRSQaqtpvlsPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEYNDGRPAKRHDIARSHSES 2252
Cdd:pfam05956 226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
|
330 340 350
....*....|....*....|....*....|...
gi 1237937751 2253 PSRLPInRSGTWKREHSKHSSSLPRVSTWRRTG 2285
Cdd:pfam05956 298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
|
|
| EB1_binding |
pfam05937 |
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ... |
2387-2560 |
2.07e-88 |
|
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins.
Pssm-ID: 399141 Cd Length: 174 Bit Score: 285.74 E-value: 2.07e-88
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2387 RSGRSPTGNTPPVIDSVSEKANPNIKDSKDNQAKQNVGNGSVPMRTVGLENRLNSFIQVDAPDQKGTEIKPGQNNPVPVS 2466
Cdd:pfam05937 1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNVPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2467 ETNESSIVERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVNNNTKKRDSKTDSTESSGTQ 2546
Cdd:pfam05937 81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
|
170
....*....|....
gi 1237937751 2547 SPKRHSGSYLVTSV 2560
Cdd:pfam05937 161 SPKRHSGSYLVTSV 174
|
|
| APC_u5 |
pfam16630 |
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ... |
753-852 |
1.17e-56 |
|
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 406923 Cd Length: 100 Bit Score: 191.66 E-value: 1.17e-56
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 753 LNSGRQSPSQNERWARPKHIIEDEIKQSEQRQSRNQSTTYPVYTESTDDKHLKFQPHFGQQECVSPYRSRGANGSETNRV 832
Cdd:pfam16630 1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
|
90 100
....*....|....*....|
gi 1237937751 833 GSNHGINQNVSQSLCQEDDY 852
Cdd:pfam16630 81 GSSHGINQKVSQSLCQVDDY 100
|
|
| APC_u14 |
pfam16635 |
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ... |
1463-1556 |
1.48e-46 |
|
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435479 Cd Length: 94 Bit Score: 162.70 E-value: 1.48e-46
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1463 IMDQVQQASASSSAPNKNQLDGKKKKPTSPVKPIPQNTEYRTRVRKNADSKNNLNAERVFSDNKDSKKQNLKNNSKVFND 1542
Cdd:pfam16635 1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
|
90
....*....|....
gi 1237937751 1543 KLPNNEDRVRGSFA 1556
Cdd:pfam16635 81 KLPNNEERTRGSFA 94
|
|
| APC_rep |
pfam18797 |
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ... |
110-183 |
3.39e-45 |
|
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.
Pssm-ID: 465870 Cd Length: 74 Bit Score: 158.09 E-value: 3.39e-45
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1237937751 110 SQPDDKRGRREIRVLHLLEQIRAYCETCWEWQEAHEPGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 183
Cdd:pfam18797 1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
|
|
| APC_u9 |
pfam16633 |
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ... |
1000-1085 |
2.43e-35 |
|
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435478 Cd Length: 89 Bit Score: 130.38 E-value: 2.43e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1000 AEDEI-GCNQTTQEADSANTLQIAEIKEKIGTRSAEDPVSEVPAVSQHPRTKSSRLQGSSLS-SESARHKAVEFSSGAKS 1077
Cdd:pfam16633 2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHIRTKPNRLQASNLSpSDSSRHKAVEFSSGAKS 81
|
....*...
gi 1237937751 1078 PSKSGAQT 1085
Cdd:pfam16633 82 PSKSGAQT 89
|
|
| APC_u15 |
pfam16636 |
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ... |
1590-1664 |
4.14e-31 |
|
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435480 Cd Length: 81 Bit Score: 118.04 E-value: 4.14e-31
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1237937751 1590 DLSREKAELRKAKENKESEAKVTSHTELTSNQQSANKTQAIAKQPINRGQPKPILQKQSTFPQSSKDIPDRGAAT 1664
Cdd:pfam16636 7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
|
|
| APC_u13 |
pfam16634 |
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ... |
1379-1432 |
1.61e-24 |
|
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 406927 Cd Length: 54 Bit Score: 98.33 E-value: 1.61e-24
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 1237937751 1379 IESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQGGKTSSVTIPELDD 1432
Cdd:pfam16634 1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1354-1377 |
1.71e-07 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 48.92 E-value: 1.71e-07
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
366-406 |
4.15e-07 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 48.19 E-value: 4.15e-07
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1237937751 366 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 406
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
366-406 |
5.64e-07 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 47.83 E-value: 5.64e-07
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1237937751 366 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 406
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1968-2281 |
1.38e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 1.38e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1968 PPLKTPASKsPSEGQTATTSPRGAKPSVKSE----LSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGR 2043
Cdd:PHA03247 2703 PPPPTPEPA-PHALVSATPLPPGPAAARQASpalpAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2044 NSISPGRNGISPPNKLSQLPRTSSPSTASTKSSGSGKMSYTSPGrqmsqqnltkqTGLSKNASSIPRSESASKGLNQ--M 2121
Cdd:PHA03247 2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPpsL 2850
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2122 NNGNGANKKVELSRMSSTKSSGSESDRSERPVLVRqstfikeAPSPTLRRKLEESA-SFESLSPSSRPASPTRSQAQTPV 2200
Cdd:PHA03247 2851 PLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRR-------LARPAVSRSTESFAlPPDQPERPPQPQAPPPPQPQPQP 2923
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2201 LSPSLPDMSLSTHSSVQAggwrKLPPNLSPTieyNDGRPAKRHDIARSHSESPSRLPINRSGTWK----RE------HSK 2270
Cdd:PHA03247 2924 PPPPQPQPPPPPPPRPQP----PLAPTTDPA---GAGEPSGAVPQPWLGALVPGRVAVPRFRVPQpapsREapasstPPL 2996
|
330
....*....|.
gi 1237937751 2271 HSSSLPRVSTW 2281
Cdd:PHA03247 2997 TGHSLSRVSSW 3007
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1750-1769 |
2.97e-05 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.
Pssm-ID: 461782 Cd Length: 22 Bit Score: 42.58 E-value: 2.97e-05
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
229-270 |
4.73e-04 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 39.72 E-value: 4.73e-04
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 1237937751 229 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 270
Cdd:smart00185 1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
229-269 |
5.44e-04 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 39.36 E-value: 5.44e-04
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1237937751 229 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLS 269
Cdd:pfam00514 1 SPENKQAVIEA-GAVPPLVRLLSSPDEEVQEEAAWALSNLA 40
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
408-448 |
6.30e-04 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 39.36 E-value: 6.30e-04
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1237937751 408 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 448
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1433-1454 |
1.14e-03 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.
Pssm-ID: 461782 Cd Length: 22 Bit Score: 38.34 E-value: 1.14e-03
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1088-1110 |
3.92e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.59 E-value: 3.92e-03
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
1008-1418 |
6.00e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 41.98 E-value: 6.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1008 QTTQEAD---SANTLQIAEIKEKIGTRSAEDPvsEVPAVSQHPRTKSSRLQGSSLSSESARHKAVEFSSGAKSPSKSGAQ 1084
Cdd:PTZ00449 480 QFTQEIKkliKKSKKKLAPIEEEDSDKHDEPP--EGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEV 557
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1085 TPKSPP--EHYVQETPLMFSRCT----SVSSLDSFESRSIASSVQ-SEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPP 1157
Cdd:PTZ00449 558 GKKPGPakEHKPSKIPTLSKKPEfpkdPKHPKDPEEPKKPKRPRSaQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1158 PPPQTAQTKR-EVPK----NKAPTAEKRESGPK------QAAVNAAVQRVQvlpdadtllhfaTESTPDGFSCSSSLSAL 1226
Cdd:PTZ00449 638 PPQRPSSPERpEGPKiiksPKPPKSPKPPFDPKfkekfyDDYLDAAAKSKE------------TKTTVVLDESFESILKE 705
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1227 SLDEPFIQKDVELRIMPPVQENDNG------NETESEQPKESNENQEKEAEKTIDSEKDLLDDSDDDDIEILEECIISAM 1300
Cdd:PTZ00449 706 TLPETPGTPFTTPRPLPPKLPRDEEfpfepiGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAE 785
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1301 PTKSSRKAKKPAQTASKLPPPVARKPSqlpvyklLPSQNRLQPQKHVSFTPGDDMPRVYCVE--GTPINFSTATSLSDL- 1377
Cdd:PTZ00449 786 TGEPDEAMKRPDSPSEHEDKPPGDHPS-------LPKKRHRLDGLALSTTDLESDAGRIAKDasGKIVKLKRSKSFDDLt 858
|
410 420 430 440
....*....|....*....|....*....|....*....|.
gi 1237937751 1378 TIESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQ 1418
Cdd:PTZ00449 859 TVEEAEEMGAEARKIVVDDDGTEADDEDTHPPEEKHKSEVR 899
|
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
57-109 |
7.30e-03 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 36.25 E-value: 7.30e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1237937751 57 SQDSCISMRQSGCLPLLIQLLHgndkdsvllgnsRGSKEARARASAALHNIIH 109
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLK------------SEDEEVVKEAAWALSNLSS 41
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
974-991 |
7.34e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 35.82 E-value: 7.34e-03
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Arm_APC_u3 |
pfam16629 |
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ... |
449-736 |
9.76e-166 |
|
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435476 Cd Length: 293 Bit Score: 512.21 E-value: 9.76e-166
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 449 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQSLYGDYVFDTNRHDDN-- 526
Cdd:pfam16629 1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 527 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLGNYHPATENPGTSSKR-GLQISTTAAQ 603
Cdd:pfam16629 81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 604 IAKVMEEVSAIHTSQEDRSSGSTTELHCVTDERNALRRSSAAHTHSNTYNFTKSENSNRTCSMPYAKLEYKRSSNDSLNS 683
Cdd:pfam16629 161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 1237937751 684 VSSSDGYGKRGQMKPSIESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 736
Cdd:pfam16629 241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
|
|
| APC_basic |
pfam05956 |
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ... |
1940-2285 |
5.72e-108 |
|
APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.
Pssm-ID: 428690 [Multi-domain] Cd Length: 336 Bit Score: 348.79 E-value: 5.72e-108
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1940 SISRGRTMIHIPGVRNSSSSTSPVSKKGPPLKTPASKSPSEGQTATTS-PRGAKPSVKSELSPVARQTSQIGGSSKAPSR 2018
Cdd:pfam05956 1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2019 SGSRDSTPSRPAQQPLSRPIQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQMSQQNLTKQ 2098
Cdd:pfam05956 81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2099 TGLSKNASSI------PRSESASKGLNqmnngNGANKKVELSRMSSTKSSGSESDRSerpVLVRQSTFIKEAPSPTLRRK 2172
Cdd:pfam05956 154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2173 LEESASfESLSPSSRPASPTRSQaqtpvlsPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEYNDGRPAKRHDIARSHSES 2252
Cdd:pfam05956 226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
|
330 340 350
....*....|....*....|....*....|...
gi 1237937751 2253 PSRLPInRSGTWKREHSKHSSSLPRVSTWRRTG 2285
Cdd:pfam05956 298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
|
|
| EB1_binding |
pfam05937 |
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ... |
2387-2560 |
2.07e-88 |
|
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins.
Pssm-ID: 399141 Cd Length: 174 Bit Score: 285.74 E-value: 2.07e-88
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2387 RSGRSPTGNTPPVIDSVSEKANPNIKDSKDNQAKQNVGNGSVPMRTVGLENRLNSFIQVDAPDQKGTEIKPGQNNPVPVS 2466
Cdd:pfam05937 1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNVPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2467 ETNESSIVERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVNNNTKKRDSKTDSTESSGTQ 2546
Cdd:pfam05937 81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
|
170
....*....|....
gi 1237937751 2547 SPKRHSGSYLVTSV 2560
Cdd:pfam05937 161 SPKRHSGSYLVTSV 174
|
|
| APC_u5 |
pfam16630 |
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ... |
753-852 |
1.17e-56 |
|
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 406923 Cd Length: 100 Bit Score: 191.66 E-value: 1.17e-56
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 753 LNSGRQSPSQNERWARPKHIIEDEIKQSEQRQSRNQSTTYPVYTESTDDKHLKFQPHFGQQECVSPYRSRGANGSETNRV 832
Cdd:pfam16630 1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
|
90 100
....*....|....*....|
gi 1237937751 833 GSNHGINQNVSQSLCQEDDY 852
Cdd:pfam16630 81 GSSHGINQKVSQSLCQVDDY 100
|
|
| APC_u14 |
pfam16635 |
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ... |
1463-1556 |
1.48e-46 |
|
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435479 Cd Length: 94 Bit Score: 162.70 E-value: 1.48e-46
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1463 IMDQVQQASASSSAPNKNQLDGKKKKPTSPVKPIPQNTEYRTRVRKNADSKNNLNAERVFSDNKDSKKQNLKNNSKVFND 1542
Cdd:pfam16635 1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
|
90
....*....|....
gi 1237937751 1543 KLPNNEDRVRGSFA 1556
Cdd:pfam16635 81 KLPNNEERTRGSFA 94
|
|
| APC_rep |
pfam18797 |
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ... |
110-183 |
3.39e-45 |
|
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.
Pssm-ID: 465870 Cd Length: 74 Bit Score: 158.09 E-value: 3.39e-45
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1237937751 110 SQPDDKRGRREIRVLHLLEQIRAYCETCWEWQEAHEPGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 183
Cdd:pfam18797 1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
|
|
| APC_u9 |
pfam16633 |
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ... |
1000-1085 |
2.43e-35 |
|
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435478 Cd Length: 89 Bit Score: 130.38 E-value: 2.43e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1000 AEDEI-GCNQTTQEADSANTLQIAEIKEKIGTRSAEDPVSEVPAVSQHPRTKSSRLQGSSLS-SESARHKAVEFSSGAKS 1077
Cdd:pfam16633 2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHIRTKPNRLQASNLSpSDSSRHKAVEFSSGAKS 81
|
....*...
gi 1237937751 1078 PSKSGAQT 1085
Cdd:pfam16633 82 PSKSGAQT 89
|
|
| APC_u15 |
pfam16636 |
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ... |
1590-1664 |
4.14e-31 |
|
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435480 Cd Length: 81 Bit Score: 118.04 E-value: 4.14e-31
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1237937751 1590 DLSREKAELRKAKENKESEAKVTSHTELTSNQQSANKTQAIAKQPINRGQPKPILQKQSTFPQSSKDIPDRGAAT 1664
Cdd:pfam16636 7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
|
|
| APC_u13 |
pfam16634 |
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ... |
1379-1432 |
1.61e-24 |
|
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 406927 Cd Length: 54 Bit Score: 98.33 E-value: 1.61e-24
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 1237937751 1379 IESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQGGKTSSVTIPELDD 1432
Cdd:pfam16634 1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1354-1377 |
1.71e-07 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 48.92 E-value: 1.71e-07
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
366-406 |
4.15e-07 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 48.19 E-value: 4.15e-07
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1237937751 366 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 406
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
366-406 |
5.64e-07 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 47.83 E-value: 5.64e-07
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1237937751 366 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 406
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1968-2281 |
1.38e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 1.38e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1968 PPLKTPASKsPSEGQTATTSPRGAKPSVKSE----LSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGR 2043
Cdd:PHA03247 2703 PPPPTPEPA-PHALVSATPLPPGPAAARQASpalpAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2044 NSISPGRNGISPPNKLSQLPRTSSPSTASTKSSGSGKMSYTSPGrqmsqqnltkqTGLSKNASSIPRSESASKGLNQ--M 2121
Cdd:PHA03247 2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPpsL 2850
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2122 NNGNGANKKVELSRMSSTKSSGSESDRSERPVLVRqstfikeAPSPTLRRKLEESA-SFESLSPSSRPASPTRSQAQTPV 2200
Cdd:PHA03247 2851 PLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRR-------LARPAVSRSTESFAlPPDQPERPPQPQAPPPPQPQPQP 2923
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2201 LSPSLPDMSLSTHSSVQAggwrKLPPNLSPTieyNDGRPAKRHDIARSHSESPSRLPINRSGTWK----RE------HSK 2270
Cdd:PHA03247 2924 PPPPQPQPPPPPPPRPQP----PLAPTTDPA---GAGEPSGAVPQPWLGALVPGRVAVPRFRVPQpapsREapasstPPL 2996
|
330
....*....|.
gi 1237937751 2271 HSSSLPRVSTW 2281
Cdd:PHA03247 2997 TGHSLSRVSSW 3007
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1973-2118 |
8.36e-06 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 51.71 E-value: 8.36e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1973 PASKSPSEGQTATTSPRGAKPSVKSELSPVA----RQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGRNSISP 2048
Cdd:PHA03307 278 PSSRPGPASSSSSPRERSPSPSPSSPGSGPApsspRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPP 357
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2049 GRNGISPPNKLSQlPRTSSPSTASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSIPRSESASKGL 2118
Cdd:PHA03307 358 PPADPSSPRKRPR-PSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAF 426
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1750-1769 |
2.97e-05 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.
Pssm-ID: 461782 Cd Length: 22 Bit Score: 42.58 E-value: 2.97e-05
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1966-2261 |
5.94e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 48.63 E-value: 5.94e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1966 KGPPLKTPASKSPSEGQTATTSPRGAKPSVKSELSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQP---LSRPIQSPG 2042
Cdd:PHA03307 101 AREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASsrqAALPLSSPE 180
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2043 RNSISPGRNGISPPNKLSQLPRTSSP----STASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSIPRSE----SA 2114
Cdd:PHA03307 181 ETARAPSSPPAEPPPSTPPAAASPRPprrsSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENEcplpRP 260
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2115 SKGLNQMNNGNGANKKVELSRMSSTKSSGSESDRSERPVLVRQStfiKEAPSPTLRRKLEESASFESLSPSSRPASPTRS 2194
Cdd:PHA03307 261 APITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPG---SGPAPSSPRASSSSSSSRESSSSSTSSSSESSR 337
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1237937751 2195 QAQTPVLSPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEY---NDGRP---AKRHDIARSH--SESPSRLPINRS 2261
Cdd:PHA03307 338 GAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSpaaSAGRPtrrRARAAVAGRArrRDATGRFPAGRP 412
|
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
229-270 |
4.73e-04 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 39.72 E-value: 4.73e-04
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 1237937751 229 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 270
Cdd:smart00185 1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
229-269 |
5.44e-04 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 39.36 E-value: 5.44e-04
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1237937751 229 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLS 269
Cdd:pfam00514 1 SPENKQAVIEA-GAVPPLVRLLSSPDEEVQEEAAWALSNLA 40
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
408-448 |
6.30e-04 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 39.36 E-value: 6.30e-04
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1237937751 408 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 448
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1433-1454 |
1.14e-03 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.
Pssm-ID: 461782 Cd Length: 22 Bit Score: 38.34 E-value: 1.14e-03
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
1971-2231 |
2.18e-03 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 43.41 E-value: 2.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1971 KTPASKSPSEGQTATTSPRGAKPsvkselSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGRNSISPgr 2050
Cdd:pfam17823 151 RANASAAPRAAIAAASAPHAASP------APRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHP-- 222
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2051 ngiSPPNKLSQLPrTSSPStASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSiPRSESASKGLNQMNNGNGANKk 2130
Cdd:pfam17823 223 ---AAGTALAAVG-NSSPA-AGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGD-PHARRLSPAKHMPSDTMARNP- 295
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2131 veLSRMSSTKSSGSESDRSERPVLvrqSTFIKEAPSPTlRRKLEESASFESLSPSSRPASPTRSQAQTPVLSPsLPDMSL 2210
Cdd:pfam17823 296 --AAPMGAQAQGPIIQVSTDQPVH---NTAGEPTPSPS-NTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASP-VPVLHT 368
|
250 260
....*....|....*....|.
gi 1237937751 2211 STHSSVQAGGWRKLPPNLSPT 2231
Cdd:pfam17823 369 SMIPEVEATSPTTQPSPLLPT 389
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1088-1110 |
3.92e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.59 E-value: 3.92e-03
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1969-2285 |
5.84e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 42.08 E-value: 5.84e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1969 PLKTPASKSPSEGQTATTSPRGAKPSVKSELSPVARQTSQiGGSSKAPSRSGsrDSTPSRPAQQPLSRPIQSPGRNSISP 2048
Cdd:PHA03307 63 DRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAR-EGSPTPPGPSS--PDPPPPTPPPASPPPSPAPDLSEMLR 139
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2049 GRNGISPPNKLSQLPRTSSPS-TASTKSSGSGKM---------SYTSPGRQMSQQNLTKQTGLSKNASSIPRSESASKGL 2118
Cdd:PHA03307 140 PVGSPGPPPAASPPAAGASPAaVASDAASSRQAAlplsspeetARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASS 219
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2119 NQMNNGNGANKKVELSRMSSTKSSGSESD---RSERPVLVRQSTfikeaPSPTLRRKLEESASFESLSPSSRPASPTRSQ 2195
Cdd:PHA03307 220 PAPAPGRSAADDAGASSSDSSSSESSGCGwgpENECPLPRPAPI-----TLPTRIWEASGWNGPSSRPGPASSSSSPRER 294
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2196 AqtPVLSPSLPDMSLSTHSSVQAGGWRKLPP-NLSPTIEYNDGRPAKRHDIARSHSESPS----RLPINRSGTWKREHSK 2270
Cdd:PHA03307 295 S--PSPSPSSPGSGPAPSSPRASSSSSSSREsSSSSTSSSSESSRGAAVSPGPSPSRSPSpsrpPPPADPSSPRKRPRPS 372
|
330
....*....|....*
gi 1237937751 2271 HSSSLPRVSTWRRTG 2285
Cdd:PHA03307 373 RAPSSPAASAGRPTR 387
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
1008-1418 |
6.00e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 41.98 E-value: 6.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1008 QTTQEAD---SANTLQIAEIKEKIGTRSAEDPvsEVPAVSQHPRTKSSRLQGSSLSSESARHKAVEFSSGAKSPSKSGAQ 1084
Cdd:PTZ00449 480 QFTQEIKkliKKSKKKLAPIEEEDSDKHDEPP--EGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEV 557
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1085 TPKSPP--EHYVQETPLMFSRCT----SVSSLDSFESRSIASSVQ-SEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPP 1157
Cdd:PTZ00449 558 GKKPGPakEHKPSKIPTLSKKPEfpkdPKHPKDPEEPKKPKRPRSaQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1158 PPPQTAQTKR-EVPK----NKAPTAEKRESGPK------QAAVNAAVQRVQvlpdadtllhfaTESTPDGFSCSSSLSAL 1226
Cdd:PTZ00449 638 PPQRPSSPERpEGPKiiksPKPPKSPKPPFDPKfkekfyDDYLDAAAKSKE------------TKTTVVLDESFESILKE 705
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1227 SLDEPFIQKDVELRIMPPVQENDNG------NETESEQPKESNENQEKEAEKTIDSEKDLLDDSDDDDIEILEECIISAM 1300
Cdd:PTZ00449 706 TLPETPGTPFTTPRPLPPKLPRDEEfpfepiGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAE 785
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1301 PTKSSRKAKKPAQTASKLPPPVARKPSqlpvyklLPSQNRLQPQKHVSFTPGDDMPRVYCVE--GTPINFSTATSLSDL- 1377
Cdd:PTZ00449 786 TGEPDEAMKRPDSPSEHEDKPPGDHPS-------LPKKRHRLDGLALSTTDLESDAGRIAKDasGKIVKLKRSKSFDDLt 858
|
410 420 430 440
....*....|....*....|....*....|....*....|.
gi 1237937751 1378 TIESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQ 1418
Cdd:PTZ00449 859 TVEEAEEMGAEARKIVVDDDGTEADDEDTHPPEEKHKSEVR 899
|
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
57-109 |
7.30e-03 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 36.25 E-value: 7.30e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1237937751 57 SQDSCISMRQSGCLPLLIQLLHgndkdsvllgnsRGSKEARARASAALHNIIH 109
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLK------------SEDEEVVKEAAWALSNLSS 41
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
974-991 |
7.34e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 35.82 E-value: 7.34e-03
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1967-2257 |
7.53e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.85 E-value: 7.53e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1967 GPPLKTPASKSPSEGQTATTSPRGAKPSvkselsPVARQTSqIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGRNSI 2046
Cdd:PHA03247 2603 DDRGDPRGPAPPSPLPPDTHAPDPPPPS------PSPAANE-PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQ 2675
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2047 SPgrngiSPPNKLSQ--LPRTSSPSTASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSIPRSESASKGLNQMNNG 2124
Cdd:PHA03247 2676 AS-----SPPQRPRRraARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPAT 2750
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2125 NGANKKVE--------LSRMSSTKSSGSESDRSERPVLVRQSTFIKEAPSPT--LRRKLEESASFESLSPSSRPAS---- 2190
Cdd:PHA03247 2751 PGGPARPArppttagpPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWdpADPPAAVLAPAAALPPAASPAGplpp 2830
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1237937751 2191 PTRSQAQTPVLSPSLPDMSLSTHSSVQAGG--WRKLPPNLSPTIEYNDGRPAKRHDIARSHSESPSRLP 2257
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFA 2899
|
|
|