|
Name |
Accession |
Description |
Interval |
E-value |
| Arm_APC_u3 |
pfam16629 |
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ... |
760-1047 |
4.50e-167 |
|
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. :
Pssm-ID: 435476 Cd Length: 293 Bit Score: 516.45 E-value: 4.50e-167
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 760 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQSLYGDYVFDTNRHDDN-- 837
Cdd:pfam16629 1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 838 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLGNYHPATENPGTSSKR-GLQISTTAAQ 914
Cdd:pfam16629 81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 915 IAKVMEEVSAIHTSQEDRSSGSTTELHCVTDERNALRRSSAAHTHSNTYNFTKSENSNRTCSMPYAKLEYKRSSNDSLNS 994
Cdd:pfam16629 161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 2248185750 995 VSSSDGYGKRGQMKPSIESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 1047
Cdd:pfam16629 241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
|
|
| APC_basic |
pfam05956 |
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ... |
2251-2596 |
7.19e-109 |
|
APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules. :
Pssm-ID: 428690 [Multi-domain] Cd Length: 336 Bit Score: 351.49 E-value: 7.19e-109
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2251 SISRGRTMIHIPGVRNSSSSTSPVSKKGPPLKTPASKSPSEGQTATTS-PRGAKPSVKSELSPVARQTSQIGGSSKAPSR 2329
Cdd:pfam05956 1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2330 SGSRDSTPSRPAQQPLSRPIQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQMSQQNLTKQ 2409
Cdd:pfam05956 81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2410 TGLSKNASSI------PRSESASKGLNqmnngNGANKKVELSRMSSTKSSGSESDRSerpVLVRQSTFIKEAPSPTLRRK 2483
Cdd:pfam05956 154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2484 LEESASfESLSPSSRPASPTRSQaqtpvlsPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEYNDGRPAKRHDIARSHSES 2563
Cdd:pfam05956 226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
|
330 340 350
....*....|....*....|....*....|...
gi 2248185750 2564 PSRLPInRSGTWKREHSKHSSSLPRVSTWRRTG 2596
Cdd:pfam05956 298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
|
|
| EB1_binding |
pfam05937 |
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ... |
2698-2871 |
8.34e-90 |
|
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins. :
Pssm-ID: 399141 Cd Length: 174 Bit Score: 289.98 E-value: 8.34e-90
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2698 RSGRSPTGNTPPVIDSVSEKANPNIKDSKDNQAKQNVGNGSVPMRTVGLENRLNSFIQVDAPDQKGTEIKPGQNNPVPVS 2777
Cdd:pfam05937 1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNVPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2778 ETNESSIVERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVNNNTKKRDSKTDSTESSGTQ 2857
Cdd:pfam05937 81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
|
170
....*....|....
gi 2248185750 2858 SPKRHSGSYLVTSV 2871
Cdd:pfam05937 161 SPKRHSGSYLVTSV 174
|
|
| APC_u5 |
pfam16630 |
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ... |
1064-1163 |
9.31e-57 |
|
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. :
Pssm-ID: 406923 Cd Length: 100 Bit Score: 192.04 E-value: 9.31e-57
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1064 LNSGRQSPSQNERWARPKHIIEDEIKQSEQRQSRNQSTTYPVYTESTDDKHLKFQPHFGQQECVSPYRSRGANGSETNRV 1143
Cdd:pfam16630 1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
|
90 100
....*....|....*....|
gi 2248185750 1144 GSNHGINQNVSQSLCQEDDY 1163
Cdd:pfam16630 81 GSSHGINQKVSQSLCQVDDY 100
|
|
| APC_u14 |
pfam16635 |
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ... |
1774-1867 |
3.39e-47 |
|
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. :
Pssm-ID: 435479 Cd Length: 94 Bit Score: 164.63 E-value: 3.39e-47
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1774 IMDQVQQASASSSAPNKNQLDGKKKKPTSPVKPIPQNTEYRTRVRKNADSKNNLNAERVFSDNKDSKKQNLKNNSKVFND 1853
Cdd:pfam16635 1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
|
90
....*....|....
gi 2248185750 1854 KLPNNEDRVRGSFA 1867
Cdd:pfam16635 81 KLPNNEERTRGSFA 94
|
|
| APC_rep |
pfam18797 |
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ... |
403-476 |
4.49e-45 |
|
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region. :
Pssm-ID: 465870 Cd Length: 74 Bit Score: 157.71 E-value: 4.49e-45
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2248185750 403 SQPDDKRGRREIRVLHLLEQIRAYCETCWEWQEAHEPGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 476
Cdd:pfam18797 1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
|
|
| APC_u9 |
pfam16633 |
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ... |
1311-1396 |
1.44e-35 |
|
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. :
Pssm-ID: 435478 Cd Length: 89 Bit Score: 131.15 E-value: 1.44e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1311 AEDEI-GCNQTTQEADSANTLQIAEIKEKIGTRSAEDPVSEVPAVSQHPRTKSSRLQGSSLS-SESARHKAVEFSSGAKS 1388
Cdd:pfam16633 2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHIRTKPNRLQASNLSpSDSSRHKAVEFSSGAKS 81
|
....*...
gi 2248185750 1389 PSKSGAQT 1396
Cdd:pfam16633 82 PSKSGAQT 89
|
|
| APC_u15 |
pfam16636 |
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ... |
1901-1975 |
2.18e-31 |
|
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. :
Pssm-ID: 435480 Cd Length: 81 Bit Score: 118.81 E-value: 2.18e-31
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2248185750 1901 DLSREKAELRKAKENKESEAKVTSHTELTSNQQSANKTQAIAKQPINRGQPKPILQKQSTFPQSSKDIPDRGAAT 1975
Cdd:pfam16636 7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
|
|
| APC_u13 |
pfam16634 |
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ... |
1690-1743 |
1.15e-24 |
|
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. :
Pssm-ID: 406927 Cd Length: 54 Bit Score: 98.71 E-value: 1.15e-24
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 2248185750 1690 IESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQGGKTSSVTIPELDD 1743
Cdd:pfam16634 1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
|
|
| Suppressor_APC |
pfam11414 |
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ... |
137-217 |
2.83e-22 |
|
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils. :
Pssm-ID: 463275 Cd Length: 82 Bit Score: 93.09 E-value: 2.83e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 137 SRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTE-NFSLQTDMTRRQLEYEARQIRVAMEEQLG 215
Cdd:pfam11414 1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGtYFDYGSDAQQERLEFLLARIQEVNRCLGG 80
|
..
gi 2248185750 216 TC 217
Cdd:pfam11414 81 LI 82
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1665-1688 |
1.50e-07 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin. :
Pssm-ID: 461781 Cd Length: 24 Bit Score: 49.30 E-value: 1.50e-07
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
677-717 |
4.65e-07 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin. :
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 48.19 E-value: 4.65e-07
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2248185750 677 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 717
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
2061-2080 |
3.47e-05 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin. :
Pssm-ID: 461782 Cd Length: 22 Bit Score: 42.58 E-value: 3.47e-05
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
540-581 |
5.30e-04 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin. :
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 39.72 E-value: 5.30e-04
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 2248185750 540 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 581
Cdd:smart00185 1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
719-759 |
7.07e-04 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats. :
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 39.36 E-value: 7.07e-04
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2248185750 719 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 759
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1744-1765 |
1.33e-03 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin. :
Pssm-ID: 461782 Cd Length: 22 Bit Score: 38.34 E-value: 1.33e-03
|
| PTZ00449 super family |
cl33186 |
104 kDa microneme/rhoptry antigen; Provisional |
1319-1729 |
3.28e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional The actual alignment was detected with superfamily member PTZ00449:
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 43.14 E-value: 3.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1319 QTTQEAD---SANTLQIAEIKEKIGTRSAEDPvsEVPAVSQHPRTKSSRLQGSSLSSESARHKAVEFSSGAKSPSKSGAQ 1395
Cdd:PTZ00449 480 QFTQEIKkliKKSKKKLAPIEEEDSDKHDEPP--EGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEV 557
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1396 TPKSPP--EHYVQETPLMFSRCT----SVSSLDSFESRSIASSVQ-SEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPP 1468
Cdd:PTZ00449 558 GKKPGPakEHKPSKIPTLSKKPEfpkdPKHPKDPEEPKKPKRPRSaQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1469 PPPQTAQTKR-EVPK----NKAPTAEKRESGPK------QAAVNAAVQRVQvlpdadtllhfaTESTPDGFSCSSSLSAL 1537
Cdd:PTZ00449 638 PPQRPSSPERpEGPKiiksPKPPKSPKPPFDPKfkekfyDDYLDAAAKSKE------------TKTTVVLDESFESILKE 705
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1538 SLDEPFIQKDVELRIMPPVQENDNG------NETESEQPKESNENQEKEAEKTIDSEKDLLDDSDDDDIEILEECIISAM 1611
Cdd:PTZ00449 706 TLPETPGTPFTTPRPLPPKLPRDEEfpfepiGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAE 785
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1612 PTKSSRKAKKPAQTASKLPPPVARKPSqlpvyklLPSQNRLQPQKHVSFTPGDDMPRVYCVE--GTPINFSTATSLSDL- 1688
Cdd:PTZ00449 786 TGEPDEAMKRPDSPSEHEDKPPGDHPS-------LPKKRHRLDGLALSTTDLESDAGRIAKDasGKIVKLKRSKSFDDLt 858
|
410 420 430 440
....*....|....*....|....*....|....*....|.
gi 2248185750 1689 TIESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQ 1729
Cdd:PTZ00449 859 TVEEAEEMGAEARKIVVDDDGTEADDEDTHPPEEKHKSEVR 899
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1285-1302 |
6.83e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin. :
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.20 E-value: 6.83e-03
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
350-402 |
8.19e-03 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin. :
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 36.25 E-value: 8.19e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 2248185750 350 SQDSCISMRQSGCLPLLIQLLHgndkdsvllgnsRGSKEARARASAALHNIIH 402
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLK------------SEDEEVVKEAAWALSNLSS 41
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Arm_APC_u3 |
pfam16629 |
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ... |
760-1047 |
4.50e-167 |
|
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435476 Cd Length: 293 Bit Score: 516.45 E-value: 4.50e-167
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 760 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQSLYGDYVFDTNRHDDN-- 837
Cdd:pfam16629 1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 838 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLGNYHPATENPGTSSKR-GLQISTTAAQ 914
Cdd:pfam16629 81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 915 IAKVMEEVSAIHTSQEDRSSGSTTELHCVTDERNALRRSSAAHTHSNTYNFTKSENSNRTCSMPYAKLEYKRSSNDSLNS 994
Cdd:pfam16629 161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 2248185750 995 VSSSDGYGKRGQMKPSIESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 1047
Cdd:pfam16629 241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
|
|
| APC_basic |
pfam05956 |
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ... |
2251-2596 |
7.19e-109 |
|
APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.
Pssm-ID: 428690 [Multi-domain] Cd Length: 336 Bit Score: 351.49 E-value: 7.19e-109
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2251 SISRGRTMIHIPGVRNSSSSTSPVSKKGPPLKTPASKSPSEGQTATTS-PRGAKPSVKSELSPVARQTSQIGGSSKAPSR 2329
Cdd:pfam05956 1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2330 SGSRDSTPSRPAQQPLSRPIQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQMSQQNLTKQ 2409
Cdd:pfam05956 81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2410 TGLSKNASSI------PRSESASKGLNqmnngNGANKKVELSRMSSTKSSGSESDRSerpVLVRQSTFIKEAPSPTLRRK 2483
Cdd:pfam05956 154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2484 LEESASfESLSPSSRPASPTRSQaqtpvlsPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEYNDGRPAKRHDIARSHSES 2563
Cdd:pfam05956 226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
|
330 340 350
....*....|....*....|....*....|...
gi 2248185750 2564 PSRLPInRSGTWKREHSKHSSSLPRVSTWRRTG 2596
Cdd:pfam05956 298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
|
|
| EB1_binding |
pfam05937 |
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ... |
2698-2871 |
8.34e-90 |
|
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins.
Pssm-ID: 399141 Cd Length: 174 Bit Score: 289.98 E-value: 8.34e-90
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2698 RSGRSPTGNTPPVIDSVSEKANPNIKDSKDNQAKQNVGNGSVPMRTVGLENRLNSFIQVDAPDQKGTEIKPGQNNPVPVS 2777
Cdd:pfam05937 1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNVPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2778 ETNESSIVERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVNNNTKKRDSKTDSTESSGTQ 2857
Cdd:pfam05937 81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
|
170
....*....|....
gi 2248185750 2858 SPKRHSGSYLVTSV 2871
Cdd:pfam05937 161 SPKRHSGSYLVTSV 174
|
|
| APC_u5 |
pfam16630 |
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ... |
1064-1163 |
9.31e-57 |
|
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 406923 Cd Length: 100 Bit Score: 192.04 E-value: 9.31e-57
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1064 LNSGRQSPSQNERWARPKHIIEDEIKQSEQRQSRNQSTTYPVYTESTDDKHLKFQPHFGQQECVSPYRSRGANGSETNRV 1143
Cdd:pfam16630 1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
|
90 100
....*....|....*....|
gi 2248185750 1144 GSNHGINQNVSQSLCQEDDY 1163
Cdd:pfam16630 81 GSSHGINQKVSQSLCQVDDY 100
|
|
| APC_u14 |
pfam16635 |
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ... |
1774-1867 |
3.39e-47 |
|
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435479 Cd Length: 94 Bit Score: 164.63 E-value: 3.39e-47
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1774 IMDQVQQASASSSAPNKNQLDGKKKKPTSPVKPIPQNTEYRTRVRKNADSKNNLNAERVFSDNKDSKKQNLKNNSKVFND 1853
Cdd:pfam16635 1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
|
90
....*....|....
gi 2248185750 1854 KLPNNEDRVRGSFA 1867
Cdd:pfam16635 81 KLPNNEERTRGSFA 94
|
|
| APC_rep |
pfam18797 |
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ... |
403-476 |
4.49e-45 |
|
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.
Pssm-ID: 465870 Cd Length: 74 Bit Score: 157.71 E-value: 4.49e-45
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2248185750 403 SQPDDKRGRREIRVLHLLEQIRAYCETCWEWQEAHEPGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 476
Cdd:pfam18797 1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
|
|
| APC_u9 |
pfam16633 |
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ... |
1311-1396 |
1.44e-35 |
|
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435478 Cd Length: 89 Bit Score: 131.15 E-value: 1.44e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1311 AEDEI-GCNQTTQEADSANTLQIAEIKEKIGTRSAEDPVSEVPAVSQHPRTKSSRLQGSSLS-SESARHKAVEFSSGAKS 1388
Cdd:pfam16633 2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHIRTKPNRLQASNLSpSDSSRHKAVEFSSGAKS 81
|
....*...
gi 2248185750 1389 PSKSGAQT 1396
Cdd:pfam16633 82 PSKSGAQT 89
|
|
| APC_u15 |
pfam16636 |
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ... |
1901-1975 |
2.18e-31 |
|
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435480 Cd Length: 81 Bit Score: 118.81 E-value: 2.18e-31
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2248185750 1901 DLSREKAELRKAKENKESEAKVTSHTELTSNQQSANKTQAIAKQPINRGQPKPILQKQSTFPQSSKDIPDRGAAT 1975
Cdd:pfam16636 7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
|
|
| APC_u13 |
pfam16634 |
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ... |
1690-1743 |
1.15e-24 |
|
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 406927 Cd Length: 54 Bit Score: 98.71 E-value: 1.15e-24
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 2248185750 1690 IESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQGGKTSSVTIPELDD 1743
Cdd:pfam16634 1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
|
|
| Suppressor_APC |
pfam11414 |
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ... |
137-217 |
2.83e-22 |
|
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.
Pssm-ID: 463275 Cd Length: 82 Bit Score: 93.09 E-value: 2.83e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 137 SRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTE-NFSLQTDMTRRQLEYEARQIRVAMEEQLG 215
Cdd:pfam11414 1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGtYFDYGSDAQQERLEFLLARIQEVNRCLGG 80
|
..
gi 2248185750 216 TC 217
Cdd:pfam11414 81 LI 82
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1665-1688 |
1.50e-07 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 49.30 E-value: 1.50e-07
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
677-717 |
4.65e-07 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 48.19 E-value: 4.65e-07
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2248185750 677 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 717
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
677-717 |
6.32e-07 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 47.83 E-value: 6.32e-07
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2248185750 677 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 717
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2279-2592 |
1.40e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.56 E-value: 1.40e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2279 PPLKTPASKsPSEGQTATTSPRGAKPSVKSE----LSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGR 2354
Cdd:PHA03247 2703 PPPPTPEPA-PHALVSATPLPPGPAAARQASpalpAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2355 NSISPGRNGISPPNKLSQLPRTSSPSTASTKSSGSGKMSYTSPGrqmsqqnltkqTGLSKNASSIPRSESASKGLNQ--M 2432
Cdd:PHA03247 2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPpsL 2850
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2433 NNGNGANKKVELSRMSSTKSSGSESDRSERPVLVRqstfikeAPSPTLRRKLEESA-SFESLSPSSRPASPTRSQAQTPV 2511
Cdd:PHA03247 2851 PLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRR-------LARPAVSRSTESFAlPPDQPERPPQPQAPPPPQPQPQP 2923
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2512 LSPSLPDMSLSTHSSVQAggwrKLPPNLSPTieyNDGRPAKRHDIARSHSESPSRLPINRSGTWK----RE------HSK 2581
Cdd:PHA03247 2924 PPPPQPQPPPPPPPRPQP----PLAPTTDPA---GAGEPSGAVPQPWLGALVPGRVAVPRFRVPQpapsREapasstPPL 2996
|
330
....*....|.
gi 2248185750 2582 HSSSLPRVSTW 2592
Cdd:PHA03247 2997 TGHSLSRVSSW 3007
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
2061-2080 |
3.47e-05 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.
Pssm-ID: 461782 Cd Length: 22 Bit Score: 42.58 E-value: 3.47e-05
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
144-276 |
1.50e-04 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 47.45 E-value: 1.50e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 144 LEELEKERSLLLADLDKEEKEKDWY--YAQLQNLTKRIDSLP---------LTENFSLQTDM---------TRRQLEYEA 203
Cdd:COG4717 104 LEELEAELEELREELEKLEKLLQLLplYQELEALEAELAELPerleeleerLEELRELEEELeeleaelaeLQEELEELL 183
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2248185750 204 RQIRVAMEEQLgtcQDMEKRAQRRIARIQQIEKDILRIRQLLQsQATEAERSSQNKHETGsHDAERQNEGQGV 276
Cdd:COG4717 184 EQLSLATEEEL---QDLAEELEELQQRLAELEEELEEAQEELE-ELEEELEQLENELEAA-ALEERLKEARLL 251
|
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
540-581 |
5.30e-04 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 39.72 E-value: 5.30e-04
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 2248185750 540 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 581
Cdd:smart00185 1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
540-580 |
6.10e-04 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 39.36 E-value: 6.10e-04
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2248185750 540 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLS 580
Cdd:pfam00514 1 SPENKQAVIEA-GAVPPLVRLLSSPDEEVQEEAAWALSNLA 40
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
719-759 |
7.07e-04 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 39.36 E-value: 7.07e-04
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2248185750 719 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 759
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1744-1765 |
1.33e-03 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.
Pssm-ID: 461782 Cd Length: 22 Bit Score: 38.34 E-value: 1.33e-03
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
130-270 |
1.55e-03 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 44.28 E-value: 1.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 130 RRGFVNGSRESTGY--------LEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLpltenfslqtdmtrRQLEY 201
Cdd:TIGR02168 657 PGGVITGGSAKTNSsilerrreIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQL--------------RKELE 722
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2248185750 202 EARQIRVAMEEQLGTcqdMEKRAQRRIARIQQIEKDILRIRQLLQSQATEAERSSQNKHETGSHDAERQ 270
Cdd:TIGR02168 723 ELSRQISALRKDLAR---LEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE 788
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
1319-1729 |
3.28e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 43.14 E-value: 3.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1319 QTTQEAD---SANTLQIAEIKEKIGTRSAEDPvsEVPAVSQHPRTKSSRLQGSSLSSESARHKAVEFSSGAKSPSKSGAQ 1395
Cdd:PTZ00449 480 QFTQEIKkliKKSKKKLAPIEEEDSDKHDEPP--EGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEV 557
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1396 TPKSPP--EHYVQETPLMFSRCT----SVSSLDSFESRSIASSVQ-SEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPP 1468
Cdd:PTZ00449 558 GKKPGPakEHKPSKIPTLSKKPEfpkdPKHPKDPEEPKKPKRPRSaQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1469 PPPQTAQTKR-EVPK----NKAPTAEKRESGPK------QAAVNAAVQRVQvlpdadtllhfaTESTPDGFSCSSSLSAL 1537
Cdd:PTZ00449 638 PPQRPSSPERpEGPKiiksPKPPKSPKPPFDPKfkekfyDDYLDAAAKSKE------------TKTTVVLDESFESILKE 705
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1538 SLDEPFIQKDVELRIMPPVQENDNG------NETESEQPKESNENQEKEAEKTIDSEKDLLDDSDDDDIEILEECIISAM 1611
Cdd:PTZ00449 706 TLPETPGTPFTTPRPLPPKLPRDEEfpfepiGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAE 785
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1612 PTKSSRKAKKPAQTASKLPPPVARKPSqlpvyklLPSQNRLQPQKHVSFTPGDDMPRVYCVE--GTPINFSTATSLSDL- 1688
Cdd:PTZ00449 786 TGEPDEAMKRPDSPSEHEDKPPGDHPS-------LPKKRHRLDGLALSTTDLESDAGRIAKDasGKIVKLKRSKSFDDLt 858
|
410 420 430 440
....*....|....*....|....*....|....*....|.
gi 2248185750 1689 TIESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQ 1729
Cdd:PTZ00449 859 TVEEAEEMGAEARKIVVDDDGTEADDEDTHPPEEKHKSEVR 899
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1399-1421 |
3.61e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.97 E-value: 3.61e-03
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1285-1302 |
6.83e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.20 E-value: 6.83e-03
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
350-402 |
8.19e-03 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 36.25 E-value: 8.19e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 2248185750 350 SQDSCISMRQSGCLPLLIQLLHgndkdsvllgnsRGSKEARARASAALHNIIH 402
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLK------------SEDEEVVKEAAWALSNLSS 41
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Arm_APC_u3 |
pfam16629 |
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ... |
760-1047 |
4.50e-167 |
|
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435476 Cd Length: 293 Bit Score: 516.45 E-value: 4.50e-167
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 760 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQSLYGDYVFDTNRHDDN-- 837
Cdd:pfam16629 1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 838 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLGNYHPATENPGTSSKR-GLQISTTAAQ 914
Cdd:pfam16629 81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 915 IAKVMEEVSAIHTSQEDRSSGSTTELHCVTDERNALRRSSAAHTHSNTYNFTKSENSNRTCSMPYAKLEYKRSSNDSLNS 994
Cdd:pfam16629 161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 2248185750 995 VSSSDGYGKRGQMKPSIESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 1047
Cdd:pfam16629 241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
|
|
| APC_basic |
pfam05956 |
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ... |
2251-2596 |
7.19e-109 |
|
APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.
Pssm-ID: 428690 [Multi-domain] Cd Length: 336 Bit Score: 351.49 E-value: 7.19e-109
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2251 SISRGRTMIHIPGVRNSSSSTSPVSKKGPPLKTPASKSPSEGQTATTS-PRGAKPSVKSELSPVARQTSQIGGSSKAPSR 2329
Cdd:pfam05956 1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2330 SGSRDSTPSRPAQQPLSRPIQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQMSQQNLTKQ 2409
Cdd:pfam05956 81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2410 TGLSKNASSI------PRSESASKGLNqmnngNGANKKVELSRMSSTKSSGSESDRSerpVLVRQSTFIKEAPSPTLRRK 2483
Cdd:pfam05956 154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2484 LEESASfESLSPSSRPASPTRSQaqtpvlsPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEYNDGRPAKRHDIARSHSES 2563
Cdd:pfam05956 226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
|
330 340 350
....*....|....*....|....*....|...
gi 2248185750 2564 PSRLPInRSGTWKREHSKHSSSLPRVSTWRRTG 2596
Cdd:pfam05956 298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
|
|
| EB1_binding |
pfam05937 |
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ... |
2698-2871 |
8.34e-90 |
|
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins.
Pssm-ID: 399141 Cd Length: 174 Bit Score: 289.98 E-value: 8.34e-90
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2698 RSGRSPTGNTPPVIDSVSEKANPNIKDSKDNQAKQNVGNGSVPMRTVGLENRLNSFIQVDAPDQKGTEIKPGQNNPVPVS 2777
Cdd:pfam05937 1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNVPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2778 ETNESSIVERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVNNNTKKRDSKTDSTESSGTQ 2857
Cdd:pfam05937 81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
|
170
....*....|....
gi 2248185750 2858 SPKRHSGSYLVTSV 2871
Cdd:pfam05937 161 SPKRHSGSYLVTSV 174
|
|
| APC_u5 |
pfam16630 |
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ... |
1064-1163 |
9.31e-57 |
|
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 406923 Cd Length: 100 Bit Score: 192.04 E-value: 9.31e-57
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1064 LNSGRQSPSQNERWARPKHIIEDEIKQSEQRQSRNQSTTYPVYTESTDDKHLKFQPHFGQQECVSPYRSRGANGSETNRV 1143
Cdd:pfam16630 1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
|
90 100
....*....|....*....|
gi 2248185750 1144 GSNHGINQNVSQSLCQEDDY 1163
Cdd:pfam16630 81 GSSHGINQKVSQSLCQVDDY 100
|
|
| APC_u14 |
pfam16635 |
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ... |
1774-1867 |
3.39e-47 |
|
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435479 Cd Length: 94 Bit Score: 164.63 E-value: 3.39e-47
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1774 IMDQVQQASASSSAPNKNQLDGKKKKPTSPVKPIPQNTEYRTRVRKNADSKNNLNAERVFSDNKDSKKQNLKNNSKVFND 1853
Cdd:pfam16635 1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
|
90
....*....|....
gi 2248185750 1854 KLPNNEDRVRGSFA 1867
Cdd:pfam16635 81 KLPNNEERTRGSFA 94
|
|
| APC_rep |
pfam18797 |
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ... |
403-476 |
4.49e-45 |
|
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.
Pssm-ID: 465870 Cd Length: 74 Bit Score: 157.71 E-value: 4.49e-45
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2248185750 403 SQPDDKRGRREIRVLHLLEQIRAYCETCWEWQEAHEPGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 476
Cdd:pfam18797 1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
|
|
| APC_u9 |
pfam16633 |
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ... |
1311-1396 |
1.44e-35 |
|
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435478 Cd Length: 89 Bit Score: 131.15 E-value: 1.44e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1311 AEDEI-GCNQTTQEADSANTLQIAEIKEKIGTRSAEDPVSEVPAVSQHPRTKSSRLQGSSLS-SESARHKAVEFSSGAKS 1388
Cdd:pfam16633 2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHIRTKPNRLQASNLSpSDSSRHKAVEFSSGAKS 81
|
....*...
gi 2248185750 1389 PSKSGAQT 1396
Cdd:pfam16633 82 PSKSGAQT 89
|
|
| APC_u15 |
pfam16636 |
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ... |
1901-1975 |
2.18e-31 |
|
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435480 Cd Length: 81 Bit Score: 118.81 E-value: 2.18e-31
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2248185750 1901 DLSREKAELRKAKENKESEAKVTSHTELTSNQQSANKTQAIAKQPINRGQPKPILQKQSTFPQSSKDIPDRGAAT 1975
Cdd:pfam16636 7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
|
|
| APC_u13 |
pfam16634 |
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ... |
1690-1743 |
1.15e-24 |
|
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 406927 Cd Length: 54 Bit Score: 98.71 E-value: 1.15e-24
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 2248185750 1690 IESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQGGKTSSVTIPELDD 1743
Cdd:pfam16634 1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
|
|
| Suppressor_APC |
pfam11414 |
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ... |
137-217 |
2.83e-22 |
|
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.
Pssm-ID: 463275 Cd Length: 82 Bit Score: 93.09 E-value: 2.83e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 137 SRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTE-NFSLQTDMTRRQLEYEARQIRVAMEEQLG 215
Cdd:pfam11414 1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGtYFDYGSDAQQERLEFLLARIQEVNRCLGG 80
|
..
gi 2248185750 216 TC 217
Cdd:pfam11414 81 LI 82
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1665-1688 |
1.50e-07 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 49.30 E-value: 1.50e-07
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
677-717 |
4.65e-07 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 48.19 E-value: 4.65e-07
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2248185750 677 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 717
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
677-717 |
6.32e-07 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 47.83 E-value: 6.32e-07
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2248185750 677 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 717
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2279-2592 |
1.40e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.56 E-value: 1.40e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2279 PPLKTPASKsPSEGQTATTSPRGAKPSVKSE----LSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGR 2354
Cdd:PHA03247 2703 PPPPTPEPA-PHALVSATPLPPGPAAARQASpalpAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2355 NSISPGRNGISPPNKLSQLPRTSSPSTASTKSSGSGKMSYTSPGrqmsqqnltkqTGLSKNASSIPRSESASKGLNQ--M 2432
Cdd:PHA03247 2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPpsL 2850
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2433 NNGNGANKKVELSRMSSTKSSGSESDRSERPVLVRqstfikeAPSPTLRRKLEESA-SFESLSPSSRPASPTRSQAQTPV 2511
Cdd:PHA03247 2851 PLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRR-------LARPAVSRSTESFAlPPDQPERPPQPQAPPPPQPQPQP 2923
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2512 LSPSLPDMSLSTHSSVQAggwrKLPPNLSPTieyNDGRPAKRHDIARSHSESPSRLPINRSGTWK----RE------HSK 2581
Cdd:PHA03247 2924 PPPPQPQPPPPPPPRPQP----PLAPTTDPA---GAGEPSGAVPQPWLGALVPGRVAVPRFRVPQpapsREapasstPPL 2996
|
330
....*....|.
gi 2248185750 2582 HSSSLPRVSTW 2592
Cdd:PHA03247 2997 TGHSLSRVSSW 3007
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
2284-2429 |
8.55e-06 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 51.71 E-value: 8.55e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2284 PASKSPSEGQTATTSPRGAKPSVKSELSPVA----RQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGRNSISP 2359
Cdd:PHA03307 278 PSSRPGPASSSSSPRERSPSPSPSSPGSGPApsspRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPP 357
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2360 GRNGISPPNKLSQlPRTSSPSTASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSIPRSESASKGL 2429
Cdd:PHA03307 358 PPADPSSPRKRPR-PSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAF 426
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
2061-2080 |
3.47e-05 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.
Pssm-ID: 461782 Cd Length: 22 Bit Score: 42.58 E-value: 3.47e-05
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
2277-2572 |
6.34e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 49.01 E-value: 6.34e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2277 KGPPLKTPASKSPSEGQTATTSPRGAKPSVKSELSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQP---LSRPIQSPG 2353
Cdd:PHA03307 101 AREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASsrqAALPLSSPE 180
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2354 RNSISPGRNGISPPNKLSQLPRTSSP----STASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSIPRSE----SA 2425
Cdd:PHA03307 181 ETARAPSSPPAEPPPSTPPAAASPRPprrsSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENEcplpRP 260
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2426 SKGLNQMNNGNGANKKVELSRMSSTKSSGSESDRSERPVLVRQStfiKEAPSPTLRRKLEESASFESLSPSSRPASPTRS 2505
Cdd:PHA03307 261 APITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPG---SGPAPSSPRASSSSSSSRESSSSSTSSSSESSR 337
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2248185750 2506 QAQTPVLSPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEY---NDGRP---AKRHDIARSH--SESPSRLPINRS 2572
Cdd:PHA03307 338 GAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSpaaSAGRPtrrRARAAVAGRArrRDATGRFPAGRP 412
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
144-276 |
1.50e-04 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 47.45 E-value: 1.50e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 144 LEELEKERSLLLADLDKEEKEKDWY--YAQLQNLTKRIDSLP---------LTENFSLQTDM---------TRRQLEYEA 203
Cdd:COG4717 104 LEELEAELEELREELEKLEKLLQLLplYQELEALEAELAELPerleeleerLEELRELEEELeeleaelaeLQEELEELL 183
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2248185750 204 RQIRVAMEEQLgtcQDMEKRAQRRIARIQQIEKDILRIRQLLQsQATEAERSSQNKHETGsHDAERQNEGQGV 276
Cdd:COG4717 184 EQLSLATEEEL---QDLAEELEELQQRLAELEEELEEAQEELE-ELEEELEQLENELEAA-ALEERLKEARLL 251
|
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
540-581 |
5.30e-04 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 39.72 E-value: 5.30e-04
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 2248185750 540 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 581
Cdd:smart00185 1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
540-580 |
6.10e-04 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 39.36 E-value: 6.10e-04
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2248185750 540 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLS 580
Cdd:pfam00514 1 SPENKQAVIEA-GAVPPLVRLLSSPDEEVQEEAAWALSNLA 40
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
719-759 |
7.07e-04 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 39.36 E-value: 7.07e-04
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2248185750 719 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 759
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
145-272 |
8.36e-04 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 44.93 E-value: 8.36e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 145 EELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLpltenfslQTDMTRRQLEYE-ARQIRVAMEEQLgtcQDMEKR 223
Cdd:COG1196 221 ELKELEAELLLLKLRELEAELEELEAELEELEAELEEL--------EAELAELEAELEeLRLELEELELEL---EEAQAE 289
|
90 100 110 120
....*....|....*....|....*....|....*....|....*....
gi 2248185750 224 AQRRIARIQQIEKDILRIRQLLQSQATEAERSSQNKHETGSHDAERQNE 272
Cdd:COG1196 290 EYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEE 338
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
2282-2542 |
9.00e-04 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 44.57 E-value: 9.00e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2282 KTPASKSPSEGQTATTSPRGAKPsvkselSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGRNSISPgr 2361
Cdd:pfam17823 151 RANASAAPRAAIAAASAPHAASP------APRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHP-- 222
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2362 ngiSPPNKLSQLPrTSSPStASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSiPRSESASKGLNQMNNGNGANKk 2441
Cdd:pfam17823 223 ---AAGTALAAVG-NSSPA-AGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGD-PHARRLSPAKHMPSDTMARNP- 295
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2442 veLSRMSSTKSSGSESDRSERPVLvrqSTFIKEAPSPTlRRKLEESASFESLSPSSRPASPTRSQAQTPVLSPsLPDMSL 2521
Cdd:pfam17823 296 --AAPMGAQAQGPIIQVSTDQPVH---NTAGEPTPSPS-NTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASP-VPVLHT 368
|
250 260
....*....|....*....|.
gi 2248185750 2522 STHSSVQAGGWRKLPPNLSPT 2542
Cdd:pfam17823 369 SMIPEVEATSPTTQPSPLLPT 389
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1744-1765 |
1.33e-03 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.
Pssm-ID: 461782 Cd Length: 22 Bit Score: 38.34 E-value: 1.33e-03
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
130-270 |
1.55e-03 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 44.28 E-value: 1.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 130 RRGFVNGSRESTGY--------LEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLpltenfslqtdmtrRQLEY 201
Cdd:TIGR02168 657 PGGVITGGSAKTNSsilerrreIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQL--------------RKELE 722
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2248185750 202 EARQIRVAMEEQLGTcqdMEKRAQRRIARIQQIEKDILRIRQLLQSQATEAERSSQNKHETGSHDAERQ 270
Cdd:TIGR02168 723 ELSRQISALRKDLAR---LEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE 788
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
144-269 |
1.63e-03 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 44.29 E-value: 1.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 144 LEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSL------------PLTENFSLQTDMTRRQLEYEARQIRVAME 211
Cdd:TIGR02169 232 KEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIeqlleelnkkikDLGEEEQLRVKEKIGELEAEIASLERSIA 311
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*....
gi 2248185750 212 EqlgtCQDMEKRAQRRIARIQ-QIEKDILRIRQLLQSQATEAERSSQNKHETGSHDAER 269
Cdd:TIGR02169 312 E----KERELEDAEERLAKLEaEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEEL 366
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
1319-1729 |
3.28e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 43.14 E-value: 3.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1319 QTTQEAD---SANTLQIAEIKEKIGTRSAEDPvsEVPAVSQHPRTKSSRLQGSSLSSESARHKAVEFSSGAKSPSKSGAQ 1395
Cdd:PTZ00449 480 QFTQEIKkliKKSKKKLAPIEEEDSDKHDEPP--EGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEV 557
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1396 TPKSPP--EHYVQETPLMFSRCT----SVSSLDSFESRSIASSVQ-SEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPP 1468
Cdd:PTZ00449 558 GKKPGPakEHKPSKIPTLSKKPEfpkdPKHPKDPEEPKKPKRPRSaQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1469 PPPQTAQTKR-EVPK----NKAPTAEKRESGPK------QAAVNAAVQRVQvlpdadtllhfaTESTPDGFSCSSSLSAL 1537
Cdd:PTZ00449 638 PPQRPSSPERpEGPKiiksPKPPKSPKPPFDPKfkekfyDDYLDAAAKSKE------------TKTTVVLDESFESILKE 705
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1538 SLDEPFIQKDVELRIMPPVQENDNG------NETESEQPKESNENQEKEAEKTIDSEKDLLDDSDDDDIEILEECIISAM 1611
Cdd:PTZ00449 706 TLPETPGTPFTTPRPLPPKLPRDEEfpfepiGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAE 785
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 1612 PTKSSRKAKKPAQTASKLPPPVARKPSqlpvyklLPSQNRLQPQKHVSFTPGDDMPRVYCVE--GTPINFSTATSLSDL- 1688
Cdd:PTZ00449 786 TGEPDEAMKRPDSPSEHEDKPPGDHPS-------LPKKRHRLDGLALSTTDLESDAGRIAKDasGKIVKLKRSKSFDDLt 858
|
410 420 430 440
....*....|....*....|....*....|....*....|.
gi 2248185750 1689 TIESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQ 1729
Cdd:PTZ00449 859 TVEEAEEMGAEARKIVVDDDGTEADDEDTHPPEEKHKSEVR 899
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1399-1421 |
3.61e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.97 E-value: 3.61e-03
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
144-258 |
4.63e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 42.06 E-value: 4.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 144 LEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLpltenfslqtdmtrrqleyeARQIRvAMEEQLgtcQDMEKR 223
Cdd:COG4942 29 LEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAAL--------------------ARRIR-ALEQEL---AALEAE 84
|
90 100 110
....*....|....*....|....*....|....*
gi 2248185750 224 AQRRIARIQQIEKDILRIRQLLQSQATEAERSSQN 258
Cdd:COG4942 85 LAELEKEIAELRAELEAQKEELAELLRALYRLGRQ 119
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
2280-2596 |
6.28e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 42.47 E-value: 6.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2280 PLKTPASKSPSEGQTATTSPRGAKPSVKSELSPVARQTSQiGGSSKAPSRSGsrDSTPSRPAQQPLSRPIQSPGRNSISP 2359
Cdd:PHA03307 63 DRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAR-EGSPTPPGPSS--PDPPPPTPPPASPPPSPAPDLSEMLR 139
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2360 GRNGISPPNKLSQLPRTSSPS-TASTKSSGSGKM---------SYTSPGRQMSQQNLTKQTGLSKNASSIPRSESASKGL 2429
Cdd:PHA03307 140 PVGSPGPPPAASPPAAGASPAaVASDAASSRQAAlplsspeetARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASS 219
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2430 NQMNNGNGANKKVELSRMSSTKSSGSESD---RSERPVLVRQSTfikeaPSPTLRRKLEESASFESLSPSSRPASPTRSQ 2506
Cdd:PHA03307 220 PAPAPGRSAADDAGASSSDSSSSESSGCGwgpENECPLPRPAPI-----TLPTRIWEASGWNGPSSRPGPASSSSSPRER 294
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2507 AqtPVLSPSLPDMSLSTHSSVQAGGWRKLPP-NLSPTIEYNDGRPAKRHDIARSHSESPS----RLPINRSGTWKREHSK 2581
Cdd:PHA03307 295 S--PSPSPSSPGSGPAPSSPRASSSSSSSREsSSSSTSSSSESSRGAAVSPGPSPSRSPSpsrpPPPADPSSPRKRPRPS 372
|
330
....*....|....*
gi 2248185750 2582 HSSSLPRVSTWRRTG 2596
Cdd:PHA03307 373 RAPSSPAASAGRPTR 387
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1285-1302 |
6.83e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.20 E-value: 6.83e-03
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
350-402 |
8.19e-03 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 36.25 E-value: 8.19e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 2248185750 350 SQDSCISMRQSGCLPLLIQLLHgndkdsvllgnsRGSKEARARASAALHNIIH 402
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLK------------SEDEEVVKEAAWALSNLSS 41
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2278-2568 |
9.02e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.85 E-value: 9.02e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2278 GPPLKTPASKSPSEGQTATTSPRGAKPSvkselsPVARQTSqIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGRNSI 2357
Cdd:PHA03247 2603 DDRGDPRGPAPPSPLPPDTHAPDPPPPS------PSPAANE-PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQ 2675
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2358 SPgrngiSPPNKLSQ--LPRTSSPSTASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSIPRSESASKGLNQMNNG 2435
Cdd:PHA03247 2676 AS-----SPPQRPRRraARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPAT 2750
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185750 2436 NGANKKVE--------LSRMSSTKSSGSESDRSERPVLVRQSTFIKEAPSPT--LRRKLEESASFESLSPSSRPAS---- 2501
Cdd:PHA03247 2751 PGGPARPArppttagpPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWdpADPPAAVLAPAAALPPAASPAGplpp 2830
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2248185750 2502 PTRSQAQTPVLSPSLPDMSLSTHSSVQAGG--WRKLPPNLSPTIEYNDGRPAKRHDIARSHSESPSRLP 2568
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFA 2899
|
|
|