NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|148699623|gb|EDL31570|]
View 

adenomatosis polyposis coli 2 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
1757-2089 6.66e-97

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


:

Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 316.82  E-value: 6.66e-97
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623  1757 TMLRGRTVIYSAGpASRTQSKGISGPCTTPKKTgtsgttqpETVTKAPSPEQQRSRSLHRPGKISELAALRHPPRSATPP 1836
Cdd:pfam05956    1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623  1837 ARLAKTPSSSSSQTSPASQPLPRRSPLATPTGGPLPGPGGS------LVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1908
Cdd:pfam05956   72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRnklsplPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623  1909 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRMASARSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 1983
Cdd:pfam05956  152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623  1984 SSADSTASTSQAASPRRGRPALPAVFLCSSRCDELRVSPRQPLAA--QRSPQAKPGLAPRAPRRTSSESPSRLPVRASPG 2061
Cdd:pfam05956  229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
                          330       340
                   ....*....|....*....|....*...
gi 148699623  2062 RPETVKRYASLPHISVSRRSDSAVSVPT 2089
Cdd:pfam05956  309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
356-424 2.61e-34

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


:

Pssm-ID: 465870  Cd Length: 74  Bit Score: 126.89  E-value: 2.61e-34
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 148699623   356 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTET-----PVPIEPQICQATCAVMKLSFDEEYRRAM 424
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGdsnpmPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
6-57 5.36e-26

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


:

Pssm-ID: 435517  Cd Length: 52  Bit Score: 102.37  E-value: 5.36e-26
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 148699623     6 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 57
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
124-204 4.28e-18

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


:

Pssm-ID: 463275  Cd Length: 82  Bit Score: 80.76  E-value: 4.28e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623   124 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDT-FSMQMDLIRQQLEFEAQHIRSLMEERFG 202
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGTyFDYGSDAQQERLEFLLARIQEVNRCLGG 80

                   ..
gi 148699623   203 TS 204
Cdd:pfam11414   81 LI 82
Arm_APC_u3 super family cl25003
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
690-936 5.33e-17

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


The actual alignment was detected with superfamily member pfam16629:

Pssm-ID: 435476  Cd Length: 293  Bit Score: 83.87  E-value: 5.33e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623   690 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQSlPEAETTSKKplpplRHLDGLVQDYASDSG 769
Cdd:pfam16629    1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623   770 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEK-----------------EAGGEAAVA 832
Cdd:pfam16629   74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKdrsldrergaglsnfhpATENSGNSS 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623   833 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 908
Cdd:pfam16629  148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
                          250       260
                   ....*....|....*....|....*...
gi 148699623   909 AAHTSLSNDSLNSGSTSDGYCTREHMTP 936
Cdd:pfam16629  228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
608-647 1.46e-06

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


:

Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 46.68  E-value: 1.46e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 148699623   608 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 647
Cdd:pfam00514    2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1505-1944 7.32e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.40  E-value: 7.32e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1505 PPPRRASAIPRALKREKPAGRKETP---SRAAQPATLPVRAQPRLIVDETPPcysltssaSSLSEPEAPEQPANHARGPE 1581
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGD--------PRGPAPPSPLPPDTHAPDPP 2627
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1582 QgskqdSSPSPRAEEELLQRCISLAMPRRRTQVP-GSRRRKPRALRSDIRPTEITQKCQ--EEVAGSDPASDLDSVEWQA 1658
Cdd:PHA03247 2628 P-----PSPSPAANEPDPHPPPTVPPPERPRDDPaPGRVSRPRRARRLGRAAQASSPPQrpRRRAARPTVGSLTSLADPP 2702
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1659 IQEGAnsivtwlhQAAAKASLEASSESDSLLSLVSGVSAGSTLQPSKLRKGRKPAAEAGGAWRPEKRGTTSTKINGSPRL 1738
Cdd:PHA03247 2703 PPPPT--------PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA 2774
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1739 PNGPEKAKGTQKMMAGESTMLRGRTVIYSAGPASRTQSKGISGPCTTPKKTGTSGTTQPETVTKAPSPEQQRSRSLHRPG 1818
Cdd:PHA03247 2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1819 KISELAALRHPPRSATPPARLAktpsssssqtsPASQPLPRRSPLATPTGGPLPGPGGSLVPKSPARALLAKQHKTQKSP 1898
Cdd:PHA03247 2855 SVAPGGDVRRRPPSRSPAAKPA-----------APARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|....*.
gi 148699623 1899 VRIPFMQRPARrvPPPLARPSPEPGSRGRAGAEGTPGARGSRLGLV 1944
Cdd:PHA03247 2924 PPPPQPQPPPP--PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL 2967
PTZ00449 super family cl33186
104 kDa microneme/rhoptry antigen; Provisional
1999-2196 3.40e-04

104 kDa microneme/rhoptry antigen; Provisional


The actual alignment was detected with superfamily member PTZ00449:

Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 45.84  E-value: 3.40e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1999 RRGRPALPAVFLC--SSRCDELRVSPRQPLAAQRSPQAKPGLAPRAPRrtSSESPSRLPVRASPGRPETV------KRYA 2070
Cdd:PTZ00449  608 RPKSPKLPELLDIpkSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIK--SPKPPKSPKPPFDPKFKEKFyddyldAAAK 685
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2071 SLPHISVSRRSDSAVSVPTTQANATRRGSDGEARPLPRVAP--PGTTWRRIKDEDVPHILRSTLPATALPLRVSSPEdSP 2148
Cdd:PTZ00449  686 SKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPrdEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHE-TP 764
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|..
gi 148699623 2149 AGTPQRKTSDAVVQTEDVaTSKTNSSTSPSLESRDP----PQAPASGPVAPQ 2196
Cdd:PTZ00449  765 ADTPLPDILAEEFKEEDI-HAETGEPDEAMKRPDSPseheDKPPGDHPSLPK 815
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1370-1392 6.38e-04

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 38.90  E-value: 6.38e-04
                           10        20
                   ....*....|....*....|...
gi 148699623  1370 DDSGTDSAEGTPVNFSSAASLSD 1392
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSS 23
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
649-689 3.29e-03

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


:

Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 37.05  E-value: 3.29e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 148699623   649 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 689
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1247-1268 3.48e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.59  E-value: 3.48e-03
                           10        20
                   ....*....|....*....|..
gi 148699623  1247 SVRFTVEKPDENFSCASSLSAL 1268
Cdd:pfam05923    3 PKRYCVEGTPANFSRASSLSSL 24
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1135-1158 5.58e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.20  E-value: 5.58e-03
                           10        20
                   ....*....|....*....|....
gi 148699623  1135 SSSSENCVQETPLVLSRCSSVSSL 1158
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1285-1622 6.20e-03

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 6.20e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1285 PPACPERAVGGGGHRRRDEAASRLDGPAPAGSRARSATDKeleALRECLG-----AAMPARLRKVASALVPGRRSLPVPv 1359
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRR---AARPTVGsltslADPPPPPPTPEPAPHALVSATPLP- 2722
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1360 ymLVPAPARGDDSGTDSAEGTPVNFSSAASLSDETlQGPSRDKPAGPGDRQKPTGRAAPArQTRSHRPKAAGAGKSTEHT 1439
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPA-RPARPPTTAGPPAPAPPAAPAAGP-PRRLTRPAVASLSESRESL 2798
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1440 RGPcRNRAGLELPLSRPQSARSNRDSSCQTRTRGDGALQS--------LCLTTPTEEAVYCFYD-SDEEPPATAPPPRRA 1510
Cdd:PHA03247 2799 PSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTapppppgpPPPSLPLGGSVAPGGDvRRRPPSRSPAAKPAA 2877
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1511 SAIPRALKREKPAgrketPSRAAQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEAPEQPANHAR---GPEQGSKQD 1587
Cdd:PHA03247 2878 PARPPVRRLARPA-----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQpplAPTTDPAGA 2952
                         330       340       350
                  ....*....|....*....|....*....|....*
gi 148699623 1588 SSPSPRAEEELLQRCISLAMPRRRTQVPGSRRRKP 1622
Cdd:PHA03247 2953 GEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSRE 2987
 
Name Accession Description Interval E-value
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
1757-2089 6.66e-97

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 316.82  E-value: 6.66e-97
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623  1757 TMLRGRTVIYSAGpASRTQSKGISGPCTTPKKTgtsgttqpETVTKAPSPEQQRSRSLHRPGKISELAALRHPPRSATPP 1836
Cdd:pfam05956    1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623  1837 ARLAKTPSSSSSQTSPASQPLPRRSPLATPTGGPLPGPGGS------LVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1908
Cdd:pfam05956   72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRnklsplPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623  1909 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRMASARSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 1983
Cdd:pfam05956  152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623  1984 SSADSTASTSQAASPRRGRPALPAVFLCSSRCDELRVSPRQPLAA--QRSPQAKPGLAPRAPRRTSSESPSRLPVRASPG 2061
Cdd:pfam05956  229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
                          330       340
                   ....*....|....*....|....*...
gi 148699623  2062 RPETVKRYASLPHISVSRRSDSAVSVPT 2089
Cdd:pfam05956  309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
356-424 2.61e-34

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


Pssm-ID: 465870  Cd Length: 74  Bit Score: 126.89  E-value: 2.61e-34
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 148699623   356 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTET-----PVPIEPQICQATCAVMKLSFDEEYRRAM 424
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGdsnpmPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
6-57 5.36e-26

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


Pssm-ID: 435517  Cd Length: 52  Bit Score: 102.37  E-value: 5.36e-26
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 148699623     6 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 57
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
124-204 4.28e-18

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


Pssm-ID: 463275  Cd Length: 82  Bit Score: 80.76  E-value: 4.28e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623   124 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDT-FSMQMDLIRQQLEFEAQHIRSLMEERFG 202
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGTyFDYGSDAQQERLEFLLARIQEVNRCLGG 80

                   ..
gi 148699623   203 TS 204
Cdd:pfam11414   81 LI 82
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
690-936 5.33e-17

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435476  Cd Length: 293  Bit Score: 83.87  E-value: 5.33e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623   690 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQSlPEAETTSKKplpplRHLDGLVQDYASDSG 769
Cdd:pfam16629    1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623   770 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEK-----------------EAGGEAAVA 832
Cdd:pfam16629   74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKdrsldrergaglsnfhpATENSGNSS 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623   833 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 908
Cdd:pfam16629  148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
                          250       260
                   ....*....|....*....|....*...
gi 148699623   909 AAHTSLSNDSLNSGSTSDGYCTREHMTP 936
Cdd:pfam16629  228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
608-647 1.46e-06

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 46.68  E-value: 1.46e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 148699623   608 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 647
Cdd:pfam00514    2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
607-647 2.24e-06

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 46.27  E-value: 2.24e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 148699623    607 REDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 647
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1769-1922 2.45e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 49.69  E-value: 2.45e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1769 GPASRTQSKGISGPCT---TPKKTGTSGTTQPETVTKAPSPEQQrsrslHRPGKISELAALRHPPRSATPPARlaKTPSS 1845
Cdd:PTZ00449  523 APGDKEGEEGEHEDSKesdEPKEGGKPGETKEGEVGKKPGPAKE-----HKPSKIPTLSKKPEFPKDPKHPKD--PEEPK 595
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 148699623 1846 SSSQTSPASQPLPRRSPlatptggplPGPGGSLVPKSPARALLAKQHKTQKSPVRIPFMQRPARRVPPPLARP--SPEP 1922
Cdd:PTZ00449  596 KPKRPRSAQRPTRPKSP---------KLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPpkSPKP 665
PHA03247 PHA03247
large tegument protein UL36; Provisional
1505-1944 7.32e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.40  E-value: 7.32e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1505 PPPRRASAIPRALKREKPAGRKETP---SRAAQPATLPVRAQPRLIVDETPPcysltssaSSLSEPEAPEQPANHARGPE 1581
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGD--------PRGPAPPSPLPPDTHAPDPP 2627
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1582 QgskqdSSPSPRAEEELLQRCISLAMPRRRTQVP-GSRRRKPRALRSDIRPTEITQKCQ--EEVAGSDPASDLDSVEWQA 1658
Cdd:PHA03247 2628 P-----PSPSPAANEPDPHPPPTVPPPERPRDDPaPGRVSRPRRARRLGRAAQASSPPQrpRRRAARPTVGSLTSLADPP 2702
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1659 IQEGAnsivtwlhQAAAKASLEASSESDSLLSLVSGVSAGSTLQPSKLRKGRKPAAEAGGAWRPEKRGTTSTKINGSPRL 1738
Cdd:PHA03247 2703 PPPPT--------PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA 2774
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1739 PNGPEKAKGTQKMMAGESTMLRGRTVIYSAGPASRTQSKGISGPCTTPKKTGTSGTTQPETVTKAPSPEQQRSRSLHRPG 1818
Cdd:PHA03247 2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1819 KISELAALRHPPRSATPPARLAktpsssssqtsPASQPLPRRSPLATPTGGPLPGPGGSLVPKSPARALLAKQHKTQKSP 1898
Cdd:PHA03247 2855 SVAPGGDVRRRPPSRSPAAKPA-----------APARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|....*.
gi 148699623 1899 VRIPFMQRPARrvPPPLARPSPEPGSRGRAGAEGTPGARGSRLGLV 1944
Cdd:PHA03247 2924 PPPPQPQPPPP--PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL 2967
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1591-1612 9.08e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 41.04  E-value: 9.08e-05
                           10        20
                   ....*....|....*....|..
gi 148699623  1591 SPRAEEELLQRCISLAMPRRRT 1612
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1999-2196 3.40e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 45.84  E-value: 3.40e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1999 RRGRPALPAVFLC--SSRCDELRVSPRQPLAAQRSPQAKPGLAPRAPRrtSSESPSRLPVRASPGRPETV------KRYA 2070
Cdd:PTZ00449  608 RPKSPKLPELLDIpkSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIK--SPKPPKSPKPPFDPKFKEKFyddyldAAAK 685
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2071 SLPHISVSRRSDSAVSVPTTQANATRRGSDGEARPLPRVAP--PGTTWRRIKDEDVPHILRSTLPATALPLRVSSPEdSP 2148
Cdd:PTZ00449  686 SKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPrdEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHE-TP 764
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|..
gi 148699623 2149 AGTPQRKTSDAVVQTEDVaTSKTNSSTSPSLESRDP----PQAPASGPVAPQ 2196
Cdd:PTZ00449  765 ADTPLPDILAEEFKEEDI-HAETGEPDEAMKRPDSPseheDKPPGDHPSLPK 815
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1370-1392 6.38e-04

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 38.90  E-value: 6.38e-04
                           10        20
                   ....*....|....*....|...
gi 148699623  1370 DDSGTDSAEGTPVNFSSAASLSD 1392
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSS 23
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
649-689 3.29e-03

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 37.05  E-value: 3.29e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 148699623   649 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 689
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1247-1268 3.48e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.59  E-value: 3.48e-03
                           10        20
                   ....*....|....*....|..
gi 148699623  1247 SVRFTVEKPDENFSCASSLSAL 1268
Cdd:pfam05923    3 PKRYCVEGTPANFSRASSLSSL 24
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1135-1158 5.58e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.20  E-value: 5.58e-03
                           10        20
                   ....*....|....*....|....
gi 148699623  1135 SSSSENCVQETPLVLSRCSSVSSL 1158
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
PHA03247 PHA03247
large tegument protein UL36; Provisional
1285-1622 6.20e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 6.20e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1285 PPACPERAVGGGGHRRRDEAASRLDGPAPAGSRARSATDKeleALRECLG-----AAMPARLRKVASALVPGRRSLPVPv 1359
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRR---AARPTVGsltslADPPPPPPTPEPAPHALVSATPLP- 2722
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1360 ymLVPAPARGDDSGTDSAEGTPVNFSSAASLSDETlQGPSRDKPAGPGDRQKPTGRAAPArQTRSHRPKAAGAGKSTEHT 1439
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPA-RPARPPTTAGPPAPAPPAAPAAGP-PRRLTRPAVASLSESRESL 2798
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1440 RGPcRNRAGLELPLSRPQSARSNRDSSCQTRTRGDGALQS--------LCLTTPTEEAVYCFYD-SDEEPPATAPPPRRA 1510
Cdd:PHA03247 2799 PSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTapppppgpPPPSLPLGGSVAPGGDvRRRPPSRSPAAKPAA 2877
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1511 SAIPRALKREKPAgrketPSRAAQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEAPEQPANHAR---GPEQGSKQD 1587
Cdd:PHA03247 2878 PARPPVRRLARPA-----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQpplAPTTDPAGA 2952
                         330       340       350
                  ....*....|....*....|....*....|....*
gi 148699623 1588 SSPSPRAEEELLQRCISLAMPRRRTQVPGSRRRKP 1622
Cdd:PHA03247 2953 GEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSRE 2987
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
7-86 7.91e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 41.04  E-value: 7.91e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623    7 SYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQEARVL--VSSGQTEVLEQLKALQTDIS 84
Cdd:COG4372    32 QLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLqaAQAELAQAQEELESLQEEAE 111

                  ..
gi 148699623   85 SL 86
Cdd:COG4372   112 EL 113
 
Name Accession Description Interval E-value
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
1757-2089 6.66e-97

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 316.82  E-value: 6.66e-97
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623  1757 TMLRGRTVIYSAGpASRTQSKGISGPCTTPKKTgtsgttqpETVTKAPSPEQQRSRSLHRPGKISELAALRHPPRSATPP 1836
Cdd:pfam05956    1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623  1837 ARLAKTPSSSSSQTSPASQPLPRRSPLATPTGGPLPGPGGS------LVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1908
Cdd:pfam05956   72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRnklsplPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623  1909 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRMASARSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 1983
Cdd:pfam05956  152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623  1984 SSADSTASTSQAASPRRGRPALPAVFLCSSRCDELRVSPRQPLAA--QRSPQAKPGLAPRAPRRTSSESPSRLPVRASPG 2061
Cdd:pfam05956  229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
                          330       340
                   ....*....|....*....|....*...
gi 148699623  2062 RPETVKRYASLPHISVSRRSDSAVSVPT 2089
Cdd:pfam05956  309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
356-424 2.61e-34

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


Pssm-ID: 465870  Cd Length: 74  Bit Score: 126.89  E-value: 2.61e-34
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 148699623   356 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTET-----PVPIEPQICQATCAVMKLSFDEEYRRAM 424
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGdsnpmPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
6-57 5.36e-26

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


Pssm-ID: 435517  Cd Length: 52  Bit Score: 102.37  E-value: 5.36e-26
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 148699623     6 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 57
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
124-204 4.28e-18

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


Pssm-ID: 463275  Cd Length: 82  Bit Score: 80.76  E-value: 4.28e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623   124 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDT-FSMQMDLIRQQLEFEAQHIRSLMEERFG 202
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGTyFDYGSDAQQERLEFLLARIQEVNRCLGG 80

                   ..
gi 148699623   203 TS 204
Cdd:pfam11414   81 LI 82
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
690-936 5.33e-17

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435476  Cd Length: 293  Bit Score: 83.87  E-value: 5.33e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623   690 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQSlPEAETTSKKplpplRHLDGLVQDYASDSG 769
Cdd:pfam16629    1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623   770 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEK-----------------EAGGEAAVA 832
Cdd:pfam16629   74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKdrsldrergaglsnfhpATENSGNSS 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623   833 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 908
Cdd:pfam16629  148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
                          250       260
                   ....*....|....*....|....*...
gi 148699623   909 AAHTSLSNDSLNSGSTSDGYCTREHMTP 936
Cdd:pfam16629  228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
608-647 1.46e-06

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 46.68  E-value: 1.46e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 148699623   608 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 647
Cdd:pfam00514    2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
607-647 2.24e-06

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 46.27  E-value: 2.24e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 148699623    607 REDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 647
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1769-1922 2.45e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 49.69  E-value: 2.45e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1769 GPASRTQSKGISGPCT---TPKKTGTSGTTQPETVTKAPSPEQQrsrslHRPGKISELAALRHPPRSATPPARlaKTPSS 1845
Cdd:PTZ00449  523 APGDKEGEEGEHEDSKesdEPKEGGKPGETKEGEVGKKPGPAKE-----HKPSKIPTLSKKPEFPKDPKHPKD--PEEPK 595
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 148699623 1846 SSSQTSPASQPLPRRSPlatptggplPGPGGSLVPKSPARALLAKQHKTQKSPVRIPFMQRPARRVPPPLARP--SPEP 1922
Cdd:PTZ00449  596 KPKRPRSAQRPTRPKSP---------KLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPpkSPKP 665
PHA03247 PHA03247
large tegument protein UL36; Provisional
1792-2274 4.21e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.17  E-value: 4.21e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1792 SGTTQPETVTKAPSPEQQRSRSLHRPGkiselaalrhpPRSATPPARLAKTPSSSSSQTSPASQPL-PRRSPLATPTGGP 1870
Cdd:PHA03247 2548 AGDPPPPLPPAAPPAAPDRSVPPPRPA-----------PRPSEPAVTSRARRPDAPPQSARPRAPVdDRGDPRGPAPPSP 2616
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1871 LPGPGGSLVPKSPARALLAKQhKTQKSPVRIPFMQRPARRVPPPLARPSPEPGSRGRAGAEGTPGARGSRLGL-----VR 1945
Cdd:PHA03247 2617 LPPDTHAPDPPPPSPSPAANE-PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAArptvgSL 2695
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1946 MASARSSGSESSDRSGFRRQLTFIKESPGLLRRRRSELSSADSTASTSQAASP-------RRGRPALPAvflcssrcdel 2018
Cdd:PHA03247 2696 TSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPatpggpaRPARPPTTA----------- 2764
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2019 rvSPRQPLAAQRSPQAKPGLAPRAPRRTSSESPSRLPVRASPGRPETVKRYASLPHISVSRRSDSAVSVPTTQANATRRG 2098
Cdd:PHA03247 2765 --GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP 2842
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2099 SDGEARPLP---RVAPPGTTWRR-------IKDEDVPHILRSTLPATALPLRVSSPEDSPAGTPQRKTSDAVVQTEDVAT 2168
Cdd:PHA03247 2843 PGPPPPSLPlggSVAPGGDVRRRppsrspaAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQ 2922
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2169 SKTNSSTSPSLESRDPPQAPASGPVAPQGSDVDGPVLTKPPASAPFPHEgLSAVIAGFPTSRHGSPSRAARVPPFNYVPS 2248
Cdd:PHA03247 2923 PPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGR-VAVPRFRVPQPAPSREAPASSTPPLTGHSL 3001
                         490       500
                  ....*....|....*....|....*.
gi 148699623 2249 PmAAATMASDSAVEKAPVSSPASLLE 2274
Cdd:PHA03247 3002 S-RVSSWASSLALHEETDPPPVSLKQ 3026
PHA03247 PHA03247
large tegument protein UL36; Provisional
1505-1944 7.32e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.40  E-value: 7.32e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1505 PPPRRASAIPRALKREKPAGRKETP---SRAAQPATLPVRAQPRLIVDETPPcysltssaSSLSEPEAPEQPANHARGPE 1581
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGD--------PRGPAPPSPLPPDTHAPDPP 2627
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1582 QgskqdSSPSPRAEEELLQRCISLAMPRRRTQVP-GSRRRKPRALRSDIRPTEITQKCQ--EEVAGSDPASDLDSVEWQA 1658
Cdd:PHA03247 2628 P-----PSPSPAANEPDPHPPPTVPPPERPRDDPaPGRVSRPRRARRLGRAAQASSPPQrpRRRAARPTVGSLTSLADPP 2702
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1659 IQEGAnsivtwlhQAAAKASLEASSESDSLLSLVSGVSAGSTLQPSKLRKGRKPAAEAGGAWRPEKRGTTSTKINGSPRL 1738
Cdd:PHA03247 2703 PPPPT--------PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA 2774
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1739 PNGPEKAKGTQKMMAGESTMLRGRTVIYSAGPASRTQSKGISGPCTTPKKTGTSGTTQPETVTKAPSPEQQRSRSLHRPG 1818
Cdd:PHA03247 2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1819 KISELAALRHPPRSATPPARLAktpsssssqtsPASQPLPRRSPLATPTGGPLPGPGGSLVPKSPARALLAKQHKTQKSP 1898
Cdd:PHA03247 2855 SVAPGGDVRRRPPSRSPAAKPA-----------APARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|....*.
gi 148699623 1899 VRIPFMQRPARrvPPPLARPSPEPGSRGRAGAEGTPGARGSRLGLV 1944
Cdd:PHA03247 2924 PPPPQPQPPPP--PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL 2967
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1591-1612 9.08e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 41.04  E-value: 9.08e-05
                           10        20
                   ....*....|....*....|..
gi 148699623  1591 SPRAEEELLQRCISLAMPRRRT 1612
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1999-2196 3.40e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 45.84  E-value: 3.40e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1999 RRGRPALPAVFLC--SSRCDELRVSPRQPLAAQRSPQAKPGLAPRAPRrtSSESPSRLPVRASPGRPETV------KRYA 2070
Cdd:PTZ00449  608 RPKSPKLPELLDIpkSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIK--SPKPPKSPKPPFDPKFKEKFyddyldAAAK 685
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2071 SLPHISVSRRSDSAVSVPTTQANATRRGSDGEARPLPRVAP--PGTTWRRIKDEDVPHILRSTLPATALPLRVSSPEdSP 2148
Cdd:PTZ00449  686 SKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPrdEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHE-TP 764
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|..
gi 148699623 2149 AGTPQRKTSDAVVQTEDVaTSKTNSSTSPSLESRDP----PQAPASGPVAPQ 2196
Cdd:PTZ00449  765 ADTPLPDILAEEFKEEDI-HAETGEPDEAMKRPDSPseheDKPPGDHPSLPK 815
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1905-2239 3.66e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.93  E-value: 3.66e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1905 QRPARRVPPPLARPSPEPGSRGRAGAeGTPGARGSRLGLVRMASARSSGSESSDRSGFRRqltfiKESPGLLRRRRSELS 1984
Cdd:PHA03307   80 PANESRSTPTWSLSTLAPASPAREGS-PTPPGPSSPDPPPPTPPPASPPPSPAPDLSEML-----RPVGSPGPPPAASPP 153
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1985 SADSTASTSQAASPRRGRPALPAvflcsSRCDELRVSPRQPLAAQRSPQAKPGLAPRAPRRTSSESPSRLPVRASPGRPE 2064
Cdd:PHA03307  154 AAGASPAAVASDAASSRQAALPL-----SSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSA 228
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2065 TVKRYASlphisvsrRSDSAVSVPTTQANATRrgsdgEARPLPRvAPPGTTWRRIkDEDVPHILRSTL-----PATALPL 2139
Cdd:PHA03307  229 ADDAGAS--------SSDSSSSESSGCGWGPE-----NECPLPR-PAPITLPTRI-WEASGWNGPSSRpgpasSSSSPRE 293
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2140 RVSSPEDSPAGTPQRKTSDAVVQTEDVATSKTNSSTSPSLESRDPPQAPASGPVAPQGSDVDGPVLTKPP----ASAPFP 2215
Cdd:PHA03307  294 RSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSsprkRPRPSR 373
                         330       340
                  ....*....|....*....|....
gi 148699623 2216 HEGLSAVIAGFPTSRHGSPSRAAR 2239
Cdd:PHA03307  374 APSSPAASAGRPTRRRARAAVAGR 397
PHA03321 PHA03321
tegument protein VP11/12; Provisional
2026-2256 5.57e-04

tegument protein VP11/12; Provisional


Pssm-ID: 223041 [Multi-domain]  Cd Length: 694  Bit Score: 45.33  E-value: 5.57e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2026 LAAQRSPqakPGlaPRAPRRTSSESPsrlPVRASPGRPETVKRYASlphisvSRRSDSAVSVPTTQANATRRGSDGEAR- 2104
Cdd:PHA03321  424 LLSSRQP---PG--APAPRRDNDPPP---PPRARPGSTPACARRAR------AQRARDAGPEYVDPLGALRRLPAGAAPp 489
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2105 PLPRVAPPGTTWRRIKDEDVPHiLRSTLPATALPLRVSSPedSPAGTPQRKTSDAVVQTEDVATSKTNSSTSPSLESRDP 2184
Cdd:PHA03321  490 PEPAAAPSPATYYTRMGGGPPR-LPPRNRATETLRPDWGP--PAAAPPEQMEDPYLEPDDDRFDRRDGAAAAATSHPREA 566
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2185 PqAPASGPVAPQGSDVDGPV-------------LTKPPASAPFPHEGLS---------AVIAGFPTSRHGSPSRAARVPP 2242
Cdd:PHA03321  567 P-APDDDPIYEGVSDSEEPVyeeiptprvyqnpLPRPMEGAGEPPDLDAptspwveeeNPIYGWGDSPLFSPPPAARFPP 645
                         250
                  ....*....|....
gi 148699623 2243 FNYVPSPMAAATMA 2256
Cdd:PHA03321  646 PDPALSPEPPALPA 659
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1370-1392 6.38e-04

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 38.90  E-value: 6.38e-04
                           10        20
                   ....*....|....*....|...
gi 148699623  1370 DDSGTDSAEGTPVNFSSAASLSD 1392
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSS 23
PHA03247 PHA03247
large tegument protein UL36; Provisional
1972-2192 2.02e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 2.02e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1972 SPGLLRRRRSELSSADSTASTSQAASPRRGRPALPAVFLCSSRCDELRVSP-RQPLAAQRSPQAKPGLAPRAP---RRTS 2047
Cdd:PHA03247  256 APPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPlALPAPPDPPPPAPAGDAEEEDdedGAME 335
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2048 SESP-----SRLPVRASPGRPETVKRYASLPHISVSRRSDSAVSVPTTQANATR-------RGSDGEARPLPRVAPPGTT 2115
Cdd:PHA03247  336 VVSPlprprQHYPLGFPKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRKRRSARhaatpfaRGPGGDDQTRPAAPVPASV 415
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 148699623 2116 wrriKDEDVPHILRSTLPATALPLRVSSP--EDSPAGTPQRKTSDAVVQTEDVATSKTNSSTSPSLESRDPPQAPASGP 2192
Cdd:PHA03247  416 ----PTPAPTPVPASAPPPPATPLPSAEPgsDDGPAPPPERQPPAPATEPAPDDPDDATRKALDALRERRPPEPPGADL 490
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
649-689 3.29e-03

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 37.05  E-value: 3.29e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 148699623   649 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 689
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1247-1268 3.48e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.59  E-value: 3.48e-03
                           10        20
                   ....*....|....*....|..
gi 148699623  1247 SVRFTVEKPDENFSCASSLSAL 1268
Cdd:pfam05923    3 PKRYCVEGTPANFSRASSLSSL 24
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1135-1158 5.58e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.20  E-value: 5.58e-03
                           10        20
                   ....*....|....*....|....
gi 148699623  1135 SSSSENCVQETPLVLSRCSSVSSL 1158
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
PHA03247 PHA03247
large tegument protein UL36; Provisional
1285-1622 6.20e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 6.20e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1285 PPACPERAVGGGGHRRRDEAASRLDGPAPAGSRARSATDKeleALRECLG-----AAMPARLRKVASALVPGRRSLPVPv 1359
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRR---AARPTVGsltslADPPPPPPTPEPAPHALVSATPLP- 2722
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1360 ymLVPAPARGDDSGTDSAEGTPVNFSSAASLSDETlQGPSRDKPAGPGDRQKPTGRAAPArQTRSHRPKAAGAGKSTEHT 1439
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPA-RPARPPTTAGPPAPAPPAAPAAGP-PRRLTRPAVASLSESRESL 2798
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1440 RGPcRNRAGLELPLSRPQSARSNRDSSCQTRTRGDGALQS--------LCLTTPTEEAVYCFYD-SDEEPPATAPPPRRA 1510
Cdd:PHA03247 2799 PSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTapppppgpPPPSLPLGGSVAPGGDvRRRPPSRSPAAKPAA 2877
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1511 SAIPRALKREKPAgrketPSRAAQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEAPEQPANHAR---GPEQGSKQD 1587
Cdd:PHA03247 2878 PARPPVRRLARPA-----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQpplAPTTDPAGA 2952
                         330       340       350
                  ....*....|....*....|....*....|....*
gi 148699623 1588 SSPSPRAEEELLQRCISLAMPRRRTQVPGSRRRKP 1622
Cdd:PHA03247 2953 GEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSRE 2987
PHA03247 PHA03247
large tegument protein UL36; Provisional
1856-2241 7.85e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 7.85e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1856 PLPRRSPLATPTGGPLpgpggslvPKSPARALLAKQHKTQKSPVRI----------------------PFMQRPA---RR 1910
Cdd:PHA03247 2496 PDPGGGGPPDPDAPPA--------PSRLAPAILPDEPVGEPVHPRMltwirgleelasddagdpppplPPAAPPAapdRS 2567
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1911 VPPPLARPSP-EPGSRGRAGAEGTP--GARGSRLGLVRMASARSSGSESSDRSGFRRQLTFIKESPGLLRRRRSELSSAD 1987
Cdd:PHA03247 2568 VPPPRPAPRPsEPAVTSRARRPDAPpqSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVP 2647
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 1988 STASTSQAASPRRGRPalpavflcssrcdelrvsPRQPLAAQRSPQAKPglAPRAPRRTSSEsPSRLPVRASPGRPETVK 2067
Cdd:PHA03247 2648 PPERPRDDPAPGRVSR------------------PRRARRLGRAAQASS--PPQRPRRRAAR-PTVGSLTSLADPPPPPP 2706
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2068 RYASLPHISVsrrsdSAVSVPTTQANATRRGSDGEARPLPRVAPPGTTwrrikdEDVPHILRSTLPATALPLRVSSPEDs 2147
Cdd:PHA03247 2707 TPEPAPHALV-----SATPLPPGPAAARQASPALPAAPAPPAVPAGPA------TPGGPARPARPPTTAGPPAPAPPAA- 2774
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623 2148 PAGTPQRKTSDAVVQTEDVATSKTNSSTSPS---------LESRDPPQAPASGPVAPQGSDVDGPVLTKPPASAPFPHEG 2218
Cdd:PHA03247 2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPAdppaavlapAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
                         410       420
                  ....*....|....*....|...
gi 148699623 2219 lsAVIAGFPTSRHGSPSRAARVP 2241
Cdd:PHA03247 2855 --SVAPGGDVRRRPPSRSPAAKP 2875
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
7-86 7.91e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 41.04  E-value: 7.91e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148699623    7 SYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQEARVL--VSSGQTEVLEQLKALQTDIS 84
Cdd:COG4372    32 QLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLqaAQAELAQAQEELESLQEEAE 111

                  ..
gi 148699623   85 SL 86
Cdd:COG4372   112 EL 113
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH