NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2248185719|ref|NP_001394385|]
View 

adenomatous polyposis coli protein isoform q [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
649-936 1.74e-166

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 435476  Cd Length: 293  Bit Score: 514.52  E-value: 1.74e-166
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  649 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQSLYGDYVFDTNRHDDN-- 726
Cdd:pfam16629    1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  727 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLGNYHPATENPGTSSKR-GLQISTTAAQ 803
Cdd:pfam16629   81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  804 IAKVMEEVSAIHTSQEDRSSGSTTELHCVTDERNALRRSSAAHTHSNTYNFTKSENSNRTCSMPYAKLEYKRSSNDSLNS 883
Cdd:pfam16629  161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2248185719  884 VSSSDGYGKRGQMKPSIESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 936
Cdd:pfam16629  241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
2140-2485 3.06e-109

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


:

Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 352.64  E-value: 3.06e-109
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2140 SISRGRTMIHIPGVRNSSSSTSPVSKKGPPLKTPASKSPSEGQTATTS-PRGAKPSVKSELSPVARQTSQIGGSSKAPSR 2218
Cdd:pfam05956    1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2219 SGSRDSTPSRPAQQPLSRPIQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQMSQQNLTKQ 2298
Cdd:pfam05956   81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2299 TGLSKNASSI------PRSESASKGLNqmnngNGANKKVELSRMSSTKSSGSESDRSerpVLVRQSTFIKEAPSPTLRRK 2372
Cdd:pfam05956  154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2373 LEESASfESLSPSSRPASPTRSQaqtpvlsPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEYNDGRPAKRHDIARSHSES 2452
Cdd:pfam05956  226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
                          330       340       350
                   ....*....|....*....|....*....|...
gi 2248185719 2453 PSRLPInRSGTWKREHSKHSSSLPRVSTWRRTG 2485
Cdd:pfam05956  298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
EB1_binding pfam05937
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ...
2587-2760 3.29e-89

EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins.


:

Pssm-ID: 399141  Cd Length: 174  Bit Score: 288.05  E-value: 3.29e-89
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2587 RSGRSPTGNTPPVIDSVSEKANPNIKDSKDNQAKQNVGNGSVPMRTVGLENRLNSFIQVDAPDQKGTEIKPGQNNPVPVS 2666
Cdd:pfam05937    1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNVPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2667 ETNESSIVERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVNNNTKKRDSKTDSTESSGTQ 2746
Cdd:pfam05937   81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
                          170
                   ....*....|....
gi 2248185719 2747 SPKRHSGSYLVTSV 2760
Cdd:pfam05937  161 SPKRHSGSYLVTSV 174
APC_u5 pfam16630
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ...
953-1052 2.06e-56

Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 406923  Cd Length: 100  Bit Score: 191.27  E-value: 2.06e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  953 LNSGRQSPSQNERWARPKHIIEDEIKQSEQRQSRNQSTTYPVYTESTDDKHLKFQPHFGQQECVSPYRSRGANGSETNRV 1032
Cdd:pfam16630    1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
                           90       100
                   ....*....|....*....|
gi 2248185719 1033 GSNHGINQNVSQSLCQEDDY 1052
Cdd:pfam16630   81 GSSHGINQKVSQSLCQVDDY 100
APC_u14 pfam16635
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ...
1663-1756 1.38e-46

Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 435479  Cd Length: 94  Bit Score: 162.70  E-value: 1.38e-46
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1663 IMDQVQQASASSSAPNKNQLDGKKKKPTSPVKPIPQNTEYRTRVRKNADSKNNLNAERVFSDNKDSKKQNLKNNSKVFND 1742
Cdd:pfam16635    1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
                           90
                   ....*....|....
gi 2248185719 1743 KLPNNEDRVRGSFA 1756
Cdd:pfam16635   81 KLPNNEERTRGSFA 94
APC_u9 pfam16633
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ...
1200-1285 2.43e-35

Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 435478  Cd Length: 89  Bit Score: 130.38  E-value: 2.43e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1200 AEDEI-GCNQTTQEADSANTLQIAEIKEKIGTRSAEDPVSEVPAVSQHPRTKSSRLQGSSLS-SESARHKAVEFSSGAKS 1277
Cdd:pfam16633    2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHIRTKPNRLQASNLSpSDSSRHKAVEFSSGAKS 81

                   ....*...
gi 2248185719 1278 PSKSGAQT 1285
Cdd:pfam16633   82 PSKSGAQT 89
APC_u15 pfam16636
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ...
1790-1864 5.12e-31

Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 435480  Cd Length: 81  Bit Score: 117.65  E-value: 5.12e-31
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2248185719 1790 DLSREKAELRKAKENKESEAKVTSHTELTSNQQSANKTQAIAKQPINRGQPKPILQKQSTFPQSSKDIPDRGAAT 1864
Cdd:pfam16636    7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
307-365 8.65e-31

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


:

Pssm-ID: 465870  Cd Length: 74  Bit Score: 116.88  E-value: 8.65e-31
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2248185719  307 HLGTKIRAYCETCWEWQEAHEPGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 365
Cdd:pfam18797   16 HLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_u13 pfam16634
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ...
1579-1632 1.75e-24

Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 406927  Cd Length: 54  Bit Score: 98.33  E-value: 1.75e-24
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2248185719 1579 IESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQGGKTSSVTIPELDD 1632
Cdd:pfam16634    1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
127-207 3.71e-22

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


:

Pssm-ID: 463275  Cd Length: 82  Bit Score: 92.70  E-value: 3.71e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  127 SRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTE-NFSLQTDMTRRQLEYEARQIRVAMEEQLG 205
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGtYFDYGSDAQQERLEFLLARIQEVNRCLGG 80

                   ..
gi 2248185719  206 TC 207
Cdd:pfam11414   81 LI 82
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
4-55 6.24e-22

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


:

Pssm-ID: 435517  Cd Length: 52  Bit Score: 90.82  E-value: 6.24e-22
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2248185719    4 ASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSI 55
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1554-1577 1.41e-07

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 49.30  E-value: 1.41e-07
                           10        20
                   ....*....|....*....|....
gi 2248185719 1554 DMPRVYCVEGTPINFSTATSLSDL 1577
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
566-606 4.47e-07

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


:

Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 48.19  E-value: 4.47e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 2248185719   566 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 606
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1950-1969 3.43e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


:

Pssm-ID: 461782  Cd Length: 22  Bit Score: 42.58  E-value: 3.43e-05
                           10        20
                   ....*....|....*....|
gi 2248185719 1950 DSEDDLLQECISSAMPKKKK 1969
Cdd:pfam05924    3 DDEDDLLQECINSAMPKKRR 22
SMC_prok_B super family cl37069
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
8-272 3.43e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


The actual alignment was detected with superfamily member TIGR02168:

Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 46.20  E-value: 3.43e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719    8 QLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSIEDeamassgqidLLERLKELNLDSSNFPGVKL 87
Cdd:TIGR02168  744 QLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQ----------LKEELKALREALDELRAELT 813
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719   88 RSKMSLRSYGSREGSVSSRSGECSPVPMGSFPRRGFVNGSRES-TGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLT 166
Cdd:TIGR02168  814 LLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESlAAEIEELEELIEELESELEALLNERASLEEALALLR 893
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  167 KRIDSLpltenfslqtDMTRRQLEYEARQIRVAMEEqlgtCQDMEKRAQRRIARIQQIEKDIL-RIRQLLQSQATEAERs 245
Cdd:TIGR02168  894 SELEEL----------SEELRELESKRSELRRELEE----LREKLAQLELRLEGLEVRIDNLQeRLSEEYSLTLEEAEA- 958
                          250       260       270
                   ....*....|....*....|....*....|..
gi 2248185719  246 SQNKHETGSHDAERQ-----NEGQGVGEINMA 272
Cdd:TIGR02168  959 LENKIEDDEEEARRRlkrleNKIKELGPVNLA 990
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
429-470 5.10e-04

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


:

Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 39.72  E-value: 5.10e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 2248185719   429 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 470
Cdd:smart00185    1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
608-648 6.80e-04

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


:

Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 39.36  E-value: 6.80e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2248185719  608 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 648
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1633-1654 1.30e-03

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


:

Pssm-ID: 461782  Cd Length: 22  Bit Score: 38.34  E-value: 1.30e-03
                           10        20
                   ....*....|....*....|..
gi 2248185719 1633 NKAEEGDILAECINSAMPKGKS 1654
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1288-1310 3.40e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.97  E-value: 3.40e-03
                           10        20
                   ....*....|....*....|....
gi 2248185719 1288 SPPEHY-VQETPLMFSRCTSVSSL 1310
Cdd:pfam05923    1 DSPKRYcVEGTPANFSRASSLSSL 24
PTZ00449 super family cl33186
104 kDa microneme/rhoptry antigen; Provisional
1208-1618 4.09e-03

104 kDa microneme/rhoptry antigen; Provisional


The actual alignment was detected with superfamily member PTZ00449:

Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 42.75  E-value: 4.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1208 QTTQEAD---SANTLQIAEIKEKIGTRSAEDPvsEVPAVSQHPRTKSSRLQGSSLSSESARHKAVEFSSGAKSPSKSGAQ 1284
Cdd:PTZ00449   480 QFTQEIKkliKKSKKKLAPIEEEDSDKHDEPP--EGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEV 557
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1285 TPKSPP--EHYVQETPLMFSRCT----SVSSLDSFESRSIASSVQ-SEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPP 1357
Cdd:PTZ00449   558 GKKPGPakEHKPSKIPTLSKKPEfpkdPKHPKDPEEPKKPKRPRSaQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1358 PPPQTAQTKR-EVPK----NKAPTAEKRESGPK------QAAVNAAVQRVQvlpdadtllhfaTESTPDGFSCSSSLSAL 1426
Cdd:PTZ00449   638 PPQRPSSPERpEGPKiiksPKPPKSPKPPFDPKfkekfyDDYLDAAAKSKE------------TKTTVVLDESFESILKE 705
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1427 SLDEPFIQKDVELRIMPPVQENDNG------NETESEQPKESNENQEKEAEKTIDSEKDLLDDSDDDDIEILEECIISAM 1500
Cdd:PTZ00449   706 TLPETPGTPFTTPRPLPPKLPRDEEfpfepiGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAE 785
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1501 PTKSSRKAKKPAQTASKLPPPVARKPSqlpvyklLPSQNRLQPQKHVSFTPGDDMPRVYCVE--GTPINFSTATSLSDL- 1577
Cdd:PTZ00449   786 TGEPDEAMKRPDSPSEHEDKPPGDHPS-------LPKKRHRLDGLALSTTDLESDAGRIAKDasGKIVKLKRSKSFDDLt 858
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|.
gi 2248185719 1578 TIESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQ 1618
Cdd:PTZ00449   859 TVEEAEEMGAEARKIVVDDDGTEADDEDTHPPEEKHKSEVR 899
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1174-1191 6.44e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.20  E-value: 6.44e-03
                           10
                   ....*....|....*...
gi 2248185719 1174 ETIQTYCVEDTPICFSRC 1191
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRA 18
 
Name Accession Description Interval E-value
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
649-936 1.74e-166

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435476  Cd Length: 293  Bit Score: 514.52  E-value: 1.74e-166
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  649 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQSLYGDYVFDTNRHDDN-- 726
Cdd:pfam16629    1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  727 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLGNYHPATENPGTSSKR-GLQISTTAAQ 803
Cdd:pfam16629   81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  804 IAKVMEEVSAIHTSQEDRSSGSTTELHCVTDERNALRRSSAAHTHSNTYNFTKSENSNRTCSMPYAKLEYKRSSNDSLNS 883
Cdd:pfam16629  161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2248185719  884 VSSSDGYGKRGQMKPSIESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 936
Cdd:pfam16629  241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
2140-2485 3.06e-109

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 352.64  E-value: 3.06e-109
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2140 SISRGRTMIHIPGVRNSSSSTSPVSKKGPPLKTPASKSPSEGQTATTS-PRGAKPSVKSELSPVARQTSQIGGSSKAPSR 2218
Cdd:pfam05956    1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2219 SGSRDSTPSRPAQQPLSRPIQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQMSQQNLTKQ 2298
Cdd:pfam05956   81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2299 TGLSKNASSI------PRSESASKGLNqmnngNGANKKVELSRMSSTKSSGSESDRSerpVLVRQSTFIKEAPSPTLRRK 2372
Cdd:pfam05956  154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2373 LEESASfESLSPSSRPASPTRSQaqtpvlsPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEYNDGRPAKRHDIARSHSES 2452
Cdd:pfam05956  226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
                          330       340       350
                   ....*....|....*....|....*....|...
gi 2248185719 2453 PSRLPInRSGTWKREHSKHSSSLPRVSTWRRTG 2485
Cdd:pfam05956  298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
EB1_binding pfam05937
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ...
2587-2760 3.29e-89

EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins.


Pssm-ID: 399141  Cd Length: 174  Bit Score: 288.05  E-value: 3.29e-89
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2587 RSGRSPTGNTPPVIDSVSEKANPNIKDSKDNQAKQNVGNGSVPMRTVGLENRLNSFIQVDAPDQKGTEIKPGQNNPVPVS 2666
Cdd:pfam05937    1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNVPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2667 ETNESSIVERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVNNNTKKRDSKTDSTESSGTQ 2746
Cdd:pfam05937   81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
                          170
                   ....*....|....
gi 2248185719 2747 SPKRHSGSYLVTSV 2760
Cdd:pfam05937  161 SPKRHSGSYLVTSV 174
APC_u5 pfam16630
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ...
953-1052 2.06e-56

Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 406923  Cd Length: 100  Bit Score: 191.27  E-value: 2.06e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  953 LNSGRQSPSQNERWARPKHIIEDEIKQSEQRQSRNQSTTYPVYTESTDDKHLKFQPHFGQQECVSPYRSRGANGSETNRV 1032
Cdd:pfam16630    1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
                           90       100
                   ....*....|....*....|
gi 2248185719 1033 GSNHGINQNVSQSLCQEDDY 1052
Cdd:pfam16630   81 GSSHGINQKVSQSLCQVDDY 100
APC_u14 pfam16635
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ...
1663-1756 1.38e-46

Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435479  Cd Length: 94  Bit Score: 162.70  E-value: 1.38e-46
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1663 IMDQVQQASASSSAPNKNQLDGKKKKPTSPVKPIPQNTEYRTRVRKNADSKNNLNAERVFSDNKDSKKQNLKNNSKVFND 1742
Cdd:pfam16635    1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
                           90
                   ....*....|....
gi 2248185719 1743 KLPNNEDRVRGSFA 1756
Cdd:pfam16635   81 KLPNNEERTRGSFA 94
APC_u9 pfam16633
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ...
1200-1285 2.43e-35

Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435478  Cd Length: 89  Bit Score: 130.38  E-value: 2.43e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1200 AEDEI-GCNQTTQEADSANTLQIAEIKEKIGTRSAEDPVSEVPAVSQHPRTKSSRLQGSSLS-SESARHKAVEFSSGAKS 1277
Cdd:pfam16633    2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHIRTKPNRLQASNLSpSDSSRHKAVEFSSGAKS 81

                   ....*...
gi 2248185719 1278 PSKSGAQT 1285
Cdd:pfam16633   82 PSKSGAQT 89
APC_u15 pfam16636
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ...
1790-1864 5.12e-31

Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435480  Cd Length: 81  Bit Score: 117.65  E-value: 5.12e-31
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2248185719 1790 DLSREKAELRKAKENKESEAKVTSHTELTSNQQSANKTQAIAKQPINRGQPKPILQKQSTFPQSSKDIPDRGAAT 1864
Cdd:pfam16636    7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
307-365 8.65e-31

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


Pssm-ID: 465870  Cd Length: 74  Bit Score: 116.88  E-value: 8.65e-31
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2248185719  307 HLGTKIRAYCETCWEWQEAHEPGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 365
Cdd:pfam18797   16 HLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_u13 pfam16634
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ...
1579-1632 1.75e-24

Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 406927  Cd Length: 54  Bit Score: 98.33  E-value: 1.75e-24
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2248185719 1579 IESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQGGKTSSVTIPELDD 1632
Cdd:pfam16634    1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
127-207 3.71e-22

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


Pssm-ID: 463275  Cd Length: 82  Bit Score: 92.70  E-value: 3.71e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  127 SRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTE-NFSLQTDMTRRQLEYEARQIRVAMEEQLG 205
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGtYFDYGSDAQQERLEFLLARIQEVNRCLGG 80

                   ..
gi 2248185719  206 TC 207
Cdd:pfam11414   81 LI 82
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
4-55 6.24e-22

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


Pssm-ID: 435517  Cd Length: 52  Bit Score: 90.82  E-value: 6.24e-22
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2248185719    4 ASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSI 55
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1554-1577 1.41e-07

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 49.30  E-value: 1.41e-07
                           10        20
                   ....*....|....*....|....
gi 2248185719 1554 DMPRVYCVEGTPINFSTATSLSDL 1577
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
566-606 4.47e-07

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 48.19  E-value: 4.47e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 2248185719   566 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 606
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
566-606 6.08e-07

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 47.83  E-value: 6.08e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2248185719  566 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 606
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
PHA03247 PHA03247
large tegument protein UL36; Provisional
2168-2481 1.04e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.94  E-value: 1.04e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2168 PPLKTPASKsPSEGQTATTSPRGAKPSVKSE----LSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGR 2243
Cdd:PHA03247  2703 PPPPTPEPA-PHALVSATPLPPGPAAARQASpalpAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2244 NSISPGRNGISPPNKLSQLPRTSSPSTASTKSSGSGKMSYTSPGrqmsqqnltkqTGLSKNASSIPRSESASKGLNQ--M 2321
Cdd:PHA03247  2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPpsL 2850
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2322 NNGNGANKKVELSRMSSTKSSGSESDRSERPVLVRqstfikeAPSPTLRRKLEESA-SFESLSPSSRPASPTRSQAQTPV 2400
Cdd:PHA03247  2851 PLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRR-------LARPAVSRSTESFAlPPDQPERPPQPQAPPPPQPQPQP 2923
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2401 LSPSLPDMSLSTHSSVQAggwrKLPPNLSPTieyNDGRPAKRHDIARSHSESPSRLPINRSGTWK----RE------HSK 2470
Cdd:PHA03247  2924 PPPPQPQPPPPPPPRPQP----PLAPTTDPA---GAGEPSGAVPQPWLGALVPGRVAVPRFRVPQpapsREapasstPPL 2996
                          330
                   ....*....|.
gi 2248185719 2471 HSSSLPRVSTW 2481
Cdd:PHA03247  2997 TGHSLSRVSSW 3007
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1950-1969 3.43e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 42.58  E-value: 3.43e-05
                           10        20
                   ....*....|....*....|
gi 2248185719 1950 DSEDDLLQECISSAMPKKKK 1969
Cdd:pfam05924    3 DDEDDLLQECINSAMPKKRR 22
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
134-266 2.16e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 46.68  E-value: 2.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  134 LEELEKERSLLLADLDKEEKEKDWY--YAQLQNLTKRIDSLP---------LTENFSLQTDM---------TRRQLEYEA 193
Cdd:COG4717    104 LEELEAELEELREELEKLEKLLQLLplYQELEALEAELAELPerleeleerLEELRELEEELeeleaelaeLQEELEELL 183
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2248185719  194 RQIRVAMEEQLgtcQDMEKRAQRRIARIQQIEKDILRIRQLLQsQATEAERSSQNKHETGsHDAERQNEGQGV 266
Cdd:COG4717    184 EQLSLATEEEL---QDLAEELEELQQRLAELEEELEEAQEELE-ELEEELEQLENELEAA-ALEERLKEARLL 251
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
8-272 3.43e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 46.20  E-value: 3.43e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719    8 QLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSIEDeamassgqidLLERLKELNLDSSNFPGVKL 87
Cdd:TIGR02168  744 QLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQ----------LKEELKALREALDELRAELT 813
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719   88 RSKMSLRSYGSREGSVSSRSGECSPVPMGSFPRRGFVNGSRES-TGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLT 166
Cdd:TIGR02168  814 LLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESlAAEIEELEELIEELESELEALLNERASLEEALALLR 893
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  167 KRIDSLpltenfslqtDMTRRQLEYEARQIRVAMEEqlgtCQDMEKRAQRRIARIQQIEKDIL-RIRQLLQSQATEAERs 245
Cdd:TIGR02168  894 SELEEL----------SEELRELESKRSELRRELEE----LREKLAQLELRLEGLEVRIDNLQeRLSEEYSLTLEEAEA- 958
                          250       260       270
                   ....*....|....*....|....*....|..
gi 2248185719  246 SQNKHETGSHDAERQ-----NEGQGVGEINMA 272
Cdd:TIGR02168  959 LENKIEDDEEEARRRlkrleNKIKELGPVNLA 990
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
429-470 5.10e-04

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 39.72  E-value: 5.10e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 2248185719   429 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 470
Cdd:smart00185    1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
429-469 5.86e-04

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 39.36  E-value: 5.86e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2248185719  429 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLS 469
Cdd:pfam00514    1 SPENKQAVIEA-GAVPPLVRLLSSPDEEVQEEAAWALSNLA 40
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
608-648 6.80e-04

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 39.36  E-value: 6.80e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2248185719  608 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 648
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1633-1654 1.30e-03

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 38.34  E-value: 1.30e-03
                           10        20
                   ....*....|....*....|..
gi 2248185719 1633 NKAEEGDILAECINSAMPKGKS 1654
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
11-247 1.59e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 43.60  E-value: 1.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719   11 KQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSI---EDEAMASSGQIDLLE-RLKELNLDSSnfpgvK 86
Cdd:COG4942     20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIaalARRIRALEQELAALEaELAELEKEIA-----E 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719   87 LRSKM-SLRSYGSREGSVSSRSGECSPvPMGSFPRRGFVNGSRESTgYLEELEKERSLLLADLDKEEKEKDWYYAQLQNL 165
Cdd:COG4942     95 LRAELeAQKEELAELLRALYRLGRQPP-LALLLSPEDFLDAVRRLQ-YLKYLAPARREQAEELRADLAELAALRAELEAE 172
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  166 TKRIDSLpltenfslqtdmtRRQLEYEARQIRVAMEEQLGTCQDMEKRAQRRIARIQQIEKDILRIRQLLQSQATEAERS 245
Cdd:COG4942    173 RAELEAL-------------LAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAA 239

                   ..
gi 2248185719  246 SQ 247
Cdd:COG4942    240 AE 241
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
134-259 2.17e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 43.90  E-value: 2.17e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  134 LEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSL------------PLTENFSLQTDMTRRQLEYEARQIRVAME 201
Cdd:TIGR02169  232 KEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIeqlleelnkkikDLGEEEQLRVKEKIGELEAEIASLERSIA 311
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2248185719  202 EqlgtCQDMEKRAQRRIARIQ-QIEKDILRIRQLLQSQATEAERSSQNKHETGSHDAER 259
Cdd:TIGR02169  312 E----KERELEDAEERLAKLEaEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEEL 366
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1288-1310 3.40e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.97  E-value: 3.40e-03
                           10        20
                   ....*....|....*....|....
gi 2248185719 1288 SPPEHY-VQETPLMFSRCTSVSSL 1310
Cdd:pfam05923    1 DSPKRYcVEGTPANFSRASSLSSL 24
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1208-1618 4.09e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 42.75  E-value: 4.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1208 QTTQEAD---SANTLQIAEIKEKIGTRSAEDPvsEVPAVSQHPRTKSSRLQGSSLSSESARHKAVEFSSGAKSPSKSGAQ 1284
Cdd:PTZ00449   480 QFTQEIKkliKKSKKKLAPIEEEDSDKHDEPP--EGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEV 557
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1285 TPKSPP--EHYVQETPLMFSRCT----SVSSLDSFESRSIASSVQ-SEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPP 1357
Cdd:PTZ00449   558 GKKPGPakEHKPSKIPTLSKKPEfpkdPKHPKDPEEPKKPKRPRSaQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1358 PPPQTAQTKR-EVPK----NKAPTAEKRESGPK------QAAVNAAVQRVQvlpdadtllhfaTESTPDGFSCSSSLSAL 1426
Cdd:PTZ00449   638 PPQRPSSPERpEGPKiiksPKPPKSPKPPFDPKfkekfyDDYLDAAAKSKE------------TKTTVVLDESFESILKE 705
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1427 SLDEPFIQKDVELRIMPPVQENDNG------NETESEQPKESNENQEKEAEKTIDSEKDLLDDSDDDDIEILEECIISAM 1500
Cdd:PTZ00449   706 TLPETPGTPFTTPRPLPPKLPRDEEfpfepiGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAE 785
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1501 PTKSSRKAKKPAQTASKLPPPVARKPSqlpvyklLPSQNRLQPQKHVSFTPGDDMPRVYCVE--GTPINFSTATSLSDL- 1577
Cdd:PTZ00449   786 TGEPDEAMKRPDSPSEHEDKPPGDHPS-------LPKKRHRLDGLALSTTDLESDAGRIAKDasGKIVKLKRSKSFDDLt 858
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|.
gi 2248185719 1578 TIESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQ 1618
Cdd:PTZ00449   859 TVEEAEEMGAEARKIVVDDDGTEADDEDTHPPEEKHKSEVR 899
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1174-1191 6.44e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.20  E-value: 6.44e-03
                           10
                   ....*....|....*...
gi 2248185719 1174 ETIQTYCVEDTPICFSRC 1191
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRA 18
 
Name Accession Description Interval E-value
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
649-936 1.74e-166

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435476  Cd Length: 293  Bit Score: 514.52  E-value: 1.74e-166
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  649 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQSLYGDYVFDTNRHDDN-- 726
Cdd:pfam16629    1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  727 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLGNYHPATENPGTSSKR-GLQISTTAAQ 803
Cdd:pfam16629   81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  804 IAKVMEEVSAIHTSQEDRSSGSTTELHCVTDERNALRRSSAAHTHSNTYNFTKSENSNRTCSMPYAKLEYKRSSNDSLNS 883
Cdd:pfam16629  161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2248185719  884 VSSSDGYGKRGQMKPSIESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 936
Cdd:pfam16629  241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
2140-2485 3.06e-109

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 352.64  E-value: 3.06e-109
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2140 SISRGRTMIHIPGVRNSSSSTSPVSKKGPPLKTPASKSPSEGQTATTS-PRGAKPSVKSELSPVARQTSQIGGSSKAPSR 2218
Cdd:pfam05956    1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2219 SGSRDSTPSRPAQQPLSRPIQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQMSQQNLTKQ 2298
Cdd:pfam05956   81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2299 TGLSKNASSI------PRSESASKGLNqmnngNGANKKVELSRMSSTKSSGSESDRSerpVLVRQSTFIKEAPSPTLRRK 2372
Cdd:pfam05956  154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2373 LEESASfESLSPSSRPASPTRSQaqtpvlsPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEYNDGRPAKRHDIARSHSES 2452
Cdd:pfam05956  226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
                          330       340       350
                   ....*....|....*....|....*....|...
gi 2248185719 2453 PSRLPInRSGTWKREHSKHSSSLPRVSTWRRTG 2485
Cdd:pfam05956  298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
EB1_binding pfam05937
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ...
2587-2760 3.29e-89

EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins.


Pssm-ID: 399141  Cd Length: 174  Bit Score: 288.05  E-value: 3.29e-89
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2587 RSGRSPTGNTPPVIDSVSEKANPNIKDSKDNQAKQNVGNGSVPMRTVGLENRLNSFIQVDAPDQKGTEIKPGQNNPVPVS 2666
Cdd:pfam05937    1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNVPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2667 ETNESSIVERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVNNNTKKRDSKTDSTESSGTQ 2746
Cdd:pfam05937   81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
                          170
                   ....*....|....
gi 2248185719 2747 SPKRHSGSYLVTSV 2760
Cdd:pfam05937  161 SPKRHSGSYLVTSV 174
APC_u5 pfam16630
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ...
953-1052 2.06e-56

Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 406923  Cd Length: 100  Bit Score: 191.27  E-value: 2.06e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  953 LNSGRQSPSQNERWARPKHIIEDEIKQSEQRQSRNQSTTYPVYTESTDDKHLKFQPHFGQQECVSPYRSRGANGSETNRV 1032
Cdd:pfam16630    1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
                           90       100
                   ....*....|....*....|
gi 2248185719 1033 GSNHGINQNVSQSLCQEDDY 1052
Cdd:pfam16630   81 GSSHGINQKVSQSLCQVDDY 100
APC_u14 pfam16635
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ...
1663-1756 1.38e-46

Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435479  Cd Length: 94  Bit Score: 162.70  E-value: 1.38e-46
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1663 IMDQVQQASASSSAPNKNQLDGKKKKPTSPVKPIPQNTEYRTRVRKNADSKNNLNAERVFSDNKDSKKQNLKNNSKVFND 1742
Cdd:pfam16635    1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
                           90
                   ....*....|....
gi 2248185719 1743 KLPNNEDRVRGSFA 1756
Cdd:pfam16635   81 KLPNNEERTRGSFA 94
APC_u9 pfam16633
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ...
1200-1285 2.43e-35

Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435478  Cd Length: 89  Bit Score: 130.38  E-value: 2.43e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1200 AEDEI-GCNQTTQEADSANTLQIAEIKEKIGTRSAEDPVSEVPAVSQHPRTKSSRLQGSSLS-SESARHKAVEFSSGAKS 1277
Cdd:pfam16633    2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHIRTKPNRLQASNLSpSDSSRHKAVEFSSGAKS 81

                   ....*...
gi 2248185719 1278 PSKSGAQT 1285
Cdd:pfam16633   82 PSKSGAQT 89
APC_u15 pfam16636
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ...
1790-1864 5.12e-31

Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435480  Cd Length: 81  Bit Score: 117.65  E-value: 5.12e-31
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2248185719 1790 DLSREKAELRKAKENKESEAKVTSHTELTSNQQSANKTQAIAKQPINRGQPKPILQKQSTFPQSSKDIPDRGAAT 1864
Cdd:pfam16636    7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
307-365 8.65e-31

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


Pssm-ID: 465870  Cd Length: 74  Bit Score: 116.88  E-value: 8.65e-31
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2248185719  307 HLGTKIRAYCETCWEWQEAHEPGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 365
Cdd:pfam18797   16 HLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_u13 pfam16634
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ...
1579-1632 1.75e-24

Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 406927  Cd Length: 54  Bit Score: 98.33  E-value: 1.75e-24
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2248185719 1579 IESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQGGKTSSVTIPELDD 1632
Cdd:pfam16634    1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
127-207 3.71e-22

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


Pssm-ID: 463275  Cd Length: 82  Bit Score: 92.70  E-value: 3.71e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  127 SRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTE-NFSLQTDMTRRQLEYEARQIRVAMEEQLG 205
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGtYFDYGSDAQQERLEFLLARIQEVNRCLGG 80

                   ..
gi 2248185719  206 TC 207
Cdd:pfam11414   81 LI 82
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
4-55 6.24e-22

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


Pssm-ID: 435517  Cd Length: 52  Bit Score: 90.82  E-value: 6.24e-22
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2248185719    4 ASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSI 55
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1554-1577 1.41e-07

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 49.30  E-value: 1.41e-07
                           10        20
                   ....*....|....*....|....
gi 2248185719 1554 DMPRVYCVEGTPINFSTATSLSDL 1577
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
566-606 4.47e-07

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 48.19  E-value: 4.47e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 2248185719   566 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 606
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
566-606 6.08e-07

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 47.83  E-value: 6.08e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2248185719  566 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 606
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
PHA03247 PHA03247
large tegument protein UL36; Provisional
2168-2481 1.04e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.94  E-value: 1.04e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2168 PPLKTPASKsPSEGQTATTSPRGAKPSVKSE----LSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGR 2243
Cdd:PHA03247  2703 PPPPTPEPA-PHALVSATPLPPGPAAARQASpalpAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2244 NSISPGRNGISPPNKLSQLPRTSSPSTASTKSSGSGKMSYTSPGrqmsqqnltkqTGLSKNASSIPRSESASKGLNQ--M 2321
Cdd:PHA03247  2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPpsL 2850
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2322 NNGNGANKKVELSRMSSTKSSGSESDRSERPVLVRqstfikeAPSPTLRRKLEESA-SFESLSPSSRPASPTRSQAQTPV 2400
Cdd:PHA03247  2851 PLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRR-------LARPAVSRSTESFAlPPDQPERPPQPQAPPPPQPQPQP 2923
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2401 LSPSLPDMSLSTHSSVQAggwrKLPPNLSPTieyNDGRPAKRHDIARSHSESPSRLPINRSGTWK----RE------HSK 2470
Cdd:PHA03247  2924 PPPPQPQPPPPPPPRPQP----PLAPTTDPA---GAGEPSGAVPQPWLGALVPGRVAVPRFRVPQpapsREapasstPPL 2996
                          330
                   ....*....|.
gi 2248185719 2471 HSSSLPRVSTW 2481
Cdd:PHA03247  2997 TGHSLSRVSSW 3007
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2173-2318 6.10e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 52.10  E-value: 6.10e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2173 PASKSPSEGQTATTSPRGAKPSVKSELSPVA----RQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGRNSISP 2248
Cdd:PHA03307   278 PSSRPGPASSSSSPRERSPSPSPSSPGSGPApsspRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPP 357
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2249 GRNGISPPNKLSQlPRTSSPSTASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSIPRSESASKGL 2318
Cdd:PHA03307   358 PPADPSSPRKRPR-PSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAF 426
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1950-1969 3.43e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 42.58  E-value: 3.43e-05
                           10        20
                   ....*....|....*....|
gi 2248185719 1950 DSEDDLLQECISSAMPKKKK 1969
Cdd:pfam05924    3 DDEDDLLQECINSAMPKKRR 22
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2166-2461 4.41e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 49.40  E-value: 4.41e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2166 KGPPLKTPASKSPSEGQTATTSPRGAKPSVKSELSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQP---LSRPIQSPG 2242
Cdd:PHA03307   101 AREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASsrqAALPLSSPE 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2243 RNSISPGRNGISPPNKLSQLPRTSSP----STASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSIPRSE----SA 2314
Cdd:PHA03307   181 ETARAPSSPPAEPPPSTPPAAASPRPprrsSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENEcplpRP 260
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2315 SKGLNQMNNGNGANKKVELSRMSSTKSSGSESDRSERPVLVRQStfiKEAPSPTLRRKLEESASFESLSPSSRPASPTRS 2394
Cdd:PHA03307   261 APITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPG---SGPAPSSPRASSSSSSSRESSSSSTSSSSESSR 337
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2248185719 2395 QAQTPVLSPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEY---NDGRP---AKRHDIARSH--SESPSRLPINRS 2461
Cdd:PHA03307   338 GAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSpaaSAGRPtrrRARAAVAGRArrRDATGRFPAGRP 412
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
134-266 2.16e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 46.68  E-value: 2.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  134 LEELEKERSLLLADLDKEEKEKDWY--YAQLQNLTKRIDSLP---------LTENFSLQTDM---------TRRQLEYEA 193
Cdd:COG4717    104 LEELEAELEELREELEKLEKLLQLLplYQELEALEAELAELPerleeleerLEELRELEEELeeleaelaeLQEELEELL 183
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2248185719  194 RQIRVAMEEQLgtcQDMEKRAQRRIARIQQIEKDILRIRQLLQsQATEAERSSQNKHETGsHDAERQNEGQGV 266
Cdd:COG4717    184 EQLSLATEEEL---QDLAEELEELQQRLAELEEELEEAQEELE-ELEEELEQLENELEAA-ALEERLKEARLL 251
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
8-272 3.43e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 46.20  E-value: 3.43e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719    8 QLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSIEDeamassgqidLLERLKELNLDSSNFPGVKL 87
Cdd:TIGR02168  744 QLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQ----------LKEELKALREALDELRAELT 813
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719   88 RSKMSLRSYGSREGSVSSRSGECSPVPMGSFPRRGFVNGSRES-TGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLT 166
Cdd:TIGR02168  814 LLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESlAAEIEELEELIEELESELEALLNERASLEEALALLR 893
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  167 KRIDSLpltenfslqtDMTRRQLEYEARQIRVAMEEqlgtCQDMEKRAQRRIARIQQIEKDIL-RIRQLLQSQATEAERs 245
Cdd:TIGR02168  894 SELEEL----------SEELRELESKRSELRRELEE----LREKLAQLELRLEGLEVRIDNLQeRLSEEYSLTLEEAEA- 958
                          250       260       270
                   ....*....|....*....|....*....|..
gi 2248185719  246 SQNKHETGSHDAERQ-----NEGQGVGEINMA 272
Cdd:TIGR02168  959 LENKIEDDEEEARRRlkrleNKIKELGPVNLA 990
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
429-470 5.10e-04

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 39.72  E-value: 5.10e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 2248185719   429 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 470
Cdd:smart00185    1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
429-469 5.86e-04

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 39.36  E-value: 5.86e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2248185719  429 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLS 469
Cdd:pfam00514    1 SPENKQAVIEA-GAVPPLVRLLSSPDEEVQEEAAWALSNLA 40
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
608-648 6.80e-04

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 39.36  E-value: 6.80e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2248185719  608 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 648
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
2171-2431 9.32e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 44.57  E-value: 9.32e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2171 KTPASKSPSEGQTATTSPRGAKPsvkselSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGRNSISPgr 2250
Cdd:pfam17823  151 RANASAAPRAAIAAASAPHAASP------APRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHP-- 222
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2251 ngiSPPNKLSQLPrTSSPStASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSiPRSESASKGLNQMNNGNGANKk 2330
Cdd:pfam17823  223 ---AAGTALAAVG-NSSPA-AGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGD-PHARRLSPAKHMPSDTMARNP- 295
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2331 veLSRMSSTKSSGSESDRSERPVLvrqSTFIKEAPSPTlRRKLEESASFESLSPSSRPASPTRSQAQTPVLSPsLPDMSL 2410
Cdd:pfam17823  296 --AAPMGAQAQGPIIQVSTDQPVH---NTAGEPTPSPS-NTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASP-VPVLHT 368
                          250       260
                   ....*....|....*....|.
gi 2248185719 2411 STHSSVQAGGWRKLPPNLSPT 2431
Cdd:pfam17823  369 SMIPEVEATSPTTQPSPLLPT 389
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
135-262 1.02e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 44.93  E-value: 1.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  135 EELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLpltenfslqtDMTRRQLEYEARQIRVAMEEQLGTCQDMEKRA 214
Cdd:COG1196    221 ELKELEAELLLLKLRELEAELEELEAELEELEAELEEL----------EAELAELEAELEELRLELEELELELEEAQAEE 290
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 2248185719  215 QRRIARIQQIEKDILRIRQLLQSQATEAERSSQNKHETGSHDAERQNE 262
Cdd:COG1196    291 YELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEE 338
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1633-1654 1.30e-03

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 38.34  E-value: 1.30e-03
                           10        20
                   ....*....|....*....|..
gi 2248185719 1633 NKAEEGDILAECINSAMPKGKS 1654
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
11-247 1.59e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 43.60  E-value: 1.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719   11 KQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSI---EDEAMASSGQIDLLE-RLKELNLDSSnfpgvK 86
Cdd:COG4942     20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIaalARRIRALEQELAALEaELAELEKEIA-----E 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719   87 LRSKM-SLRSYGSREGSVSSRSGECSPvPMGSFPRRGFVNGSRESTgYLEELEKERSLLLADLDKEEKEKDWYYAQLQNL 165
Cdd:COG4942     95 LRAELeAQKEELAELLRALYRLGRQPP-LALLLSPEDFLDAVRRLQ-YLKYLAPARREQAEELRADLAELAALRAELEAE 172
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  166 TKRIDSLpltenfslqtdmtRRQLEYEARQIRVAMEEQLGTCQDMEKRAQRRIARIQQIEKDILRIRQLLQSQATEAERS 245
Cdd:COG4942    173 RAELEAL-------------LAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAA 239

                   ..
gi 2248185719  246 SQ 247
Cdd:COG4942    240 AE 241
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
134-259 2.17e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 43.90  E-value: 2.17e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  134 LEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSL------------PLTENFSLQTDMTRRQLEYEARQIRVAME 201
Cdd:TIGR02169  232 KEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIeqlleelnkkikDLGEEEQLRVKEKIGELEAEIASLERSIA 311
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2248185719  202 EqlgtCQDMEKRAQRRIARIQ-QIEKDILRIRQLLQSQATEAERSSQNKHETGSHDAER 259
Cdd:TIGR02169  312 E----KERELEDAEERLAKLEaEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEEL 366
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
120-260 2.21e-03

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 43.51  E-value: 2.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  120 RRGFVNGSRESTGY--------LEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLpltenfslqtdmtrRQLEY 191
Cdd:TIGR02168  657 PGGVITGGSAKTNSsilerrreIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQL--------------RKELE 722
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2248185719  192 EARQIRVAMEEQLGTcqdMEKRAQRRIARIQQIEKDILRIRQLLQSQATEAERSSQNKHETGSHDAERQ 260
Cdd:TIGR02168  723 ELSRQISALRKDLAR---LEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE 788
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1288-1310 3.40e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.97  E-value: 3.40e-03
                           10        20
                   ....*....|....*....|....
gi 2248185719 1288 SPPEHY-VQETPLMFSRCTSVSSL 1310
Cdd:pfam05923    1 DSPKRYcVEGTPANFSRASSLSSL 24
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2169-2485 3.89e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.85  E-value: 3.89e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2169 PLKTPASKSPSEGQTATTSPRGAKPSVKSELSPVARQTSQiGGSSKAPSRSGsrDSTPSRPAQQPLSRPIQSPGRNSISP 2248
Cdd:PHA03307    63 DRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAR-EGSPTPPGPSS--PDPPPPTPPPASPPPSPAPDLSEMLR 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2249 GRNGISPPNKLSQLPRTSSPS-TASTKSSGSGKM---------SYTSPGRQMSQQNLTKQTGLSKNASSIPRSESASKGL 2318
Cdd:PHA03307   140 PVGSPGPPPAASPPAAGASPAaVASDAASSRQAAlplsspeetARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASS 219
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2319 NQMNNGNGANKKVELSRMSSTKSSGSESD---RSERPVLVRQSTfikeaPSPTLRRKLEESASFESLSPSSRPASPTRSQ 2395
Cdd:PHA03307   220 PAPAPGRSAADDAGASSSDSSSSESSGCGwgpENECPLPRPAPI-----TLPTRIWEASGWNGPSSRPGPASSSSSPRER 294
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2396 AqtPVLSPSLPDMSLSTHSSVQAGGWRKLPP-NLSPTIEYNDGRPAKRHDIARSHSESPS----RLPINRSGTWKREHSK 2470
Cdd:PHA03307   295 S--PSPSPSSPGSGPAPSSPRASSSSSSSREsSSSSTSSSSESSRGAAVSPGPSPSRSPSpsrpPPPADPSSPRKRPRPS 372
                          330
                   ....*....|....*
gi 2248185719 2471 HSSSLPRVSTWRRTG 2485
Cdd:PHA03307   373 RAPSSPAASAGRPTR 387
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1208-1618 4.09e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 42.75  E-value: 4.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1208 QTTQEAD---SANTLQIAEIKEKIGTRSAEDPvsEVPAVSQHPRTKSSRLQGSSLSSESARHKAVEFSSGAKSPSKSGAQ 1284
Cdd:PTZ00449   480 QFTQEIKkliKKSKKKLAPIEEEDSDKHDEPP--EGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEV 557
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1285 TPKSPP--EHYVQETPLMFSRCT----SVSSLDSFESRSIASSVQ-SEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPP 1357
Cdd:PTZ00449   558 GKKPGPakEHKPSKIPTLSKKPEfpkdPKHPKDPEEPKKPKRPRSaQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1358 PPPQTAQTKR-EVPK----NKAPTAEKRESGPK------QAAVNAAVQRVQvlpdadtllhfaTESTPDGFSCSSSLSAL 1426
Cdd:PTZ00449   638 PPQRPSSPERpEGPKiiksPKPPKSPKPPFDPKfkekfyDDYLDAAAKSKE------------TKTTVVLDESFESILKE 705
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1427 SLDEPFIQKDVELRIMPPVQENDNG------NETESEQPKESNENQEKEAEKTIDSEKDLLDDSDDDDIEILEECIISAM 1500
Cdd:PTZ00449   706 TLPETPGTPFTTPRPLPPKLPRDEEfpfepiGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAE 785
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 1501 PTKSSRKAKKPAQTASKLPPPVARKPSqlpvyklLPSQNRLQPQKHVSFTPGDDMPRVYCVE--GTPINFSTATSLSDL- 1577
Cdd:PTZ00449   786 TGEPDEAMKRPDSPSEHEDKPPGDHPS-------LPKKRHRLDGLALSTTDLESDAGRIAKDasGKIVKLKRSKSFDDLt 858
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|.
gi 2248185719 1578 TIESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQ 1618
Cdd:PTZ00449   859 TVEEAEEMGAEARKIVVDDDGTEADDEDTHPPEEKHKSEVR 899
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
134-248 5.52e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 41.67  E-value: 5.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719  134 LEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLpltenfslqtdmtrrqleyeARQIRvAMEEQLgtcQDMEKR 213
Cdd:COG4942     29 LEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAAL--------------------ARRIR-ALEQEL---AALEAE 84
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 2248185719  214 AQRRIARIQQIEKDILRIRQLLQSQATEAERSSQN 248
Cdd:COG4942     85 LAELEKEIAELRAELEAQKEELAELLRALYRLGRQ 119
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1174-1191 6.44e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.20  E-value: 6.44e-03
                           10
                   ....*....|....*...
gi 2248185719 1174 ETIQTYCVEDTPICFSRC 1191
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRA 18
PHA03247 PHA03247
large tegument protein UL36; Provisional
2167-2457 7.14e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 7.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2167 GPPLKTPASKSPSEGQTATTSPRGAKPSvkselsPVARQTSqIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGRNSI 2246
Cdd:PHA03247  2603 DDRGDPRGPAPPSPLPPDTHAPDPPPPS------PSPAANE-PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQ 2675
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2247 SPgrngiSPPNKLSQ--LPRTSSPSTASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSIPRSESASKGLNQMNNG 2324
Cdd:PHA03247  2676 AS-----SPPQRPRRraARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPAT 2750
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2248185719 2325 NGANKKVE--------LSRMSSTKSSGSESDRSERPVLVRQSTFIKEAPSPT--LRRKLEESASFESLSPSSRPAS---- 2390
Cdd:PHA03247  2751 PGGPARPArppttagpPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWdpADPPAAVLAPAAALPPAASPAGplpp 2830
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2248185719 2391 PTRSQAQTPVLSPSLPDMSLSTHSSVQAGG--WRKLPPNLSPTIEYNDGRPAKRHDIARSHSESPSRLP 2457
Cdd:PHA03247  2831 PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFA 2899
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH