|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
18-133 |
8.80e-94 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 287.78 E-value: 8.80e-94
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 18 FKFTIPESLDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTICAQVIPFLS 97
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110
....*....|....*....|....*....|....*..
gi 21541824 98 QEHQQQVAQAVERAKQVTMAELNAIIGQ-QQLQAQHL 133
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQqQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
483-768 |
1.20e-43 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 162.77 E-value: 1.20e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 483 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDISHPGNKSPVSqldclNRDNYIRSCKLLPDGCTLIVGGEASTLSIWDLAa 561
Cdd:COG2319 77 HTAAVLSVAFSPDGRLLASASAdGTVRLWDLATGLLLRTLT-----GHTGAVRSVAFSPDGKTLASGSADGTVRLWDLA- 150
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 562 pTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 641
Cdd:COG2319 151 -TGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVR 229
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 642 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPD-KYQLHLHESCVLSLKFAYCGKWFVSTGKD 719
Cdd:COG2319 230 LWDLATGKLLRTlTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGElLRTLTGHSGGVNSVAFSPDGKLLASGSDD 309
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 21541824 720 NLLNAWRTPYGASIFQSK-ESSSVLSCDISVDDKYIVTGSGDKKATVYEV 768
Cdd:COG2319 310 GTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDL 359
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
483-767 |
2.09e-38 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 144.40 E-value: 2.09e-38
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 483 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLdCLNRDNyIRSCKLLPDGCTLIVGGEASTLSIWDLaa 561
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL-- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 562 PTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 641
Cdd:cd00200 81 ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 642 SWDLREGR---QLQQHdfTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPD-KYQLHLHESCVLSLKFAYCGKWFVSTG 717
Cdd:cd00200 161 LWDLRTGKcvaTLTGH--TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASGS 238
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 21541824 718 KDNLLNAWRTPYGASIFQ-SKESSSVLSCDISVDDKYIVTGSGDKKATVYE 767
Cdd:cd00200 239 EDGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
605-644 |
2.87e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 47.31 E-value: 2.87e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 21541824 605 NQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 644
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
595-766 |
8.78e-07 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 52.40 E-value: 8.78e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 595 DGNIAVWDLHNQTLVRQFQGHTDGASCIDISN-DGTKLWTGGLDNTVRSWDLREGRQLQQHDFTSQIFSLGY-CPTGEWL 672
Cdd:PLN00181 554 EGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGTIKTKANICCVQFpSESGRSL 633
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 673 AVGMESSNVEVLHVNKPdkyQLHL-----HESCVLSLKFAYCGKwFVSTGKDNLLNAWRTPYGASIFQSKESSSVLS--- 744
Cdd:PLN00181 634 AFGSADHKVYYYDLRNP---KLPLctmigHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMSISGINETPLHSFMGhtn 709
|
170 180
....*....|....*....|....*.
gi 21541824 745 ----CDISVDDKYIVTGSGDKKATVY 766
Cdd:PLN00181 710 vknfVGLSVSDGYIATGSETNEVFVY 735
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
607-644 |
3.77e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.26 E-value: 3.77e-06
10 20 30
....*....|....*....|....*....|....*...
gi 21541824 607 TLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 644
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
319-476 |
6.66e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.39 E-value: 6.66e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 319 TPTPRSDMPTPGTSATPGLRPGLGKPPA--IDPLVNQAAAGLRTPLAV--PGPYPAPFG-------MVPHAGMNGELTSP 387
Cdd:PHA03247 2707 TPEPAPHALVSATPLPPGPAAARQASPAlpAAPAPPAVPAGPATPGGParPARPPTTAGppapappAAPAAGPPRRLTRP 2786
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 388 GAAYASLHNMSPQMSAAAAAAAVVAYGRSPMV-------GFDPPPHMRVPTIPPNLAGIPGGKPAYSFHVTADGQMQPVP 460
Cdd:PHA03247 2787 AVASLSESRESLPSPWDPADPPAAVLAPAAALppaaspaGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRP 2866
|
170
....*....|....*.
gi 21541824 461 FPPDALIGPGIPRHAR 476
Cdd:PHA03247 2867 PSRSPAAKPAAPARPP 2882
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
192-463 |
2.24e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 41.68 E-value: 2.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 192 RDREPGTSNSllvPDSLRGTDKRRNGPEFSndikkRKVDDKDSSHYDSDGDKSDDNLVVDVSNEDPSSPRASPA-HSPRE 270
Cdd:pfam03154 82 RQREKGASDT---EEPERATAKKSKTQEIS-----RPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSiPSPQD 153
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 271 NGIDKNrllkkdassspastaSSASSTSLKSKEMSLHEKASTPVLKSSTPTPRSDMPTPG-TSATPGLRPGLGKPPAIDP 349
Cdd:pfam03154 154 NESDSD---------------SSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGpTPSAPSVPPQGSPATSQPP 218
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 350 LVNQAAAGLRT---------PLAVPGPYPAPFGMVPHAGMNGELTSPGAAyASLHNMSPQMSAAAAAAAVVAYGRSPMVG 420
Cdd:pfam03154 219 NQTQSTAAPHTliqqtptlhPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ-PSLHGQMPPMPHSLQTGPSHMQHPVPPQP 297
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 21541824 421 FDPPPHMRVPTIPPNLAGIPGGKPAYSFHVTADGQMQPVPFPP 463
Cdd:pfam03154 298 FPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPP 340
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
18-133 |
8.80e-94 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 287.78 E-value: 8.80e-94
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 18 FKFTIPESLDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTICAQVIPFLS 97
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110
....*....|....*....|....*....|....*..
gi 21541824 98 QEHQQQVAQAVERAKQVTMAELNAIIGQ-QQLQAQHL 133
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQqQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
483-768 |
1.20e-43 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 162.77 E-value: 1.20e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 483 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDISHPGNKSPVSqldclNRDNYIRSCKLLPDGCTLIVGGEASTLSIWDLAa 561
Cdd:COG2319 77 HTAAVLSVAFSPDGRLLASASAdGTVRLWDLATGLLLRTLT-----GHTGAVRSVAFSPDGKTLASGSADGTVRLWDLA- 150
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 562 pTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 641
Cdd:COG2319 151 -TGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVR 229
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 642 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPD-KYQLHLHESCVLSLKFAYCGKWFVSTGKD 719
Cdd:COG2319 230 LWDLATGKLLRTlTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGElLRTLTGHSGGVNSVAFSPDGKLLASGSDD 309
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 21541824 720 NLLNAWRTPYGASIFQSK-ESSSVLSCDISVDDKYIVTGSGDKKATVYEV 768
Cdd:COG2319 310 GTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDL 359
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
483-768 |
2.09e-42 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 159.31 E-value: 2.09e-42
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 483 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLDclNRDNYIRSCKLLPDGCTLIVGGEASTLSIWDLAa 561
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLA---TGKLLRTLT--GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA- 192
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 562 pTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 641
Cdd:COG2319 193 -TGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 642 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPDK-YQLHLHESCVLSLKFAYCGKWFVSTGKD 719
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLlRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 21541824 720 NLLNAWRTPYGASIFQSKE-SSSVLSCDISVDDKYIVTGSGDKKATVYEV 768
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
483-767 |
2.09e-38 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 144.40 E-value: 2.09e-38
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 483 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLdCLNRDNyIRSCKLLPDGCTLIVGGEASTLSIWDLaa 561
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL-- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 562 PTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 641
Cdd:cd00200 81 ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 642 SWDLREGR---QLQQHdfTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPD-KYQLHLHESCVLSLKFAYCGKWFVSTG 717
Cdd:cd00200 161 LWDLRTGKcvaTLTGH--TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASGS 238
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 21541824 718 KDNLLNAWRTPYGASIFQ-SKESSSVLSCDISVDDKYIVTGSGDKKATVYE 767
Cdd:cd00200 239 EDGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
476-728 |
8.51e-38 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 145.82 E-value: 8.51e-38
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 476 RQINTLN-HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLDclNRDNYIRSCKLLPDGCTLIVGGEAST 553
Cdd:COG2319 153 KLLRTLTgHSGAVTSVAFSPDGKLLASGSDdGTVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGT 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 554 LSIWDLAapTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWT 633
Cdd:COG2319 228 VRLWDLA--TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 634 GGLDNTVRSWDLREGRQLQQHD-FTSQIFSLGYCPTGEWLAVGMESSNVEVLHVN-KPDKYQLHLHESCVLSLKFAYCGK 711
Cdd:COG2319 306 GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLAtGELLRTLTGHTGAVTSVAFSPDGR 385
|
250
....*....|....*..
gi 21541824 712 WFVSTGKDNLLNAWRTP 728
Cdd:COG2319 386 TLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
533-768 |
4.16e-30 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 123.48 E-value: 4.16e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 533 IRSCKLLPDGCTLIVGGEASTLSIWDLAAPTPRIKAELTSSAPacYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQF 612
Cdd:COG2319 39 VASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAV--LSVAFSPDGRLLASASADGTVRLWDLATGLLLRTL 116
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 613 QGHTDGASCIDISNDGTKLWTGGLDNTVRSWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPDK 691
Cdd:COG2319 117 TGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTlTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKL 196
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 21541824 692 YQ-LHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRTPYGASIFQSK-ESSSVLSCDISVDDKYIVTGSGDKKATVYEV 768
Cdd:COG2319 197 LRtLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTgHSGSVRSVAFSPDGRLLASGSADGTVRLWDL 275
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
577-768 |
3.25e-26 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 109.35 E-value: 3.25e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 577 CYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWDLREGRQLQQ-HD 655
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTlTG 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 656 FTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPD-KYQLHLHESCVLSLKFAyCGKWFVSTGK-DNLLNAWRTPYGASI 733
Cdd:cd00200 92 HTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKcLTTLRGHTDWVNSVAFS-PDGTFVASSSqDGTIKLWDLRTGKCV 170
|
170 180 190
....*....|....*....|....*....|....*..
gi 21541824 734 --FQSkESSSVLSCDISVDDKYIVTGSGDKKATVYEV 768
Cdd:cd00200 171 atLTG-HTGEVNSVAFSPDGEKLLSSSSDGTIKLWDL 206
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
608-768 |
3.37e-20 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 91.63 E-value: 3.37e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 608 LVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWDLREG---RQLQQHdfTSQIFSLGYCPTGEWLAVGMESSNVEVL 684
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellRTLKGH--TGPVRDVAASADGTYLASGSSDKTIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 685 HVNKPDK-YQLHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRTPYGASI--FQSKEsSSVLSCDISVDDKYIVTGSGDK 761
Cdd:cd00200 79 DLETGECvRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLttLRGHT-DWVNSVAFSPDGTFVASSSQDG 157
|
....*..
gi 21541824 762 KATVYEV 768
Cdd:cd00200 158 TIKLWDL 164
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
551-768 |
3.04e-19 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 90.74 E-value: 3.04e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 551 ASTLSIWDLAAPTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTK 630
Cdd:COG2319 13 SADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRL 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 631 LWTGGLDNTVRSWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPDK-YQLHLHESCVLSLKFAY 708
Cdd:COG2319 93 LASASADGTVRLWDLATGLLLRTlTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLlRTLTGHSGAVTSVAFSP 172
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 21541824 709 CGKWFVSTGKDNLLNAWRTPYGASIFQ-SKESSSVLSCDISVDDKYIVTGSGDKKATVYEV 768
Cdd:COG2319 173 DGKLLASGSDDGTVRLWDLATGKLLRTlTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDL 233
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
476-605 |
5.06e-12 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 68.40 E-value: 5.06e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 476 RQINTLN-HGEVVCAVTISNPTRHVYTGGKGC-VKVWDIShpgNKSPVSQLDclNRDNYIRSCKLLPDGCTLIVGGEAST 553
Cdd:COG2319 279 ELLRTLTgHSGGVNSVAFSPDGKLLASGSDDGtVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGT 353
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 21541824 554 LSIWDLAapTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHN 605
Cdd:COG2319 354 VRLWDLA--TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
605-644 |
2.87e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 47.31 E-value: 2.87e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 21541824 605 NQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 644
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
595-766 |
8.78e-07 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 52.40 E-value: 8.78e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 595 DGNIAVWDLHNQTLVRQFQGHTDGASCIDISN-DGTKLWTGGLDNTVRSWDLREGRQLQQHDFTSQIFSLGY-CPTGEWL 672
Cdd:PLN00181 554 EGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGTIKTKANICCVQFpSESGRSL 633
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 673 AVGMESSNVEVLHVNKPdkyQLHL-----HESCVLSLKFAYCGKwFVSTGKDNLLNAWRTPYGASIFQSKESSSVLS--- 744
Cdd:PLN00181 634 AFGSADHKVYYYDLRNP---KLPLctmigHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMSISGINETPLHSFMGhtn 709
|
170 180
....*....|....*....|....*.
gi 21541824 745 ----CDISVDDKYIVTGSGDKKATVY 766
Cdd:PLN00181 710 vknfVGLSVSDGYIATGSETNEVFVY 735
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
607-644 |
3.77e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.26 E-value: 3.77e-06
10 20 30
....*....|....*....|....*....|....*...
gi 21541824 607 TLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 644
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
319-476 |
6.66e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.39 E-value: 6.66e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 319 TPTPRSDMPTPGTSATPGLRPGLGKPPA--IDPLVNQAAAGLRTPLAV--PGPYPAPFG-------MVPHAGMNGELTSP 387
Cdd:PHA03247 2707 TPEPAPHALVSATPLPPGPAAARQASPAlpAAPAPPAVPAGPATPGGParPARPPTTAGppapappAAPAAGPPRRLTRP 2786
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 388 GAAYASLHNMSPQMSAAAAAAAVVAYGRSPMV-------GFDPPPHMRVPTIPPNLAGIPGGKPAYSFHVTADGQMQPVP 460
Cdd:PHA03247 2787 AVASLSESRESLPSPWDPADPPAAVLAPAAALppaaspaGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRP 2866
|
170
....*....|....*.
gi 21541824 461 FPPDALIGPGIPRHAR 476
Cdd:PHA03247 2867 PSRSPAAKPAAPARPP 2882
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
563-602 |
7.56e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 37.68 E-value: 7.56e-04
10 20 30 40
....*....|....*....|....*....|....*....|
gi 21541824 563 TPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWD 602
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| NBCH_WD40 |
pfam20426 |
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at ... |
568-649 |
9.20e-04 |
|
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at the C-terminus of neurobeachin-like proteins.
Pssm-ID: 466575 [Multi-domain] Cd Length: 350 Bit Score: 42.37 E-value: 9.20e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 568 AELTSSAPACYALAISPDSKVCFSCcsdGNiavWD-------LHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTV 640
Cdd:pfam20426 75 AENVELGAQCFATLQTPSENFLISC---GN---WEnsfqvisLNDGRMVQSIRQHKDVVSCVAVTSDGSILATGSYDTTV 148
|
....*....
gi 21541824 641 RSWDLREGR 649
Cdd:pfam20426 149 MVWEVLRGR 157
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
192-463 |
2.24e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 41.68 E-value: 2.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 192 RDREPGTSNSllvPDSLRGTDKRRNGPEFSndikkRKVDDKDSSHYDSDGDKSDDNLVVDVSNEDPSSPRASPA-HSPRE 270
Cdd:pfam03154 82 RQREKGASDT---EEPERATAKKSKTQEIS-----RPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSiPSPQD 153
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 271 NGIDKNrllkkdassspastaSSASSTSLKSKEMSLHEKASTPVLKSSTPTPRSDMPTPG-TSATPGLRPGLGKPPAIDP 349
Cdd:pfam03154 154 NESDSD---------------SSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGpTPSAPSVPPQGSPATSQPP 218
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 350 LVNQAAAGLRT---------PLAVPGPYPAPFGMVPHAGMNGELTSPGAAyASLHNMSPQMSAAAAAAAVVAYGRSPMVG 420
Cdd:pfam03154 219 NQTQSTAAPHTliqqtptlhPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ-PSLHGQMPPMPHSLQTGPSHMQHPVPPQP 297
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 21541824 421 FDPPPHMRVPTIPPNLAGIPGGKPAYSFHVTADGQMQPVPFPP 463
Cdd:pfam03154 298 FPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPP 340
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
329-467 |
4.25e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 40.82 E-value: 4.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 329 PGTSATPGLRPGLGKPPAIDPLVNQAAAGLRTPLavPGPYPAPFGMVPHAGMNGELTSPGAAYASLHnmSPQMSAAAAAA 408
Cdd:PHA03378 691 PGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRA--RPPAAAPGRARPPAAAPGRARPPAAAPGRAR--PPAAAPGRARP 766
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 409 AVVAYGR-SPMvgfdPPPHmrvptIPPNLAGIPGGKPAysfhvtadgQMQPVPFPPDALI 467
Cdd:PHA03378 767 PAAAPGApTPQ----PPPQ-----APPAPQQRPRGAPT---------PQPPPQAGPTSMQ 808
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
578-602 |
8.08e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 34.63 E-value: 8.08e-03
10 20
....*....|....*....|....*
gi 21541824 578 YALAISPDSKVCFSCCSDGNIAVWD 602
Cdd:pfam00400 15 TSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| COG5276 |
COG5276 |
Uncharacterized secreted protein, contains LVIVD repeats, choice-of-anchor domain [Function ... |
498-644 |
9.86e-03 |
|
Uncharacterized secreted protein, contains LVIVD repeats, choice-of-anchor domain [Function unknown];
Pssm-ID: 444087 [Multi-domain] Cd Length: 320 Bit Score: 38.77 E-value: 9.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 498 HVYTG-GKGCVKVWDISHPGNKSPVSQLDCLNRDNYirscKLLPDGCTLIVGGEAST-LSIWDLAAPT-PRIKAELTSSA 574
Cdd:COG5276 31 YAYVAgGSNGLAIVDVSDPANPVLVGSLPTPGGTWR----DVKVSGDYLYVASEGSEgLQIFDISDPAnPKLVGRYDTGG 106
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 21541824 575 PACYALAISpDSKVCFSCCSDGNIAVWDLHNQT---LVRQFQgHTDGASCIDISNDGTKLWTGGLDNTVRSWD 644
Cdd:COG5276 107 SGAHNIAVD-GNYAYVAGGSDNGLVIVDISDPTnpvLVGRYS-LPGQAYLHDVQVVGDYAYVADWEDGLVIVD 177
|
|
|