|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
22-101 |
2.97e-64 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 209.20 E-value: 2.97e-64
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 22 FKFTISESCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQAEIVKRLNAICAQVIPFLS 101
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
460-745 |
2.15e-44 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 164.70 E-value: 2.15e-44
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 460 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLDclNRDNYIRSCRLLPDGRTLIVGGEASTLSIWDLAa 538
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLA---TGKLLRTLT--GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA- 192
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 539 pTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 618
Cdd:COG2319 193 -TGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 619 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMENSNVEVLHVTKPDK-YQLHLHESCVLSLKFAHCGKWFVSTGKD 696
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLlRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 1720394946 697 NLLNAWRTPYGASIFQSKE-SSSVLSCDISVDDKYIVTGSGDKKATVYEV 745
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
460-744 |
5.44e-39 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 145.94 E-value: 5.44e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 460 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLdCLNRDNyIRSCRLLPDGRTLIVGGEASTLSIWDLaa 538
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL-- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 539 PTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 618
Cdd:cd00200 81 ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 619 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMENSNVEVLHVTKPD-KYQLHLHESCVLSLKFAHCGKWFVSTGKD 696
Cdd:cd00200 161 LWDLRTGKCVATlTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASGSED 240
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 1720394946 697 NLLNAWRTPYGASIFQ-SKESSSVLSCDISVDDKYIVTGSGDKKATVYE 744
Cdd:cd00200 241 GTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
495-743 |
2.73e-07 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 54.32 E-value: 2.73e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 495 KSPVSQLDCLNRDNYIRSCRLLPDGRTLIVGGEASTLSIWDLAAPtprIKAELTSSAPACYALAISPDSKVCF------- 567
Cdd:PLN00181 471 KADLKQGDLLNSSNLVCAIGFDRDGEFFATAGVNKKIKIFECESI---IKDGRDIHYPVVELASRSKLSGICWnsyiksq 547
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 568 --SCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISN-DGTKLWTGGLDNTVRSWDLREGRQLQQHDFTSQIFSLGY-C 643
Cdd:PLN00181 548 vaSSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGTIKTKANICCVQFpS 627
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 644 PTGEWLAVGMENSNVEVLHVTKPdkyQLHL-----HESCVLSLKFAHCGKwFVSTGKDNLLNAWRTPYGASIFQSKESSS 718
Cdd:PLN00181 628 ESGRSLAFGSADHKVYYYDLRNP---KLPLctmigHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMSISGINETPLHS 703
|
250 260 270
....*....|....*....|....*....|..
gi 1720394946 719 VLS-------CDISVDDKYIVTGSGDKKATVY 743
Cdd:PLN00181 704 FMGhtnvknfVGLSVSDGYIATGSETNEVFVY 735
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
582-621 |
4.64e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 46.54 E-value: 4.64e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1720394946 582 NQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 621
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
584-621 |
5.63e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 43.49 E-value: 5.63e-06
10 20 30
....*....|....*....|....*....|....*...
gi 1720394946 584 TLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 621
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
22-101 |
2.97e-64 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 209.20 E-value: 2.97e-64
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 22 FKFTISESCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQAEIVKRLNAICAQVIPFLS 101
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
460-745 |
2.15e-44 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 164.70 E-value: 2.15e-44
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 460 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLDclNRDNYIRSCRLLPDGRTLIVGGEASTLSIWDLAa 538
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLA---TGKLLRTLT--GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA- 192
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 539 pTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 618
Cdd:COG2319 193 -TGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 619 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMENSNVEVLHVTKPDK-YQLHLHESCVLSLKFAHCGKWFVSTGKD 696
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLlRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 1720394946 697 NLLNAWRTPYGASIFQSKE-SSSVLSCDISVDDKYIVTGSGDKKATVYEV 745
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
427-745 |
3.28e-41 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 155.84 E-value: 3.28e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 427 VSADGQMQPVPFPPDALIGPGIPRHARQINTLNHGEVVCAVTISNPTRHVYTGGKGCVKVWDISHPGNKSPVSQLdclnR 506
Cdd:COG2319 2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLG----H 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 507 DNYIRSCRLLPDGRTLIVGGEASTLSIWDLAapTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLV 586
Cdd:COG2319 78 TAAVLSVAFSPDGRLLASASADGTVRLWDLA--TGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLL 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 587 RQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWDLREGRQLQQ---HdfTSQIFSLGYCPTGEWLAVGMENSNVEVLHV 663
Cdd:COG2319 156 RTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTltgH--TGAVRSVAFSPDGKLLASGSADGTVRLWDL 233
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 664 -TKPDKYQLHLHESCVLSLKFAHCGKWFVSTGKDNLLNAWRTPYGASI-FQSKESSSVLSCDISVDDKYIVTGSGDKKAT 741
Cdd:COG2319 234 aTGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLrTLTGHSGGVNSVAFSPDGKLLASGSDDGTVR 313
|
....
gi 1720394946 742 VYEV 745
Cdd:COG2319 314 LWDL 317
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
453-705 |
2.81e-40 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 153.14 E-value: 2.81e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 453 RQINTLN-HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLDclNRDNYIRSCRLLPDGRTLIVGGEAST 530
Cdd:COG2319 153 KLLRTLTgHSGAVTSVAFSPDGKLLASGSDdGTVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGT 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 531 LSIWDLAapTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWT 610
Cdd:COG2319 228 VRLWDLA--TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 611 GGLDNTVRSWDLREGRQLQQHD-FTSQIFSLGYCPTGEWLAVGMENSNVEVLHV-TKPDKYQLHLHESCVLSLKFAHCGK 688
Cdd:COG2319 306 GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLaTGELLRTLTGHTGAVTSVAFSPDGR 385
|
250
....*....|....*..
gi 1720394946 689 WFVSTGKDNLLNAWRTP 705
Cdd:COG2319 386 TLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
460-744 |
5.44e-39 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 145.94 E-value: 5.44e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 460 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLdCLNRDNyIRSCRLLPDGRTLIVGGEASTLSIWDLaa 538
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL-- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 539 PTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 618
Cdd:cd00200 81 ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 619 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMENSNVEVLHVTKPD-KYQLHLHESCVLSLKFAHCGKWFVSTGKD 696
Cdd:cd00200 161 LWDLRTGKCVATlTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASGSED 240
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 1720394946 697 NLLNAWRTPYGASIFQ-SKESSSVLSCDISVDDKYIVTGSGDKKATVYE 744
Cdd:cd00200 241 GTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
508-745 |
1.37e-29 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 118.98 E-value: 1.37e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 508 NYIRSCRLLPDGRTLIVGGEASTLSIWDLAapTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVR 587
Cdd:cd00200 10 GGVTCVAFSPDGKLLATGSGDGTIKVWDLE--TGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVR 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 588 QFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMENSNVEV--LHVT 664
Cdd:cd00200 88 TLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTlRGHTDWVNSVAFSPDGTFVASSSQDGTIKLwdLRTG 167
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 665 KPdKYQLHLHESCVLSLKFAHCGKWFVSTGKDNLLNAWRTPYGASI--FQSKEsSSVLSCDISVDDKYIVTGSGDKKATV 742
Cdd:cd00200 168 KC-VATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLgtLRGHE-NGVNSVAFSPDGYLLASGSEDGTIRV 245
|
...
gi 1720394946 743 YEV 745
Cdd:cd00200 246 WDL 248
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
585-745 |
3.71e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 94.32 E-value: 3.71e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 585 LVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWDLREG---RQLQQHdfTSQIFSLGYCPTGEWLAVGMENSNVEVL 661
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellRTLKGH--TGPVRDVAASADGTYLASGSSDKTIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 662 HVTKPDK-YQLHLHESCVLSLKFAHCGKWFVSTGKDNLLNAWRTPYGASI--FQSKEsSSVLSCDISVDDKYIVTGSGDK 738
Cdd:cd00200 79 DLETGECvRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLttLRGHT-DWVNSVAFSPDGTFVASSSQDG 157
|
....*..
gi 1720394946 739 KATVYEV 745
Cdd:cd00200 158 TIKLWDL 164
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
453-582 |
1.21e-13 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 73.41 E-value: 1.21e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 453 RQINTLN-HGEVVCAVTISNPTRHVYTGGKGC-VKVWDIShpgNKSPVSQLDclNRDNYIRSCRLLPDGRTLIVGGEAST 530
Cdd:COG2319 279 ELLRTLTgHSGGVNSVAFSPDGKLLASGSDDGtVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGT 353
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 1720394946 531 LSIWDLAapTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHN 582
Cdd:COG2319 354 VRLWDLA--TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
495-743 |
2.73e-07 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 54.32 E-value: 2.73e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 495 KSPVSQLDCLNRDNYIRSCRLLPDGRTLIVGGEASTLSIWDLAAPtprIKAELTSSAPACYALAISPDSKVCF------- 567
Cdd:PLN00181 471 KADLKQGDLLNSSNLVCAIGFDRDGEFFATAGVNKKIKIFECESI---IKDGRDIHYPVVELASRSKLSGICWnsyiksq 547
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 568 --SCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISN-DGTKLWTGGLDNTVRSWDLREGRQLQQHDFTSQIFSLGY-C 643
Cdd:PLN00181 548 vaSSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGTIKTKANICCVQFpS 627
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 644 PTGEWLAVGMENSNVEVLHVTKPdkyQLHL-----HESCVLSLKFAHCGKwFVSTGKDNLLNAWRTPYGASIFQSKESSS 718
Cdd:PLN00181 628 ESGRSLAFGSADHKVYYYDLRNP---KLPLctmigHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMSISGINETPLHS 703
|
250 260 270
....*....|....*....|....*....|..
gi 1720394946 719 VLS-------CDISVDDKYIVTGSGDKKATVY 743
Cdd:PLN00181 704 FMGhtnvknfVGLSVSDGYIATGSETNEVFVY 735
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
582-621 |
4.64e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 46.54 E-value: 4.64e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1720394946 582 NQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 621
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
584-621 |
5.63e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 43.49 E-value: 5.63e-06
10 20 30
....*....|....*....|....*....|....*...
gi 1720394946 584 TLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 621
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
540-579 |
1.09e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 37.29 E-value: 1.09e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1720394946 540 TPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWD 579
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| NBCH_WD40 |
pfam20426 |
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at ... |
545-626 |
1.11e-03 |
|
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at the C-terminus of neurobeachin-like proteins.
Pssm-ID: 466575 [Multi-domain] Cd Length: 350 Bit Score: 41.98 E-value: 1.11e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 545 AELTSSAPACYALAISPDSKVCFSCcsdGNiavWD-------LHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTV 617
Cdd:pfam20426 75 AENVELGAQCFATLQTPSENFLISC---GN---WEnsfqvisLNDGRMVQSIRQHKDVVSCVAVTSDGSILATGSYDTTV 148
|
....*....
gi 1720394946 618 RSWDLREGR 626
Cdd:pfam20426 149 MVWEVLRGR 157
|
|
| COG5276 |
COG5276 |
Uncharacterized secreted protein, contains LVIVD repeats, choice-of-anchor domain [Function ... |
475-621 |
8.86e-03 |
|
Uncharacterized secreted protein, contains LVIVD repeats, choice-of-anchor domain [Function unknown];
Pssm-ID: 444087 [Multi-domain] Cd Length: 320 Bit Score: 38.77 E-value: 8.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394946 475 HVYTG-GKGCVKVWDISHPGNKSPVSQLDCLNRDNYirscRLLPDGRTLIVGGEAST-LSIWDLAAPT-PRIKAELTSSA 551
Cdd:COG5276 31 YAYVAgGSNGLAIVDVSDPANPVLVGSLPTPGGTWR----DVKVSGDYLYVASEGSEgLQIFDISDPAnPKLVGRYDTGG 106
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1720394946 552 PACYALAISpDSKVCFSCCSDGNIAVWDLHNQT---LVRQFQgHTDGASCIDISNDGTKLWTGGLDNTVRSWD 621
Cdd:COG5276 107 SGAHNIAVD-GNYAYVAGGSDNGLVIVDISDPTnpvLVGRYS-LPGQAYLHDVQVVGDYAYVADWEDGLVIVD 177
|
|
|