|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
666-1004 |
9.53e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 135.54 E-value: 9.53e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 666 GHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWDWK 745
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKGHTG-PVRDVAASADGTYLASGSSD--KTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 746 KGEKLSIARGSKDKIFVVKMNPYvpDKLITAGIKH--MKFWRKAGGGLIGRkgyigTLGKNDTMMCAVYGWTEEMAFSGT 823
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVETGKCLTT-----LRGHTDWVNSVAFSPDGTFVASSS 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 824 STGDVCIW--RDIFLVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYaikraalapgskgllledn 899
Cdd:cd00200 155 QDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTL------------------- 215
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 900 psiraislghghilvgtkngeilevdksgpitllvQGHmEGEVWGLATHPYLPICATVSDDKTLRIWDLSPSHCMLAVRK 979
Cdd:cd00200 216 -----------------------------------RGH-ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG 259
|
330 340
....*....|....*....|....*
gi 2462539142 980 LKKGGRCCCFSPDGKALAVGLNDGS 1004
Cdd:cd00200 260 HTNSVTSLAWSPDGKRLASGSADGT 284
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1369-1756 |
4.93e-31 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 127.72 E-value: 4.93e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1369 LTVNQHPKFINIVATGQVGDSADMSATAPSIHIWDAMNKQTLSILRcYHSKGVCSVSFSATGKLLLSVGLDpeHTITIWR 1448
Cdd:COG2319 72 ATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLT-GHTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1449 WQEGAKIASRAGHNQRIFVAEFRPDSdTQFVSVGV-KHVKFWTLAGRALLSkkgllsTLEDARmQTMLAIAFGANNLTF- 1526
Cdd:COG2319 149 LATGKLLRTLTGHSGAVTSVAFSPDG-KLLASGSDdGTVRLWDLATGKLLR------TLTGHT-GAVRSVAFSPDGKLLa 220
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1527 TGTISGDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDG-LIVTGGkerpskEGGAVKLWDqelrrcrafrLETGQatdC 1604
Cdd:COG2319 221 SGSADGTVRLWDlATGKLLRTLTGHSGSVRSV-AFSPDGrLLASGS------ADGTVRLWD----------LATGE---L 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1605 VRSVcrgkgkilvgtrnaeiievgeknaacnilvnGHVDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlG 1684
Cdd:COG2319 281 LRTL-------------------------------TGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT-G 328
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462539142 1685 HAA--RTVCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCAIHDIRFSPDSRYLAVGSSENSVDFYDLT 1756
Cdd:COG2319 329 HTGavRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
4-379 |
1.82e-29 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 123.10 E-value: 1.82e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 4 ALRSLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKDvHTHGIACLAFDLDGQRLVSVGLDskNAVCVWDWKRGKM 83
Cdd:COG2319 80 AVLSVAFSPDGRLLASASADGT--VRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSAD--GTVRLWDLATGKL 154
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 84 LSMAPGHTDRIFDISWDlyqPNklvscgvkhikfwslcgnaltpkrgvfGKTgdlqtilcLAcardeltySGALNGDIYV 163
Cdd:COG2319 155 LRTLTGHSGAVTSVAFS---PD---------------------------GKL--------LA--------SGSDDGTVRL 188
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 164 W--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIDLRETdqgykglSVRSVCWR--GDH 236
Cdd:COG2319 189 WdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLaTGKLLRTLTGHSG-------SVRSVAFSpdGRL 260
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 237 ILVGTQDSEIFeIVVQERNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCN-MEEPIRCAAV 315
Cdd:COG2319 261 LASGSADGTVR-LWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAF 338
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462539142 316 NADGIHLALGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSPDGTYLAVGCNDSSVDIYGVA 379
Cdd:COG2319 339 SPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
836-1207 |
9.61e-24 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 106.15 E-value: 9.61e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 836 LVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTY-----AIKRAALAPGSKglllednpsiraislg 908
Cdd:COG2319 70 LLATLLGHTAAVLSVAFSPDGrlLASASADGTVRLWDLATGLLLRTLtghtgAVRSVAFSPDGK---------------- 133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 909 hgHILVGTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHP---YLpicATVSDDKTLRIWDLSPSHCMLAVRKLKKGG 984
Cdd:COG2319 134 --TLASGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGKLLRTLTGHTGAV 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 985 RCCCFSPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFIDIYNVMSSKRVGICKG 1064
Cdd:COG2319 208 RSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSP-DGRLLASGSADGTVRLWDLATGELLRTLTG 286
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1065 ATSYITHIDWDIRGKLLqvntgakeqlffeAPRGKKQTIpsveveKIawasWTSVLGLCcegIWPVIGEVTDVTASCLTS 1144
Cdd:COG2319 287 HSGGVNSVAFSPDGKLL-------------ASGSDDGTV------RL----WDLATGKL---LRTLTGHTGAVRSVAFSP 340
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462539142 1145 DKMVLATGDDLGFVKLFRYPTKGKFGKFKryvAHSTHVTNVRWTYDDSMLVTlGGTDMSLMVW 1207
Cdd:COG2319 341 DGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
613-656 |
8.93e-18 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons. :
Pssm-ID: 460922 Cd Length: 72 Bit Score: 79.13 E-value: 8.93e-18
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 2462539142 613 APGNSIRLHFVHGYRGYDCRSNLFYTQIGEIVYHVAAVGVIYNR 656
Cdd:pfam03451 29 PPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
1280-1351 |
8.71e-15 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons. :
Pssm-ID: 460922 Cd Length: 72 Bit Score: 70.66 E-value: 8.71e-15
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462539142 1280 VRGSRPPVSraPPQPEKLQTNNVGKKKRPIEDLVLELIFGYRGRDCRNNVHYLNDGdDIIYHTASVGILHNV 1351
Cdd:pfam03451 4 IRGRPGAVY--PPSNYYPKDDLDQKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYDV 72
|
|
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1638-1894 |
4.45e-13 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 71.98 E-value: 4.45e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1638 VNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLnKVNLGHAA--RTVCYSPEGDMVAIGMKNGefiillvsS 1715
Cdd:cd00200 5 LKGH-TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELL-RTLKGHTGpvRDVAASADGTYLASGSSDK--------T 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1716 LKIWgKKRDRRC---------AIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRISYCKDipsFVIQMDFSADSSYl 1786
Cdd:cd00200 75 IRLW-DLETGECvrtltghtsYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTF- 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1787 qVSSGCYKR--HVYEVPSGKHLMDHaaidritwatwtsilgdevlgiwsrHAEKADVNCACVSHSGISLVTGDDFGMVKL 1864
Cdd:cd00200 150 -VASSSQDGtiKLWDLRTGKCVATL-------------------------TGHTGEVNSVAFSPDGEKLLSSSSDGTIKL 203
|
250 260 270
....*....|....*....|....*....|....
gi 2462539142 1865 FDFPCPEKFVSLC----FVYYYQFTPNFDVLSSA 1894
Cdd:cd00200 204 WDLSTGKCLGTLRghenGVNSVAFSPDGYLLASG 237
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
666-1004 |
9.53e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 135.54 E-value: 9.53e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 666 GHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWDWK 745
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKGHTG-PVRDVAASADGTYLASGSSD--KTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 746 KGEKLSIARGSKDKIFVVKMNPYvpDKLITAGIKH--MKFWRKAGGGLIGRkgyigTLGKNDTMMCAVYGWTEEMAFSGT 823
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVETGKCLTT-----LRGHTDWVNSVAFSPDGTFVASSS 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 824 STGDVCIW--RDIFLVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYaikraalapgskgllledn 899
Cdd:cd00200 155 QDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTL------------------- 215
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 900 psiraislghghilvgtkngeilevdksgpitllvQGHmEGEVWGLATHPYLPICATVSDDKTLRIWDLSPSHCMLAVRK 979
Cdd:cd00200 216 -----------------------------------RGH-ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG 259
|
330 340
....*....|....*....|....*
gi 2462539142 980 LKKGGRCCCFSPDGKALAVGLNDGS 1004
Cdd:cd00200 260 HTNSVTSLAWSPDGKRLASGSADGT 284
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
661-1053 |
2.35e-34 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 137.74 E-value: 2.35e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 661 QRFYLGHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQYgVSAVDFSADGKRLASVGIDdsHTVV 740
Cdd:COG2319 71 LATLLGHTAAVLSVAFSPDGRLLASA--SADGTVRLWDLATGLLLRTLTGHTGA-VRSVAFSPDGKTLASGSAD--GTVR 145
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 741 LWDWKKGEKLSIARGSKDKIFVVKMNPyvpD--KLITAGI-KHMKFWRKAGGGLIgrkgyigtlgkndtmmcavygwtee 817
Cdd:COG2319 146 LWDLATGKLLRTLTGHSGAVTSVAFSP---DgkLLASGSDdGTVRLWDLATGKLL------------------------- 197
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 818 mafsgtstgdvciwrdiflvKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYAikraalapgskgll 895
Cdd:COG2319 198 --------------------RTLTGHTGAVRSVAFSPDGklLASGSADGTVRLWDLATGKLLRTLT-------------- 243
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 896 lEDNPSIRAISLGH-GHILV-GTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHP---YLpicATVSDDKTLRIWDLS 969
Cdd:COG2319 244 -GHSGSVRSVAFSPdGRLLAsGSADGTVRLWDlATGELLRTLTGH-SGGVNSVAFSPdgkLL---ASGSDDGTVRLWDLA 318
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 970 PSHCMLAVRKLKKGGRCCCFSPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFID 1049
Cdd:COG2319 319 TGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSP-DGRTLASGSADGTVR 397
|
....
gi 2462539142 1050 IYNV 1053
Cdd:COG2319 398 LWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1369-1756 |
4.93e-31 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 127.72 E-value: 4.93e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1369 LTVNQHPKFINIVATGQVGDSADMSATAPSIHIWDAMNKQTLSILRcYHSKGVCSVSFSATGKLLLSVGLDpeHTITIWR 1448
Cdd:COG2319 72 ATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLT-GHTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1449 WQEGAKIASRAGHNQRIFVAEFRPDSdTQFVSVGV-KHVKFWTLAGRALLSkkgllsTLEDARmQTMLAIAFGANNLTF- 1526
Cdd:COG2319 149 LATGKLLRTLTGHSGAVTSVAFSPDG-KLLASGSDdGTVRLWDLATGKLLR------TLTGHT-GAVRSVAFSPDGKLLa 220
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1527 TGTISGDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDG-LIVTGGkerpskEGGAVKLWDqelrrcrafrLETGQatdC 1604
Cdd:COG2319 221 SGSADGTVRLWDlATGKLLRTLTGHSGSVRSV-AFSPDGrLLASGS------ADGTVRLWD----------LATGE---L 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1605 VRSVcrgkgkilvgtrnaeiievgeknaacnilvnGHVDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlG 1684
Cdd:COG2319 281 LRTL-------------------------------TGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT-G 328
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462539142 1685 HAA--RTVCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCAIHDIRFSPDSRYLAVGSSENSVDFYDLT 1756
Cdd:COG2319 329 HTGavRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
4-379 |
1.82e-29 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 123.10 E-value: 1.82e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 4 ALRSLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKDvHTHGIACLAFDLDGQRLVSVGLDskNAVCVWDWKRGKM 83
Cdd:COG2319 80 AVLSVAFSPDGRLLASASADGT--VRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSAD--GTVRLWDLATGKL 154
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 84 LSMAPGHTDRIFDISWDlyqPNklvscgvkhikfwslcgnaltpkrgvfGKTgdlqtilcLAcardeltySGALNGDIYV 163
Cdd:COG2319 155 LRTLTGHSGAVTSVAFS---PD---------------------------GKL--------LA--------SGSDDGTVRL 188
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 164 W--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIDLRETdqgykglSVRSVCWR--GDH 236
Cdd:COG2319 189 WdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLaTGKLLRTLTGHSG-------SVRSVAFSpdGRL 260
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 237 ILVGTQDSEIFeIVVQERNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCN-MEEPIRCAAV 315
Cdd:COG2319 261 LASGSADGTVR-LWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAF 338
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462539142 316 NADGIHLALGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSPDGTYLAVGCNDSSVDIYGVA 379
Cdd:COG2319 339 SPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
6-294 |
1.43e-27 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 114.74 E-value: 1.43e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 6 RSLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKdVHTHGIACLAFDLDGQRLVSVGLDskNAVCVWDWKRGKMLS 85
Cdd:cd00200 13 TCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSD--KTIRLWDLETGECVR 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 86 MAPGHTDRIFDISWDlyqPNK--LVSCGV-KHIKFWSLcgNALTPKRGVFGKTGDlqtILCLACARDE-LTYSGALNGDI 161
Cdd:cd00200 88 TLTGHTSYVSSVAFS---PDGriLSSSSRdKTIKVWDV--ETGKCLTTLRGHTDW---VNSVAFSPDGtFVASSSQDGTI 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 162 YVW--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIDlretdqgYKGLSVRSVCW--RG 234
Cdd:cd00200 160 KLWdlRTGKCVATLTG-HTGEVNSVAFSPDGekLLSSSSDGTIKLWDLsTGKCLGTLR-------GHENGVNSVAFspDG 231
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462539142 235 DHILVGTQDS--EIFEIVVQERNKPFlimQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 294
Cdd:cd00200 232 YLLASGSEDGtiRVWDLRTGECVQTL---SGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
836-1207 |
9.61e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 106.15 E-value: 9.61e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 836 LVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTY-----AIKRAALAPGSKglllednpsiraislg 908
Cdd:COG2319 70 LLATLLGHTAAVLSVAFSPDGrlLASASADGTVRLWDLATGLLLRTLtghtgAVRSVAFSPDGK---------------- 133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 909 hgHILVGTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHP---YLpicATVSDDKTLRIWDLSPSHCMLAVRKLKKGG 984
Cdd:COG2319 134 --TLASGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGKLLRTLTGHTGAV 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 985 RCCCFSPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFIDIYNVMSSKRVGICKG 1064
Cdd:COG2319 208 RSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSP-DGRLLASGSADGTVRLWDLATGELLRTLTG 286
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1065 ATSYITHIDWDIRGKLLqvntgakeqlffeAPRGKKQTIpsveveKIawasWTSVLGLCcegIWPVIGEVTDVTASCLTS 1144
Cdd:COG2319 287 HSGGVNSVAFSPDGKLL-------------ASGSDDGTV------RL----WDLATGKL---LRTLTGHTGAVRSVAFSP 340
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462539142 1145 DKMVLATGDDLGFVKLFRYPTKGKFGKFKryvAHSTHVTNVRWTYDDSMLVTlGGTDMSLMVW 1207
Cdd:COG2319 341 DGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1417-1719 |
4.23e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 95.86 E-value: 4.23e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1417 HSKGVCSVSFSATGKLLLSVGLDpeHTITIWRWQEGAKIASRAGHNQRIFVAEFRPDSdTQFVSVGV-KHVKFWTLagra 1495
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADG-TYLASGSSdKTIRLWDL---- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1496 llSKKGLLSTLEDARmQTMLAIAFGANNLTFTGTIS-GDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDGLIVTGgker 1573
Cdd:cd00200 81 --ETGECVRTLTGHT-SYVSSVAFSPDGRILSSSSRdKTIKVWDvETGKCLTTLRGHTDWVNSV-AFSPDGTFVAS---- 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1574 pSKEGGAVKLWD-QELRRCRAFRLETGQatdcVRSVC--RGKGKILVGTRNAEIIEVGEKNAACNILVNGHvDGPIWGLA 1650
Cdd:cd00200 153 -SSQDGTIKLWDlRTGKCVATLTGHTGE----VNSVAfsPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGH-ENGVNSVA 226
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462539142 1651 THPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlGHAAR--TVCYSPEGDMVAIGMKNGefiillvsSLKIW 1719
Cdd:cd00200 227 FSPDGYLLASGSEDGTIRVWDLRTGECVQTLS-GHTNSvtSLAWSPDGKRLASGSADG--------TIRIW 288
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
836-1081 |
3.32e-20 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 93.17 E-value: 3.32e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 836 LVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYAIKRAALApgskglllednpsiRAISLGHGH-I 912
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGklLATGSGDGTIKVWDLETGELLRTLKGHTGPVR--------------DVAASADGTyL 66
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 913 LVGTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHPYLPICATVSDDKTLRIWDLSPSHCMLAVRKLKKGGRCCCFSP 991
Cdd:cd00200 67 ASGSSDKTIRLWDlETGECVRTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSP 145
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 992 DGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFIDIYNVMSSKRVGICKGATSYITH 1071
Cdd:cd00200 146 DGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSP-DGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNS 224
|
250
....*....|
gi 2462539142 1072 IDWDIRGKLL 1081
Cdd:cd00200 225 VAFSPDGYLL 234
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
613-656 |
8.93e-18 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 79.13 E-value: 8.93e-18
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 2462539142 613 APGNSIRLHFVHGYRGYDCRSNLFYTQIGEIVYHVAAVGVIYNR 656
Cdd:pfam03451 29 PPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
1280-1351 |
8.71e-15 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 70.66 E-value: 8.71e-15
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462539142 1280 VRGSRPPVSraPPQPEKLQTNNVGKKKRPIEDLVLELIFGYRGRDCRNNVHYLNDGdDIIYHTASVGILHNV 1351
Cdd:pfam03451 4 IRGRPGAVY--PPSNYYPKDDLDQKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYDV 72
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1638-1894 |
4.45e-13 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 71.98 E-value: 4.45e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1638 VNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLnKVNLGHAA--RTVCYSPEGDMVAIGMKNGefiillvsS 1715
Cdd:cd00200 5 LKGH-TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELL-RTLKGHTGpvRDVAASADGTYLASGSSDK--------T 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1716 LKIWgKKRDRRC---------AIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRISYCKDipsFVIQMDFSADSSYl 1786
Cdd:cd00200 75 IRLW-DLETGECvrtltghtsYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTF- 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1787 qVSSGCYKR--HVYEVPSGKHLMDHaaidritwatwtsilgdevlgiwsrHAEKADVNCACVSHSGISLVTGDDFGMVKL 1864
Cdd:cd00200 150 -VASSSQDGtiKLWDLRTGKCVATL-------------------------TGHTGEVNSVAFSPDGEKLLSSSSDGTIKL 203
|
250 260 270
....*....|....*....|....*....|....
gi 2462539142 1865 FDFPCPEKFVSLC----FVYYYQFTPNFDVLSSA 1894
Cdd:cd00200 204 WDLSTGKCLGTLRghenGVNSVAFSPDGYLLASG 237
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
1690-1772 |
1.47e-05 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 45.35 E-value: 1.47e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1690 VCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRR-CAIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRISYCK 1768
Cdd:pfam12894 1 MSWCPTMDLIALATEDGELLLHRLNWQRVWTLSPDKEdLEVTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFSAGS 80
|
....
gi 2462539142 1769 DIPS 1772
Cdd:pfam12894 81 DLIT 84
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1650-1866 |
2.14e-04 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 46.06 E-value: 2.14e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1650 ATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNLGHAARTVC-YSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCA 1728
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLaASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1729 IHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRIsycKDIPSFVIQMDFSADSSYLqVSSGCYKR-HVYEVPSGKH-- 1805
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTL---TGHTGAVRSVAFSPDGKTL-ASGSADGTvRLWDLATGKLlr 156
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462539142 1806 -LMDHA-AIDRITWA----TWTSILGDEVLGIWSRHAEK---------ADVNCACVSHSGISLVTGDDFGMVKLFD 1866
Cdd:COG2319 157 tLTGHSgAVTSVAFSpdgkLLASGSDDGTVRLWDLATGKllrtltghtGAVRSVAFSPDGKLLASGSADGTVRLWD 232
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
701-743 |
3.65e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.60 E-value: 3.65e-04
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 2462539142 701 TIKPLSILKGHHQYgVSAVDFSADGKRLASVGIDdsHTVVLWD 743
Cdd:smart00320 1 SGELLKTLKGHTGP-VTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1639-1671 |
3.87e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.60 E-value: 3.87e-04
10 20 30
....*....|....*....|....*....|...
gi 2462539142 1639 NGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWD 1671
Cdd:smart00320 9 KGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
255-294 |
1.18e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.10 E-value: 1.18e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 2462539142 255 NKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 294
Cdd:pfam00400 1 GKLLKTLEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
256-294 |
2.04e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 37.68 E-value: 2.04e-03
10 20 30
....*....|....*....|....*....|....*....
gi 2462539142 256 KPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 294
Cdd:smart00320 3 ELLKTLKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
703-743 |
4.46e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 36.55 E-value: 4.46e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2462539142 703 KPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWD 743
Cdd:pfam00400 2 KLLKTLEGHTG-SVTSLAFSPDGKLLASGSDD--GTVKVWD 39
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
666-1004 |
9.53e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 135.54 E-value: 9.53e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 666 GHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWDWK 745
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKGHTG-PVRDVAASADGTYLASGSSD--KTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 746 KGEKLSIARGSKDKIFVVKMNPYvpDKLITAGIKH--MKFWRKAGGGLIGRkgyigTLGKNDTMMCAVYGWTEEMAFSGT 823
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVETGKCLTT-----LRGHTDWVNSVAFSPDGTFVASSS 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 824 STGDVCIW--RDIFLVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYaikraalapgskgllledn 899
Cdd:cd00200 155 QDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTL------------------- 215
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 900 psiraislghghilvgtkngeilevdksgpitllvQGHmEGEVWGLATHPYLPICATVSDDKTLRIWDLSPSHCMLAVRK 979
Cdd:cd00200 216 -----------------------------------RGH-ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG 259
|
330 340
....*....|....*....|....*
gi 2462539142 980 LKKGGRCCCFSPDGKALAVGLNDGS 1004
Cdd:cd00200 260 HTNSVTSLAWSPDGKRLASGSADGT 284
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
661-1053 |
2.35e-34 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 137.74 E-value: 2.35e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 661 QRFYLGHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQYgVSAVDFSADGKRLASVGIDdsHTVV 740
Cdd:COG2319 71 LATLLGHTAAVLSVAFSPDGRLLASA--SADGTVRLWDLATGLLLRTLTGHTGA-VRSVAFSPDGKTLASGSAD--GTVR 145
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 741 LWDWKKGEKLSIARGSKDKIFVVKMNPyvpD--KLITAGI-KHMKFWRKAGGGLIgrkgyigtlgkndtmmcavygwtee 817
Cdd:COG2319 146 LWDLATGKLLRTLTGHSGAVTSVAFSP---DgkLLASGSDdGTVRLWDLATGKLL------------------------- 197
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 818 mafsgtstgdvciwrdiflvKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYAikraalapgskgll 895
Cdd:COG2319 198 --------------------RTLTGHTGAVRSVAFSPDGklLASGSADGTVRLWDLATGKLLRTLT-------------- 243
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 896 lEDNPSIRAISLGH-GHILV-GTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHP---YLpicATVSDDKTLRIWDLS 969
Cdd:COG2319 244 -GHSGSVRSVAFSPdGRLLAsGSADGTVRLWDlATGELLRTLTGH-SGGVNSVAFSPdgkLL---ASGSDDGTVRLWDLA 318
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 970 PSHCMLAVRKLKKGGRCCCFSPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFID 1049
Cdd:COG2319 319 TGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSP-DGRTLASGSADGTVR 397
|
....
gi 2462539142 1050 IYNV 1053
Cdd:COG2319 398 LWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1369-1756 |
4.93e-31 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 127.72 E-value: 4.93e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1369 LTVNQHPKFINIVATGQVGDSADMSATAPSIHIWDAMNKQTLSILRcYHSKGVCSVSFSATGKLLLSVGLDpeHTITIWR 1448
Cdd:COG2319 72 ATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLT-GHTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1449 WQEGAKIASRAGHNQRIFVAEFRPDSdTQFVSVGV-KHVKFWTLAGRALLSkkgllsTLEDARmQTMLAIAFGANNLTF- 1526
Cdd:COG2319 149 LATGKLLRTLTGHSGAVTSVAFSPDG-KLLASGSDdGTVRLWDLATGKLLR------TLTGHT-GAVRSVAFSPDGKLLa 220
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1527 TGTISGDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDG-LIVTGGkerpskEGGAVKLWDqelrrcrafrLETGQatdC 1604
Cdd:COG2319 221 SGSADGTVRLWDlATGKLLRTLTGHSGSVRSV-AFSPDGrLLASGS------ADGTVRLWD----------LATGE---L 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1605 VRSVcrgkgkilvgtrnaeiievgeknaacnilvnGHVDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlG 1684
Cdd:COG2319 281 LRTL-------------------------------TGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT-G 328
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462539142 1685 HAA--RTVCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCAIHDIRFSPDSRYLAVGSSENSVDFYDLT 1756
Cdd:COG2319 329 HTGavRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
660-1004 |
2.64e-30 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 125.79 E-value: 2.64e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 660 TQRFYLGHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTV 739
Cdd:COG2319 112 LLRTLTGHTGAVRSVAFSPDGKTLASG--SADGTVRLWDLATGKLLRTLTGHSG-AVTSVAFSPDGKLLASGSDD--GTV 186
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 740 VLWDWKKGEKLSIARGSKDKIFVVKMNPyvpD--KLITAGI-KHMKFWRKAGGGLIGrkgyigTLGKNDtmmcavyGWTE 816
Cdd:COG2319 187 RLWDLATGKLLRTLTGHTGAVRSVAFSP---DgkLLASGSAdGTVRLWDLATGKLLR------TLTGHS-------GSVR 250
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 817 EMAFS--------GTSTGDVCIWR--DIFLVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYaikr 884
Cdd:COG2319 251 SVAFSpdgrllasGSADGTVRLWDlaTGELLRTLTGHSGGVNSVAFSPDGklLASGSDDGTVRLWDLATGKLLRTL---- 326
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 885 aalapgskglllednpsiraislghghilvgtkngeilevdksgpitllvQGHmEGEVWGLATHPYLPICATVSDDKTLR 964
Cdd:COG2319 327 --------------------------------------------------TGH-TGAVRSVAFSPDGKTLASGSDDGTVR 355
|
330 340 350 360
....*....|....*....|....*....|....*....|
gi 2462539142 965 IWDLSPSHCMLAVRKLKKGGRCCCFSPDGKALAVGLNDGS 1004
Cdd:COG2319 356 LWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGT 395
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
4-379 |
1.82e-29 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 123.10 E-value: 1.82e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 4 ALRSLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKDvHTHGIACLAFDLDGQRLVSVGLDskNAVCVWDWKRGKM 83
Cdd:COG2319 80 AVLSVAFSPDGRLLASASADGT--VRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSAD--GTVRLWDLATGKL 154
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 84 LSMAPGHTDRIFDISWDlyqPNklvscgvkhikfwslcgnaltpkrgvfGKTgdlqtilcLAcardeltySGALNGDIYV 163
Cdd:COG2319 155 LRTLTGHSGAVTSVAFS---PD---------------------------GKL--------LA--------SGSDDGTVRL 188
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 164 W--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIDLRETdqgykglSVRSVCWR--GDH 236
Cdd:COG2319 189 WdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLaTGKLLRTLTGHSG-------SVRSVAFSpdGRL 260
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 237 ILVGTQDSEIFeIVVQERNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCN-MEEPIRCAAV 315
Cdd:COG2319 261 LASGSADGTVR-LWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAF 338
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462539142 316 NADGIHLALGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSPDGTYLAVGCNDSSVDIYGVA 379
Cdd:COG2319 339 SPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
201-743 |
5.68e-28 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 118.86 E-value: 5.68e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 201 RLWDLTFKPITVIDLRETDQGYKGLSVRSVCWRGDHILVGTQDSEIFEIVVQERNKPFLIMQGHcEGELWALAVHPTKPL 280
Cdd:COG2319 14 ADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH-TAAVLSVAFSPDGRL 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 281 AVTGSDDRSVRIWSLVDHALIARCNM-EEPIRCAAVNADGIHLALGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSP 359
Cdd:COG2319 93 LASASADGTVRLWDLATGLLLRTLTGhTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSP 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 360 DGTYLAVGCNDSSVDIYGVAQrykkvGECLGSL----SFITHLDWSSDSRYLQTNDGNGK-RLfYRMPGGKEVTSTEEik 434
Cdd:COG2319 173 DGKLLASGSDDGTVRLWDLAT-----GKLLRTLtghtGAVRSVAFSPDGKLLASGSADGTvRL-WDLATGKLLRTLTG-- 244
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 435 gvHWASWTCVSglevngiWpkysdindinSVDGnyigQVLVTADDYGIIKLFRypcLRKGAKFRKYIGHSAHVTNVRWSH 514
Cdd:COG2319 245 --HSGSVRSVA-------F----------SPDG----RLLASGSADGTVRLWD---LATGELLRTLTGHSGGVNSVAFSP 298
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 515 DYQWVISiGGADHSVFQWkfiperklkdavhiapqesladshsdesdsdlsdvpeldseieqetqltyrrqvykedlpql 594
Cdd:COG2319 299 DGKLLAS-GSDDGTVRLW-------------------------------------------------------------- 315
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 595 keqckekqksatskrrERAPGNSIRLHfvhgyrgydcrsnlfytqigeivyhvaavgviynrqqntqrfyLGHDDDILCL 674
Cdd:COG2319 316 ----------------DLATGKLLRTL-------------------------------------------TGHTGAVRSV 336
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462539142 675 TIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWD 743
Cdd:COG2319 337 AFSPDGKTLASG--SDDGTVRLWDLATGELLRTLTGHTG-AVTSVAFSPDGRTLASGSAD--GTVRLWD 400
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
6-294 |
1.43e-27 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 114.74 E-value: 1.43e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 6 RSLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKdVHTHGIACLAFDLDGQRLVSVGLDskNAVCVWDWKRGKMLS 85
Cdd:cd00200 13 TCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSD--KTIRLWDLETGECVR 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 86 MAPGHTDRIFDISWDlyqPNK--LVSCGV-KHIKFWSLcgNALTPKRGVFGKTGDlqtILCLACARDE-LTYSGALNGDI 161
Cdd:cd00200 88 TLTGHTSYVSSVAFS---PDGriLSSSSRdKTIKVWDV--ETGKCLTTLRGHTDW---VNSVAFSPDGtFVASSSQDGTI 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 162 YVW--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIDlretdqgYKGLSVRSVCW--RG 234
Cdd:cd00200 160 KLWdlRTGKCVATLTG-HTGEVNSVAFSPDGekLLSSSSDGTIKLWDLsTGKCLGTLR-------GHENGVNSVAFspDG 231
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462539142 235 DHILVGTQDS--EIFEIVVQERNKPFlimQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 294
Cdd:cd00200 232 YLLASGSEDGtiRVWDLRTGECVQTL---SGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
192-532 |
1.01e-26 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 115.01 E-value: 1.01e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 192 ATGGRDGCIRLWDLTFKPITVIDLRETDqgykglSVRSVCWR--GDHILVGTQDSEI--FEIvvqERNKPFLIMQGHcEG 267
Cdd:COG2319 94 ASASADGTVRLWDLATGLLLRTLTGHTG------AVRSVAFSpdGKTLASGSADGTVrlWDL---ATGKLLRTLTGH-SG 163
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 268 ELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCN-MEEPIRCAAVNADGIHLALGMKDGSFTVLRVRDMTEVVHIK 346
Cdd:COG2319 164 AVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLT 243
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 347 DRKEAIHELKYSPDGTYLAVGCNDSSVDIYGVAQrykkvGECLGSL----SFITHLDWSSDSRYLQTNDGNGKrlfyrmp 422
Cdd:COG2319 244 GHSGSVRSVAFSPDGRLLASGSADGTVRLWDLAT-----GELLRTLtghsGGVNSVAFSPDGKLLASGSDDGT------- 311
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 423 ggkevtsteeIKGVHWASWTCVSGLEVNGIWpkysdindINSVDGNYIGQVLVTADDYGIIKLFRypcLRKGAKFRKYIG 502
Cdd:COG2319 312 ----------VRLWDLATGKLLRTLTGHTGA--------VRSVAFSPDGKTLASGSDDGTVRLWD---LATGELLRTLTG 370
|
330 340 350
....*....|....*....|....*....|
gi 2462539142 503 HSAHVTNVRWSHDYQWVISiGGADHSVFQW 532
Cdd:COG2319 371 HTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
89-376 |
2.53e-26 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 110.89 E-value: 2.53e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 89 GHTDRIFDISWDlYQPNKLVSCGV-KHIKFWSLCGNalTPKRGVFGKTGDLQTilCLACARDELTYSGALNGDIYVW--K 165
Cdd:cd00200 7 GHTGGVTCVAFS-PDGKLLATGSGdGTIKVWDLETG--ELLRTLKGHTGPVRD--VAASADGTYLASGSSDKTIRLWdlE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 166 GINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIdlretdQGYKGlSVRSVCWRGDHILV--G 240
Cdd:cd00200 82 TGECVRTLTG-HTSYVSSVAFSPDGriLSSSSRDKTIKVWDVeTGKCLTTL------RGHTD-WVNSVAFSPDGTFVasS 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 241 TQDSEIFEIVVQErNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCN-MEEPIRCAAVNADG 319
Cdd:cd00200 154 SQDGTIKLWDLRT-GKCVATLTGH-TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRgHENGVNSVAFSPDG 231
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*..
gi 2462539142 320 IHLALGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSPDGTYLAVGCNDSSVDIY 376
Cdd:cd00200 232 YLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIW 288
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
46-330 |
1.58e-25 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 108.58 E-value: 1.58e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 46 HTHGIACLAFDLDGQRLVSVGLDSKnaVCVWDWKRGKMLSMAPGHTDRIFDISWDLYQpNKLVSCGV-KHIKFWSLCGNA 124
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLKGHTGPVRDVAASADG-TYLASGSSdKTIRLWDLETGE 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 125 LTpkrGVFgkTGDLQTILCLACARD-ELTYSGALNGDIYVWKGIN--LIRTIQGaHAAGIFSMNACEEGF--ATGGRDGC 199
Cdd:cd00200 85 CV---RTL--TGHTSYVSSVAFSPDgRILSSSSRDKTIKVWDVETgkCLTTLRG-HTDWVNSVAFSPDGTfvASSSQDGT 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 200 IRLWDL-TFKPITVIDlretdqGYKGlSVRSVCWRGD--HILVGTQDSEIFeivVQERNKPFLI--MQGHcEGELWALAV 274
Cdd:cd00200 159 IKLWDLrTGKCVATLT------GHTG-EVNSVAFSPDgeKLLSSSSDGTIK---LWDLSTGKCLgtLRGH-ENGVNSVAF 227
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*..
gi 2462539142 275 HPTKPLAVTGSDDRSVRIWSLVDHALIARCNM-EEPIRCAAVNADGIHLALGMKDGS 330
Cdd:cd00200 228 SPDGYLLASGSEDGTIRVWDLRTGECVQTLSGhTNSVTSLAWSPDGKRLASGSADGT 284
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1398-1674 |
4.81e-25 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 110.00 E-value: 4.81e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1398 SIHIWDAMNKQTLSILRCyHSKGVCSVSFSATGKLLLSVGLDpeHTITIWRWQEGAKIASRAGHNQRIFVAEFRPDSDTq 1477
Cdd:COG2319 143 TVRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKL- 218
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1478 FVSVGV-KHVKFWTLAGRALLSkkgllsTLEDARmQTMLAIAFGANNLTF-TGTISGDVCVWK-DHILCRIVARAHNGPV 1554
Cdd:COG2319 219 LASGSAdGTVRLWDLATGKLLR------TLTGHS-GSVRSVAFSPDGRLLaSGSADGTVRLWDlATGELLRTLTGHSGGV 291
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1555 FAMYTTLRDGLIVTGGkerpskEGGAVKLWDQELRRC-RAFRLETGQatdcVRSVC-RGKGKILVGTRNAEIIEVGE-KN 1631
Cdd:COG2319 292 NSVAFSPDGKLLASGS------DDGTVRLWDLATGKLlRTLTGHTGA----VRSVAfSPDGKTLASGSDDGTVRLWDlAT 361
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 2462539142 1632 AACNILVNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIAD 1674
Cdd:COG2319 362 GELLRTLTGH-TGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1369-1866 |
3.46e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 107.30 E-value: 3.46e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1369 LTVNQHPKFINIVATGQVGDSADMSATAPSIHIWDAMNKQTLSILRcYHSKGVCSVSFSATGKLLLSVGLDpeHTITIWR 1448
Cdd:COG2319 30 LLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLL-GHTAAVLSVAFSPDGRLLASASAD--GTVRLWD 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1449 WQEGAKIASRAGHNQRIFVAEFRPDSDTqFVSVGV-KHVKFWTLAGRALLSkkgllstledarmqtmlaiafgannlTFT 1527
Cdd:COG2319 107 LATGLLLRTLTGHTGAVRSVAFSPDGKT-LASGSAdGTVRLWDLATGKLLR--------------------------TLT 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1528 GtisgdvcvwkdhilcrivaraHNGPVFAMyTTLRDG-LIVTGGkerpskEGGAVKLWDqelrrcrafrLETGQatdCVR 1606
Cdd:COG2319 160 G---------------------HSGAVTSV-AFSPDGkLLASGS------DDGTVRLWD----------LATGK---LLR 198
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1607 SVcrgkgkilvgtrnaeiievgeknaacnilvNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlGHA 1686
Cdd:COG2319 199 TL------------------------------TGH-TGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLT-GHS 246
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1687 A--RTVCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCAIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRI 1764
Cdd:COG2319 247 GsvRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTL 326
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1765 sycKDIPSFVIQMDFSADSSYLQVSSGCYKRHVYEVPSGKhlmdhaaidritwatwtsilgdeVLGIWSRHAekADVNCA 1844
Cdd:COG2319 327 ---TGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGE-----------------------LLRTLTGHT--GAVTSV 378
|
490 500
....*....|....*....|..
gi 2462539142 1845 CVSHSGISLVTGDDFGMVKLFD 1866
Cdd:COG2319 379 AFSPDGRTLASGSADGTVRLWD 400
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
836-1207 |
9.61e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 106.15 E-value: 9.61e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 836 LVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTY-----AIKRAALAPGSKglllednpsiraislg 908
Cdd:COG2319 70 LLATLLGHTAAVLSVAFSPDGrlLASASADGTVRLWDLATGLLLRTLtghtgAVRSVAFSPDGK---------------- 133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 909 hgHILVGTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHP---YLpicATVSDDKTLRIWDLSPSHCMLAVRKLKKGG 984
Cdd:COG2319 134 --TLASGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGKLLRTLTGHTGAV 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 985 RCCCFSPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFIDIYNVMSSKRVGICKG 1064
Cdd:COG2319 208 RSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSP-DGRLLASGSADGTVRLWDLATGELLRTLTG 286
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1065 ATSYITHIDWDIRGKLLqvntgakeqlffeAPRGKKQTIpsveveKIawasWTSVLGLCcegIWPVIGEVTDVTASCLTS 1144
Cdd:COG2319 287 HSGGVNSVAFSPDGKLL-------------ASGSDDGTV------RL----WDLATGKL---LRTLTGHTGAVRSVAFSP 340
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462539142 1145 DKMVLATGDDLGFVKLFRYPTKGKFGKFKryvAHSTHVTNVRWTYDDSMLVTlGGTDMSLMVW 1207
Cdd:COG2319 341 DGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
653-870 |
1.28e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 97.41 E-value: 1.28e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 653 IYNRQQNTQRFYL-GHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQYgVSAVDFSADGKRLASV 731
Cdd:cd00200 77 LWDLETGECVRTLtGHTSYVSSVAFSPDGRILSSS--SRDKTIKVWDVETGKCLTTLRGHTDW-VNSVAFSPDGTFVASS 153
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 732 GIDdsHTVVLWDWKKGEKLSIARGSKDKIFVVKmnpYVPD--KLITAGI-KHMKFWRKAGGGLigrkgyIGTL-GKNDTM 807
Cdd:cd00200 154 SQD--GTIKLWDLRTGKCVATLTGHTGEVNSVA---FSPDgeKLLSSSSdGTIKLWDLSTGKC------LGTLrGHENGV 222
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462539142 808 MCAVYGWTEEMAFSGTSTGDVCIW--RDIFLVKTVKAHDGPVFSM--HALEKGFVTGGKDGIVALWD 870
Cdd:cd00200 223 NSVAFSPDGYLLASGSEDGTIRVWdlRTGECVQTLSGHTNSVTSLawSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1417-1719 |
4.23e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 95.86 E-value: 4.23e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1417 HSKGVCSVSFSATGKLLLSVGLDpeHTITIWRWQEGAKIASRAGHNQRIFVAEFRPDSdTQFVSVGV-KHVKFWTLagra 1495
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADG-TYLASGSSdKTIRLWDL---- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1496 llSKKGLLSTLEDARmQTMLAIAFGANNLTFTGTIS-GDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDGLIVTGgker 1573
Cdd:cd00200 81 --ETGECVRTLTGHT-SYVSSVAFSPDGRILSSSSRdKTIKVWDvETGKCLTTLRGHTDWVNSV-AFSPDGTFVAS---- 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1574 pSKEGGAVKLWD-QELRRCRAFRLETGQatdcVRSVC--RGKGKILVGTRNAEIIEVGEKNAACNILVNGHvDGPIWGLA 1650
Cdd:cd00200 153 -SSQDGTIKLWDlRTGKCVATLTGHTGE----VNSVAfsPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGH-ENGVNSVA 226
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462539142 1651 THPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlGHAAR--TVCYSPEGDMVAIGMKNGefiillvsSLKIW 1719
Cdd:cd00200 227 FSPDGYLLASGSEDGTIRVWDLRTGECVQTLS-GHTNSvtSLAWSPDGKRLASGSADG--------TIRIW 288
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
169-533 |
1.85e-20 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 93.94 E-value: 1.85e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 169 LIRTIQGaHAAGIFSM--NACEEGFATGGRDGCIRLWDLTFKpitviDLRETDQGYKGlSVRSVCWRGDH--ILVGTQDS 244
Cdd:cd00200 1 LRRTLKG-HTGGVTCVafSPDGKLLATGSGDGTIKVWDLETG-----ELLRTLKGHTG-PVRDVAASADGtyLASGSSDK 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 245 EIFeIVVQERNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCNM-EEPIRCAAVNADGIHLA 323
Cdd:cd00200 74 TIR-LWDLETGECVRTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGhTDWVNSVAFSPDGTFVA 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 324 LGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSPDGTYLAVGCNDSSVDIYGVAQRyKKVGECLGSLSFITHLDWSSD 403
Cdd:cd00200 152 SSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG-KCLGTLRGHENGVNSVAFSPD 230
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 404 SRYLqtndgngkrlfyrmpggkevtsteeikgvhwaswtcvsglevngiwpkysdindinsvdgnyigqvlVTADDYGII 483
Cdd:cd00200 231 GYLL-------------------------------------------------------------------ASGSEDGTI 243
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|
gi 2462539142 484 KLFRypcLRKGAKFRKYIGHSAHVTNVRWSHDYQWVISiGGADHSVFQWK 533
Cdd:cd00200 244 RVWD---LRTGECVQTLSGHTNSVTSLAWSPDGKRLAS-GSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
836-1081 |
3.32e-20 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 93.17 E-value: 3.32e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 836 LVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYAIKRAALApgskglllednpsiRAISLGHGH-I 912
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGklLATGSGDGTIKVWDLETGELLRTLKGHTGPVR--------------DVAASADGTyL 66
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 913 LVGTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHPYLPICATVSDDKTLRIWDLSPSHCMLAVRKLKKGGRCCCFSP 991
Cdd:cd00200 67 ASGSSDKTIRLWDlETGECVRTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSP 145
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 992 DGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFIDIYNVMSSKRVGICKGATSYITH 1071
Cdd:cd00200 146 DGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSP-DGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNS 224
|
250
....*....|
gi 2462539142 1072 IDWDIRGKLL 1081
Cdd:cd00200 225 VAFSPDGYLL 234
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
613-656 |
8.93e-18 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 79.13 E-value: 8.93e-18
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 2462539142 613 APGNSIRLHFVHGYRGYDCRSNLFYTQIGEIVYHVAAVGVIYNR 656
Cdd:pfam03451 29 PPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1359-1671 |
1.20e-17 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 85.46 E-value: 1.20e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1359 YQEHNDDILCLTVNQHPKFInivATGqvgdSADmsataPSIHIWDaMNKQTLSILRCYHSKGVCSVSFSATGKLLLSVGL 1438
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLL---ATG----SGD-----GTIKVWD-LETGELLRTLKGHTGPVRDVAASADGTYLASGSS 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1439 DpeHTITIWRWQEGAKIASRAGHNQRIFVAEFRPDSdtQFVSVGVKH--VKFWTLAgrallsKKGLLSTLEDARMQTMlA 1516
Cdd:cd00200 72 D--KTIRLWDLETGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDVE------TGKCLTTLRGHTDWVN-S 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1517 IAF-GANNLTFTGTISGDVCVWKDHIL-CRIVARAHNGPVFAMYTTLRDGLIVTGGkerpskEGGAVKLWDQELRRCRA- 1593
Cdd:cd00200 141 VAFsPDGTFVASSSQDGTIKLWDLRTGkCVATLTGHTGEVNSVAFSPDGEKLLSSS------SDGTIKLWDLSTGKCLGt 214
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462539142 1594 FRLETGQATDCVRSvcrGKGKILVGTRNAEIIEVGE-KNAACNILVNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWD 1671
Cdd:cd00200 215 LRGHENGVNSVAFS---PDGYLLASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
1280-1351 |
8.71e-15 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 70.66 E-value: 8.71e-15
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462539142 1280 VRGSRPPVSraPPQPEKLQTNNVGKKKRPIEDLVLELIFGYRGRDCRNNVHYLNDGdDIIYHTASVGILHNV 1351
Cdd:pfam03451 4 IRGRPGAVY--PPSNYYPKDDLDQKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYDV 72
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1605-1866 |
2.88e-13 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 72.37 E-value: 2.88e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1605 VRSVC--RGKGKILVGTRNAEIIEVGEKNAACNILVNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVN 1682
Cdd:cd00200 12 VTCVAfsPDGKLLATGSGDGTIKVWDLETGELLRTLKGH-TGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTLT 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1683 lGHAA--RTVCYSPEGDMVAIGMKNGefiillvsSLKIW----GKK----RDRRCAIHDIRFSPDSRYLAVGSSENSVDF 1752
Cdd:cd00200 91 -GHTSyvSSVAFSPDGRILSSSSRDK--------TIKVWdvetGKClttlRGHTDWVNSVAFSPDGTFVASSSQDGTIKL 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1753 YDLTLGPTLNRIsycKDIPSFVIQMDFSADSSYLQVSSGCYKRHVYEVPSGKHLMD-HAAIDRITWATW-------TSIL 1824
Cdd:cd00200 162 WDLRTGKCVATL---TGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTlRGHENGVNSVAFspdgyllASGS 238
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 2462539142 1825 GDEVLGIWSRHAEK---------ADVNCACVSHSGISLVTGDDFGMVKLFD 1866
Cdd:cd00200 239 EDGTIRVWDLRTGEcvqtlsghtNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1638-1894 |
4.45e-13 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 71.98 E-value: 4.45e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1638 VNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLnKVNLGHAA--RTVCYSPEGDMVAIGMKNGefiillvsS 1715
Cdd:cd00200 5 LKGH-TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELL-RTLKGHTGpvRDVAASADGTYLASGSSDK--------T 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1716 LKIWgKKRDRRC---------AIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRISYCKDipsFVIQMDFSADSSYl 1786
Cdd:cd00200 75 IRLW-DLETGECvrtltghtsYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTF- 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1787 qVSSGCYKR--HVYEVPSGKHLMDHaaidritwatwtsilgdevlgiwsrHAEKADVNCACVSHSGISLVTGDDFGMVKL 1864
Cdd:cd00200 150 -VASSSQDGtiKLWDLRTGKCVATL-------------------------TGHTGEVNSVAFSPDGEKLLSSSSDGTIKL 203
|
250 260 270
....*....|....*....|....*....|....
gi 2462539142 1865 FDFPCPEKFVSLC----FVYYYQFTPNFDVLSSA 1894
Cdd:cd00200 204 WDLSTGKCLGTLRghenGVNSVAFSPDGYLLASG 237
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
1690-1772 |
1.47e-05 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 45.35 E-value: 1.47e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1690 VCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRR-CAIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRISYCK 1768
Cdd:pfam12894 1 MSWCPTMDLIALATEDGELLLHRLNWQRVWTLSPDKEdLEVTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFSAGS 80
|
....
gi 2462539142 1769 DIPS 1772
Cdd:pfam12894 81 DLIT 84
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1650-1866 |
2.14e-04 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 46.06 E-value: 2.14e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1650 ATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNLGHAARTVC-YSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCA 1728
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLaASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1729 IHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRIsycKDIPSFVIQMDFSADSSYLqVSSGCYKR-HVYEVPSGKH-- 1805
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTL---TGHTGAVRSVAFSPDGKTL-ASGSADGTvRLWDLATGKLlr 156
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462539142 1806 -LMDHA-AIDRITWA----TWTSILGDEVLGIWSRHAEK---------ADVNCACVSHSGISLVTGDDFGMVKLFD 1866
Cdd:COG2319 157 tLTGHSgAVTSVAFSpdgkLLASGSDDGTVRLWDLATGKllrtltghtGAVRSVAFSPDGKLLASGSADGTVRLWD 232
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1639-1671 |
2.95e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 40.02 E-value: 2.95e-04
10 20 30
....*....|....*....|....*....|...
gi 2462539142 1639 NGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWD 1671
Cdd:pfam00400 8 EGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
701-743 |
3.65e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.60 E-value: 3.65e-04
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 2462539142 701 TIKPLSILKGHHQYgVSAVDFSADGKRLASVGIDdsHTVVLWD 743
Cdd:smart00320 1 SGELLKTLKGHTGP-VTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1639-1671 |
3.87e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.60 E-value: 3.87e-04
10 20 30
....*....|....*....|....*....|...
gi 2462539142 1639 NGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWD 1671
Cdd:smart00320 9 KGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
255-294 |
1.18e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.10 E-value: 1.18e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 2462539142 255 NKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 294
Cdd:pfam00400 1 GKLLKTLEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WDR74 |
cd22857 |
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and ... |
191-295 |
2.02e-03 |
|
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and plants is an essential factor for ribosome assembly. In cooperation with the assembly factor NVL2, WDR74 participates in an early cleavage of the pre-rRNA processing pathway. NVL2 is a type II double ring, AAA-ATPase, that may mediate the release of WDR74 from nucleolar pre-60S particles. WDR74 has been implicated in tumorigenesis. In lung cancer, it regulates cell proliferation, cell cycle progression, chemoresistance and cell aggressiveness, by inducing nuclear beta-catenin accumulation and driving downstream Wnt-responsive genes expression. In melanoma, it promotes apoptosis resistance and aggressive behavior by regulating the RPL5-MDM2-p53 pathway. WDR74 contains an N-terminal seven-bladed beta-propeller WD40 domain that associates with the D1-AAA domain of the AAA-ATPase NVL2, and a flexible lysine-rich C-terminus that extends outward from the WD40 domain, and is required for nucleolar localization.
Pssm-ID: 439303 [Multi-domain] Cd Length: 325 Bit Score: 42.60 E-value: 2.02e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 191 FATGGRDGCIRLWDLTF--KPITVIDLRETdqGYKGLSVRSvcwRGDHILVGTQDSEIFEIVVQErNKPFLIMQGHCEGE 268
Cdd:cd22857 195 IVTGTGYHQVRLYDTRAqrRPVVSVDFGET--PIKAVAEDP---DGHTVYVGDTSGDLASIDLRT-GKLLGCFKGKCGGS 268
|
90 100
....*....|....*....|....*..
gi 2462539142 269 LWALAVHPTKPLAVTGSDDRSVRIWSL 295
Cdd:cd22857 269 IRSIARHPELPLIASCGLDRYLRIWDT 295
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
256-294 |
2.04e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 37.68 E-value: 2.04e-03
10 20 30
....*....|....*....|....*....|....*....
gi 2462539142 256 KPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 294
Cdd:smart00320 3 ELLKTLKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
935-967 |
2.97e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 37.29 E-value: 2.97e-03
10 20 30
....*....|....*....|....*....|...
gi 2462539142 935 QGHmEGEVWGLATHPYLPICATVSDDKTLRIWD 967
Cdd:smart00320 9 KGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
703-743 |
4.46e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 36.55 E-value: 4.46e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2462539142 703 KPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWD 743
Cdd:pfam00400 2 KLLKTLEGHTG-SVTSLAFSPDGKLLASGSDD--GTVKVWD 39
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
1647-1757 |
9.64e-03 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 40.06 E-value: 9.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539142 1647 WGLATHPSRDF-FLSAAEDGTVRLWDIADKKMLNKVNLGHAARTVCYSPEGDMVAIGMKNGEFIILLVS-----SLKIWg 1720
Cdd:COG3391 113 RGLAVDPDGGRlYVADSGNGRVSVIDTATGKVVATIPVGAGPHGIAVDPDGKRLYVANSGSNTVSVIVSvidtaTGKVV- 191
|
90 100 110
....*....|....*....|....*....|....*...
gi 2462539142 1721 KKRDRRCAIHDIRFSPDSRYLAVGSSE-NSVDFYDLTL 1757
Cdd:COG3391 192 ATIPVGGGPVGVAVSPDGRRLYVANRGsNTSNGGSNTV 229
|
|
|