|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
365-745 |
1.28e-36 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 143.90 E-value: 1.28e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 365 ALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVWNVevypefedqrACLPSGTFL-TCSSDNTIRFWNL 443
Cdd:COG2319 83 SVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG---HTGAVRSV----------AFSPDGKTLaSGSADGTVRLWDL 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 444 DSAsdtrwqknifsdsllkvvyvendiQHLQDLSHFPDRgsengtpmdmkagVRVMQVSPDGQHLASGDRSGNLRIHELH 523
Cdd:COG2319 150 ATG------------------------KLLRTLTGHSGA-------------VTSVAFSPDGKLLASGSDDGTVRLWDLA 192
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 524 FMDELIKVEAHDAEVLCLEYSkPEtGvTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSSITAIKFagTRDVQMI-SCGA 602
Cdd:COG2319 193 TGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGSVRSVAF--SPDGRLLaSGSA 266
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 603 DKSIYFRSAQqaSDGLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVSGKQKKCYKGSQGDEGSllkVHV 682
Cdd:COG2319 267 DGTVRLWDLA--TGELLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS---VAF 338
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 755519504 683 DPSGTFLATSCSDKSISLIDFYSGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIWHL 745
Cdd:COG2319 339 SPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
365-743 |
3.03e-27 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 113.58 E-value: 3.03e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 365 ALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVWNVevypefedqRACLPSGTFLTCSSDNTIRFWNLD 444
Cdd:cd00200 14 CVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKG---HTGPVRDV---------AASADGTYLASGSSDKTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 445 SASDTRwqknIFsdsllkvvyvendIQHLQDlshfpdrgsengtpmdmkagVRVMQVSPDGQHLASGDRSGNLRIHELHF 524
Cdd:cd00200 82 TGECVR----TL-------------TGHTSY--------------------VSSVAFSPDGRILSSSSRDKTIKVWDVET 124
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 525 MDELIKVEAHDAEVLCLEYSKPETgvtLLASASRDRLIHVLNVEKNYnLEQTLDDHSSSITAIKFAGTRDvQMISCGADK 604
Cdd:cd00200 125 GKCLTTLRGHTDWVNSVAFSPDGT---FVASSSQDGTIKLWDLRTGK-CVATLTGHTGEVNSVAFSPDGE-KLLSSSSDG 199
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 605 SIyfrsaqqasdglhfvrthhvaekttlydmdiditqkyvavacqdrnvRVYNTVSGKQKKCYkgsQGDEGSLLKVHVDP 684
Cdd:cd00200 200 TI-----------------------------------------------KLWDLSTGKCLGTL---RGHENGVNSVAFSP 229
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*....
gi 755519504 685 SGTFLATSCSDKSISLIDFYSGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIW 743
Cdd:cd00200 230 DGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIW 288
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
94-445 |
7.72e-23 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 103.07 E-value: 7.72e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 94 VVVLNPKENKQQHIFNTTRKSLSALAFSPDGKYIVTGenGHRPAVRIWDVEEKTQVAEMLGHKYGVACVAFSPNMKHIVS 173
Cdd:COG2319 144 VRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLAS 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 174 MGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWflEASTEAKVTStvpLVGRSGilgel 251
Cdd:COG2319 222 GSA--DGTVRLWDLATGKLLRTLTGhSGSVRSVAFSPDGRLLASGSaDGTVRLW--DLATGELLRT---LTGHSG----- 289
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 252 hnnifcgvacgrgrmagntfcvsysgllcqfnekrvldkWINlkvslssCLCVS--DELIFCGCTDGIVRIFQAHSLLYL 329
Cdd:COG2319 290 ---------------------------------------GVN-------SVAFSpdGKLLASGSDDGTVRLWDLATGKLL 323
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 330 TNLpKPHYLGVDvahgldssflfhrkaeavypdtvALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVW 409
Cdd:COG2319 324 RTL-TGHTGAVR-----------------------SVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTG---HTGAVT 376
|
330 340 350
....*....|....*....|....*....|....*..
gi 755519504 410 NVevypefedqrACLPSGTFL-TCSSDNTIRFWNLDS 445
Cdd:COG2319 377 SV----------AFSPDGRTLaSGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
112-442 |
1.69e-22 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 99.33 E-value: 1.69e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 112 RKSLSALAFSPDGKYIVTG-ENGHrpaVRIWDVEEKTQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKD 190
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGsGDGT---IKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKTIRLWDLETG 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 191 IVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWFLEASTEAKVTSTvplvgrsgilgelHNN-IFCGVACGRGRMA 267
Cdd:cd00200 84 ECVRTlTGHTSYVSSVAFSPDGRILSSSSrDKTIKVWDVETGKCLTTLRG-------------HTDwVNSVAFSPDGTFV 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 268 gntFCVSYSGL-----LCQFNEKRVL---DKWINlkvslssCLCVSD--ELIFCGCTDGIVRIFqahsllyltNLPKPHY 337
Cdd:cd00200 151 ---ASSSQDGTiklwdLRTGKCVATLtghTGEVN-------SVAFSPdgEKLLSSSSDGTIKLW---------DLSTGKC 211
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 338 LGVDVAHGldssflfhrkaEAVYpdtvALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVWNVevypef 417
Cdd:cd00200 212 LGTLRGHE-----------NGVN----SVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG---HTNSVTSL------ 267
|
330 340
....*....|....*....|....*.
gi 755519504 418 edqrACLPSGTFL-TCSSDNTIRFWN 442
Cdd:cd00200 268 ----AWSPDGKRLaSGSADGTIRIWD 289
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
705-744 |
2.89e-05 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 42.68 E-value: 2.89e-05
10 20 30 40
....*....|....*....|....*....|....*....|
gi 755519504 705 SGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIWH 744
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1009-1515 |
1.33e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.86 E-value: 1.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 1009 AAAHSSAPQTDP-----GPHLTMTAGKPEYPSteELSQPELPGLGNGSLPQTPEQEKFLRHhfETLTDAPTEGPMGIFLE 1083
Cdd:PHA03247 2562 AAPDRSVPPPRPaprpsEPAVTSRARRPDAPP--QSARPRAPVDDRGDPRGPAPPSPLPPD--THAPDPPPPSPSPAANE 2637
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 1084 LFHGSLGDIKISETEDYFFNP---RLSISTQFLSRLQKTSrCPPRLPLHLMKSPEAQPVGQGGNQPKAGPLRAGTGYMSS 1160
Cdd:PHA03247 2638 PDPHPPPTVPPPERPRDDPAPgrvSRPRRARRLGRAAQAS-SPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALV 2716
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 1161 DGTNVLSG-QKAEETQEALSLL-VPPLSGLTSCVP----PSSVPPTDRKPPTPT--SVLTTGREQSISAPSSCSYLESTT 1232
Cdd:PHA03247 2717 SATPLPPGpAAARQASPALPAApAPPAVPAGPATPggpaRPARPPTTAGPPAPAppAAPAAGPPRRLTRPAVASLSESRE 2796
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 1233 SSHAKTTRSISLGDSEGPVTAELPQSLHKPLSPGqelqaiPTTVALTSSIKDHEPAPlswgnhearASLKLTLSSVCEQL 1312
Cdd:PHA03247 2797 SLPSPWDPADPPAAVLAPAAALPPAASPAGPLPP------PTSAQPTAPPPPPGPPP---------PSLPLGGSVAPGGD 2861
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 1313 LS--PPPQEPPITHVWSQEPvdvppsmavTVASFCAPsPVDMSTlglhSSMFLPKTSASGPLTPPAHLQLLETRS----- 1385
Cdd:PHA03247 2862 VRrrPPSRSPAAKPAAPARP---------PVRRLARP-AVSRST----ESFALPPDQPERPPQPQAPPPPQPQPQppppp 2927
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 1386 ------RVPGSTAALLEPTPDASGVIADSPGHWD----------TEVP---TPELLGSVESVLHRLQTAFQEALDLYRML 1446
Cdd:PHA03247 2928 qpqpppPPPPRPQPPLAPTTDPAGAGEPSGAVPQpwlgalvpgrVAVPrfrVPQPAPSREAPASSTPPLTGHSLSRVSSW 3007
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 1447 VSSSQLGPEQQQAQTELASTFhWILNQLEASNCMAAANLAPPQT-LPSPDPLSLPTLCPLASPNLQALLE 1515
Cdd:PHA03247 3008 ASSLALHEETDPPPVSLKQTL-WPPDDTEDSDADSLFDSDSERSdLEALDPLPPEPHDPFAHEPDPATPE 3076
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
706-743 |
1.88e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 40.02 E-value: 1.88e-04
10 20 30
....*....|....*....|....*....|....*...
gi 755519504 706 GECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIW 743
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
146-186 |
4.27e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.52 E-value: 4.27e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 755519504 146 KTQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWD 186
Cdd:smart00320 2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSD--DGTIKLWD 40
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
365-745 |
1.28e-36 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 143.90 E-value: 1.28e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 365 ALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVWNVevypefedqrACLPSGTFL-TCSSDNTIRFWNL 443
Cdd:COG2319 83 SVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG---HTGAVRSV----------AFSPDGKTLaSGSADGTVRLWDL 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 444 DSAsdtrwqknifsdsllkvvyvendiQHLQDLSHFPDRgsengtpmdmkagVRVMQVSPDGQHLASGDRSGNLRIHELH 523
Cdd:COG2319 150 ATG------------------------KLLRTLTGHSGA-------------VTSVAFSPDGKLLASGSDDGTVRLWDLA 192
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 524 FMDELIKVEAHDAEVLCLEYSkPEtGvTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSSITAIKFagTRDVQMI-SCGA 602
Cdd:COG2319 193 TGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGSVRSVAF--SPDGRLLaSGSA 266
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 603 DKSIYFRSAQqaSDGLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVSGKQKKCYKGSQGDEGSllkVHV 682
Cdd:COG2319 267 DGTVRLWDLA--TGELLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS---VAF 338
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 755519504 683 DPSGTFLATSCSDKSISLIDFYSGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIWHL 745
Cdd:COG2319 339 SPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
365-743 |
3.03e-27 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 113.58 E-value: 3.03e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 365 ALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVWNVevypefedqRACLPSGTFLTCSSDNTIRFWNLD 444
Cdd:cd00200 14 CVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKG---HTGPVRDV---------AASADGTYLASGSSDKTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 445 SASDTRwqknIFsdsllkvvyvendIQHLQDlshfpdrgsengtpmdmkagVRVMQVSPDGQHLASGDRSGNLRIHELHF 524
Cdd:cd00200 82 TGECVR----TL-------------TGHTSY--------------------VSSVAFSPDGRILSSSSRDKTIKVWDVET 124
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 525 MDELIKVEAHDAEVLCLEYSKPETgvtLLASASRDRLIHVLNVEKNYnLEQTLDDHSSSITAIKFAGTRDvQMISCGADK 604
Cdd:cd00200 125 GKCLTTLRGHTDWVNSVAFSPDGT---FVASSSQDGTIKLWDLRTGK-CVATLTGHTGEVNSVAFSPDGE-KLLSSSSDG 199
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 605 SIyfrsaqqasdglhfvrthhvaekttlydmdiditqkyvavacqdrnvRVYNTVSGKQKKCYkgsQGDEGSLLKVHVDP 684
Cdd:cd00200 200 TI-----------------------------------------------KLWDLSTGKCLGTL---RGHENGVNSVAFSP 229
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*....
gi 755519504 685 SGTFLATSCSDKSISLIDFYSGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIW 743
Cdd:cd00200 230 DGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIW 288
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
493-745 |
4.60e-27 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 112.81 E-value: 4.60e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 493 KAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELIKVEAHDAEVLCLEYSKpetGVTLLASASRDRLIHVLNVEKNyN 572
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA---DGTYLASGSSDKTIRLWDLETG-E 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 573 LEQTLDDHSSSITAIKFAGTRDVqMISCGADKSIyfRSAQQASDGLHFVRTHHvaeKTTLYDMDIDITQKYVAVACQDRN 652
Cdd:cd00200 85 CVRTLTGHTSYVSSVAFSPDGRI-LSSSSRDKTI--KVWDVETGKCLTTLRGH---TDWVNSVAFSPDGTFVASSSQDGT 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 653 VRVYNTVSGKqkkCYKGSQGDEGSLLKVHVDPSGTFLATSCSDKSISLIDFYSGECVAKMFGHSEIVTGMKFTYDCRHLI 732
Cdd:cd00200 159 IKLWDLRTGK---CVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLA 235
|
250
....*....|...
gi 755519504 733 TVSGDSCVFIWHL 745
Cdd:cd00200 236 SGSEDGTIRVWDL 248
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
43-568 |
1.08e-25 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 111.54 E-value: 1.08e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 43 LRRRTRLAAAPEDTVQNRVTLEKVLGITAQNSSGLTCDPGTGHVAYLAGCVVVVLNPKENKQQHIFNTTRKSLSALAFSP 122
Cdd:COG2319 9 LAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSP 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 123 DGKYIVTGenGHRPAVRIWDVEEKTQVAEMLGHKYGVACVAFSPNMKHIVSMGYQHdmVLNVWDWKKDIVVAS-NKVSCR 201
Cdd:COG2319 89 DGRLLASA--SADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADG--TVRLWDLATGKLLRTlTGHSGA 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 202 VIALSFSEDSSYFVTVG-NRHVRFWFLEASTEAKVtstvpLVGrsgilgelHNNIFCGVAcgrgrmagntfcVSYSGllc 280
Cdd:COG2319 165 VTSVAFSPDGKLLASGSdDGTVRLWDLATGKLLRT-----LTG--------HTGAVRSVA------------FSPDG--- 216
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 281 qfnekrvldkwinlkvslssclcvsdELIFCGCTDGIVRIFQAHSLLYLTNLPkphylgvdvAHGldssflfhrkaEAVY 360
Cdd:COG2319 217 --------------------------KLLASGSADGTVRLWDLATGKLLRTLT---------GHS-----------GSVR 250
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 361 pdtvALTFDPVHQWLSCVYKDHSIYIWDVKDiDEVSKIWSElfHSSFVWNVevypefedqrACLPSGTFL-TCSSDNTIR 439
Cdd:COG2319 251 ----SVAFSPDGRLLASGSADGTVRLWDLAT-GELLRTLTG--HSGGVNSV----------AFSPDGKLLaSGSDDGTVR 313
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 440 FWNLDSAsdtrwqknifsdsllkvvyvendiQHLQDLSHFPDRgsengtpmdmkagVRVMQVSPDGQHLASGDRSGNLRI 519
Cdd:COG2319 314 LWDLATG------------------------KLLRTLTGHTGA-------------VRSVAFSPDGKTLASGSDDGTVRL 356
|
490 500 510 520
....*....|....*....|....*....|....*....|....*....
gi 755519504 520 HELHFMDELIKVEAHDAEVLCLEYSKPEtgvTLLASASRDRLIHVLNVE 568
Cdd:COG2319 357 WDLATGELLRTLTGHTGAVTSVAFSPDG---RTLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
534-743 |
2.90e-25 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 107.42 E-value: 2.90e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 534 HDAEVLCLEYSkpeTGVTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSSITAIKFAGTRDvQMISCGADKSIYFrsaqQ 613
Cdd:cd00200 8 HTGGVTCVAFS---PDGKLLATGSGDGTIKVWDLETG-ELLRTLKGHTGPVRDVAASADGT-YLASGSSDKTIRL----W 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 614 ASDGLHFVRTHHVAEKTtLYDMDIDITQKYVAVACQDRNVRVYNTVSGKQKKCYKGSQGDegsLLKVHVDPSGTFLATSC 693
Cdd:cd00200 79 DLETGECVRTLTGHTSY-VSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDW---VNSVAFSPDGTFVASSS 154
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|
gi 755519504 694 SDKSISLIDFYSGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIW 743
Cdd:cd00200 155 QDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLW 204
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
481-751 |
2.21e-23 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 104.61 E-value: 2.21e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 481 DRGSENGTPMDMKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELIKVEAHDAEVLCLEYSkPETgvTLLASASRDR 560
Cdd:COG2319 66 AAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFS-PDG--KTLASGSADG 142
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 561 LIHVLNVEKNyNLEQTLDDHSSSITAIKFA--GTRdvqMISCGADKSIYFRSAQQASDgLHFVRTHhvaeKTTLYDMDID 638
Cdd:COG2319 143 TVRLWDLATG-KLLRTLTGHSGAVTSVAFSpdGKL---LASGSDDGTVRLWDLATGKL-LRTLTGH----TGAVRSVAFS 213
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 639 ITQKYVAVACQDRNVRVYNTVSGKQKKCYKGsqgDEGSLLKVHVDPSGTFLATSCSDKSISLIDFYSGECVAKMFGHSEI 718
Cdd:COG2319 214 PDGKLLASGSADGTVRLWDLATGKLLRTLTG---HSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGG 290
|
250 260 270
....*....|....*....|....*....|....*
gi 755519504 719 VTGMKFTYDCRHLITVSGDSCVFIWHL--GPEITT 751
Cdd:COG2319 291 VNSVAFSPDGKLLASGSDDGTVRLWDLatGKLLRT 325
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
94-445 |
7.72e-23 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 103.07 E-value: 7.72e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 94 VVVLNPKENKQQHIFNTTRKSLSALAFSPDGKYIVTGenGHRPAVRIWDVEEKTQVAEMLGHKYGVACVAFSPNMKHIVS 173
Cdd:COG2319 144 VRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLAS 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 174 MGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWflEASTEAKVTStvpLVGRSGilgel 251
Cdd:COG2319 222 GSA--DGTVRLWDLATGKLLRTLTGhSGSVRSVAFSPDGRLLASGSaDGTVRLW--DLATGELLRT---LTGHSG----- 289
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 252 hnnifcgvacgrgrmagntfcvsysgllcqfnekrvldkWINlkvslssCLCVS--DELIFCGCTDGIVRIFQAHSLLYL 329
Cdd:COG2319 290 ---------------------------------------GVN-------SVAFSpdGKLLASGSDDGTVRLWDLATGKLL 323
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 330 TNLpKPHYLGVDvahgldssflfhrkaeavypdtvALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVW 409
Cdd:COG2319 324 RTL-TGHTGAVR-----------------------SVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTG---HTGAVT 376
|
330 340 350
....*....|....*....|....*....|....*..
gi 755519504 410 NVevypefedqrACLPSGTFL-TCSSDNTIRFWNLDS 445
Cdd:COG2319 377 SV----------AFSPDGRTLaSGSADGTVRLWDLAT 403
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
3-519 |
1.29e-22 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 102.30 E-value: 1.29e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 3 AALAAGGYTRSDTIEKLSSVMAGVPARRNQSSPPPAPPLCLRRRTRLAAAPEDTVQNRVTLEKVLGITAQNSSGLTCDPG 82
Cdd:COG2319 11 AASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDG 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 83 TGHVAYLAGCVVVVLNPKENKQQHIFNTTRKSLSALAFSPDGKYIVTGENGHRpaVRIWDVEEKTQVAEMLGHKYGVACV 162
Cdd:COG2319 91 RLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTGHSGAVTSV 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 163 AFSPNMKHIVSMGYqhDMVLNVWDWKKDIVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWFLEASTEAKVtstvp 240
Cdd:COG2319 169 AFSPDGKLLASGSD--DGTVRLWDLATGKLLRTlTGHTGAVRSVAFSPDGKLLASGSaDGTVRLWDLATGKLLRT----- 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 241 LVGRSGILgelhnnifcgvacgrgrmagntFCVSYSGllcqfnekrvldkwinlkvslssclcvSDELIFCGCTDGIVRI 320
Cdd:COG2319 242 LTGHSGSV----------------------RSVAFSP---------------------------DGRLLASGSADGTVRL 272
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 321 FqahsllyltnlpkphylgvDVAHGLDSSFLFHRKAeAVYpdtvALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWS 400
Cdd:COG2319 273 W-------------------DLATGELLRTLTGHSG-GVN----SVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTG 328
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 401 elfHSSFVWNVevypefedqrACLPSGTFL-TCSSDNTIRFWNLdsasDTRWQKNIFSdsllkvvyvendiQHlqdlshf 479
Cdd:COG2319 329 ---HTGAVRSV----------AFSPDGKTLaSGSDDGTVRLWDL----ATGELLRTLT-------------GH------- 371
|
490 500 510 520
....*....|....*....|....*....|....*....|
gi 755519504 480 pdrgsengtpmdmKAGVRVMQVSPDGQHLASGDRSGNLRI 519
Cdd:COG2319 372 -------------TGAVTSVAFSPDGRTLASGSADGTVRL 398
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
112-442 |
1.69e-22 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 99.33 E-value: 1.69e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 112 RKSLSALAFSPDGKYIVTG-ENGHrpaVRIWDVEEKTQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKD 190
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGsGDGT---IKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKTIRLWDLETG 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 191 IVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWFLEASTEAKVTSTvplvgrsgilgelHNN-IFCGVACGRGRMA 267
Cdd:cd00200 84 ECVRTlTGHTSYVSSVAFSPDGRILSSSSrDKTIKVWDVETGKCLTTLRG-------------HTDwVNSVAFSPDGTFV 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 268 gntFCVSYSGL-----LCQFNEKRVL---DKWINlkvslssCLCVSD--ELIFCGCTDGIVRIFqahsllyltNLPKPHY 337
Cdd:cd00200 151 ---ASSSQDGTiklwdLRTGKCVATLtghTGEVN-------SVAFSPdgEKLLSSSSDGTIKLW---------DLSTGKC 211
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 338 LGVDVAHGldssflfhrkaEAVYpdtvALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVWNVevypef 417
Cdd:cd00200 212 LGTLRGHE-----------NGVN----SVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG---HTNSVTSL------ 267
|
330 340
....*....|....*....|....*.
gi 755519504 418 edqrACLPSGTFL-TCSSDNTIRFWN 442
Cdd:cd00200 268 ----AWSPDGKRLaSGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
300-606 |
1.14e-18 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 88.16 E-value: 1.14e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 300 SCLCVSD--ELIFCGCTDGIVRIFQAHSLLYLTNLpKPHYLGV-DVAHGLDSSFLF------------------------ 352
Cdd:cd00200 13 TCVAFSPdgKLLATGSGDGTIKVWDLETGELLRTL-KGHTGPVrDVAASADGTYLAsgssdktirlwdletgecvrtltg 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 353 HRKAeaVYpdtvALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIwseLFHSSFVWNVEVypefedqracLPSGTFLTC 432
Cdd:cd00200 92 HTSY--VS----SVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTL---RGHTDWVNSVAF----------SPDGTFVAS 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 433 SS-DNTIRFWNLDSasdtrwqknifsdsllkvvyvendiqhlqdlshfpdrGSENGTPMDMKAGVRVMQVSPDGQHLASG 511
Cdd:cd00200 153 SSqDGTIKLWDLRT-------------------------------------GKCVATLTGHTGEVNSVAFSPDGEKLLSS 195
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 512 DRSGNLRIHELHfMDELIKV-EAHDAEVLCLEYSKPEtgvTLLASASRDRLIHVLNVEKNYNLeQTLDDHSSSITAIKFA 590
Cdd:cd00200 196 SSDGTIKLWDLS-TGKCLGTlRGHENGVNSVAFSPDG---YLLASGSEDGTIRVWDLRTGECV-QTLSGHTNSVTSLAWS 270
|
330
....*....|....*.
gi 755519504 591 GTRDVqMISCGADKSI 606
Cdd:cd00200 271 PDGKR-LASGSADGTI 285
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
154-564 |
1.03e-16 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 82.38 E-value: 1.03e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 154 GHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWFLEASt 231
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATGSG--DGTIKVWDLETGELLRTLKGhTGPVRDVAASADGTYLASGSsDKTIRLWDLETG- 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 232 eaKVTSTvpLVGrsgilgelHNnifcgvacgrgrmaGNTFCVSYSgllcqfnekrvldkwinlkvslssclcVSDELIFC 311
Cdd:cd00200 84 --ECVRT--LTG--------HT--------------SYVSSVAFS---------------------------PDGRILSS 110
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 312 GCTDGIVRIFQAHSLLYLTnlpkphylgvdvahgldsSFLFHRKaeavypDTVALTFDPVHQWLSCVYKDHSIYIWDVKD 391
Cdd:cd00200 111 SSRDKTIKVWDVETGKCLT------------------TLRGHTD------WVNSVAFSPDGTFVASSSQDGTIKLWDLRT 166
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 392 IdevSKIWSELFHSSFVWNVEVYPEfedqraclpSGTFLTCSSDNTIRFWNLDSAsdtrwqknifsdsllkvvyvendiQ 471
Cdd:cd00200 167 G---KCVATLTGHTGEVNSVAFSPD---------GEKLLSSSSDGTIKLWDLSTG------------------------K 210
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 472 HLQDLshfpdRGSENgtpmdmkaGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELIKVEAHDAEVLCLEYSkPETGVt 551
Cdd:cd00200 211 CLGTL-----RGHEN--------GVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWS-PDGKR- 275
|
410
....*....|...
gi 755519504 552 lLASASRDRLIHV 564
Cdd:cd00200 276 -LASGSADGTIRI 287
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
481-751 |
1.05e-14 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 78.03 E-value: 1.05e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 481 DRGSENGTPMDMKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELIKVEAHDAEVLCLEYSkpeTGVTLLASASRDR 560
Cdd:COG2319 24 ALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFS---PDGRLLASASADG 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 561 LIHVLNVEKNYNLeQTLDDHSSSITAIKFAgtrdvqmiscgadksiyfrsaqqaSDGlhfvrthhvaekttlydmdidit 640
Cdd:COG2319 101 TVRLWDLATGLLL-RTLTGHTGAVRSVAFS------------------------PDG----------------------- 132
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 641 qKYVAVACQDRNVRVYNTVSGKQKKCYKGSQGDEGSllkVHVDPSGTFLATSCSDKSISLIDFYSGECVAKMFGHSEIVT 720
Cdd:COG2319 133 -KTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTS---VAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 208
|
250 260 270
....*....|....*....|....*....|...
gi 755519504 721 GMKFTYDCRHLITVSGDSCVFIWHL--GPEITT 751
Cdd:COG2319 209 SVAFSPDGKLLASGSADGTVRLWDLatGKLLRT 241
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
633-756 |
1.46e-12 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 70.06 E-value: 1.46e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 633 YDMDIDITQKYVAVACQDRNVRVYNTVSGKQKKCYKGSqgdEGSLLKVHVDPSGTFLATSCSDKSISLIDFYSGECVAKM 712
Cdd:cd00200 13 TCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGH---TGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTL 89
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 755519504 713 FGHSEIVTGMKFTYDCRHLITVSGDSCVFIWHL-GPEITTCMKQH 756
Cdd:cd00200 90 TGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVeTGKCLTTLRGH 134
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
88-225 |
1.53e-12 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 70.06 E-value: 1.53e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 88 YLAGC----VVVVLNPKENKQQHIFNTTRKSLSALAFSPDGKYIVTGENGHrpAVRIWDVEEKTQVAEMLGHKYGVACVA 163
Cdd:cd00200 107 ILSSSsrdkTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDG--TIKLWDLRTGKCVATLTGHTGEVNSVA 184
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 755519504 164 FSPNMKHIVSMGyqHDMVLNVWDWKKDIVVASNKVSC-RVIALSFSEDSSYFVTVG-NRHVRFW 225
Cdd:cd00200 185 FSPDGEKLLSSS--SDGTIKLWDLSTGKCLGTLRGHEnGVNSVAFSPDGYLLASGSeDGTIRVW 246
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
705-744 |
2.89e-05 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 42.68 E-value: 2.89e-05
10 20 30 40
....*....|....*....|....*....|....*....|
gi 755519504 705 SGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIWH 744
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1009-1515 |
1.33e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.86 E-value: 1.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 1009 AAAHSSAPQTDP-----GPHLTMTAGKPEYPSteELSQPELPGLGNGSLPQTPEQEKFLRHhfETLTDAPTEGPMGIFLE 1083
Cdd:PHA03247 2562 AAPDRSVPPPRPaprpsEPAVTSRARRPDAPP--QSARPRAPVDDRGDPRGPAPPSPLPPD--THAPDPPPPSPSPAANE 2637
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 1084 LFHGSLGDIKISETEDYFFNP---RLSISTQFLSRLQKTSrCPPRLPLHLMKSPEAQPVGQGGNQPKAGPLRAGTGYMSS 1160
Cdd:PHA03247 2638 PDPHPPPTVPPPERPRDDPAPgrvSRPRRARRLGRAAQAS-SPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALV 2716
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 1161 DGTNVLSG-QKAEETQEALSLL-VPPLSGLTSCVP----PSSVPPTDRKPPTPT--SVLTTGREQSISAPSSCSYLESTT 1232
Cdd:PHA03247 2717 SATPLPPGpAAARQASPALPAApAPPAVPAGPATPggpaRPARPPTTAGPPAPAppAAPAAGPPRRLTRPAVASLSESRE 2796
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 1233 SSHAKTTRSISLGDSEGPVTAELPQSLHKPLSPGqelqaiPTTVALTSSIKDHEPAPlswgnhearASLKLTLSSVCEQL 1312
Cdd:PHA03247 2797 SLPSPWDPADPPAAVLAPAAALPPAASPAGPLPP------PTSAQPTAPPPPPGPPP---------PSLPLGGSVAPGGD 2861
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 1313 LS--PPPQEPPITHVWSQEPvdvppsmavTVASFCAPsPVDMSTlglhSSMFLPKTSASGPLTPPAHLQLLETRS----- 1385
Cdd:PHA03247 2862 VRrrPPSRSPAAKPAAPARP---------PVRRLARP-AVSRST----ESFALPPDQPERPPQPQAPPPPQPQPQppppp 2927
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 1386 ------RVPGSTAALLEPTPDASGVIADSPGHWD----------TEVP---TPELLGSVESVLHRLQTAFQEALDLYRML 1446
Cdd:PHA03247 2928 qpqpppPPPPRPQPPLAPTTDPAGAGEPSGAVPQpwlgalvpgrVAVPrfrVPQPAPSREAPASSTPPLTGHSLSRVSSW 3007
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 1447 VSSSQLGPEQQQAQTELASTFhWILNQLEASNCMAAANLAPPQT-LPSPDPLSLPTLCPLASPNLQALLE 1515
Cdd:PHA03247 3008 ASSLALHEETDPPPVSLKQTL-WPPDDTEDSDADSLFDSDSERSdLEALDPLPPEPHDPFAHEPDPATPE 3076
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
706-743 |
1.88e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 40.02 E-value: 1.88e-04
10 20 30
....*....|....*....|....*....|....*...
gi 755519504 706 GECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIW 743
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
635-727 |
3.25e-04 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 41.11 E-value: 3.25e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519504 635 MDIditqkyVAVACQDRNVRVYNTvSGKqkKCYKGSQGDEGSLLK-VHVDPSGTFLATSCSDKSISLIDFYSGECVAKMF 713
Cdd:pfam12894 7 MDL------IALATEDGELLLHRL-NWQ--RVWTLSPDKEDLEVTsLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFS 77
|
90
....*....|....
gi 755519504 714 GHSEIVTGMKFTYD 727
Cdd:pfam12894 78 AGSDLITCLGWGEN 91
|
|
| COG4946 |
COG4946 |
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ... |
118-171 |
4.19e-03 |
|
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];
Pssm-ID: 443973 [Multi-domain] Cd Length: 1072 Bit Score: 41.95 E-value: 4.19e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*....
gi 755519504 118 LAFSPDGKYIV---TGENGHRpAVRIWDVEEK--TQVAEmlgHKYGVACVAFSPNMKHI 171
Cdd:COG4946 437 LAWSPDSKWLAyskPGPNQLS-QIFLYDVETGktVQLTD---GRYDDGSPAFSPDGKYL 491
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
146-186 |
4.27e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.52 E-value: 4.27e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 755519504 146 KTQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWD 186
Cdd:smart00320 2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSD--DGTIKLWD 40
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
714-762 |
6.12e-03 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 40.40 E-value: 6.12e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 755519504 714 GHSEIVTGMKFTYDCRHLITVSGDSCVFIWHL-GPEITTCMKQHLLEINH 762
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLeTGELLRTLKGHTGPVRD 56
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
103-142 |
9.22e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 35.37 E-value: 9.22e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 755519504 103 KQQHIFNTTRKSLSALAFSPDGKYIVTG-ENGHrpaVRIWD 142
Cdd:smart00320 3 ELLKTLKGHTGPVTSVAFSPDGKYLASGsDDGT---IKLWD 40
|
|
|