NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|21541824|ref|NP_005068|]
View 

transducin-like enhancer protein 1 isoform 2 [Homo sapiens]

Protein Classification

WD40 domain-containing protein( domain architecture ID 10511324)

WD40 domain-containing protein similar to Drosophila melanogaster protein groucho and Homo sapiens transducin-like enhancer proteins; also contains a Groucho/TLE N-terminal Q-rich domain

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
TLE_N pfam03920
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ...
18-133 8.80e-94

Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.


:

Pssm-ID: 461094  Cd Length: 117  Bit Score: 287.78  E-value: 8.80e-94
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824    18 FKFTIPESLDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTICAQVIPFLS 97
Cdd:pfam03920   1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 21541824    98 QEHQQQVAQAVERAKQVTMAELNAIIGQ-QQLQAQHL 133
Cdd:pfam03920  81 QEHQQQVAQAVERAKQVTMAELNAIIGQqQQLQAQHL 117
WD40 COG2319
WD40 repeat [General function prediction only];
483-768 1.20e-43

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 162.77  E-value: 1.20e-43
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 483 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDISHPGNKSPVSqldclNRDNYIRSCKLLPDGCTLIVGGEASTLSIWDLAa 561
Cdd:COG2319  77 HTAAVLSVAFSPDGRLLASASAdGTVRLWDLATGLLLRTLT-----GHTGAVRSVAFSPDGKTLASGSADGTVRLWDLA- 150
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 562 pTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 641
Cdd:COG2319 151 -TGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVR 229
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 642 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPD-KYQLHLHESCVLSLKFAYCGKWFVSTGKD 719
Cdd:COG2319 230 LWDLATGKLLRTlTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGElLRTLTGHSGGVNSVAFSPDGKLLASGSDD 309
                       250       260       270       280       290
                ....*....|....*....|....*....|....*....|....*....|
gi 21541824 720 NLLNAWRTPYGASIFQSK-ESSSVLSCDISVDDKYIVTGSGDKKATVYEV 768
Cdd:COG2319 310 GTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDL 359
PHA03247 super family cl33720
large tegument protein UL36; Provisional
319-476 6.66e-04

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 6.66e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824   319 TPTPRSDMPTPGTSATPGLRPGLGKPPA--IDPLVNQAAAGLRTPLAV--PGPYPAPFG-------MVPHAGMNGELTSP 387
Cdd:PHA03247 2707 TPEPAPHALVSATPLPPGPAAARQASPAlpAAPAPPAVPAGPATPGGParPARPPTTAGppapappAAPAAGPPRRLTRP 2786
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824   388 GAAYASLHNMSPQMSAAAAAAAVVAYGRSPMV-------GFDPPPHMRVPTIPPNLAGIPGGKPAYSFHVTADGQMQPVP 460
Cdd:PHA03247 2787 AVASLSESRESLPSPWDPADPPAAVLAPAAALppaaspaGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRP 2866
                         170
                  ....*....|....*.
gi 21541824   461 FPPDALIGPGIPRHAR 476
Cdd:PHA03247 2867 PSRSPAAKPAAPARPP 2882
 
Name Accession Description Interval E-value
TLE_N pfam03920
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ...
18-133 8.80e-94

Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.


Pssm-ID: 461094  Cd Length: 117  Bit Score: 287.78  E-value: 8.80e-94
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824    18 FKFTIPESLDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTICAQVIPFLS 97
Cdd:pfam03920   1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 21541824    98 QEHQQQVAQAVERAKQVTMAELNAIIGQ-QQLQAQHL 133
Cdd:pfam03920  81 QEHQQQVAQAVERAKQVTMAELNAIIGQqQQLQAQHL 117
WD40 COG2319
WD40 repeat [General function prediction only];
483-768 1.20e-43

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 162.77  E-value: 1.20e-43
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 483 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDISHPGNKSPVSqldclNRDNYIRSCKLLPDGCTLIVGGEASTLSIWDLAa 561
Cdd:COG2319  77 HTAAVLSVAFSPDGRLLASASAdGTVRLWDLATGLLLRTLT-----GHTGAVRSVAFSPDGKTLASGSADGTVRLWDLA- 150
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 562 pTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 641
Cdd:COG2319 151 -TGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVR 229
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 642 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPD-KYQLHLHESCVLSLKFAYCGKWFVSTGKD 719
Cdd:COG2319 230 LWDLATGKLLRTlTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGElLRTLTGHSGGVNSVAFSPDGKLLASGSDD 309
                       250       260       270       280       290
                ....*....|....*....|....*....|....*....|....*....|
gi 21541824 720 NLLNAWRTPYGASIFQSK-ESSSVLSCDISVDDKYIVTGSGDKKATVYEV 768
Cdd:COG2319 310 GTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDL 359
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
483-767 2.09e-38

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 144.40  E-value: 2.09e-38
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 483 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLdCLNRDNyIRSCKLLPDGCTLIVGGEASTLSIWDLaa 561
Cdd:cd00200   8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL-- 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 562 PTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 641
Cdd:cd00200  81 ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 642 SWDLREGR---QLQQHdfTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPD-KYQLHLHESCVLSLKFAYCGKWFVSTG 717
Cdd:cd00200 161 LWDLRTGKcvaTLTGH--TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASGS 238
                       250       260       270       280       290
                ....*....|....*....|....*....|....*....|....*....|.
gi 21541824 718 KDNLLNAWRTPYGASIFQ-SKESSSVLSCDISVDDKYIVTGSGDKKATVYE 767
Cdd:cd00200 239 EDGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
605-644 2.87e-07

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 47.31  E-value: 2.87e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 21541824    605 NQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 644
Cdd:smart00320   1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
PLN00181 PLN00181
protein SPA1-RELATED; Provisional
595-766 8.78e-07

protein SPA1-RELATED; Provisional


Pssm-ID: 177776 [Multi-domain]  Cd Length: 793  Bit Score: 52.40  E-value: 8.78e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824  595 DGNIAVWDLHNQTLVRQFQGHTDGASCIDISN-DGTKLWTGGLDNTVRSWDLREGRQLQQHDFTSQIFSLGY-CPTGEWL 672
Cdd:PLN00181 554 EGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGTIKTKANICCVQFpSESGRSL 633
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824  673 AVGMESSNVEVLHVNKPdkyQLHL-----HESCVLSLKFAYCGKwFVSTGKDNLLNAWRTPYGASIFQSKESSSVLS--- 744
Cdd:PLN00181 634 AFGSADHKVYYYDLRNP---KLPLctmigHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMSISGINETPLHSFMGhtn 709
                        170       180
                 ....*....|....*....|....*.
gi 21541824  745 ----CDISVDDKYIVTGSGDKKATVY 766
Cdd:PLN00181 710 vknfVGLSVSDGYIATGSETNEVFVY 735
WD40 pfam00400
WD domain, G-beta repeat;
607-644 3.77e-06

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 44.26  E-value: 3.77e-06
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 21541824   607 TLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 644
Cdd:pfam00400   2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
PHA03247 PHA03247
large tegument protein UL36; Provisional
319-476 6.66e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 6.66e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824   319 TPTPRSDMPTPGTSATPGLRPGLGKPPA--IDPLVNQAAAGLRTPLAV--PGPYPAPFG-------MVPHAGMNGELTSP 387
Cdd:PHA03247 2707 TPEPAPHALVSATPLPPGPAAARQASPAlpAAPAPPAVPAGPATPGGParPARPPTTAGppapappAAPAAGPPRRLTRP 2786
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824   388 GAAYASLHNMSPQMSAAAAAAAVVAYGRSPMV-------GFDPPPHMRVPTIPPNLAGIPGGKPAYSFHVTADGQMQPVP 460
Cdd:PHA03247 2787 AVASLSESRESLPSPWDPADPPAAVLAPAAALppaaspaGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRP 2866
                         170
                  ....*....|....*.
gi 21541824   461 FPPDALIGPGIPRHAR 476
Cdd:PHA03247 2867 PSRSPAAKPAAPARPP 2882
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
192-463 2.24e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.68  E-value: 2.24e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824   192 RDREPGTSNSllvPDSLRGTDKRRNGPEFSndikkRKVDDKDSSHYDSDGDKSDDNLVVDVSNEDPSSPRASPA-HSPRE 270
Cdd:pfam03154  82 RQREKGASDT---EEPERATAKKSKTQEIS-----RPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSiPSPQD 153
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824   271 NGIDKNrllkkdassspastaSSASSTSLKSKEMSLHEKASTPVLKSSTPTPRSDMPTPG-TSATPGLRPGLGKPPAIDP 349
Cdd:pfam03154 154 NESDSD---------------SSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGpTPSAPSVPPQGSPATSQPP 218
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824   350 LVNQAAAGLRT---------PLAVPGPYPAPFGMVPHAGMNGELTSPGAAyASLHNMSPQMSAAAAAAAVVAYGRSPMVG 420
Cdd:pfam03154 219 NQTQSTAAPHTliqqtptlhPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ-PSLHGQMPPMPHSLQTGPSHMQHPVPPQP 297
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|...
gi 21541824   421 FDPPPHMRVPTIPPNLAGIPGGKPAYSFHVTADGQMQPVPFPP 463
Cdd:pfam03154 298 FPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPP 340
 
Name Accession Description Interval E-value
TLE_N pfam03920
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ...
18-133 8.80e-94

Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.


Pssm-ID: 461094  Cd Length: 117  Bit Score: 287.78  E-value: 8.80e-94
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824    18 FKFTIPESLDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTICAQVIPFLS 97
Cdd:pfam03920   1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 21541824    98 QEHQQQVAQAVERAKQVTMAELNAIIGQ-QQLQAQHL 133
Cdd:pfam03920  81 QEHQQQVAQAVERAKQVTMAELNAIIGQqQQLQAQHL 117
WD40 COG2319
WD40 repeat [General function prediction only];
483-768 1.20e-43

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 162.77  E-value: 1.20e-43
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 483 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDISHPGNKSPVSqldclNRDNYIRSCKLLPDGCTLIVGGEASTLSIWDLAa 561
Cdd:COG2319  77 HTAAVLSVAFSPDGRLLASASAdGTVRLWDLATGLLLRTLT-----GHTGAVRSVAFSPDGKTLASGSADGTVRLWDLA- 150
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 562 pTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 641
Cdd:COG2319 151 -TGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVR 229
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 642 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPD-KYQLHLHESCVLSLKFAYCGKWFVSTGKD 719
Cdd:COG2319 230 LWDLATGKLLRTlTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGElLRTLTGHSGGVNSVAFSPDGKLLASGSDD 309
                       250       260       270       280       290
                ....*....|....*....|....*....|....*....|....*....|
gi 21541824 720 NLLNAWRTPYGASIFQSK-ESSSVLSCDISVDDKYIVTGSGDKKATVYEV 768
Cdd:COG2319 310 GTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDL 359
WD40 COG2319
WD40 repeat [General function prediction only];
483-768 2.09e-42

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 159.31  E-value: 2.09e-42
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 483 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLDclNRDNYIRSCKLLPDGCTLIVGGEASTLSIWDLAa 561
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLA---TGKLLRTLT--GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA- 192
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 562 pTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 641
Cdd:COG2319 193 -TGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 642 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPDK-YQLHLHESCVLSLKFAYCGKWFVSTGKD 719
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLlRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
                       250       260       270       280       290
                ....*....|....*....|....*....|....*....|....*....|
gi 21541824 720 NLLNAWRTPYGASIFQSKE-SSSVLSCDISVDDKYIVTGSGDKKATVYEV 768
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
483-767 2.09e-38

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 144.40  E-value: 2.09e-38
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 483 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLdCLNRDNyIRSCKLLPDGCTLIVGGEASTLSIWDLaa 561
Cdd:cd00200   8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL-- 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 562 PTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 641
Cdd:cd00200  81 ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 642 SWDLREGR---QLQQHdfTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPD-KYQLHLHESCVLSLKFAYCGKWFVSTG 717
Cdd:cd00200 161 LWDLRTGKcvaTLTGH--TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASGS 238
                       250       260       270       280       290
                ....*....|....*....|....*....|....*....|....*....|.
gi 21541824 718 KDNLLNAWRTPYGASIFQ-SKESSSVLSCDISVDDKYIVTGSGDKKATVYE 767
Cdd:cd00200 239 EDGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
476-728 8.51e-38

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 145.82  E-value: 8.51e-38
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 476 RQINTLN-HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLDclNRDNYIRSCKLLPDGCTLIVGGEAST 553
Cdd:COG2319 153 KLLRTLTgHSGAVTSVAFSPDGKLLASGSDdGTVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGT 227
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 554 LSIWDLAapTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWT 633
Cdd:COG2319 228 VRLWDLA--TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 634 GGLDNTVRSWDLREGRQLQQHD-FTSQIFSLGYCPTGEWLAVGMESSNVEVLHVN-KPDKYQLHLHESCVLSLKFAYCGK 711
Cdd:COG2319 306 GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLAtGELLRTLTGHTGAVTSVAFSPDGR 385
                       250
                ....*....|....*..
gi 21541824 712 WFVSTGKDNLLNAWRTP 728
Cdd:COG2319 386 TLASGSADGTVRLWDLA 402
WD40 COG2319
WD40 repeat [General function prediction only];
533-768 4.16e-30

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 123.48  E-value: 4.16e-30
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 533 IRSCKLLPDGCTLIVGGEASTLSIWDLAAPTPRIKAELTSSAPacYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQF 612
Cdd:COG2319  39 VASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAV--LSVAFSPDGRLLASASADGTVRLWDLATGLLLRTL 116
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 613 QGHTDGASCIDISNDGTKLWTGGLDNTVRSWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPDK 691
Cdd:COG2319 117 TGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTlTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKL 196
                       170       180       190       200       210       220       230
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 21541824 692 YQ-LHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRTPYGASIFQSK-ESSSVLSCDISVDDKYIVTGSGDKKATVYEV 768
Cdd:COG2319 197 LRtLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTgHSGSVRSVAFSPDGRLLASGSADGTVRLWDL 275
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
577-768 3.25e-26

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 109.35  E-value: 3.25e-26
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 577 CYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWDLREGRQLQQ-HD 655
Cdd:cd00200  12 VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTlTG 91
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 656 FTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPD-KYQLHLHESCVLSLKFAyCGKWFVSTGK-DNLLNAWRTPYGASI 733
Cdd:cd00200  92 HTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKcLTTLRGHTDWVNSVAFS-PDGTFVASSSqDGTIKLWDLRTGKCV 170
                       170       180       190
                ....*....|....*....|....*....|....*..
gi 21541824 734 --FQSkESSSVLSCDISVDDKYIVTGSGDKKATVYEV 768
Cdd:cd00200 171 atLTG-HTGEVNSVAFSPDGEKLLSSSSDGTIKLWDL 206
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
608-768 3.37e-20

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 91.63  E-value: 3.37e-20
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 608 LVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWDLREG---RQLQQHdfTSQIFSLGYCPTGEWLAVGMESSNVEVL 684
Cdd:cd00200   1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellRTLKGH--TGPVRDVAASADGTYLASGSSDKTIRLW 78
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 685 HVNKPDK-YQLHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRTPYGASI--FQSKEsSSVLSCDISVDDKYIVTGSGDK 761
Cdd:cd00200  79 DLETGECvRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLttLRGHT-DWVNSVAFSPDGTFVASSSQDG 157

                ....*..
gi 21541824 762 KATVYEV 768
Cdd:cd00200 158 TIKLWDL 164
WD40 COG2319
WD40 repeat [General function prediction only];
551-768 3.04e-19

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 90.74  E-value: 3.04e-19
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 551 ASTLSIWDLAAPTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTK 630
Cdd:COG2319  13 SADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRL 92
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 631 LWTGGLDNTVRSWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPDK-YQLHLHESCVLSLKFAY 708
Cdd:COG2319  93 LASASADGTVRLWDLATGLLLRTlTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLlRTLTGHSGAVTSVAFSP 172
                       170       180       190       200       210       220
                ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 21541824 709 CGKWFVSTGKDNLLNAWRTPYGASIFQ-SKESSSVLSCDISVDDKYIVTGSGDKKATVYEV 768
Cdd:COG2319 173 DGKLLASGSDDGTVRLWDLATGKLLRTlTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDL 233
WD40 COG2319
WD40 repeat [General function prediction only];
476-605 5.06e-12

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 68.40  E-value: 5.06e-12
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 476 RQINTLN-HGEVVCAVTISNPTRHVYTGGKGC-VKVWDIShpgNKSPVSQLDclNRDNYIRSCKLLPDGCTLIVGGEAST 553
Cdd:COG2319 279 ELLRTLTgHSGGVNSVAFSPDGKLLASGSDDGtVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGT 353
                        90       100       110       120       130
                ....*....|....*....|....*....|....*....|....*....|..
gi 21541824 554 LSIWDLAapTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHN 605
Cdd:COG2319 354 VRLWDLA--TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
605-644 2.87e-07

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 47.31  E-value: 2.87e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 21541824    605 NQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 644
Cdd:smart00320   1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
PLN00181 PLN00181
protein SPA1-RELATED; Provisional
595-766 8.78e-07

protein SPA1-RELATED; Provisional


Pssm-ID: 177776 [Multi-domain]  Cd Length: 793  Bit Score: 52.40  E-value: 8.78e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824  595 DGNIAVWDLHNQTLVRQFQGHTDGASCIDISN-DGTKLWTGGLDNTVRSWDLREGRQLQQHDFTSQIFSLGY-CPTGEWL 672
Cdd:PLN00181 554 EGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGTIKTKANICCVQFpSESGRSL 633
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824  673 AVGMESSNVEVLHVNKPdkyQLHL-----HESCVLSLKFAYCGKwFVSTGKDNLLNAWRTPYGASIFQSKESSSVLS--- 744
Cdd:PLN00181 634 AFGSADHKVYYYDLRNP---KLPLctmigHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMSISGINETPLHSFMGhtn 709
                        170       180
                 ....*....|....*....|....*.
gi 21541824  745 ----CDISVDDKYIVTGSGDKKATVY 766
Cdd:PLN00181 710 vknfVGLSVSDGYIATGSETNEVFVY 735
WD40 pfam00400
WD domain, G-beta repeat;
607-644 3.77e-06

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 44.26  E-value: 3.77e-06
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 21541824   607 TLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 644
Cdd:pfam00400   2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
PHA03247 PHA03247
large tegument protein UL36; Provisional
319-476 6.66e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 6.66e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824   319 TPTPRSDMPTPGTSATPGLRPGLGKPPA--IDPLVNQAAAGLRTPLAV--PGPYPAPFG-------MVPHAGMNGELTSP 387
Cdd:PHA03247 2707 TPEPAPHALVSATPLPPGPAAARQASPAlpAAPAPPAVPAGPATPGGParPARPPTTAGppapappAAPAAGPPRRLTRP 2786
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824   388 GAAYASLHNMSPQMSAAAAAAAVVAYGRSPMV-------GFDPPPHMRVPTIPPNLAGIPGGKPAYSFHVTADGQMQPVP 460
Cdd:PHA03247 2787 AVASLSESRESLPSPWDPADPPAAVLAPAAALppaaspaGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRP 2866
                         170
                  ....*....|....*.
gi 21541824   461 FPPDALIGPGIPRHAR 476
Cdd:PHA03247 2867 PSRSPAAKPAAPARPP 2882
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
563-602 7.56e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 37.68  E-value: 7.56e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 21541824    563 TPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWD 602
Cdd:smart00320   1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
NBCH_WD40 pfam20426
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at ...
568-649 9.20e-04

Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at the C-terminus of neurobeachin-like proteins.


Pssm-ID: 466575 [Multi-domain]  Cd Length: 350  Bit Score: 42.37  E-value: 9.20e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824   568 AELTSSAPACYALAISPDSKVCFSCcsdGNiavWD-------LHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTV 640
Cdd:pfam20426  75 AENVELGAQCFATLQTPSENFLISC---GN---WEnsfqvisLNDGRMVQSIRQHKDVVSCVAVTSDGSILATGSYDTTV 148

                  ....*....
gi 21541824   641 RSWDLREGR 649
Cdd:pfam20426 149 MVWEVLRGR 157
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
192-463 2.24e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.68  E-value: 2.24e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824   192 RDREPGTSNSllvPDSLRGTDKRRNGPEFSndikkRKVDDKDSSHYDSDGDKSDDNLVVDVSNEDPSSPRASPA-HSPRE 270
Cdd:pfam03154  82 RQREKGASDT---EEPERATAKKSKTQEIS-----RPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSiPSPQD 153
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824   271 NGIDKNrllkkdassspastaSSASSTSLKSKEMSLHEKASTPVLKSSTPTPRSDMPTPG-TSATPGLRPGLGKPPAIDP 349
Cdd:pfam03154 154 NESDSD---------------SSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGpTPSAPSVPPQGSPATSQPP 218
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824   350 LVNQAAAGLRT---------PLAVPGPYPAPFGMVPHAGMNGELTSPGAAyASLHNMSPQMSAAAAAAAVVAYGRSPMVG 420
Cdd:pfam03154 219 NQTQSTAAPHTliqqtptlhPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ-PSLHGQMPPMPHSLQTGPSHMQHPVPPQP 297
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|...
gi 21541824   421 FDPPPHMRVPTIPPNLAGIPGGKPAYSFHVTADGQMQPVPFPP 463
Cdd:pfam03154 298 FPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPP 340
PHA03378 PHA03378
EBNA-3B; Provisional
329-467 4.25e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 40.82  E-value: 4.25e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824  329 PGTSATPGLRPGLGKPPAIDPLVNQAAAGLRTPLavPGPYPAPFGMVPHAGMNGELTSPGAAYASLHnmSPQMSAAAAAA 408
Cdd:PHA03378 691 PGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRA--RPPAAAPGRARPPAAAPGRARPPAAAPGRAR--PPAAAPGRARP 766
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824  409 AVVAYGR-SPMvgfdPPPHmrvptIPPNLAGIPGGKPAysfhvtadgQMQPVPFPPDALI 467
Cdd:PHA03378 767 PAAAPGApTPQ----PPPQ-----APPAPQQRPRGAPT---------PQPPPQAGPTSMQ 808
WD40 pfam00400
WD domain, G-beta repeat;
578-602 8.08e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 34.63  E-value: 8.08e-03
                          10        20
                  ....*....|....*....|....*
gi 21541824   578 YALAISPDSKVCFSCCSDGNIAVWD 602
Cdd:pfam00400  15 TSLAFSPDGKLLASGSDDGTVKVWD 39
COG5276 COG5276
Uncharacterized secreted protein, contains LVIVD repeats, choice-of-anchor domain [Function ...
498-644 9.86e-03

Uncharacterized secreted protein, contains LVIVD repeats, choice-of-anchor domain [Function unknown];


Pssm-ID: 444087 [Multi-domain]  Cd Length: 320  Bit Score: 38.77  E-value: 9.86e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21541824 498 HVYTG-GKGCVKVWDISHPGNKSPVSQLDCLNRDNYirscKLLPDGCTLIVGGEAST-LSIWDLAAPT-PRIKAELTSSA 574
Cdd:COG5276  31 YAYVAgGSNGLAIVDVSDPANPVLVGSLPTPGGTWR----DVKVSGDYLYVASEGSEgLQIFDISDPAnPKLVGRYDTGG 106
                        90       100       110       120       130       140       150
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 21541824 575 PACYALAISpDSKVCFSCCSDGNIAVWDLHNQT---LVRQFQgHTDGASCIDISNDGTKLWTGGLDNTVRSWD 644
Cdd:COG5276 107 SGAHNIAVD-GNYAYVAGGSDNGLVIVDISDPTnpvLVGRYS-LPGQAYLHDVQVVGDYAYVADWEDGLVIVD 177
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH