NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|767912363|ref|XP_011542464|]
View 

TAF5-like RNA polymerase II p300/CBP-associated factor-associated factor 65 kDa subunit 5L isoform X2 [Homo sapiens]

Protein Classification

TAF5 family protein( domain architecture ID 10169025)

TATA binding protein (TBP) associated factor 5 (TAF5) family protein, similar to TAF5 which is one of several TAFs that bind TBP and are involved in forming the transcription factor IID (TFIID) complex

Gene Ontology:  GO:0006357

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
201-470 1.16e-78

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 252.14  E-value: 1.16e-78
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 201 NTAEISPDSKLLAAGFDNSCIKLWSLRSkklksephqvdvsrihlacdileeeddeddnaGTEMKILRGHCGPVYSTRFL 280
Cdd:COG2319  166 TSVAFSPDGKLLASGSDDGTVRLWDLAT--------------------------------GKLLRTLTGHTGAVRSVAFS 213
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 281 ADSSGLLSCSEDMSIRYWDLGSFTNTVLYQGHAYPVWDLDISPYSLYFASGSHDRTARLWSFDRTYPLRIYAGHLADVDC 360
Cdd:COG2319  214 PDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNS 293
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 361 VKFHPNSNYLATGSTDKTVRLWSAQQGNSVRLFTGHRGPVLSLAFSPNGKYLASAGEDQRLKLWDLASGTLYKELRGHTD 440
Cdd:COG2319  294 VAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTG 373
                        250       260       270
                 ....*....|....*....|....*....|
gi 767912363 441 NITSLTFSPDSGLIASASMDNSVRVWDIRN 470
Cdd:COG2319  374 AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
TAF5_NTD2 cd08044
TAF5_NTD2 is the second conserved N-terminal region of TATA Binding Protein (TBP) Associated ...
13-130 3.35e-36

TAF5_NTD2 is the second conserved N-terminal region of TATA Binding Protein (TBP) Associated Factor 5 (TAF5), involved in forming Transcription Factor IID (TFIID); The TATA Binding Protein (TBP) Associated Factor 5 (TAF5) is one of several TAFs that bind TBP and are involved in forming Transcription Factor IID (TFIID) complex. TAF5 contains three domains, two conserved sequence motifs at the N-terminal and one at the C-terminal region. TFIID is one of seven General Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the preinitiation complex. TFIID complex is composed of the TBP and at least 13 TAFs. In yeast and human cells, TAFs have been found as components of other complexes besides TFIID. TAF5 may play a major role in forming TFIID and its related complexes. TAFs from various species were originally named by their predicted molecular weight or their electrophoretic mobility in polyacrylamide gels. A new, unified nomenclature for the pol II TAFs has been suggested to show the relationship between TAF orthologs and paralogs. TAF5 has a paralog gene (TAF5L) which has a redundant function. Several hypotheses are proposed for TAFs functions such as serving as activator-binding sites, core-promoter recognition or a role in essential catalytic activity. C-terminus of TAF5 contains six WD40 repeats that likely form a closed beta propeller structure and may be involved in protein-protein interaction. The first part of the TAF5 N-terminal (TAF5_NTD1) homodimerizes in the absence of other TAFs. The second conserved N-terminal part of TAF5 (TAF5_NTD2) has an alpha-helical domain. One study has shown that TAF5_NTD2 homodimerizes only at high concentration of calcium but not any other metals. No dimerization was observed in other structural studies of TAF_NTD2. Several TAFs interact via histone-fold (HFD) motifs; HFD is the interaction motif involved in heterodimerization of the core histones and their assembly into nucleosome octamer. However, TAF5 does not have a HFD motif.


:

Pssm-ID: 176269  Cd Length: 133  Bit Score: 130.78  E-value: 3.35e-36
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363  13 SDSQHSHEVMPLLYPLFVYLHLNLVQNSPKSTVESFYSRFHGMFlqNASQKDVIEQLQTTQTIQDILSNFKLRAFLDNKY 92
Cdd:cd08044   18 SLDIYKYELSQLLYPIFVHSYLDLVASGHLEEAKSFFERFSGDF--EDSHSEDIKKLSSITTPEHLKENELAKLFRSNKY 95
                         90       100       110
                 ....*....|....*....|....*....|....*...
gi 767912363  93 VVRLQEDSYNYLIRYLQSDNNTALCKVLTLHIHLDVQP 130
Cdd:cd08044   96 VIRMSRDAYSLLLRFLESWGGSLLLKILNEHIDIDVRD 133
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
201-470 1.16e-78

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 252.14  E-value: 1.16e-78
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 201 NTAEISPDSKLLAAGFDNSCIKLWSLRSkklksephqvdvsrihlacdileeeddeddnaGTEMKILRGHCGPVYSTRFL 280
Cdd:COG2319  166 TSVAFSPDGKLLASGSDDGTVRLWDLAT--------------------------------GKLLRTLTGHTGAVRSVAFS 213
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 281 ADSSGLLSCSEDMSIRYWDLGSFTNTVLYQGHAYPVWDLDISPYSLYFASGSHDRTARLWSFDRTYPLRIYAGHLADVDC 360
Cdd:COG2319  214 PDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNS 293
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 361 VKFHPNSNYLATGSTDKTVRLWSAQQGNSVRLFTGHRGPVLSLAFSPNGKYLASAGEDQRLKLWDLASGTLYKELRGHTD 440
Cdd:COG2319  294 VAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTG 373
                        250       260       270
                 ....*....|....*....|....*....|
gi 767912363 441 NITSLTFSPDSGLIASASMDNSVRVWDIRN 470
Cdd:COG2319  374 AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
205-510 1.82e-71

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 229.53  E-value: 1.82e-71
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 205 ISPDSKLLAAGFDNSCIKLWSLRSkklksephqvdvsrihlacdileeeddeddnaGTEMKILRGHCGPVYSTRFLADSS 284
Cdd:cd00200   17 FSPDGKLLATGSGDGTIKVWDLET--------------------------------GELLRTLKGHTGPVRDVAASADGT 64
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 285 GLLSCSEDMSIRYWDLGSFTNTVLYQGHAYPVWDLDISPYSLYFASGSHDRTARLWSFDRTYPLRIYAGHLADVDCVKFH 364
Cdd:cd00200   65 YLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFS 144
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 365 PNSNYLATGSTDKTVRLWSAQQGNSVRLFTGHRGPVLSLAFSPNGKYLASAGEDQRLKLWDLASGTLYKELRGHTDNITS 444
Cdd:cd00200  145 PDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNS 224
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 767912363 445 LTFSPDSGLIASASMDNSVRVWDIRNTYCSAPADGSSSElvgvytgqmsnVLSVQFMACNLLLVTG 510
Cdd:cd00200  225 VAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS-----------VTSLAWSPDGKRLASG 279
TAF5_NTD2 cd08044
TAF5_NTD2 is the second conserved N-terminal region of TATA Binding Protein (TBP) Associated ...
13-130 3.35e-36

TAF5_NTD2 is the second conserved N-terminal region of TATA Binding Protein (TBP) Associated Factor 5 (TAF5), involved in forming Transcription Factor IID (TFIID); The TATA Binding Protein (TBP) Associated Factor 5 (TAF5) is one of several TAFs that bind TBP and are involved in forming Transcription Factor IID (TFIID) complex. TAF5 contains three domains, two conserved sequence motifs at the N-terminal and one at the C-terminal region. TFIID is one of seven General Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the preinitiation complex. TFIID complex is composed of the TBP and at least 13 TAFs. In yeast and human cells, TAFs have been found as components of other complexes besides TFIID. TAF5 may play a major role in forming TFIID and its related complexes. TAFs from various species were originally named by their predicted molecular weight or their electrophoretic mobility in polyacrylamide gels. A new, unified nomenclature for the pol II TAFs has been suggested to show the relationship between TAF orthologs and paralogs. TAF5 has a paralog gene (TAF5L) which has a redundant function. Several hypotheses are proposed for TAFs functions such as serving as activator-binding sites, core-promoter recognition or a role in essential catalytic activity. C-terminus of TAF5 contains six WD40 repeats that likely form a closed beta propeller structure and may be involved in protein-protein interaction. The first part of the TAF5 N-terminal (TAF5_NTD1) homodimerizes in the absence of other TAFs. The second conserved N-terminal part of TAF5 (TAF5_NTD2) has an alpha-helical domain. One study has shown that TAF5_NTD2 homodimerizes only at high concentration of calcium but not any other metals. No dimerization was observed in other structural studies of TAF_NTD2. Several TAFs interact via histone-fold (HFD) motifs; HFD is the interaction motif involved in heterodimerization of the core histones and their assembly into nucleosome octamer. However, TAF5 does not have a HFD motif.


Pssm-ID: 176269  Cd Length: 133  Bit Score: 130.78  E-value: 3.35e-36
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363  13 SDSQHSHEVMPLLYPLFVYLHLNLVQNSPKSTVESFYSRFHGMFlqNASQKDVIEQLQTTQTIQDILSNFKLRAFLDNKY 92
Cdd:cd08044   18 SLDIYKYELSQLLYPIFVHSYLDLVASGHLEEAKSFFERFSGDF--EDSHSEDIKKLSSITTPEHLKENELAKLFRSNKY 95
                         90       100       110
                 ....*....|....*....|....*....|....*...
gi 767912363  93 VVRLQEDSYNYLIRYLQSDNNTALCKVLTLHIHLDVQP 130
Cdd:cd08044   96 VIRMSRDAYSLLLRFLESWGGSLLLKILNEHIDIDVRD 133
TFIID_NTD2 pfam04494
WD40 associated region in TFIID subunit, NTD2 domain; This region is an all-alpha domain ...
13-124 1.61e-33

WD40 associated region in TFIID subunit, NTD2 domain; This region is an all-alpha domain associated with the WD40 helical bundle of the TAF5 subunit of transcription factor TFIID. The domain has distant structural similarity to RNA polymerase II CTD interacting factors. It contains several conserved clefts that are likely to be critical for TFIID complex assembly. The TAF5 subunit is present twice in the TFIID complex and is critical for the function and assembly of the complex, and the NTD2 and N-terminal domain is crucial for homodimerization.


Pssm-ID: 461330  Cd Length: 130  Bit Score: 123.37  E-value: 1.61e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363   13 SDSQHSHEVMPLLYPLFVYLHLNLVQNSPKSTVESFYSRFHGMFLqnASQKDVIEQLQTTQTIQDILSNFKLRAFLDNKY 92
Cdd:pfam04494  21 SLDIYKPELRRLLYPVFVHSYLDLVAKGHIEEAKEFFEKFRGDHE--ALHGDDLRKLAGITLPEHLEENELAKLFRSNKY 98
                          90       100       110
                  ....*....|....*....|....*....|..
gi 767912363   93 VVRLQEDSYNYLIRYLQSDNNTALCKVLTLHI 124
Cdd:pfam04494  99 RIRLSRYSFDLLLRFLQENESSVILRIINEHL 130
PTZ00421 PTZ00421
coronin; Provisional
350-483 7.21e-12

coronin; Provisional


Pssm-ID: 173611 [Multi-domain]  Cd Length: 493  Bit Score: 67.61  E-value: 7.21e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 350 IYAGHLADVDCVKFHP-NSNYLATGSTDKTVRLWSA-----QQGNSVRL--FTGHRGPVLSLAFSPNGK-YLASAGEDQR 420
Cdd:PTZ00421  70 ILLGQEGPIIDVAFNPfDPQKLFTASEDGTIMGWGIpeeglTQNISDPIvhLQGHTKKVGIVSFHPSAMnVLASAGADMV 149
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767912363 421 LKLWDLASGTLYKELRGHTDNITSLTFSPDSGLIASASMDNSVRVWDIRNtyCSAPADGSSSE 483
Cdd:PTZ00421 150 VNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRD--GTIVSSVEAHA 210
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
428-467 8.84e-11

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 56.94  E-value: 8.84e-11
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 767912363   428 SGTLYKELRGHTDNITSLTFSPDSGLIASASMDNSVRVWD 467
Cdd:smart00320   1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
429-467 5.42e-10

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 54.66  E-value: 5.42e-10
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 767912363  429 GTLYKELRGHTDNITSLTFSPDSGLIASASMDNSVRVWD 467
Cdd:pfam00400   1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
201-470 1.16e-78

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 252.14  E-value: 1.16e-78
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 201 NTAEISPDSKLLAAGFDNSCIKLWSLRSkklksephqvdvsrihlacdileeeddeddnaGTEMKILRGHCGPVYSTRFL 280
Cdd:COG2319  166 TSVAFSPDGKLLASGSDDGTVRLWDLAT--------------------------------GKLLRTLTGHTGAVRSVAFS 213
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 281 ADSSGLLSCSEDMSIRYWDLGSFTNTVLYQGHAYPVWDLDISPYSLYFASGSHDRTARLWSFDRTYPLRIYAGHLADVDC 360
Cdd:COG2319  214 PDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNS 293
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 361 VKFHPNSNYLATGSTDKTVRLWSAQQGNSVRLFTGHRGPVLSLAFSPNGKYLASAGEDQRLKLWDLASGTLYKELRGHTD 440
Cdd:COG2319  294 VAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTG 373
                        250       260       270
                 ....*....|....*....|....*....|
gi 767912363 441 NITSLTFSPDSGLIASASMDNSVRVWDIRN 470
Cdd:COG2319  374 AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 COG2319
WD40 repeat [General function prediction only];
201-510 1.01e-77

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 249.44  E-value: 1.01e-77
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 201 NTAEISPDSKLLAAGFDNSCIKLWSLRSkklksephqvdvsrihlacdileeeddeddnaGTEMKILRGHCGPVYSTRFL 280
Cdd:COG2319  124 RSVAFSPDGKTLASGSADGTVRLWDLAT--------------------------------GKLLRTLTGHSGAVTSVAFS 171
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 281 ADSSGLLSCSEDMSIRYWDLGSFTNTVLYQGHAYPVWDLDISPYSLYFASGSHDRTARLWSFDRTYPLRIYAGHLADVDC 360
Cdd:COG2319  172 PDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRS 251
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 361 VKFHPNSNYLATGSTDKTVRLWSAQQGNSVRLFTGHRGPVLSLAFSPNGKYLASAGEDQRLKLWDLASGTLYKELRGHTD 440
Cdd:COG2319  252 VAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTG 331
                        250       260       270       280       290       300       310
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 441 NITSLTFSPDSGLIASASMDNSVRVWDIrntycsapadgSSSELVGVYTGQMSNVLSVQFMACNLLLVTG 510
Cdd:COG2319  332 AVRSVAFSPDGKTLASGSDDGTVRLWDL-----------ATGELLRTLTGHTGAVTSVAFSPDGRTLASG 390
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
205-510 1.82e-71

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 229.53  E-value: 1.82e-71
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 205 ISPDSKLLAAGFDNSCIKLWSLRSkklksephqvdvsrihlacdileeeddeddnaGTEMKILRGHCGPVYSTRFLADSS 284
Cdd:cd00200   17 FSPDGKLLATGSGDGTIKVWDLET--------------------------------GELLRTLKGHTGPVRDVAASADGT 64
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 285 GLLSCSEDMSIRYWDLGSFTNTVLYQGHAYPVWDLDISPYSLYFASGSHDRTARLWSFDRTYPLRIYAGHLADVDCVKFH 364
Cdd:cd00200   65 YLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFS 144
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 365 PNSNYLATGSTDKTVRLWSAQQGNSVRLFTGHRGPVLSLAFSPNGKYLASAGEDQRLKLWDLASGTLYKELRGHTDNITS 444
Cdd:cd00200  145 PDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNS 224
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 767912363 445 LTFSPDSGLIASASMDNSVRVWDIRNTYCSAPADGSSSElvgvytgqmsnVLSVQFMACNLLLVTG 510
Cdd:cd00200  225 VAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS-----------VTSLAWSPDGKRLASG 279
WD40 COG2319
WD40 repeat [General function prediction only];
260-500 2.29e-69

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 227.87  E-value: 2.29e-69
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 260 AGTEMKILRGHCGPVYSTRFLADSSGLLSCSEDMSIRYWDLGSFTNTVLYQGHAYPVWDLDISPYSLYFASGSHDRTARL 339
Cdd:COG2319   67 AGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRL 146
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 340 WSFDRTYPLRIYAGHLADVDCVKFHPNSNYLATGSTDKTVRLWSAQQGNSVRLFTGHRGPVLSLAFSPNGKYLASAGEDQ 419
Cdd:COG2319  147 WDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADG 226
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 420 RLKLWDLASGTLYKELRGHTDNITSLTFSPDSGLIASASMDNSVRVWDIrntycsapadgSSSELVGVYTGQMSNVLSVQ 499
Cdd:COG2319  227 TVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDL-----------ATGELLRTLTGHSGGVNSVA 295

                 .
gi 767912363 500 F 500
Cdd:COG2319  296 F 296
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
263-510 2.05e-66

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 216.43  E-value: 2.05e-66
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 263 EMKILRGHCGPVYSTRFLADSSGLLSCSEDMSIRYWDLGSFTNTVLYQGHAYPVWDLDISPYSLYFASGSHDRTARLWSF 342
Cdd:cd00200    1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDL 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 343 DRTYPLRIYAGHLADVDCVKFHPNSNYLATGSTDKTVRLWSAQQGNSVRLFTGHRGPVLSLAFSPNGKYLASAGEDQRLK 422
Cdd:cd00200   81 ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 423 LWDLASGTLYKELRGHTDNITSLTFSPDSGLIASASMDNSVRVWDIRntycsapadgsSSELVGVYTGQMSNVLSVQFMA 502
Cdd:cd00200  161 LWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLS-----------TGKCLGTLRGHENGVNSVAFSP 229

                 ....*...
gi 767912363 503 CNLLLVTG 510
Cdd:cd00200  230 DGYLLASG 237
WD40 COG2319
WD40 repeat [General function prediction only];
267-510 3.27e-66

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 219.40  E-value: 3.27e-66
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 267 LRGHCGPVYSTRFLADSSGLLSCSEDMSIRYWDLGSFTNTVLYQGHAYPVWDLDISPYSLYFASGSHDRTARLWSFDRTY 346
Cdd:COG2319   32 LLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGL 111
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 347 PLRIYAGHLADVDCVKFHPNSNYLATGSTDKTVRLWSAQQGNSVRLFTGHRGPVLSLAFSPNGKYLASAGEDQRLKLWDL 426
Cdd:COG2319  112 LLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDL 191
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 427 ASGTLYKELRGHTDNITSLTFSPDSGLIASASMDNSVRVWDIRntycsapadgsSSELVGVYTGQMSNVLSVQFMACNLL 506
Cdd:COG2319  192 ATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLA-----------TGKLLRTLTGHSGSVRSVAFSPDGRL 260

                 ....
gi 767912363 507 LVTG 510
Cdd:COG2319  261 LASG 264
WD40 COG2319
WD40 repeat [General function prediction only];
278-510 2.76e-51

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 180.11  E-value: 2.76e-51
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 278 RFLADSSGLLSCSEDMSIRYWDLGSFTNTVLYQGHAYPVWDLDISPYSLYFASGSHDRTARLWSFDRTYPLRIYAGHLAD 357
Cdd:COG2319    1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 358 VDCVKFHPNSNYLATGSTDKTVRLWSAQQGNSVRLFTGHRGPVLSLAFSPNGKYLASAGEDQRLKLWDLASGTLYKELRG 437
Cdd:COG2319   81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTG 160
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767912363 438 HTDNITSLTFSPDSGLIASASMDNSVRVWDIRntycsapadgsSSELVGVYTGQMSNVLSVQFMACNLLLVTG 510
Cdd:COG2319  161 HSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA-----------TGKLLRTLTGHTGAVRSVAFSPDGKLLASG 222
TAF5_NTD2 cd08044
TAF5_NTD2 is the second conserved N-terminal region of TATA Binding Protein (TBP) Associated ...
13-130 3.35e-36

TAF5_NTD2 is the second conserved N-terminal region of TATA Binding Protein (TBP) Associated Factor 5 (TAF5), involved in forming Transcription Factor IID (TFIID); The TATA Binding Protein (TBP) Associated Factor 5 (TAF5) is one of several TAFs that bind TBP and are involved in forming Transcription Factor IID (TFIID) complex. TAF5 contains three domains, two conserved sequence motifs at the N-terminal and one at the C-terminal region. TFIID is one of seven General Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the preinitiation complex. TFIID complex is composed of the TBP and at least 13 TAFs. In yeast and human cells, TAFs have been found as components of other complexes besides TFIID. TAF5 may play a major role in forming TFIID and its related complexes. TAFs from various species were originally named by their predicted molecular weight or their electrophoretic mobility in polyacrylamide gels. A new, unified nomenclature for the pol II TAFs has been suggested to show the relationship between TAF orthologs and paralogs. TAF5 has a paralog gene (TAF5L) which has a redundant function. Several hypotheses are proposed for TAFs functions such as serving as activator-binding sites, core-promoter recognition or a role in essential catalytic activity. C-terminus of TAF5 contains six WD40 repeats that likely form a closed beta propeller structure and may be involved in protein-protein interaction. The first part of the TAF5 N-terminal (TAF5_NTD1) homodimerizes in the absence of other TAFs. The second conserved N-terminal part of TAF5 (TAF5_NTD2) has an alpha-helical domain. One study has shown that TAF5_NTD2 homodimerizes only at high concentration of calcium but not any other metals. No dimerization was observed in other structural studies of TAF_NTD2. Several TAFs interact via histone-fold (HFD) motifs; HFD is the interaction motif involved in heterodimerization of the core histones and their assembly into nucleosome octamer. However, TAF5 does not have a HFD motif.


Pssm-ID: 176269  Cd Length: 133  Bit Score: 130.78  E-value: 3.35e-36
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363  13 SDSQHSHEVMPLLYPLFVYLHLNLVQNSPKSTVESFYSRFHGMFlqNASQKDVIEQLQTTQTIQDILSNFKLRAFLDNKY 92
Cdd:cd08044   18 SLDIYKYELSQLLYPIFVHSYLDLVASGHLEEAKSFFERFSGDF--EDSHSEDIKKLSSITTPEHLKENELAKLFRSNKY 95
                         90       100       110
                 ....*....|....*....|....*....|....*...
gi 767912363  93 VVRLQEDSYNYLIRYLQSDNNTALCKVLTLHIHLDVQP 130
Cdd:cd08044   96 VIRMSRDAYSLLLRFLESWGGSLLLKILNEHIDIDVRD 133
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
205-383 1.12e-34

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 131.69  E-value: 1.12e-34
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 205 ISPDSKLLAAGFDNSCIKLWSLRSKKLksephqvdvsrihlacdileeeddeddnagteMKILRGHCGPVYSTRFLADSS 284
Cdd:cd00200  143 FSPDGTFVASSSQDGTIKLWDLRTGKC--------------------------------VATLTGHTGEVNSVAFSPDGE 190
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 285 GLLSCSEDMSIRYWDLGSFTNTVLYQGHAYPVWDLDISPYSLYFASGSHDRTARLWSFDRTYPLRIYAGHLADVDCVKFH 364
Cdd:cd00200  191 KLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWS 270
                        170
                 ....*....|....*....
gi 767912363 365 PNSNYLATGSTDKTVRLWS 383
Cdd:cd00200  271 PDGKRLASGSADGTIRIWD 289
TFIID_NTD2 pfam04494
WD40 associated region in TFIID subunit, NTD2 domain; This region is an all-alpha domain ...
13-124 1.61e-33

WD40 associated region in TFIID subunit, NTD2 domain; This region is an all-alpha domain associated with the WD40 helical bundle of the TAF5 subunit of transcription factor TFIID. The domain has distant structural similarity to RNA polymerase II CTD interacting factors. It contains several conserved clefts that are likely to be critical for TFIID complex assembly. The TAF5 subunit is present twice in the TFIID complex and is critical for the function and assembly of the complex, and the NTD2 and N-terminal domain is crucial for homodimerization.


Pssm-ID: 461330  Cd Length: 130  Bit Score: 123.37  E-value: 1.61e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363   13 SDSQHSHEVMPLLYPLFVYLHLNLVQNSPKSTVESFYSRFHGMFLqnASQKDVIEQLQTTQTIQDILSNFKLRAFLDNKY 92
Cdd:pfam04494  21 SLDIYKPELRRLLYPVFVHSYLDLVAKGHIEEAKEFFEKFRGDHE--ALHGDDLRKLAGITLPEHLEENELAKLFRSNKY 98
                          90       100       110
                  ....*....|....*....|....*....|..
gi 767912363   93 VVRLQEDSYNYLIRYLQSDNNTALCKVLTLHI 124
Cdd:pfam04494  99 RIRLSRYSFDLLLRFLQENESSVILRIINEHL 130
PTZ00421 PTZ00421
coronin; Provisional
350-483 7.21e-12

coronin; Provisional


Pssm-ID: 173611 [Multi-domain]  Cd Length: 493  Bit Score: 67.61  E-value: 7.21e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 350 IYAGHLADVDCVKFHP-NSNYLATGSTDKTVRLWSA-----QQGNSVRL--FTGHRGPVLSLAFSPNGK-YLASAGEDQR 420
Cdd:PTZ00421  70 ILLGQEGPIIDVAFNPfDPQKLFTASEDGTIMGWGIpeeglTQNISDPIvhLQGHTKKVGIVSFHPSAMnVLASAGADMV 149
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767912363 421 LKLWDLASGTLYKELRGHTDNITSLTFSPDSGLIASASMDNSVRVWDIRNtyCSAPADGSSSE 483
Cdd:PTZ00421 150 VNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRD--GTIVSSVEAHA 210
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
428-467 8.84e-11

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 56.94  E-value: 8.84e-11
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 767912363   428 SGTLYKELRGHTDNITSLTFSPDSGLIASASMDNSVRVWD 467
Cdd:smart00320   1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
386-425 2.91e-10

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 55.40  E-value: 2.91e-10
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 767912363   386 QGNSVRLFTGHRGPVLSLAFSPNGKYLASAGEDQRLKLWD 425
Cdd:smart00320   1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
429-467 5.42e-10

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 54.66  E-value: 5.42e-10
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 767912363  429 GTLYKELRGHTDNITSLTFSPDSGLIASASMDNSVRVWD 467
Cdd:pfam00400   1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
PLN00181 PLN00181
protein SPA1-RELATED; Provisional
284-466 8.80e-10

protein SPA1-RELATED; Provisional


Pssm-ID: 177776 [Multi-domain]  Cd Length: 793  Bit Score: 61.26  E-value: 8.80e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 284 SGLLSCSEDMSIRYWDLGSFTNTVLYQGHAYPVWDLDISPYS-LYFASGSHDRTARLWSFDRTYPLRIYAGHlADVDCVK 362
Cdd:PLN00181 546 SQVASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADpTLLASGSDDGSVKLWSINQGVSIGTIKTK-ANICCVQ 624
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 363 FHPNSNY-LATGSTDKTVRLWSAQQgNSVRLFT--GHRGPVLSLAFSpNGKYLASAGEDQRLKLWDL---ASG---TLYK 433
Cdd:PLN00181 625 FPSESGRsLAFGSADHKVYYYDLRN-PKLPLCTmiGHSKTVSYVRFV-DSSTLVSSSTDNTLKLWDLsmsISGineTPLH 702
                        170       180       190
                 ....*....|....*....|....*....|...
gi 767912363 434 ELRGHTDNITSLTFSPDSGLIASASMDNSVRVW 466
Cdd:PLN00181 703 SFMGHTNVKNFVGLSVSDGYIATGSETNEVFVY 735
WD40 pfam00400
WD domain, G-beta repeat;
387-425 1.19e-09

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 53.50  E-value: 1.19e-09
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 767912363  387 GNSVRLFTGHRGPVLSLAFSPNGKYLASAGEDQRLKLWD 425
Cdd:pfam00400   1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
347-383 4.13e-09

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 51.93  E-value: 4.13e-09
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 767912363   347 PLRIYAGHLADVDCVKFHPNSNYLATGSTDKTVRLWS 383
Cdd:smart00320   4 LLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
COG4946 COG4946
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ...
400-470 4.41e-09

Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];


Pssm-ID: 443973 [Multi-domain]  Cd Length: 1072  Bit Score: 59.28  E-value: 4.41e-09
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 767912363  400 VLSLAFSPNGKYLASAGEDQRLKLWDLASGTLYKELRG-HTDNITSLTFSPDSGLIA-SASMDN---SVRVWDIRN 470
Cdd:COG4946   391 VFNPVWSPDGKKIAFTDNRGRLWVVDLASGKVRKVDTDgYGDGISDLAWSPDSKWLAySKPGPNqlsQIFLYDVET 466
WD40 pfam00400
WD domain, G-beta repeat;
347-383 6.57e-09

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 51.58  E-value: 6.57e-09
                          10        20        30
                  ....*....|....*....|....*....|....*..
gi 767912363  347 PLRIYAGHLADVDCVKFHPNSNYLATGSTDKTVRLWS 383
Cdd:pfam00400   3 LLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
PLN00181 PLN00181
protein SPA1-RELATED; Provisional
329-468 2.66e-07

protein SPA1-RELATED; Provisional


Pssm-ID: 177776 [Multi-domain]  Cd Length: 793  Bit Score: 53.55  E-value: 2.66e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 329 ASGSHDRTARLWSFDRTYPLRIYAGHLADVDCVKFHP-NSNYLATGSTDKTVRLWSAQQGNSVRLFTGhRGPVLSLAF-S 406
Cdd:PLN00181 549 ASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGTIKT-KANICCVQFpS 627
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767912363 407 PNGKYLASAGEDQRLKLWDLASGTL-YKELRGHTDNITSLTFSpDSGLIASASMDNSVRVWDI 468
Cdd:PLN00181 628 ESGRSLAFGSADHKVYYYDLRNPKLpLCTMIGHSKTVSYVRFV-DSSTLVSSSTDNTLKLWDL 689
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
309-341 2.99e-06

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 43.84  E-value: 2.99e-06
                           10        20        30
                   ....*....|....*....|....*....|...
gi 767912363   309 YQGHAYPVWDLDISPYSLYFASGSHDRTARLWS 341
Cdd:smart00320   8 LKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
ANAPC4_WD40 pfam12894
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ...
361-448 6.96e-06

Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,


Pssm-ID: 403945 [Multi-domain]  Cd Length: 91  Bit Score: 44.58  E-value: 6.96e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363  361 VKFHPNSNYLATGSTDKTVRLwsaQQGNSVRLFTG----HRGPVLSLAFSPNGKYLASAGEDQRLKLWDLASGTLYKELR 436
Cdd:pfam12894   1 MSWCPTMDLIALATEDGELLL---HRLNWQRVWTLspdkEDLEVTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFS 77
                          90
                  ....*....|..
gi 767912363  437 GHTDNITSLTFS 448
Cdd:pfam12894  78 AGSDLITCLGWG 89
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
261-299 7.36e-06

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 42.68  E-value: 7.36e-06
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 767912363   261 GTEMKILRGHCGPVYSTRFLADSSGLLSCSEDMSIRYWD 299
Cdd:smart00320   2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
PTZ00420 PTZ00420
coronin; Provisional
379-470 7.59e-06

coronin; Provisional


Pssm-ID: 240412 [Multi-domain]  Cd Length: 568  Bit Score: 48.41  E-value: 7.59e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 379 VRLWSAQQGNSVRLFTGHRGPVLSLAFSP-NGKYLASAGEDQRLKLWDLA-SGTLYKE-------LRGHTDNITSLTFSP 449
Cdd:PTZ00420  56 IRLENQMRKPPVIKLKGHTSSILDLQFNPcFSEILASGSEDLTIRVWEIPhNDESVKEikdpqciLKGHKKKISIIDWNP 135
                         90       100
                 ....*....|....*....|..
gi 767912363 450 DSGLI-ASASMDNSVRVWDIRN 470
Cdd:PTZ00420 136 MNYYImCSSGFDSFVNIWDIEN 157
WD40 pfam00400
WD domain, G-beta repeat;
309-341 7.93e-06

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 42.72  E-value: 7.93e-06
                          10        20        30
                  ....*....|....*....|....*....|...
gi 767912363  309 YQGHAYPVWDLDISPYSLYFASGSHDRTARLWS 341
Cdd:pfam00400   7 LEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 pfam00400
WD domain, G-beta repeat;
261-299 1.96e-05

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 41.56  E-value: 1.96e-05
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 767912363  261 GTEMKILRGHCGPVYSTRFLADSSGLLSCSEDMSIRYWD 299
Cdd:pfam00400   1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
YncE COG3391
DNA-binding beta-propeller fold protein YncE [General function prediction only];
326-470 2.92e-04

DNA-binding beta-propeller fold protein YncE [General function prediction only];


Pssm-ID: 442618 [Multi-domain]  Cd Length: 237  Bit Score: 42.37  E-value: 2.92e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 326 LYFASGSHDRTARLWSFDRTYPLRIYAGhlADVDCVKFHPNSNYL-ATGSTDKTVRLWSAQQGNSVRLFTGHRGPVlSLA 404
Cdd:COG3391   82 LYVANSGSGRVSVIDLATGKVVATIPVG--GGPRGLAVDPDGGRLyVADSGNGRVSVIDTATGKVVATIPVGAGPH-GIA 158
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 767912363 405 FSPNGKYL--ASAGEDQRLKL---WDLASGTLYKELRGHtDNITSLTFSPDSGLI--------ASASMDNSVRVWDIRN 470
Cdd:COG3391  159 VDPDGKRLyvANSGSNTVSVIvsvIDTATGKVVATIPVG-GGPVGVAVSPDGRRLyvanrgsnTSNGGSNTVSVIDLAT 236
TolB COG0823
Periplasmic component TolB of the Tol biopolymer transport system [Intracellular trafficking, ...
373-485 5.67e-04

Periplasmic component TolB of the Tol biopolymer transport system [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 440585 [Multi-domain]  Cd Length: 158  Bit Score: 40.43  E-value: 5.67e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 373 GSTDktVRLWSAQQGNSVRLfTGHRGPVLSLAFSPNGKYLA-SAGEDQRLKLW--DLASGTLYKELRGHTDNiTSLTFSP 449
Cdd:COG0823    9 GNSD--IYVVDLDGGEPRRL-TNSPGIDTSPAWSPDGRRIAfTSDRGGGPQIYvvDADGGEPRRLTFGGGYN-ASPSWSP 84
                         90       100       110
                 ....*....|....*....|....*....|....*.
gi 767912363 450 DSGLIASASMDNSvrVWDIRntycSAPADGSSSELV 485
Cdd:COG0823   85 DGKRLAFVSRSDG--RFDIY----VLDLDGGAPRRL 114
PTZ00420 PTZ00420
coronin; Provisional
267-430 7.51e-04

coronin; Provisional


Pssm-ID: 240412 [Multi-domain]  Cd Length: 568  Bit Score: 42.24  E-value: 7.51e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 267 LRGHCGPVYSTRFLADSSGLL-SCSEDMSIRYWDLGSFTNTV--------LYQGHAYPVWDLDISPYSLY-FASGSHDRT 336
Cdd:PTZ00420  70 LKGHTSSILDLQFNPCFSEILaSGSEDLTIRVWEIPHNDESVkeikdpqcILKGHKKKISIIDWNPMNYYiMCSSGFDSF 149
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 337 ARLW-------SFDRTYPLRIYA------GHLADVDCVKFHPN----------SNYLATGSTDKTVRLWsaqqgnsVRLF 393
Cdd:PTZ00420 150 VNIWdienekrAFQINMPKKLSSlkwnikGNLLSGTCVGKHMHiidprkqeiaSSFHIHDGGKNTKNIW-------IDGL 222
                        170       180       190
                 ....*....|....*....|....*....|....*..
gi 767912363 394 TGHRGPVLSLAFSPNGKylasagedQRLKLWDLASGT 430
Cdd:PTZ00420 223 GGDDNYILSTGFSKNNM--------REMKLWDLKNTT 251
COG4946 COG4946
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ...
365-450 7.54e-04

Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];


Pssm-ID: 443973 [Multi-domain]  Cd Length: 1072  Bit Score: 42.33  E-value: 7.54e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363  365 PNSNYLATgsTDKTVRLW-----SaqqGNSVRLFTG-HRGPVLSLAFSPNGKYLA----SAGEDQRLKLWDLASGTLYKE 434
Cdd:COG4946   398 PDGKKIAF--TDNRGRLWvvdlaS---GKVRKVDTDgYGDGISDLAWSPDSKWLAyskpGPNQLSQIFLYDVETGKTVQL 472
                          90
                  ....*....|....*.
gi 767912363  435 LRGHTDNiTSLTFSPD 450
Cdd:COG4946   473 TDGRYDD-GSPAFSPD 487
PLN00181 PLN00181
protein SPA1-RELATED; Provisional
412-500 1.20e-03

protein SPA1-RELATED; Provisional


Pssm-ID: 177776 [Multi-domain]  Cd Length: 793  Bit Score: 41.61  E-value: 1.20e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 412 LASAGEDQRLKLWDLASGTLYKELRGHTDNITSLTF-SPDSGLIASASMDNSVRVWDIRNtycsapadgssselvGVYTG 490
Cdd:PLN00181 548 VASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYsSADPTLLASGSDDGSVKLWSINQ---------------GVSIG 612
                         90
                 ....*....|...
gi 767912363 491 QM---SNVLSVQF 500
Cdd:PLN00181 613 TIktkANICCVQF 625
TolB COG0823
Periplasmic component TolB of the Tol biopolymer transport system [Intracellular trafficking, ...
363-458 3.08e-03

Periplasmic component TolB of the Tol biopolymer transport system [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 440585 [Multi-domain]  Cd Length: 158  Bit Score: 38.50  E-value: 3.08e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 363 FHPNSNYLA-TGSTDKTVRLW--SAQQGNSVRLfTGHRGPVLSLAFSPNGKYLA-SAGEDQRLKLW--DLASGtlykELR 436
Cdd:COG0823   38 WSPDGRRIAfTSDRGGGPQIYvvDADGGEPRRL-TFGGGYNASPSWSPDGKRLAfVSRSDGRFDIYvlDLDGG----APR 112
                         90       100
                 ....*....|....*....|..
gi 767912363 437 GHTDNITSLTFSPDSGLIASAS 458
Cdd:COG0823  113 RLTDGPGSPSWSPDGRRIVFSS 134
TolB COG0823
Periplasmic component TolB of the Tol biopolymer transport system [Intracellular trafficking, ...
315-424 3.14e-03

Periplasmic component TolB of the Tol biopolymer transport system [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 440585 [Multi-domain]  Cd Length: 158  Bit Score: 38.50  E-value: 3.14e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767912363 315 PVWdldiSPY--SLYFASgSHDRTARLWSFDR--TYPLRIYAGHLADVDCVkFHPNSNYLA-TGSTDKTVRLW--SAQQG 387
Cdd:COG0823   36 PAW----SPDgrRIAFTS-DRGGGPQIYVVDAdgGEPRRLTFGGGYNASPS-WSPDGKRLAfVSRSDGRFDIYvlDLDGG 109
                         90       100       110
                 ....*....|....*....|....*....|....*...
gi 767912363 388 NSVRLFTGHRGPvlslAFSPNGKYLA-SAGEDQRLKLW 424
Cdd:COG0823  110 APRRLTDGPGSP----SWSPDGRRIVfSSDRGGRPDLY 143
NBCH_WD40 pfam20426
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at ...
406-468 6.99e-03

Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at the C-terminus of neurobeachin-like proteins.


Pssm-ID: 466575 [Multi-domain]  Cd Length: 350  Bit Score: 38.90  E-value: 6.99e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 767912363  406 SPNGKYLASAGE-DQRLKLWDLASGTLYKELRGHTDNITSLTFSPDSGLIASASMDNSVRVWDI 468
Cdd:pfam20426  90 TPSENFLISCGNwENSFQVISLNDGRMVQSIRQHKDVVSCVAVTSDGSILATGSYDTTVMVWEV 153
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH