NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|755527378|ref|XP_011246836|]
View 

zinc finger protein 821 isoform X1 [Mus musculus]

Protein Classification

C2H2-type zinc finger protein( domain architecture ID 10442881)

Cys2His2 (C2H2)-type zinc finger protein may be involved in transcriptional regulation

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COG2433 super family cl43687
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
261-335 5.45e-06

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


The actual alignment was detected with superfamily member COG2433:

Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 48.70  E-value: 5.45e-06
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 755527378 261 LRRQNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRmRDREAKRLQRMQETDEQRARRLQRDREamRLKRANE 335
Cdd:COG2433  432 LEAELEEKDERIERLERELSEARSEERREIRKDREISR-LDREIERLERELEEERERIEELKRKLE--RLKELWK 503
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
151-173 7.51e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


:

Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 33.81  E-value: 7.51e-03
                          10        20
                  ....*....|....*....|...
gi 755527378  151 YMCPVCGRALSSPGSLGRHLLIH 173
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
 
Name Accession Description Interval E-value
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
261-335 5.45e-06

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 48.70  E-value: 5.45e-06
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 755527378 261 LRRQNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRmRDREAKRLQRMQETDEQRARRLQRDREamRLKRANE 335
Cdd:COG2433  432 LEAELEEKDERIERLERELSEARSEERREIRKDREISR-LDREIERLERELEEERERIEELKRKLE--RLKELWK 503
CCDC47 pfam07946
PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of ...
294-355 1.50e-04

PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of the PAT complex, an endoplasmic reticulum (ER)-resident membrane multiprotein complex that facilitates multi-pass membrane proteins insertion into membranes. The PAT complex, formed by CCDC47 and Asterix proteins, acts as an intramembrane chaperone by directly interacting with nascent transmembrane domains (TMDs), releasing its substrates upon correct folding, and is needed for optimal biogenesis of multi-pass membrane proteins. CCDC47 is required to maintain the stability of Asterix. CCDC47 is associated with various membrane-associated processes and is component of a ribosome-associated ER translocon complex involved in multi-pass membrane protein transport into the ER membrane and biogenesis. It is also involved in the regulation of calcium ion homeostasis in the ER, being also required for proper protein degradation via the ERAD (ER-associated degradation) pathway.


Pssm-ID: 462322  Cd Length: 323  Bit Score: 43.33  E-value: 1.50e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 755527378  294 REVRRMRDREAKRLQRMQETDEQRARRLQRD---REAMRLKRANETPEKRQARLIREREaKRLKR 355
Cdd:pfam07946 260 KKAKKTREEEIEKIKKAAEEERAEEAQEKKEeakKKEREEKLAKLSPEEQRKYEEKERK-KEQRK 323
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
276-388 3.54e-04

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 42.57  E-value: 3.54e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755527378  276 ERERTAKKSR-RDNETPEEREVRRMRDREAKR-------LQRMQETDEQRARRLQRDREAMRLK----RANETPEKRQAR 343
Cdd:TIGR01642   4 EPDREREKSRgRDRDRSSERPRRRSRDRSRFRdrhrrsrERSYREDSRPRDRRRYDSRSPRSLRyssvRRSRDRPRRRSR 83
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 755527378  344 LI------REREAKRLKRRLEKMDMMLRAQFGQDPSAM----AALAAEMNFFQLP 388
Cdd:TIGR01642  84 SVrsieqhRRRLRDRSPSNQWRKDDKKRSLWDIKPPGYelvtADQAKASQVFSVP 138
PTZ00121 PTZ00121
MAEBL; Provisional
273-359 1.20e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 41.28  E-value: 1.20e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755527378  273 QRLERERTAKKSRRDNETPEEREVRRMRD-----REAKRLQRMQETDEQRARRLQRDREAMRLKRANETPEKRQARLIRE 347
Cdd:PTZ00121 1206 RKAEEERKAEEARKAEDAKKAEAVKKAEEakkdaEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKK 1285
                          90
                  ....*....|..
gi 755527378  348 REAKRLKRRLEK 359
Cdd:PTZ00121 1286 AEEKKKADEAKK 1297
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
151-173 7.51e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 33.81  E-value: 7.51e-03
                          10        20
                  ....*....|....*....|...
gi 755527378  151 YMCPVCGRALSSPGSLGRHLLIH 173
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
 
Name Accession Description Interval E-value
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
261-335 5.45e-06

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 48.70  E-value: 5.45e-06
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 755527378 261 LRRQNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRmRDREAKRLQRMQETDEQRARRLQRDREamRLKRANE 335
Cdd:COG2433  432 LEAELEEKDERIERLERELSEARSEERREIRKDREISR-LDREIERLERELEEERERIEELKRKLE--RLKELWK 503
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
260-365 8.10e-05

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 44.85  E-value: 8.10e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755527378 260 ALRRQNEPLEVRlqRLERERTAKKSRRDNETPEEREVRRMRDR------EAKRLQRMQETDEQRARRLQRDREAMRLKRA 333
Cdd:COG2433  381 ALEELIEKELPE--EEPEAEREKEHEERELTEEEEEIRRLEEQverleaEVEELEAELEEKDERIERLERELSEARSEER 458
                         90       100       110
                 ....*....|....*....|....*....|..
gi 755527378 334 NETPEKRQARlIREREAKRLKRRLEKMDMMLR 365
Cdd:COG2433  459 REIRKDREIS-RLDREIERLERELEEERERIE 489
CCDC47 pfam07946
PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of ...
294-355 1.50e-04

PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of the PAT complex, an endoplasmic reticulum (ER)-resident membrane multiprotein complex that facilitates multi-pass membrane proteins insertion into membranes. The PAT complex, formed by CCDC47 and Asterix proteins, acts as an intramembrane chaperone by directly interacting with nascent transmembrane domains (TMDs), releasing its substrates upon correct folding, and is needed for optimal biogenesis of multi-pass membrane proteins. CCDC47 is required to maintain the stability of Asterix. CCDC47 is associated with various membrane-associated processes and is component of a ribosome-associated ER translocon complex involved in multi-pass membrane protein transport into the ER membrane and biogenesis. It is also involved in the regulation of calcium ion homeostasis in the ER, being also required for proper protein degradation via the ERAD (ER-associated degradation) pathway.


Pssm-ID: 462322  Cd Length: 323  Bit Score: 43.33  E-value: 1.50e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 755527378  294 REVRRMRDREAKRLQRMQETDEQRARRLQRD---REAMRLKRANETPEKRQARLIREREaKRLKR 355
Cdd:pfam07946 260 KKAKKTREEEIEKIKKAAEEERAEEAQEKKEeakKKEREEKLAKLSPEEQRKYEEKERK-KEQRK 323
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
243-367 2.27e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 43.22  E-value: 2.27e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755527378 243 AAYRKLLETQTPSVRKWALRRQNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRMRDREAKRLQRMQETDEQRARRLQ 322
Cdd:COG4717  119 EKLEKLLQLLPLYQELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLA 198
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*
gi 755527378 323 RDREAMRLKRANETPEKRQARlireREAKRLKRRLEKMDMMLRAQ 367
Cdd:COG4717  199 EELEELQQRLAELEEELEEAQ----EELEELEEELEQLENELEAA 239
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
250-371 3.25e-04

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 42.92  E-value: 3.25e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755527378 250 ETQTPSVRKWALRRQNEPLEVRLQRLERErtAKKSRRDNETpEEREVRRMRDREAKRLQRMQETDEQRARRLQRDREAMR 329
Cdd:COG2433  393 EEPEAEREKEHEERELTEEEEEIRRLEEQ--VERLEAEVEE-LEAELEEKDERIERLERELSEARSEERREIRKDREISR 469
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|..
gi 755527378 330 LKRANETPEKRQARLirEREAKRLKRRLEKMDMMLRAQFGQD 371
Cdd:COG2433  470 LDREIERLERELEEE--RERIEELKRKLERLKELWKLEHSGE 509
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
276-388 3.54e-04

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 42.57  E-value: 3.54e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755527378  276 ERERTAKKSR-RDNETPEEREVRRMRDREAKR-------LQRMQETDEQRARRLQRDREAMRLK----RANETPEKRQAR 343
Cdd:TIGR01642   4 EPDREREKSRgRDRDRSSERPRRRSRDRSRFRdrhrrsrERSYREDSRPRDRRRYDSRSPRSLRyssvRRSRDRPRRRSR 83
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 755527378  344 LI------REREAKRLKRRLEKMDMMLRAQFGQDPSAM----AALAAEMNFFQLP 388
Cdd:TIGR01642  84 SVrsieqhRRRLRDRSPSNQWRKDDKKRSLWDIKPPGYelvtADQAKASQVFSVP 138
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
264-359 5.66e-04

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 42.03  E-value: 5.66e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755527378  264 QNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRMRDREAKRLQRM----QETDEQRARRLQRDREAMRLKRA------ 333
Cdd:pfam17380 416 QQQKVEMEQIRAEQEEARQREVRRLEEERAREMERVRLEEQERQQQVerlrQQEEERKRKKLELEKEKRDRKRAeeqrrk 495
                          90       100
                  ....*....|....*....|....*....
gi 755527378  334 ---NETPEKRQARLIREREAKRLKRRLEK 359
Cdd:pfam17380 496 ileKELEERKQAMIEEERKRKLLEKEMEE 524
PTZ00121 PTZ00121
MAEBL; Provisional
273-359 1.20e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 41.28  E-value: 1.20e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755527378  273 QRLERERTAKKSRRDNETPEEREVRRMRD-----REAKRLQRMQETDEQRARRLQRDREAMRLKRANETPEKRQARLIRE 347
Cdd:PTZ00121 1206 RKAEEERKAEEARKAEDAKKAEAVKKAEEakkdaEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKK 1285
                          90
                  ....*....|..
gi 755527378  348 REAKRLKRRLEK 359
Cdd:PTZ00121 1286 AEEKKKADEAKK 1297
PTZ00121 PTZ00121
MAEBL; Provisional
263-359 3.31e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 40.12  E-value: 3.31e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755527378  263 RQNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRmrdreAKRLQRMQETDEQRARRLQRD-----REAMRLKRANETP 337
Cdd:PTZ00121 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKK-----AEELKKAEEENKIKAAEEAKKaeedkKKAEEAKKAEEDE 1687
                          90       100
                  ....*....|....*....|..
gi 755527378  338 EKRQARLIREREAKRLKRRLEK 359
Cdd:PTZ00121 1688 KKAAEALKKEAEEAKKAEELKK 1709
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
261-367 3.51e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 39.53  E-value: 3.51e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755527378 261 LRRQNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRMRDREAKRLQRMQETDEQRARRLQRDREAMRLKRANETpEKR 340
Cdd:COG1196  307 LEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELE-ELA 385
                         90       100
                 ....*....|....*....|....*..
gi 755527378 341 QARLIREREAKRLKRRLEKMDMMLRAQ 367
Cdd:COG1196  386 EELLEALRAAAELAAQLEELEEAEEAL 412
PTZ00121 PTZ00121
MAEBL; Provisional
273-359 6.76e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 38.97  E-value: 6.76e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755527378  273 QRLERERTAKKSRRDNETPEEREVRRM----RDREAKRLQRMQETDEqrARRLQRDREAMRLKRANETPEKRQARLIRER 348
Cdd:PTZ00121 1128 RKAEEARKAEDARKAEEARKAEDAKRVeiarKAEDARKAEEARKAED--AKKAEAARKAEEVRKAEELRKAEDARKAEAA 1205
                          90
                  ....*....|.
gi 755527378  349 EAKRLKRRLEK 359
Cdd:PTZ00121 1206 RKAEEERKAEE 1216
PRK00247 PRK00247
putative inner membrane protein translocase component YidC; Validated
257-342 6.83e-03

putative inner membrane protein translocase component YidC; Validated


Pssm-ID: 178945 [Multi-domain]  Cd Length: 429  Bit Score: 38.68  E-value: 6.83e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755527378 257 RKWALRRQN----------EPLEVRLQRLERERTAKKSRRDNETPEEREVRRMRDREA---KRLQRMQETDEQRARRLQR 323
Cdd:PRK00247 304 FLWTLRRNRlrmiitpwraPELHAENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREInreARQERAAAMARARARRAAV 383
                         90
                 ....*....|....*....
gi 755527378 324 DREAMRLKRANETPEKRQA 342
Cdd:PRK00247 384 KAKKKGLIDASPNEDTPSE 402
Nop53 pfam07767
Nop53 (60S ribosomal biogenesis); This nucleolar family of proteins are involved in 60S ...
269-359 7.02e-03

Nop53 (60S ribosomal biogenesis); This nucleolar family of proteins are involved in 60S ribosomal biogenesis. They are specifically involved in the processing beyond the 27S stage of 25S rRNA maturation. This family contains sequences that bear similarity to the glioma tumour suppressor candidate region gene 2 protein (p60). This protein has been found to interact with herpes simplex type 1 regulatory proteins.


Pssm-ID: 462259 [Multi-domain]  Cd Length: 353  Bit Score: 38.43  E-value: 7.02e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755527378  269 EVRLQRLERE----RTAKKSRRDNETPEEREVRRM----RDREAKRLQRMQETDEQRARRLQRdREAMRLKRANETPEKR 340
Cdd:pfam07767 206 EAEKKRLKEEekleRVLEKIAESAATAEAREEKRKtkaqRNKEKRRKEEEREAKEEKALKKKL-AQLERLKEIAKEIAEK 284
                          90
                  ....*....|....*....
gi 755527378  341 QARLIREREAKRLKRRLEK 359
Cdd:pfam07767 285 EKEREEKAEARKREKRKKK 303
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
151-173 7.51e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 33.81  E-value: 7.51e-03
                          10        20
                  ....*....|....*....|...
gi 755527378  151 YMCPVCGRALSSPGSLGRHLLIH 173
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
271-384 8.12e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 38.38  E-value: 8.12e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755527378 271 RLQRLERERTAKKSRRDNETPEEREVRRMRDREAKRLQRMQETDEQRARRLQRDREAMRLKRANETPEKRQARLIREREA 350
Cdd:COG1196  285 EAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAE 364
                         90       100       110
                 ....*....|....*....|....*....|....
gi 755527378 351 KRLKRRLEKMDMMLRAQFGQDPSAMAALAAEMNF 384
Cdd:COG1196  365 EALLEAEAELAEAEEELEELAEELLEALRAAAEL 398
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
285-359 9.19e-03

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 37.98  E-value: 9.19e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 755527378  285 RRDNETPEEREVRRMRDREAKRlqrmqETDEQRARRLQRDREAMRLKRANETPEKRQARLIREREAKRLKRRLEK 359
Cdd:TIGR01622   2 YRDRERERLRDSSSAGDRDRRR-----DKGRERSRDRSRDRERSRSRRRDRHRDRDYYRGRERRSRSRRPNRRYR 71
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
260-358 9.63e-03

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 37.98  E-value: 9.63e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755527378  260 ALRRQNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRMRDREAK-RLQRMQETDEQRARRLQRDREAMRLKRANETPE 338
Cdd:pfam13868 137 EEQAEWKELEKEEEREEDERILEYLKEKAEREEEREAEREEIEEEKeREIARLRAQQEKAQDEKAERDELRAKLYQEEQE 216
                          90       100
                  ....*....|....*....|
gi 755527378  339 KRQARLIREREAKRLKRRLE 358
Cdd:pfam13868 217 RKERQKEREEAEKKARQRQE 236
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH