NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1034609085|ref|XP_016882663|]
View 

transcription elongation factor SPT5 isoform X1 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
KOW_Spt5_3 cd06083
KOW domain of Spt5, repeat 3; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
21-71 2.39e-28

KOW domain of Spt5, repeat 3; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


:

Pssm-ID: 240507  Cd Length: 51  Bit Score: 107.23  E-value: 2.39e-28
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1034609085  21 YFKMGDHVKVIAGRFEGDTGLIVRVEENFVILFSDLTMHELKVLPRDLQLC 71
Cdd:cd06083     1 HFKVGDHVKVISGRHEGETGLVVKVEDDVVTVFSDLTMRELKVFPRDLQLS 51
KOW_Spt5_6 cd06086
KOW domain of Spt5, repeat 6; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
578-634 2.37e-27

KOW domain of Spt5, repeat 6; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


:

Pssm-ID: 240510  Cd Length: 58  Bit Score: 104.52  E-value: 2.37e-27
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034609085 578 EHLEPITPTKNNKVKVILGEDREATGVLLSIDGEDGIVRMDLDEQLKILNLRFLGKL 634
Cdd:cd06086     1 EHLEPVPPEKGDRVKVIKGEDRGSTGELISIDGADGIVKMDSDGDIKILPMNFLAKL 57
KOW_Spt5_5 cd06085
KOW domain of Spt5, repeat 5; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
252-301 7.90e-26

KOW domain of Spt5, repeat 5; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


:

Pssm-ID: 240509  Cd Length: 52  Bit Score: 100.25  E-value: 7.90e-26
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1034609085 252 DNELIGQTVRISQGPYKGYIGVVKDATESTARVELHSTCQTISVDRQRLT 301
Cdd:cd06085     2 RDPLIGKTVRIRKGPYKGYIGIVKDATGTTARVELHSKNKTITVDRSRLA 51
CTD smart01104
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
322-439 1.50e-23

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteristic TPA motif.


:

Pssm-ID: 215026 [Multi-domain]  Cd Length: 121  Bit Score: 96.05  E-value: 1.50e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  322 GSQTPMYG-SGSRTPMYGSQTP----LQDGSRTPHYGSQTPLHDG--SRTPAQSGAWdPNNPNTPSRAEEEYEYAFDDEP 394
Cdd:smart01104   1 GGRTPAWGaSGSKTPAWGSRTPgtaaGGAPTARGGSGSRTPAWGGagSRTPAWGGAG-PTGSRTPAWGGASAWGNKSSEG 79
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 1034609085  395 TPSPQA--YGGTPNPQTPGYpdpssPQVNPQYNPQTPGTPAMYNTDQ 439
Cdd:smart01104  80 SASSWAagPGGAYGAPTPGY-----GGTPSAYGPATPGGGAMAGSAS 121
KOW_Spt5_4 cd06084
KOW domain of Spt5, repeat 4; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
147-189 3.83e-20

KOW domain of Spt5, repeat 4; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


:

Pssm-ID: 240508  Cd Length: 43  Bit Score: 83.72  E-value: 3.83e-20
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 1034609085 147 KDIVKVIDGPHSGREGEIRHLFRSFAFLHCKKLVENGGMFVCK 189
Cdd:cd06084     1 GDTVKVVDGPYKGRQGTVLHIYRGTLFLHSREVTENGGIFVVR 43
PHA03269 super family cl29788
envelope glycoprotein C; Provisional
375-520 3.44e-08

envelope glycoprotein C; Provisional


The actual alignment was detected with superfamily member PHA03269:

Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 56.66  E-value: 3.44e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 375 NPNTPSRAEEEYEYAFDDEPTPSPQayggtPNPQTPGYPDPS-SPQVNPQYNPQtpgtpamyntdqfsPYAAPSPQGSYQ 453
Cdd:PHA03269   21 NLNTNIPIPELHTSAATQKPDPAPA-----PHQAASRAPDPAvAPTSAASRKPD--------------LAQAPTPAASEK 81
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 454 PSPSPQSYHQV--APSPAGYQNTHSPASYHPTPSPM-AYQASPSPSPVGYSPMTPgAPSPGGYNPHTPGS 520
Cdd:PHA03269   82 FDPAPAPHQAAsrAPDPAVAPQLAAAPKPDAAEAFTsAAQAHEAPADAGTSAASK-KPDPAAHTQHSPPP 150
KOW super family cl00354
KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known ...
1-20 1.52e-04

KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. The KOW motif contains an invariants glycine residue and comprises alternating blocks of hydrophilic and hydrophobic residues.


The actual alignment was detected with superfamily member cd06082:

Pssm-ID: 469738  Cd Length: 51  Bit Score: 39.79  E-value: 1.52e-04
                          10        20
                  ....*....|....*....|
gi 1034609085   1 MPKHEDLKDMLEFPAQELRK 20
Cdd:cd06082    32 MPKHEDLKEPLEFPAKELRK 51
 
Name Accession Description Interval E-value
KOW_Spt5_3 cd06083
KOW domain of Spt5, repeat 3; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
21-71 2.39e-28

KOW domain of Spt5, repeat 3; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240507  Cd Length: 51  Bit Score: 107.23  E-value: 2.39e-28
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1034609085  21 YFKMGDHVKVIAGRFEGDTGLIVRVEENFVILFSDLTMHELKVLPRDLQLC 71
Cdd:cd06083     1 HFKVGDHVKVISGRHEGETGLVVKVEDDVVTVFSDLTMRELKVFPRDLQLS 51
KOW_Spt5_6 cd06086
KOW domain of Spt5, repeat 6; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
578-634 2.37e-27

KOW domain of Spt5, repeat 6; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240510  Cd Length: 58  Bit Score: 104.52  E-value: 2.37e-27
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034609085 578 EHLEPITPTKNNKVKVILGEDREATGVLLSIDGEDGIVRMDLDEQLKILNLRFLGKL 634
Cdd:cd06086     1 EHLEPVPPEKGDRVKVIKGEDRGSTGELISIDGADGIVKMDSDGDIKILPMNFLAKL 57
KOW_Spt5_5 cd06085
KOW domain of Spt5, repeat 5; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
252-301 7.90e-26

KOW domain of Spt5, repeat 5; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240509  Cd Length: 52  Bit Score: 100.25  E-value: 7.90e-26
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1034609085 252 DNELIGQTVRISQGPYKGYIGVVKDATESTARVELHSTCQTISVDRQRLT 301
Cdd:cd06085     2 RDPLIGKTVRIRKGPYKGYIGIVKDATGTTARVELHSKNKTITVDRSRLA 51
CTD smart01104
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
322-439 1.50e-23

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteristic TPA motif.


Pssm-ID: 215026 [Multi-domain]  Cd Length: 121  Bit Score: 96.05  E-value: 1.50e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  322 GSQTPMYG-SGSRTPMYGSQTP----LQDGSRTPHYGSQTPLHDG--SRTPAQSGAWdPNNPNTPSRAEEEYEYAFDDEP 394
Cdd:smart01104   1 GGRTPAWGaSGSKTPAWGSRTPgtaaGGAPTARGGSGSRTPAWGGagSRTPAWGGAG-PTGSRTPAWGGASAWGNKSSEG 79
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 1034609085  395 TPSPQA--YGGTPNPQTPGYpdpssPQVNPQYNPQTPGTPAMYNTDQ 439
Cdd:smart01104  80 SASSWAagPGGAYGAPTPGY-----GGTPSAYGPATPGGGAMAGSAS 121
KOW_Spt5_4 cd06084
KOW domain of Spt5, repeat 4; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
147-189 3.83e-20

KOW domain of Spt5, repeat 4; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240508  Cd Length: 43  Bit Score: 83.72  E-value: 3.83e-20
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 1034609085 147 KDIVKVIDGPHSGREGEIRHLFRSFAFLHCKKLVENGGMFVCK 189
Cdd:cd06084     1 GDTVKVVDGPYKGRQGTVLHIYRGTLFLHSREVTENGGIFVVR 43
CTD pfam12815
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
322-372 6.18e-13

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteriztic TPA motif.


Pssm-ID: 372327 [Multi-domain]  Cd Length: 71  Bit Score: 64.00  E-value: 6.18e-13
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034609085 322 GSQTPMYGS--GSRTPMY---GSQTPL--QDGSRTPHY--GSQTPLHD--GSRTPAQSGAWD 372
Cdd:pfam12815   1 GSRTPAYNSagGSRTPAWgadGSRTPAygGAGGRTPAYnqGGKTPAWGgaGSRTPAYYGAWG 62
PHA03269 PHA03269
envelope glycoprotein C; Provisional
375-520 3.44e-08

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 56.66  E-value: 3.44e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 375 NPNTPSRAEEEYEYAFDDEPTPSPQayggtPNPQTPGYPDPS-SPQVNPQYNPQtpgtpamyntdqfsPYAAPSPQGSYQ 453
Cdd:PHA03269   21 NLNTNIPIPELHTSAATQKPDPAPA-----PHQAASRAPDPAvAPTSAASRKPD--------------LAQAPTPAASEK 81
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 454 PSPSPQSYHQV--APSPAGYQNTHSPASYHPTPSPM-AYQASPSPSPVGYSPMTPgAPSPGGYNPHTPGS 520
Cdd:PHA03269   82 FDPAPAPHQAAsrAPDPAVAPQLAAAPKPDAAEAFTsAAQAHEAPADAGTSAASK-KPDPAAHTQHSPPP 150
PHA03247 PHA03247
large tegument protein UL36; Provisional
295-512 8.86e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.71  E-value: 8.86e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  295 VDRQRLTTVGSRRPGGMTSTYGRTPMYGSQTPMYGSGSRTPMYGSQTPlqdGSRTPHYGSQTPLHDGSRTPAQSGAWDPN 374
Cdd:PHA03247  2661 VSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP---EPAPHALVSATPLPPGPAAARQASPALPA 2737
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  375 NPNTPsraeeeyeyafddePTPSPQAYGGTPN----PQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPqg 450
Cdd:PHA03247  2738 APAPP--------------AVPAGPATPGGPArparPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP-- 2801
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034609085  451 syqPSPSPQSYHQVAPSPAgYQNTHSPASYHPTPSPMAYQASPSPSPVGYSPMTP-GAPSPGG 512
Cdd:PHA03247  2802 ---WDPADPPAAVLAPAAA-LPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLgGSVAPGG 2860
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
394-520 1.46e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 51.69  E-value: 1.46e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 394 PTPSPQAYGGTPNPQTPGYPDPSSPQVN-PQYNPQTPGTP--AMYNTDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAG 470
Cdd:pfam03154 188 PPGTTQAATAGPTPSAPSVPPQGSPATSqPPNQTQSTAAPhtLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLP 267
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034609085 471 YQNTHSPASYHP---------TPSPMAYQASPSPSPVGYSPMTPG----APSPGGYNPHTPGS 520
Cdd:pfam03154 268 QPSLHGQMPPMPhslqtgpshMQHPVPPQPFPLTPQSSQSQVPPGpspaAPGQSQQRIHTPPS 330
KOW smart00739
KOW (Kyprides, Ouzounis, Woese) motif; Motif in ribosomal proteins, NusG, Spt5p, KIN17 and T54.
21-48 9.65e-06

KOW (Kyprides, Ouzounis, Woese) motif; Motif in ribosomal proteins, NusG, Spt5p, KIN17 and T54.


Pssm-ID: 128978  Cd Length: 28  Bit Score: 42.32  E-value: 9.65e-06
                           10        20
                   ....*....|....*....|....*...
gi 1034609085   21 YFKMGDHVKVIAGRFEGDTGLIVRVEEN 48
Cdd:smart00739   1 KFEVGDTVRVIAGPFKGKVGKVLEVDGE 28
KOW pfam00467
KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, ...
256-287 1.92e-05

KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG.


Pssm-ID: 425698 [Multi-domain]  Cd Length: 32  Bit Score: 41.60  E-value: 1.92e-05
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1034609085 256 IGQTVRISQGPYKGYIGVVKDATESTARVELH 287
Cdd:pfam00467   1 KGDVVRVIAGPFKGKVGKVVEVDDKKNRVLVE 32
KOW pfam00467
KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, ...
25-53 3.78e-05

KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG.


Pssm-ID: 425698 [Multi-domain]  Cd Length: 32  Bit Score: 40.83  E-value: 3.78e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1034609085  25 GDHVKVIAGRFEGDTGLIVRVEE--NFVILF 53
Cdd:pfam00467   2 GDVVRVIAGPFKGKVGKVVEVDDkkNRVLVE 32
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
360-527 4.22e-05

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 46.60  E-value: 4.22e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 360 DGSRTPAQSgawdPNNPNTPSRAEEEYEYAFDD-EPTPSPQAYGGTPNP----QTPGYPDPSSPQVNPQYNPQT--PGTP 432
Cdd:COG5180   222 DHPRPEAAS----SPKVDPPSTSEARSRPATVDaQPEMRPPADAKERRRaaigDTPAAEPPGLPVLEAGSEPQSdaPEAE 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 433 AMYNTDQFSPYAAPSPQGSYQPSPS-----PQSYHQVAPSPAGYQNTHSPASYHPTPSPMAYQASPSpspvgySPMTPGA 507
Cdd:COG5180   298 TARPIDVKGVASAPPATRPVRPPGGardpgTPRPGQPTERPAGVPEAASDAGQPPSAYPPAEEAVPG------KPLEQGA 371
                         170       180
                  ....*....|....*....|....
gi 1034609085 508 PSPG--GYN--PHTPGSGIEQNSS 527
Cdd:COG5180   372 PRPGssGGDgaPFQPPNGAPQPGL 395
KOW_Spt5_2 cd06082
KOW domain of Spt5, repeat 2; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
1-20 1.52e-04

KOW domain of Spt5, repeat 2; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240506  Cd Length: 51  Bit Score: 39.79  E-value: 1.52e-04
                          10        20
                  ....*....|....*....|
gi 1034609085   1 MPKHEDLKDMLEFPAQELRK 20
Cdd:cd06082    32 MPKHEDLKEPLEFPAKELRK 51
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
374-510 3.16e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 43.60  E-value: 3.16e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 374 NNPNTPSRAEEEYEYAFDD-EPTPSPQAYGGTPN-PQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQG- 450
Cdd:NF033839  249 DNVNTKVEIENTVHKIFADmDAVVTKFKKGLTQDtPKEPGNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKp 328
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034609085 451 SYQPSPSPQSYH-QVAPSPAGYQNTHSPASYHPTPSPMAYQASPSPSpvgySPMTPGAPSP 510
Cdd:NF033839  329 KPEVKPQPEKPKpEVKPQLETPKPEVKPQPEKPKPEVKPQPEKPKPE----VKPQPETPKP 385
SP7_N cd22542
N-terminal domain of transcription factor Specificity Protein (SP) 7; Specificity Proteins ...
352-520 1.37e-03

N-terminal domain of transcription factor Specificity Protein (SP) 7; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP7, also called Osterix (Osx) in humans, is highly conserved among bone-forming vertebrates. It plays a major role, along with Runx2 and Dlx5 in driving the differentiation of mesenchymal precursor cells into osteoblasts and eventually osteocytes. SP7 also plays a regulatory role by inhibiting chondrocyte differentiation, maintaining the balance between differentiation of mesenchymal precursor cells into ossified bone or cartilage. Mutations of this gene have been associated with multiple dysfunctional bone phenotypes in vertebrates. SP7 is thought to play a role in diseases such as Osteogenesis imperfecta. SP7 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. This model represents the N-terminal domain of SP7.


Pssm-ID: 411691 [Multi-domain]  Cd Length: 297  Bit Score: 41.04  E-value: 1.37e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 352 YGSQTPLHDgSRTPAQSGAWDPNNPNTP--------SRAEE----EYEYAFDD-----EPTPSPQA---YGGTPNPQTPG 411
Cdd:cd22542    26 FGGSSPIRD-SATPGKPGNNPGKKPYSLgsdlssakSRSSElmgdSYTATFSSgnglmSPSGSPQAsttYGNDYNPFSHS 104
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 412 YPDPSSPQ----VNPQYNPQTPGTPAMYNT-DQFSPY-----AAPSPQGSYQPSPSPQSYHQVAPSPAGYQNTHSPASYH 481
Cdd:cd22542   105 FPTSSGSQdpslLVSKGHPSADCLPSVYTSlDMAHPYgswykTGIHPGISSSSTNATASWWDMHSNTNWLSAQGQPDGLQ 184
                         170       180       190
                  ....*....|....*....|....*....|....*....
gi 1034609085 482 PTPSPMAYQASPSPSPVGYSPMTPgaPSPGGYNPHTPGS 520
Cdd:cd22542   185 ASLQPVPAQTPLNPQLPSYTEFTT--LNPAPYPAVGISS 221
KOW pfam00467
KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, ...
146-177 1.74e-03

KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG.


Pssm-ID: 425698 [Multi-domain]  Cd Length: 32  Bit Score: 36.21  E-value: 1.74e-03
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1034609085 146 VKDIVKVIDGPHSGREGEIRHLFRSFAFLHCK 177
Cdd:pfam00467   1 KGDVVRVIAGPFKGKVGKVVEVDDKKNRVLVE 32
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
392-518 5.69e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 39.75  E-value: 5.69e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 392 DEPTPSPQAYGGTPNPQTPGYPDPSSPQVNPQynPQTPGTPAMYNTDQFSPYAAPSPQ-GSYQPSPSPQSYH-QVAPSPA 469
Cdd:NF033839  370 EKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQ--PEKPKPEVKPQPEKPKPEVKPQPEkPKPEVKPQPEKPKpEVKPQPE 447
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 1034609085 470 GYQNTHSPASYHPTPSPMAYQASPSPSpVGYSPMTP----GAPSPGGYNPHTP 518
Cdd:NF033839  448 KPKPEVKPQPETPKPEVKPQPEKPKPE-VKPQPEKPkpdnSKPQADDKKPSTP 499
KOW smart00739
KOW (Kyprides, Ouzounis, Woese) motif; Motif in ribosomal proteins, NusG, Spt5p, KIN17 and T54.
256-276 9.87e-03

KOW (Kyprides, Ouzounis, Woese) motif; Motif in ribosomal proteins, NusG, Spt5p, KIN17 and T54.


Pssm-ID: 128978  Cd Length: 28  Bit Score: 33.84  E-value: 9.87e-03
                           10        20
                   ....*....|....*....|.
gi 1034609085  256 IGQTVRISQGPYKGYIGVVKD 276
Cdd:smart00739   4 VGDTVRVIAGPFKGKVGKVLE 24
 
Name Accession Description Interval E-value
KOW_Spt5_3 cd06083
KOW domain of Spt5, repeat 3; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
21-71 2.39e-28

KOW domain of Spt5, repeat 3; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240507  Cd Length: 51  Bit Score: 107.23  E-value: 2.39e-28
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1034609085  21 YFKMGDHVKVIAGRFEGDTGLIVRVEENFVILFSDLTMHELKVLPRDLQLC 71
Cdd:cd06083     1 HFKVGDHVKVISGRHEGETGLVVKVEDDVVTVFSDLTMRELKVFPRDLQLS 51
KOW_Spt5_6 cd06086
KOW domain of Spt5, repeat 6; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
578-634 2.37e-27

KOW domain of Spt5, repeat 6; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240510  Cd Length: 58  Bit Score: 104.52  E-value: 2.37e-27
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034609085 578 EHLEPITPTKNNKVKVILGEDREATGVLLSIDGEDGIVRMDLDEQLKILNLRFLGKL 634
Cdd:cd06086     1 EHLEPVPPEKGDRVKVIKGEDRGSTGELISIDGADGIVKMDSDGDIKILPMNFLAKL 57
KOW_Spt5_5 cd06085
KOW domain of Spt5, repeat 5; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
252-301 7.90e-26

KOW domain of Spt5, repeat 5; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240509  Cd Length: 52  Bit Score: 100.25  E-value: 7.90e-26
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1034609085 252 DNELIGQTVRISQGPYKGYIGVVKDATESTARVELHSTCQTISVDRQRLT 301
Cdd:cd06085     2 RDPLIGKTVRIRKGPYKGYIGIVKDATGTTARVELHSKNKTITVDRSRLA 51
CTD smart01104
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
322-439 1.50e-23

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteristic TPA motif.


Pssm-ID: 215026 [Multi-domain]  Cd Length: 121  Bit Score: 96.05  E-value: 1.50e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  322 GSQTPMYG-SGSRTPMYGSQTP----LQDGSRTPHYGSQTPLHDG--SRTPAQSGAWdPNNPNTPSRAEEEYEYAFDDEP 394
Cdd:smart01104   1 GGRTPAWGaSGSKTPAWGSRTPgtaaGGAPTARGGSGSRTPAWGGagSRTPAWGGAG-PTGSRTPAWGGASAWGNKSSEG 79
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 1034609085  395 TPSPQA--YGGTPNPQTPGYpdpssPQVNPQYNPQTPGTPAMYNTDQ 439
Cdd:smart01104  80 SASSWAagPGGAYGAPTPGY-----GGTPSAYGPATPGGGAMAGSAS 121
KOW_Spt5_4 cd06084
KOW domain of Spt5, repeat 4; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
147-189 3.83e-20

KOW domain of Spt5, repeat 4; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240508  Cd Length: 43  Bit Score: 83.72  E-value: 3.83e-20
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 1034609085 147 KDIVKVIDGPHSGREGEIRHLFRSFAFLHCKKLVENGGMFVCK 189
Cdd:cd06084     1 GDTVKVVDGPYKGRQGTVLHIYRGTLFLHSREVTENGGIFVVR 43
CTD pfam12815
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
322-372 6.18e-13

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteriztic TPA motif.


Pssm-ID: 372327 [Multi-domain]  Cd Length: 71  Bit Score: 64.00  E-value: 6.18e-13
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034609085 322 GSQTPMYGS--GSRTPMY---GSQTPL--QDGSRTPHY--GSQTPLHD--GSRTPAQSGAWD 372
Cdd:pfam12815   1 GSRTPAYNSagGSRTPAWgadGSRTPAygGAGGRTPAYnqGGKTPAWGgaGSRTPAYYGAWG 62
PHA03269 PHA03269
envelope glycoprotein C; Provisional
375-520 3.44e-08

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 56.66  E-value: 3.44e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 375 NPNTPSRAEEEYEYAFDDEPTPSPQayggtPNPQTPGYPDPS-SPQVNPQYNPQtpgtpamyntdqfsPYAAPSPQGSYQ 453
Cdd:PHA03269   21 NLNTNIPIPELHTSAATQKPDPAPA-----PHQAASRAPDPAvAPTSAASRKPD--------------LAQAPTPAASEK 81
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 454 PSPSPQSYHQV--APSPAGYQNTHSPASYHPTPSPM-AYQASPSPSPVGYSPMTPgAPSPGGYNPHTPGS 520
Cdd:PHA03269   82 FDPAPAPHQAAsrAPDPAVAPQLAAAPKPDAAEAFTsAAQAHEAPADAGTSAASK-KPDPAAHTQHSPPP 150
PHA03247 PHA03247
large tegument protein UL36; Provisional
295-512 8.86e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.71  E-value: 8.86e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  295 VDRQRLTTVGSRRPGGMTSTYGRTPMYGSQTPMYGSGSRTPMYGSQTPlqdGSRTPHYGSQTPLHDGSRTPAQSGAWDPN 374
Cdd:PHA03247  2661 VSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP---EPAPHALVSATPLPPGPAAARQASPALPA 2737
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  375 NPNTPsraeeeyeyafddePTPSPQAYGGTPN----PQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPqg 450
Cdd:PHA03247  2738 APAPP--------------AVPAGPATPGGPArparPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP-- 2801
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034609085  451 syqPSPSPQSYHQVAPSPAgYQNTHSPASYHPTPSPMAYQASPSPSPVGYSPMTP-GAPSPGG 512
Cdd:PHA03247  2802 ---WDPADPPAAVLAPAAA-LPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLgGSVAPGG 2860
CTD pfam12815
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
304-369 9.89e-08

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteriztic TPA motif.


Pssm-ID: 372327 [Multi-domain]  Cd Length: 71  Bit Score: 49.37  E-value: 9.89e-08
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034609085 304 GSRRPGGMTSTYGRTPMY-------------GSQTPMYGSGSRTPMYGsqtplQDGSRTPHYGSQTplhDGSRTPAQSG 369
Cdd:pfam12815   1 GSRTPAYNSAGGSRTPAWgadgsrtpayggaGGRTPAYNQGGKTPAWG-----GAGSRTPAYYGAW---GGSRTPAYGG 71
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
339-520 1.87e-07

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 54.39  E-value: 1.87e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 339 SQTPLQDGSRTPHYGSQTPLHDgsrtPAQSGAWDPNNPNTPSRAEEEYEYAFDDEPtPSPQAYGGTPNPQTPGYPdPSSP 418
Cdd:pfam03154 259 SQVSPQPLPQPSLHGQMPPMPH----SLQTGPSHMQHPVPPQPFPLTPQSSQSQVP-PGPSPAAPGQSQQRIHTP-PSQS 332
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 419 QVNPQYNP-QTPGTPAMYNTdqfsPYAAPSPQGSYQPSPSPQSY----HQVAPSPAGYQN---------------THSPA 478
Cdd:pfam03154 333 QLQSQQPPrEQPLPPAPLSM----PHIKPPPTTPIPQLPNPQSHkhppHLSGPSPFQMNSnlppppalkplsslsTHHPP 408
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*
gi 1034609085 479 SYHPTPSPMAYQASPSPSPVGYSPM---TPGAPSPGGYNPHTPGS 520
Cdd:pfam03154 409 SAHPPPLQLMPQSQQLPPPPAQPPVltqSQSLPPPAASHPPTSGL 453
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
326-527 4.79e-07

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 53.23  E-value: 4.79e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 326 PMYGSGSRTPMYGSQTPLQDGSRTPHYGSQTPLHDGSRTPAQSGAWDPNNPNTPsrAEEEYEYAFDDEPTPSPQayggTP 405
Cdd:pfam03154 294 PPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPP--APLSMPHIKPPPTTPIPQ----LP 367
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 406 NPQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQgsyqPSP---SPQSyHQVAPSPAG------YQNTHS 476
Cdd:pfam03154 368 NPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPPSAH----PPPlqlMPQS-QQLPPPPAQppvltqSQSLPP 442
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034609085 477 PASYHPTPSpmAYQASPSPSPVGYSPMTPGAP----SPGGYNPHTP--GSGIEQNSS 527
Cdd:pfam03154 443 PAASHPPTS--GLHQVPSQSPFPQHPFVPGGPppitPPSGPPTSTSsaMPGIQPPSS 497
KOW cd00380
KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known ...
147-184 8.99e-07

KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. The KOW motif contains an invariants glycine residue and comprises alternating blocks of hydrophilic and hydrophobic residues.


Pssm-ID: 240504  Cd Length: 49  Bit Score: 46.06  E-value: 8.99e-07
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 1034609085 147 KDIVKVIDGPHSGREGEIRHLFRSFAFLHCKKLVENGG 184
Cdd:cd00380     1 GDVVRVLRGPYKGREGVVVDIDPRFGIVTVKGATGSKG 38
KOW cd00380
KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known ...
257-300 9.92e-07

KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. The KOW motif contains an invariants glycine residue and comprises alternating blocks of hydrophilic and hydrophobic residues.


Pssm-ID: 240504  Cd Length: 49  Bit Score: 45.67  E-value: 9.92e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1034609085 257 GQTVRISQGPYKGYIGVVKDATEST--ARVELH--STCQTISVDRQRL 300
Cdd:cd00380     1 GDVVRVLRGPYKGREGVVVDIDPRFgiVTVKGAtgSKGAELKVRFDDV 48
PHA03247 PHA03247
large tegument protein UL36; Provisional
318-532 1.29e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.86  E-value: 1.29e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  318 TPMYGSQTPMygsgSRTPMYGSQTPLQDGSRTPHYGSQTPLHDGSR-TPAQSGAWDPNNPNTPsRAEEEYEYAFDDEPTP 396
Cdd:PHA03247  2823 SPAGPLPPPT----SAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRrPPSRSPAAKPAAPARP-PVRRLARPAVSRSTES 2897
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  397 SPQAYGGTPNPQTPGYPDPSSPQVNPQYNPQtpgtpamyntdqfsPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQNTHS 476
Cdd:PHA03247  2898 FALPPDQPERPPQPQAPPPPQPQPQPPPPPQ--------------PQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPW 2963
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1034609085  477 PASYHPTPSPMAYQASPSPSPvgySPMTPGAPSPGgyNPHTPGSGIeqnsSDWVTT 532
Cdd:PHA03247  2964 LGALVPGRVAVPRFRVPQPAP---SREAPASSTPP--LTGHSLSRV----SSWASS 3010
KOW cd00380
KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known ...
25-69 1.34e-06

KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. The KOW motif contains an invariants glycine residue and comprises alternating blocks of hydrophilic and hydrophobic residues.


Pssm-ID: 240504  Cd Length: 49  Bit Score: 45.29  E-value: 1.34e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1034609085  25 GDHVKVIAGRFEGDTGLIVRVEENFVIL----FSDLTMHELKVLPRDLQ 69
Cdd:cd00380     1 GDVVRVLRGPYKGREGVVVDIDPRFGIVtvkgATGSKGAELKVRFDDVD 49
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
394-520 1.46e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 51.69  E-value: 1.46e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 394 PTPSPQAYGGTPNPQTPGYPDPSSPQVN-PQYNPQTPGTP--AMYNTDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAG 470
Cdd:pfam03154 188 PPGTTQAATAGPTPSAPSVPPQGSPATSqPPNQTQSTAAPhtLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLP 267
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034609085 471 YQNTHSPASYHP---------TPSPMAYQASPSPSPVGYSPMTPG----APSPGGYNPHTPGS 520
Cdd:pfam03154 268 QPSLHGQMPPMPhslqtgpshMQHPVPPQPFPLTPQSSQSQVPPGpspaAPGQSQQRIHTPPS 330
PRK10263 PRK10263
DNA translocase FtsK; Provisional
393-519 1.67e-06

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 51.62  E-value: 1.67e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  393 EPTPSPQAYGGTPNPQtpgYPDPSSPQVNP---QYNPQTPGTPAMYNTDQFSPYAAPSP-QGSYQPSPSPQSYHQVAPSP 468
Cdd:PRK10263   370 EPVIAPAPEGYPQQSQ---YAQPAVQYNEPlqqPVQPQQPYYAPAAEQPAQQPYYAPAPeQPAQQPYYAPAPEQPVAGNA 446
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1034609085  469 AGYQNTHSPasYHPTPSPMAYQASPSPSPVGYSPMTPGAPSPGGYNPHTPG 519
Cdd:PRK10263   447 WQAEEQQST--FAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPV 495
PHA03247 PHA03247
large tegument protein UL36; Provisional
339-519 2.42e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.09  E-value: 2.42e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  339 SQTPLQDGSRTPHYGSQTPLHDGSRTPAQSGAWDPNNPNTPSRAEEEYEYAFDDEPTPSPQAYGGTPNPQTPGYP----- 413
Cdd:PHA03247  2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHppptv 2646
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  414 --------DPSSPQVNPQYNPQTPGTPAMYN--TDQFSPYAAPSPQGSYQPS---PSPQSYHQVAPSPAGYQNTHSPASY 480
Cdd:PHA03247  2647 ppperprdDPAPGRVSRPRRARRLGRAAQASspPQRPRRRAARPTVGSLTSLadpPPPPPTPEPAPHALVSATPLPPGPA 2726
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 1034609085  481 HPTPSPMAYQASPSPSPVGYSPMTPGAPSPGGYNPHTPG 519
Cdd:PHA03247  2727 AARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAG 2765
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
308-521 2.77e-06

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 50.39  E-value: 2.77e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 308 PGGMTSTYGRTPMYGSQTPMYGSGSRTPMYGSQ-------TPLQDGSRTPHYGSQTPLHDGSRTPAQSGAWDPNNP---- 376
Cdd:pfam09606 231 PQQMGGAPNQVAMQQQQPQQQGQQSQLGMGINQmqqmpqgVGGGAGQGGPGQPMGPPGQQPGAMPNVMSIGDQNNYqqqq 310
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 377 --NTPSRAEEEYEYAFDDEPTPS--------PQAYGGTPNPQTPGypdpssPQVNPQYNPQTPGTPAMYNTDQFSPYAAP 446
Cdd:pfam09606 311 trQQQQQQGGNHPAAHQQQMNQSvgqggqvvALGGLNHLETWNPG------NFGGLGANPMQRGQPGMMSSPSPVPGQQV 384
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034609085 447 SPQGSYQPSPSPQSYHQVAPSPAGYQNTHSPASyHPTPSPmAYQASPSPSPVGY--SPMTPGAPSPGGyNPHTPGSG 521
Cdd:pfam09606 385 RQVTPNQFMRQSPQPSVPSPQGPGSQPPQSHPG-GMIPSP-ALIPSPSPQMSQQpaQQRTIGQDSPGG-SLNTPGQS 458
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
277-524 2.99e-06

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 50.34  E-value: 2.99e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 277 ATESTARVELHSTCQTISvdrqrlTTVGSRRPGGMTSTYGRTPMYGSQTPMYGSGSRTPMYGSQTPlQDGSRTPHYGSQT 356
Cdd:pfam17823 170 AASPAPRTAASSTTAASS------TTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALA-AVGNSSPAAGTVT 242
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 357 PLhDGSRTPAQSGawdpnnpnTPSRAEEEYEYAFDDEPTPSPQAYGGTPNPQTPGypDPSSPQVNPQYNPQTPGTPAMYN 436
Cdd:pfam17823 243 AA-VGTVTPAALA--------TLAAAAGTVASAAGTINMGDPHARRLSPAKHMPS--DTMARNPAAPMGAQAQGPIIQVS 311
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 437 TDQfsPYAAPSPqgsyQPSPSPQSYHQVAPSPAGYQNTHSPASyhPTPSPMAYQASPSPSPVGYSPMTPGA--------P 508
Cdd:pfam17823 312 TDQ--PVHNTAG----EPTPSPSNTTLEPNTPKSVASTNLAVV--TTTKAQAKEPSASPVPVLHTSMIPEVeatspttqP 383
                         250
                  ....*....|....*.
gi 1034609085 509 SPGGYNPHTPGSGIEQ 524
Cdd:pfam17823 384 SPLLPTQGAAGPGILL 399
PHA03378 PHA03378
EBNA-3B; Provisional
313-512 3.72e-06

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 50.07  E-value: 3.72e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 313 STYGRTPMYGSQTPMYGSGSRTPMYGSQTPLQD-----------GSRTPHYGSQTPLHDGSRTPAQSGAWdPNNPNTPSR 381
Cdd:PHA03378  603 SQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPlrmqpitfnvlVFPTPHQPPQVEITPYKPTWTQIGHI-PYQPSPTGA 681
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 382 AEEEYEYAFDDEPTPSPQAYGGTPNPQTPgyPDPSS-PQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQGSYQPSPSPQS 460
Cdd:PHA03378  682 NTMLPIQWAPGTMQPPPRAPTPMRPPAAP--PGRAQrPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAA 759
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034609085 461 YHQVAPSPAGyqnthSPASYHPTPSPMA-------YQASPSPSP---VGYSPMTPGAPSPGG 512
Cdd:PHA03378  760 APGRARPPAA-----APGAPTPQPPPQAppapqqrPRGAPTPQPppqAGPTSMQLMPRAAPG 816
KOW smart00739
KOW (Kyprides, Ouzounis, Woese) motif; Motif in ribosomal proteins, NusG, Spt5p, KIN17 and T54.
21-48 9.65e-06

KOW (Kyprides, Ouzounis, Woese) motif; Motif in ribosomal proteins, NusG, Spt5p, KIN17 and T54.


Pssm-ID: 128978  Cd Length: 28  Bit Score: 42.32  E-value: 9.65e-06
                           10        20
                   ....*....|....*....|....*...
gi 1034609085   21 YFKMGDHVKVIAGRFEGDTGLIVRVEEN 48
Cdd:smart00739   1 KFEVGDTVRVIAGPFKGKVGKVLEVDGE 28
PHA03378 PHA03378
EBNA-3B; Provisional
349-518 1.16e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 48.52  E-value: 1.16e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 349 TPHYGSQTPLHDGSRTPAQSGAWDPNNPNTPSRAEEE---YEYAFDDEPTPSPQAYGGTPNPQTPGYPDPS-SPQVNPQ- 423
Cdd:PHA03378  582 TSQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETsapRQWPMPLRPIPMRPLRMQPITFNVLVFPTPHqPPQVEITp 661
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 424 ------------YNPQTPG---------TPAMYNTDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQNTHSPASYHP 482
Cdd:PHA03378  662 ykptwtqighipYQPSPTGantmlpiqwAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAP 741
                         170       180       190
                  ....*....|....*....|....*....|....*...
gi 1034609085 483 TPS--PMAYQASPSPSPVGYSPMTPGAPSPGGYNPHTP 518
Cdd:PHA03378  742 GRArpPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPP 779
PHA03377 PHA03377
EBNA-3C; Provisional
288-518 1.68e-05

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 48.13  E-value: 1.68e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  288 STCQTISVDRQRLTTVGSRRPGGMTSTygrTPMYGSQTPMYgSGSRTPMYGSQTPLQD---GSRTPHYGSQTPLHDGSRT 364
Cdd:PHA03377   686 SVFVLPSVDAGRAQPSEESHLSSMSPT---QPISHEEQPRY-EDPDDPLDLSLHPDQApppSHQAPYSGHEEPQAQQAPY 761
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  365 PaqsGAWDPNNPNTPSRAEEEyeyafddeptpsPQAYGGTPNpQTPGYPDPSSPQVN-----------PQY----NPQTP 429
Cdd:PHA03377   762 P---GYWEPRPPQAPYLGYQE------------PQAQGVQVS-SYPGYAGPWGLRAQhpryrhswaywSQYpghgHPQGP 825
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  430 GTP-AMYNTDQFSPYAAP-----SPQGSYQPSPSP----QSYHQVAPSPAGYQNTHSPASYHPTPS----PMAYQASPSP 495
Cdd:PHA03377   826 WAPrPPHLPPQWDGSAGHgqdqvSQFPHLQSETGPprlqLSQVPQLPYSQTLVSSSAPSWSSPQPRapirPIPTRFPPPP 905
                          250       260
                   ....*....|....*....|...
gi 1034609085  496 SPVGYSpMTPGAPSPGGYNPHTP 518
Cdd:PHA03377   906 MPLQDS-MAVGCDSSGTACPSMP 927
PRK10263 PRK10263
DNA translocase FtsK; Provisional
415-526 1.91e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 48.16  E-value: 1.91e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  415 PSSPQVNPQYNPQTpgtpamynTDQFSPYAAPSPQGSYQPSPSPQSYHQ-VAPSPAGYQNTHSPASYHPTPSPMAYQASP 493
Cdd:PRK10263   740 PHEPLFTPIVEPVQ--------QPQQPVAPQQQYQQPQQPVAPQPQYQQpQQPVAPQPQYQQPQQPVAPQPQYQQPQQPV 811
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1034609085  494 SPSPVGYSPMTPGAPSPGGYNPHTPGSGIEQNS 526
Cdd:PRK10263   812 APQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDT 844
KOW pfam00467
KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, ...
256-287 1.92e-05

KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG.


Pssm-ID: 425698 [Multi-domain]  Cd Length: 32  Bit Score: 41.60  E-value: 1.92e-05
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1034609085 256 IGQTVRISQGPYKGYIGVVKDATESTARVELH 287
Cdd:pfam00467   1 KGDVVRVIAGPFKGKVGKVVEVDDKKNRVLVE 32
PHA03247 PHA03247
large tegument protein UL36; Provisional
361-516 2.08e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 2.08e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  361 GSRTPAQSGAWDPNNPNTPSRAEEEYeyaFDDEPTPSP-----------------QAYGGTPNPQTPGYPDPSSPQV--N 421
Cdd:PHA03247  2494 AAPDPGGGGPPDPDAPPAPSRLAPAI---LPDEPVGEPvhprmltwirgleelasDDAGDPPPPLPPAAPPAAPDRSvpP 2570
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  422 PQYNPQTPGtPAMyNTDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQNTHSPASYHPTPSPMAYQA-SPSPSPVGY 500
Cdd:PHA03247  2571 PRPAPRPSE-PAV-TSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPdPHPPPTVPP 2648
                          170
                   ....*....|....*.
gi 1034609085  501 SPMTPGAPSPGGYNPH 516
Cdd:PHA03247  2649 PERPRDDPAPGRVSRP 2664
KOW pfam00467
KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, ...
25-53 3.78e-05

KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG.


Pssm-ID: 425698 [Multi-domain]  Cd Length: 32  Bit Score: 40.83  E-value: 3.78e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1034609085  25 GDHVKVIAGRFEGDTGLIVRVEE--NFVILF 53
Cdd:pfam00467   2 GDVVRVIAGPFKGKVGKVVEVDDkkNRVLVE 32
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
360-527 4.22e-05

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 46.60  E-value: 4.22e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 360 DGSRTPAQSgawdPNNPNTPSRAEEEYEYAFDD-EPTPSPQAYGGTPNP----QTPGYPDPSSPQVNPQYNPQT--PGTP 432
Cdd:COG5180   222 DHPRPEAAS----SPKVDPPSTSEARSRPATVDaQPEMRPPADAKERRRaaigDTPAAEPPGLPVLEAGSEPQSdaPEAE 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 433 AMYNTDQFSPYAAPSPQGSYQPSPS-----PQSYHQVAPSPAGYQNTHSPASYHPTPSPMAYQASPSpspvgySPMTPGA 507
Cdd:COG5180   298 TARPIDVKGVASAPPATRPVRPPGGardpgTPRPGQPTERPAGVPEAASDAGQPPSAYPPAEEAVPG------KPLEQGA 371
                         170       180
                  ....*....|....*....|....
gi 1034609085 508 PSPG--GYN--PHTPGSGIEQNSS 527
Cdd:COG5180   372 PRPGssGGDgaPFQPPNGAPQPGL 395
PHA03378 PHA03378
EBNA-3B; Provisional
314-510 6.04e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 46.21  E-value: 6.04e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 314 TYGRTPMYGSQTPMYGSGSRTPMYGSQTP-----------LQDGSRTPHYGSQTPLHDGSRTPAQSGAWDPNNPNTPSRA 382
Cdd:PHA03378  578 TSPTTSQLASSAPSYAQTPWPVPHPSQTPeppttqshipeTSAPRQWPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQV 657
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 383 EeeyeyafddeptPSPQAYGGTPNPQTPGYPDPSSPQVN--PQYNPQTPGTPAMYNTDQFSPYAAPS----PQGSYQPSP 456
Cdd:PHA03378  658 E------------ITPYKPTWTQIGHIPYQPSPTGANTMlpIQWAPGTMQPPPRAPTPMRPPAAPPGraqrPAAATGRAR 725
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 1034609085 457 SPQSYHQVAPSPAGYQNTHSPASYHPTPS-PMAYQASPSPSPVGyspmTPGAPSP 510
Cdd:PHA03378  726 PPAAAPGRARPPAAAPGRARPPAAAPGRArPPAAAPGRARPPAA----APGAPTP 776
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
393-516 6.73e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.13  E-value: 6.73e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 393 EPTPSPQAYGGTPNPQTPGyPDPSSPQVNPQYNPQTPGTPAmyntdqfsPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQ 472
Cdd:PRK07764  391 AGAPAAAAPSAAAAAPAAA-PAPAAAAPAAAAAPAPAAAPQ--------PAPAPAPAPAPPSPAGNAPAGGAPSPPPAAA 461
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 1034609085 473 NTHSPASYHPTPSPMAYQASPSPSPVGYSPMTPGAPSPGGYNPH 516
Cdd:PRK07764  462 PSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAG 505
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
396-524 7.42e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 45.91  E-value: 7.42e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 396 PSPQAYGGTPNPQTPGYPDPSS-PQVNPqynpqTPGTPAMynTDQFSPYAAPSPQGSyQPSPSPQSYHQVAPS--PAGYQ 472
Cdd:pfam03154 172 PVLQAQSGAASPPSPPPPGTTQaATAGP-----TPSAPSV--PPQGSPATSQPPNQT-QSTAAPHTLIQQTPTlhPQRLP 243
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 1034609085 473 NTHSPASYHPTPSPMAY-QASPSPSPVGYSPMTPGAPSPGGYNPHTPGSGIEQ 524
Cdd:pfam03154 244 SPHPPLQPMTQPPPPSQvSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQ 296
PHA03247 PHA03247
large tegument protein UL36; Provisional
325-555 7.49e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 7.49e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  325 TPMYGSGSRTPmyGSQTPLQDGSRTPHYGSQTPLHDGSRTPAQSGAWDPNNPNTPSRAEEEYEYAFDDEPTPSPQAYGGT 404
Cdd:PHA03247  2741 PPAVPAGPATP--GGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  405 PNPQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFS-----PYAAPSPQGSYQPSPSPQSYHQVA--PSPAGYQNTHSP 477
Cdd:PHA03247  2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSvapggDVRRRPPSRSPAAKPAAPARPPVRrlARPAVSRSTESF 2898
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034609085  478 ASYHPTPSPMAYQASPSPSPVGYSPMTPGAPSPGGYNPHTPGSGIEQNSSDWVTTDIQVKVRDTYLDTQVVGQTGVIR 555
Cdd:PHA03247  2899 ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPR 2976
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
364-529 9.56e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 45.47  E-value: 9.56e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 364 TPAQSGAWDPN------NPNTPSRAEEEYEYAfDDEPTPSPQAYGGTPNPQTP---GYP------DPSSPQVNPQYNP-- 426
Cdd:PRK08691  359 APLAAASCDANavientELQSPSAQTAEKETA-AKKPQPRPEAETAQTPVQTAsaaAMPsegktaGPVSNQENNDVPPwe 437
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 427 ------QTPGTPAmyntdqfsPYAAPSPQGSYQPSPSPQSyhQVAPSPAGYQNTHSPASYHPTPSPmaYQASPSPSPVGY 500
Cdd:PRK08691  438 dapdeaQTAAGTA--------QTSAKSIQTASEAETPPEN--QVSKNKAADNETDAPLSEVPSENP--IQATPNDEAVET 505
                         170       180       190
                  ....*....|....*....|....*....|....*
gi 1034609085 501 SPMTPGAPSPGGYNPHTPGS------GIEQNSSDW 529
Cdd:PRK08691  506 ETFAHEAPAEPFYGYGFPDNdcppedGAEIPPPDW 540
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
394-508 1.36e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 45.06  E-value: 1.36e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 394 PTPSPQAYGGTPNPQTPGyPDPSSPQVNPQYNPQTPGTPAmyntdqfsPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQN 473
Cdd:PRK14959  387 EGPASGGAATIPTPGTQG-PQGTAPAAGMTPSSAAPATPA--------PSAAPSPRVPWDDAPPAPPRSGIPPRPAPRMP 457
                          90       100       110
                  ....*....|....*....|....*....|....*.
gi 1034609085 474 THSPASYHPTPSPMAYQASPSPS-PVGYSPMTPGAP 508
Cdd:PRK14959  458 EASPVPGAPDSVASASDAPPTLGdPSDTAEHTPSGP 493
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
341-528 1.39e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 1.39e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  341 TPLQDGSRTPHYGSQ-----------TPLHDGSRTPAqSGAWDPNNPNTPSRAEE-EYEYAFDDEPTPSPQAYGGTPNPQ 408
Cdd:PHA03307    26 ATPGDAADDLLSGSQgqlvsdsaelaAVTVVAGAAAC-DRFEPPTGPPPGPGTEApANESRSTPTWSLSTLAPASPAREG 104
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  409 TPGYPDPSSPqvnpqynpqtPGTPAMYNTDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQNThSPASYHPTP---- 484
Cdd:PHA03307   105 SPTPPGPSSP----------DPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPA-AVASDAASSrqaa 173
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1034609085  485 --SPMAYQASPSPSPVGYSPMTPGAPSPGGYNPHTPGSGIEQNSSD 528
Cdd:PHA03307   174 lpLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASS 219
KOW_Spt5_2 cd06082
KOW domain of Spt5, repeat 2; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
1-20 1.52e-04

KOW domain of Spt5, repeat 2; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240506  Cd Length: 51  Bit Score: 39.79  E-value: 1.52e-04
                          10        20
                  ....*....|....*....|
gi 1034609085   1 MPKHEDLKDMLEFPAQELRK 20
Cdd:cd06082    32 MPKHEDLKEPLEFPAKELRK 51
PHA02682 PHA02682
ORF080 virion core protein; Provisional
355-540 2.00e-04

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 43.70  E-value: 2.00e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 355 QTPLHDGSRTPAQSgawdpnnPNTPSRAEeeyeyafddePTPSPQAYGGTPNPQTPGYPDPSSPQVNPQYNPQTPGTPAM 434
Cdd:PHA02682   79 QSPLAPSPACAAPA-------PACPACAP----------AAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAPACP 141
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 435 YNTDQFSPyAAPSPQgsyqPSPSPqsyhqvAPSPAGYQNTHSPASYhPTPSPMAYQASPSPSPVgyspmtpgapspggYN 514
Cdd:PHA02682  142 PSTRQCPP-APPLPT----PKPAP------AAKPIFLHNQLPPPDY-PAASCPTIETAPAASPV--------------LE 195
                         170       180       190
                  ....*....|....*....|....*....|.
gi 1034609085 515 PHTPGSGIEQNSSDWVT-----TDIQVKVRD 540
Cdd:PHA02682  196 PRIPDKIIDADNDDKDLikkelADIADSVRD 226
PHA03291 PHA03291
envelope glycoprotein I; Provisional
389-536 2.46e-04

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 43.79  E-value: 2.46e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 389 AFDDEPTPSPQAYGGTPnpqTPGYPDPSSPQVNPQYNPqtpgtpamynTDQFSPyAAPSPQGSYQPSPspqsyhQVAPSP 468
Cdd:PHA03291  165 AFPAEGTLAAPPLGEGS---ADGSCDPALPLSAPRLGP----------ADVFVP-ATPRPTPRTTASP------ETTPTP 224
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034609085 469 AgyqNTHSPASyHPTPSPMAYQASPSPSPVGYSPMTPGAPSPGGYNPHTPGSGIEQNSSDWVTTDIQV 536
Cdd:PHA03291  225 S---TTTSPPS-TTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEAPPANATPAPEASRYELTVTQI 288
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
345-533 2.78e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.21  E-value: 2.78e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 345 DGSRTPHYGSQTPLHDGSR--TPAQSGAWDPNNPNTPSRAEEEYEYAFDDEPTPSPQAYGGTPNPQTPGYPDPSSPQVNP 422
Cdd:PRK07764  596 GGEGPPAPASSGPPEEAARpaAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGG 675
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 423 QYNPQTPGTPAMyntdqfSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQNTHSPASYHPTPSPMAYQASPSPSPVGYSP 502
Cdd:PRK07764  676 AAPAAPPPAPAP------AAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPP 749
                         170       180       190
                  ....*....|....*....|....*....|.
gi 1034609085 503 MTPGAPSPGGYNPHTPGSGIEQNSSDWVTTD 533
Cdd:PRK07764  750 DPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
374-510 3.16e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 43.60  E-value: 3.16e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 374 NNPNTPSRAEEEYEYAFDD-EPTPSPQAYGGTPN-PQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQG- 450
Cdd:NF033839  249 DNVNTKVEIENTVHKIFADmDAVVTKFKKGLTQDtPKEPGNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKp 328
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034609085 451 SYQPSPSPQSYH-QVAPSPAGYQNTHSPASYHPTPSPMAYQASPSPSpvgySPMTPGAPSP 510
Cdd:NF033839  329 KPEVKPQPEKPKpEVKPQLETPKPEVKPQPEKPKPEVKPQPEKPKPE----VKPQPETPKP 385
dnaA PRK14086
chromosomal replication initiator protein DnaA;
335-499 3.41e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 43.66  E-value: 3.41e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 335 PMY-GSQTPLQDGSRTPHYGSQTPLHDGSRTPAQSGAWDPNnPNTPSRAEEEYEYAFDDEPTPSPQAYGGTPNPQTPGYP 413
Cdd:PRK14086  103 RRTsEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTAR-PAYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYA 181
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 414 DPSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQGSY-------QPSPSPQSYHQV--APSPAGYQNTHSPASYHPTP 484
Cdd:PRK14086  182 SPASYAPEQERDREPYDAGRPEYDQRRRDYDHPRPDWDRprrdrtdRPEPPPGAGHVHrgGPGPPERDDAPVVPIRPSAP 261
                         170
                  ....*....|....*
gi 1034609085 485 SPMAYQASPSPSPVG 499
Cdd:PRK14086  262 GPLAAQPAPAPGPGE 276
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
303-511 5.16e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 43.37  E-value: 5.16e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 303 VGSRRPGGMTStyGRTPMYGSQTPM-YGSGSRTPMYGSQTPLQDgSRTPHYGSQTPlhdGSRTPAQSGAWDPNNPNTPSR 381
Cdd:pfam05109 473 VTSPTPAGTTS--GASPVTPSPSPRdNGTESKAPDMTSPTSAVT-TPTPNATSPTP---AVTTPTPNATSPTLGKTSPTS 546
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 382 AeeeYEYAFDDEPTPSPQAYGGTPNPQTPGY-------------PDPSSPQV---NPQYNPQ------TPGTPAMYNTDQ 439
Cdd:pfam05109 547 A---VTTPTPNATSPTPAVTTPTPNATIPTLgktsptsavttptPNATSPTVgetSPQANTTnhtlggTSSTPVVTSPPK 623
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 440 FSPYAAPSPQGSYQPSPS------PQSYHQ-VAPSPAGYQNTHSP--ASYHPTPSPMAYQASPSPSPVGYSPMTPGAPSP 510
Cdd:pfam05109 624 NATSAVTTGQHNITSSSTssmslrPSSISEtLSPSTSDNSTSHMPllTSAHPTGGENITQVTPASTSTHHVSTSSPAPRP 703

                  .
gi 1034609085 511 G 511
Cdd:pfam05109 704 G 704
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
405-591 5.25e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 43.26  E-value: 5.25e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 405 PNPQTPGYPDPSSPQVNPQYNPQTPGTPAMyntdqfspyAAPSPQGSYQPSPSPQSyhQVAPSPAGYQNTHSPASYHPTP 484
Cdd:PRK14950  364 PAPQPAKPTAAAPSPVRPTPAPSTRPKAAA---------AANIPPKEPVRETATPP--PVPPRPVAPPVPHTPESAPKLT 432
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 485 spmayqasPSPSPVGYSPMTPGAPSPGGYNP--HTPGSGIEQNSSDWVTTDIQVKVRDTYLdtQVVGQTGViRSVTggmc 562
Cdd:PRK14950  433 --------RAAIPVDEKPKYTPPAPPKEEEKalIADGDVLEQLEAIWKQILRDVPPRSPAV--QALLSSGV-RPVS---- 497
                         170       180       190
                  ....*....|....*....|....*....|
gi 1034609085 563 svyLKDSEKVVSISSE-HLEPITPTKNNKV 591
Cdd:PRK14950  498 ---VEKNTLTLSFKSKfHKDKIEEPENRKI 524
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
396-527 5.75e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.94  E-value: 5.75e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 396 PSPQAYGGTPNPQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQNTH 475
Cdd:PRK12323  392 PAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRP 471
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 1034609085 476 SPASYHPTPSPMAYQASPSPSPVGYSP---MTPGAPSPGGYNPHTPGSGIEQNSS 527
Cdd:PRK12323  472 VAAAAAAAPARAAPAAAPAPADDDPPPweeLPPEFASPAPAQPDAAPAGWVAESI 526
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
365-520 5.76e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.05  E-value: 5.76e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 365 PAQSGAWDPNNPNTPSRAEEEYEYAfdDEPTPSPQAYGGTPNPQTPGYPDPSSPQVNPQYNPQTPG----TPAMYNTDQF 440
Cdd:PRK07764  592 PGAAGGEGPPAPASSGPPEEAARPA--APAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKhvavPDASDGGDGW 669
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 441 SPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQNTHSPASYHP---------TPSPMAYQASPSPSPVGYSPMTPGAPSPG 511
Cdd:PRK07764  670 PAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAgqaddpaaqPPQAAQGASAPSPAADDPVPLPPEPDDPP 749

                  ....*....
gi 1034609085 512 GYNPHTPGS 520
Cdd:PRK07764  750 DPAGAPAQP 758
PTZ00395 PTZ00395
Sec24-related protein; Provisional
338-516 6.27e-04

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 43.14  E-value: 6.27e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  338 GSQTPLQDGSRTPHYGSQTPL-HDGSRTPAQSGAWDPNNPNTPSRAEEEYEYAFDDEPTPSPQAYGGTP--NP--QTPGY 412
Cdd:PTZ00395   345 GSPNAASAGAPFNGLGNQADGgHINQVHPDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAGFSNAGysNPgnSNPGY 424
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  413 PDP---SSPQVNPQY------NPQTPGTPamYNTDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQNTHSPASYHPT 483
Cdd:PTZ00395   425 NNApnsNTPYNNPPNsntpysNPPNSNPP--YSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQHRAANQPAANLPT 502
                          170       180       190
                   ....*....|....*....|....*....|...
gi 1034609085  484 PSPMAyqASPSPSPVGYSPMTPGAPSPGGYNPH 516
Cdd:PTZ00395   503 ANQPA--ANNFHGAAGNSVGNPFASRPFGSAPY 533
dnaA PRK14086
chromosomal replication initiator protein DnaA;
393-519 6.67e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 42.89  E-value: 6.67e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 393 EPTPSPQAYGGTPNPQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQGSYQPSPSPQsyhqvAPSPAGYQ 472
Cdd:PRK14086   94 EPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWPR-----AADDYGWQ 168
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 1034609085 473 NT-HSPASYHPTPSPMAYQASPSPSPVGYSPMTPGAPSPGGyNPHTPG 519
Cdd:PRK14086  169 QQrLGFPPRAPYASPASYAPEQERDREPYDAGRPEYDQRRR-DYDHPR 215
PHA03247 PHA03247
large tegument protein UL36; Provisional
376-518 9.01e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 9.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  376 PNTPS-RAEEEYEYAFDDEPTPSPQAyGGTPNPQTPGYPDPSSPQVNPQYNPQTPGTPAMY----NTDQFSPYAAPSPQG 450
Cdd:PHA03247  2475 PGAPVyRRPAEARFPFAAGAAPDPGG-GGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLtwirGLEELASDDAGDPPP 2553
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  451 SYQPSPSPQSYHQVAPSPagyqnthSPASYHPTPSPMAYQASPSPSPVGYSPMTPGAP--SPGGYNPHTP 518
Cdd:PHA03247  2554 PLPPAAPPAAPDRSVPPP-------RPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDrgDPRGPAPPSP 2616
PRK10263 PRK10263
DNA translocase FtsK; Provisional
335-500 1.32e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.99  E-value: 1.32e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  335 PMYGSQTPLQDGSR--TPHYGSQTPlhDGSRTPAQSGaWDPNNPNTPSRAEEEYEYAFDDEPTPSPQAYGGTPNPQTPGY 412
Cdd:PRK10263   345 PVASVDVPPAQPTVawQPVPGPQTG--EPVIAPAPEG-YPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYY 421
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  413 -PDPSSPQVNPQYNPQtPGTPAMYNtdqfsPYAAPSPQGSYQPSPSPQSYHQ-VAPSPAGYQNTHSPASYHPT---PSPM 487
Cdd:PRK10263   422 aPAPEQPAQQPYYAPA-PEQPVAGN-----AWQAEEQQSTFAPQSTYQTEQTyQQPAAQEPLYQQPQPVEQQPvvePEPV 495
                          170
                   ....*....|...
gi 1034609085  488 AYQASPSPSPVGY 500
Cdd:PRK10263   496 VEETKPARPPLYY 508
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
395-510 1.33e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.83  E-value: 1.33e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 395 TPSPQAYGGTPNPQTPGY--PD-----PSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQGSYQPSPSPQSYHQVAPS 467
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFaaPNtttglPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 1034609085 468 PAgyQNTHSPASYHPTPSPMAyqasPSPSPVGYSPmTPGAPSP 510
Cdd:pfam05109 502 KA--PDMTSPTSAVTTPTPNA----TSPTPAVTTP-TPNATSP 537
SP7_N cd22542
N-terminal domain of transcription factor Specificity Protein (SP) 7; Specificity Proteins ...
352-520 1.37e-03

N-terminal domain of transcription factor Specificity Protein (SP) 7; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP7, also called Osterix (Osx) in humans, is highly conserved among bone-forming vertebrates. It plays a major role, along with Runx2 and Dlx5 in driving the differentiation of mesenchymal precursor cells into osteoblasts and eventually osteocytes. SP7 also plays a regulatory role by inhibiting chondrocyte differentiation, maintaining the balance between differentiation of mesenchymal precursor cells into ossified bone or cartilage. Mutations of this gene have been associated with multiple dysfunctional bone phenotypes in vertebrates. SP7 is thought to play a role in diseases such as Osteogenesis imperfecta. SP7 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. This model represents the N-terminal domain of SP7.


Pssm-ID: 411691 [Multi-domain]  Cd Length: 297  Bit Score: 41.04  E-value: 1.37e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 352 YGSQTPLHDgSRTPAQSGAWDPNNPNTP--------SRAEE----EYEYAFDD-----EPTPSPQA---YGGTPNPQTPG 411
Cdd:cd22542    26 FGGSSPIRD-SATPGKPGNNPGKKPYSLgsdlssakSRSSElmgdSYTATFSSgnglmSPSGSPQAsttYGNDYNPFSHS 104
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 412 YPDPSSPQ----VNPQYNPQTPGTPAMYNT-DQFSPY-----AAPSPQGSYQPSPSPQSYHQVAPSPAGYQNTHSPASYH 481
Cdd:cd22542   105 FPTSSGSQdpslLVSKGHPSADCLPSVYTSlDMAHPYgswykTGIHPGISSSSTNATASWWDMHSNTNWLSAQGQPDGLQ 184
                         170       180       190
                  ....*....|....*....|....*....|....*....
gi 1034609085 482 PTPSPMAYQASPSPSPVGYSPMTPgaPSPGGYNPHTPGS 520
Cdd:cd22542   185 ASLQPVPAQTPLNPQLPSYTEFTT--LNPAPYPAVGISS 221
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
360-520 1.44e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.08  E-value: 1.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  360 DGSRTPAQSGAWDPNNPNTPSRAeeeyeyAFDDEPTPSPQAYGGTPNPQTPGYPDPSSP--QVNPQYNPQTPGTPAMYNT 437
Cdd:PHA03307   238 DSSSSESSGCGWGPENECPLPRP------APITLPTRIWEASGWNGPSSRPGPASSSSSprERSPSPSPSSPGSGPAPSS 311
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  438 DQFSPYAAPSPQGSyQPSPSPQSyhqVAPSPAGyqnTHSPASYHPTPSPmayqASPSPSPVGYSPMTPGAPSPGGYNPHT 517
Cdd:PHA03307   312 PRASSSSSSSRESS-SSSTSSSS---ESSRGAA---VSPGPSPSRSPSP----SRPPPPADPSSPRKRPRPSRAPSSPAA 380

                   ...
gi 1034609085  518 PGS 520
Cdd:PHA03307   381 SAG 383
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
378-518 1.47e-03

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 40.02  E-value: 1.47e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 378 TPSRAEEEYEYAFDDEPTPSPQAYGGTPNPQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQfsPYAAPSPQGSYQPSPS 457
Cdd:pfam15240  29 PSLISEEEGQSQQGGQGPQGPPPGGFPPQPPASDDPPGPPPPGGPQQPPPQGGKQKPQGPPP--QGGPRPPPGKPQGPPP 106
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034609085 458 PQSYHQVAPSPAGYQNTHSPASYHPTPSPMAYQASPSPSPvGYSPMTPGAPSPGGYNPHTP 518
Cdd:pfam15240 107 QGGNQQQGPPPPGKPQGPPPQGGGPPPQGGNQQGPPPPPP-GNPQGPPQRPPQPGNPQGPP 166
KOW pfam00467
KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, ...
146-177 1.74e-03

KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG.


Pssm-ID: 425698 [Multi-domain]  Cd Length: 32  Bit Score: 36.21  E-value: 1.74e-03
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1034609085 146 VKDIVKVIDGPHSGREGEIRHLFRSFAFLHCK 177
Cdd:pfam00467   1 KGDVVRVIAGPFKGKVGKVVEVDDKKNRVLVE 32
PHA03325 PHA03325
nuclear-egress-membrane-like protein; Provisional
354-515 1.86e-03

nuclear-egress-membrane-like protein; Provisional


Pssm-ID: 223044  Cd Length: 418  Bit Score: 41.02  E-value: 1.86e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 354 SQTPLHDGSRTPAQSGAWDPNNPNTPSRAEEEYEYAFDDEPTPSPQAyggtpnpqtpgYPDPSSPQVNPQYNPQTPGTPA 433
Cdd:PHA03325  266 SSLPTSAPKRRSRRAGAMRAAAGETADLADDDGSEHSDPEPLPASLP-----------PPPVRRPRVKHPEAGKEEPDGA 334
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 434 MYNTDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGyqnthSPASYHPTPSPMAYQASPSPSPVGYSPMTPGAPSPGGY 513
Cdd:PHA03325  335 RNAEAKEPAQPATSTSSKGSSSAQNKDSGSTGPGSSL-----AAASSFLEDDDFGSPPLDLTTSLRHMPSPSVTSAPEPP 409

                  ..
gi 1034609085 514 NP 515
Cdd:PHA03325  410 SI 411
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
341-510 2.11e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.29  E-value: 2.11e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 341 TPLQDGSRTPHYGSQTPLHDGSRTPAQSGAWDPNNPNTPSRAEEEYE--YAFDDEPTPSPQAYGGTPNPQTPGYPDPSS- 417
Cdd:pfam03154  75 SPLKSAKRQREKGASDTEEPERATAKKSKTQEISRPNSPSEGEGESSdgRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDn 154
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 418 -----------------PQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQGSYQPSPSPQsyhqVAPSPAGYQNTHSPASY 480
Cdd:pfam03154 155 esdsdssaqqqilqtqpPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPA----TSQPPNQTQSTAAPHTL 230
                         170       180       190
                  ....*....|....*....|....*....|
gi 1034609085 481 HPTPSPMAYQASPSPSPvGYSPMTPGAPSP 510
Cdd:pfam03154 231 IQQTPTLHPQRLPSPHP-PLQPMTQPPPPS 259
PHA03369 PHA03369
capsid maturational protease; Provisional
364-461 2.35e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 41.14  E-value: 2.35e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 364 TPAQSGAWDPNNPNTPSRAEEEYEYAFDDEPTPSPQAYGGTPNPQTPGYPDPSSPQVnpqynPQTPGTPAMYNTDQFSPY 443
Cdd:PHA03369  353 LTAPSRVLAAAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQF-----CGDPGLVSPYNPQSPGTS 427
                          90
                  ....*....|....*...
gi 1034609085 444 AAPSPQGSYQPSPSPQSY 461
Cdd:PHA03369  428 YGPEPVGPVPPQPTNPYV 445
PRK10263 PRK10263
DNA translocase FtsK; Provisional
395-518 2.45e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.22  E-value: 2.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  395 TPSPQAYGGTPNP--QTPGYPDPSSPQVNPQYNPQT---PGTPamyntdqfSPYAAPSPQGsYQPSP---SPQSYHQvAP 466
Cdd:PRK10263   327 TTATQSWAAPVEPvtQTPPVASVDVPPAQPTVAWQPvpgPQTG--------EPVIAPAPEG-YPQQSqyaQPAVQYN-EP 396
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1034609085  467 SPAGYQNTHSPASYHPTPSPMAYQASPSP-SPVGYSPMTPGAPSPGGYNPHTP 518
Cdd:PRK10263   397 LQQPVQPQQPYYAPAAEQPAQQPYYAPAPeQPAQQPYYAPAPEQPVAGNAWQA 449
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
402-521 2.93e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 40.74  E-value: 2.93e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 402 GGTPNPQTPGYPDPSSPQVnPQYNPQTPGTPAMyntdqfspyAAPSPQGSYQPSPSPQSYHQVAPSPAgyqnthSPASYH 481
Cdd:PRK07764  389 GGAGAPAAAAPSAAAAAPA-AAPAPAAAAPAAA---------AAPAPAAAPQPAPAPAPAPAPPSPAG------NAPAGG 452
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|
gi 1034609085 482 PTPSPMAYQASPSPSPVGYSPMTPGAPSPGGYNPHTPGSG 521
Cdd:PRK07764  453 APSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAA 492
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
346-518 3.12e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 40.92  E-value: 3.12e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  346 GSRTPHYGSQTPLHDGSRTPAQSGAWDPnnPNTPSRAEEEYEYAFDDEPTPSPQAYGGTPNPQTPGYPDPSSPQvnPQYN 425
Cdd:PHA03307   773 ALLEPAEPQRGAGSSPPVRAEAAFRRPG--RLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPARPPGAAAR--PPPA 848
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  426 PQTPGTPAMyntDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQNTHSPASyHPTPSPMAyqaspsPSPVGYSPMTP 505
Cdd:PHA03307   849 RSSESSKSK---PAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAG-APAPRPRP------APRVKLGPMPP 918
                          170       180
                   ....*....|....*....|
gi 1034609085  506 GAPSP-GGY------NPHTP 518
Cdd:PHA03307   919 GGPDPrGGFrrvppgDLHTP 938
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
342-511 3.40e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 40.54  E-value: 3.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  342 PLQDGSRTPHYGSQT--PLHDGSRTPAQSGAWDPNNP-------NTPSRAEEEYEYAFDDEPTPSPQAYGGTPNPQTPGY 412
Cdd:PHA03307   195 PSTPPAAASPRPPRRssPISASASSPAPAPGRSAADDagasssdSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASG 274
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  413 PDPSSPQVNPQYNPQTPGtpamyntdqfSPYAAPSPQ---GSYQPSPSPQSYHQVAPSPAGyqnTHSPASYHPTPSPMAY 489
Cdd:PHA03307   275 WNGPSSRPGPASSSSSPR----------ERSPSPSPSspgSGPAPSSPRASSSSSSSRESS---SSSTSSSSESSRGAAV 341
                          170       180
                   ....*....|....*....|..
gi 1034609085  490 QASPSPSPVGYSPMTPGAPSPG 511
Cdd:PHA03307   342 SPGPSPSRSPSPSRPPPPADPS 363
PHA03269 PHA03269
envelope glycoprotein C; Provisional
431-532 3.93e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 40.10  E-value: 3.93e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 431 TPAMYN---TDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQNTHSPASYHPTPSPMAYQASPSPSPVGYSPMTPGA 507
Cdd:PHA03269   28 IPELHTsaaTQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPDPAVAPQLAAA 107
                          90       100
                  ....*....|....*....|....*
gi 1034609085 508 PSPggyNPHTPGSGIEQNSSDWVTT 532
Cdd:PHA03269  108 PKP---DAAEAFTSAAQAHEAPADA 129
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
398-519 3.95e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 40.08  E-value: 3.95e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 398 PQAYGGTPNP---QTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAG---- 470
Cdd:PRK14951  366 PAAAAEAAAPaekKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAApaav 445
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 1034609085 471 YQNTHSPASYHPTPSPMAYQASPSPSPVGYSPMTPGAPSPGGYNPHTPG 519
Cdd:PRK14951  446 ALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEG 494
dnaA PRK14086
chromosomal replication initiator protein DnaA;
373-521 3.98e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 40.19  E-value: 3.98e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 373 PNNPNTPSRAEEeyeyafddeptPSPQAYGGTPNPQTPGYPDPSSPQVNPQYnPQTPGTPAMY--NTDQFSPYAAPSPQG 450
Cdd:PRK14086   96 APPPPHARRTSE-----------PELPRPGRRPYEGYGGPRADDRPPGLPRQ-DQLPTARPAYpaYQQRPEPGAWPRAAD 163
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 451 SYQPSPSPQSYhqvaPSPAGYQnthSPASYHPTPSPMAY----------QASPSPSPVGYSPMTPGA-------PSPGGY 513
Cdd:PRK14086  164 DYGWQQQRLGF----PPRAPYA---SPASYAPEQERDREpydagrpeydQRRRDYDHPRPDWDRPRRdrtdrpePPPGAG 236

                  ....*...
gi 1034609085 514 NPHTPGSG 521
Cdd:PRK14086  237 HVHRGGPG 244
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
366-510 3.99e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 40.52  E-value: 3.99e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 366 AQSGAWDPNNPNTPSRAEEEYEYAFDDEPTPSPQAYGGT---PNPQTPGYPDPSSPQVNPQYNPQTpgTPAMYNTDQFSP 442
Cdd:pfam03154 176 AQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATsqpPNQTQSTAAPHTLIQQTPTLHPQR--LPSPHPPLQPMT 253
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 443 YAAPSPQGSYQPSPSPQSY------------------HQVAPSPAGYQNTHSPASYHPTPSPMAyqASPSPSPVGYSPMT 504
Cdd:pfam03154 254 QPPPPSQVSPQPLPQPSLHgqmppmphslqtgpshmqHPVPPQPFPLTPQSSQSQVPPGPSPAA--PGQSQQRIHTPPSQ 331

                  ....*.
gi 1034609085 505 PGAPSP 510
Cdd:pfam03154 332 SQLQSQ 337
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
323-484 4.17e-03

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 40.03  E-value: 4.17e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 323 SQTPMYGSGSRTPMYGSQT----PLQDG-SRTPHYGSQTPLHDgSRTPAQSGAWDP--NNPNTPSRAEEEYEYAFDDEPT 395
Cdd:pfam05539 215 STEPVGTQGTTTSSNPEPQteppPSQRGpSGSPQHPPSTTSQD-QSTTGDGQEHTQrrKTPPATSNRRSPHSTATPPPTT 293
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 396 PSPQAYGGTPNPQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPyaaPSPQ------GSYQPSpSPQSYHQVAPSPA 469
Cdd:pfam05539 294 KRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPTTQNLVDCKELDP---PKPNsicygvGIYNEA-LPRGCDIVVPLCS 369
                         170
                  ....*....|....*
gi 1034609085 470 GYqNTHSPASYHPTP 484
Cdd:pfam05539 370 TY-TIMCMDTYYSKP 383
KOW cd00380
KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known ...
588-630 5.31e-03

KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. The KOW motif contains an invariants glycine residue and comprises alternating blocks of hydrophilic and hydrophobic residues.


Pssm-ID: 240504  Cd Length: 49  Bit Score: 35.27  E-value: 5.31e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 1034609085 588 NNKVKVILGEDREATGVLLSIDGEDGIVRMDLDEQLK--ILNLRF 630
Cdd:cd00380     1 GDVVRVLRGPYKGREGVVVDIDPRFGIVTVKGATGSKgaELKVRF 45
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
392-518 5.69e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 39.75  E-value: 5.69e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 392 DEPTPSPQAYGGTPNPQTPGYPDPSSPQVNPQynPQTPGTPAMYNTDQFSPYAAPSPQ-GSYQPSPSPQSYH-QVAPSPA 469
Cdd:NF033839  370 EKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQ--PEKPKPEVKPQPEKPKPEVKPQPEkPKPEVKPQPEKPKpEVKPQPE 447
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 1034609085 470 GYQNTHSPASYHPTPSPMAYQASPSPSpVGYSPMTP----GAPSPGGYNPHTP 518
Cdd:NF033839  448 KPKPEVKPQPETPKPEVKPQPEKPKPE-VKPQPEKPkpdnSKPQADDKKPSTP 499
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
394-511 6.21e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 39.58  E-value: 6.21e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 394 PTPSPQAYGGTPNPQTPgypDPSSPQVNPQYNPQTPGTPAmyntdqfSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQN 473
Cdd:PRK07764  404 AAPAAAPAPAAAAPAAA---AAPAPAAAPQPAPAPAPAPA-------PPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAA 473
                          90       100       110
                  ....*....|....*....|....*....|....*...
gi 1034609085 474 THSPASyhPTPSPMAYQASPSPSPVgysPMTPGAPSPG 511
Cdd:PRK07764  474 PEPTAA--PAPAPPAAPAPAAAPAA---PAAPAAPAGA 506
DUF1373 pfam07117
Protein of unknown function (DUF1373); This family consists of several hypothetical proteins ...
376-490 6.42e-03

Protein of unknown function (DUF1373); This family consists of several hypothetical proteins which seem to be specific to Oryzias latipes (Japanese ricefish). Members of this family are typically around 200 residues in length. The function of this family is unknown.


Pssm-ID: 462093 [Multi-domain]  Cd Length: 212  Bit Score: 38.62  E-value: 6.42e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 376 PNTPSRAEEEYEY----AFDDEPTPSPQAYGGTPNPQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQGS 451
Cdd:pfam07117  42 PPRPEEEEGQGGGggtfPFPGSPEPEPGGGGSGPMPMSASAPEPEPAKAKPQRPAPAQGHGHGGGGDSDSSGSGSGHQGS 121
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 1034609085 452 YQP---SPSPQSYHQVAPSPAGYQNTHSPasyHPTPSPMAYQ 490
Cdd:pfam07117 122 GGAgagAGAPGHQHEQEQESSSSDDDDED---EFEFTPEEDE 160
PHA03369 PHA03369
capsid maturational protease; Provisional
404-541 6.63e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 39.60  E-value: 6.63e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 404 TPNPQTPGYPDPSSPQVNPqynPQTPGTPAM---YNTDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSpAGYQNTHSPASY 480
Cdd:PHA03369  353 LTAPSRVLAAAAKVAVIAA---PQTHTGPADrqrPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLV-SPYNPQSPGTSY 428
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034609085 481 HPTPSPMAYQASPSPS--PVGYSPMT-PGAPSPGGYnpHTPGS-GIEQNSSDWVTTDIQVKVRDT 541
Cdd:PHA03369  429 GPEPVGPVPPQPTNPYvmPISMANMVyPGHPQEHGH--ERKRKrGGELKEELIETLKLVKKLKEE 491
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
317-510 6.66e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 39.77  E-value: 6.66e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  317 RTPMYGSQTPMYGSGSRTPMYGSQTPLQDGSRTPhygsQTPLHDGSRTPAQSGAwDPNNPNTPSRAEeeyeyafdDEPTP 396
Cdd:PHA03307    64 RFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAP----ASPAREGSPTPPGPSS-PDPPPPTPPPAS--------PPPSP 130
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085  397 SPQAYGGTPNPQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTdqfSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGyqnths 476
Cdd:PHA03307   131 APDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQ---AALPLSSPEETARAPSSPPAEPPPSTPPAA------ 201
                          170       180       190
                   ....*....|....*....|....*....|....
gi 1034609085  477 PASYHPTPSPMAyqASPSPSPVGYSPMTPGAPSP 510
Cdd:PHA03307   202 ASPRPPRRSSPI--SASASSPAPAPGRSAADDAG 233
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
331-469 9.76e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 39.20  E-value: 9.76e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034609085 331 GSRTPMYGSQTPLQDGSRTPHYGSQTPLHDGSRTPAQSGAWDPNNPNTPSRAEEEYEyafddeptPSPQAYGGTPNPQTP 410
Cdd:PRK07764  674 GGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASA--------PSPAADDPVPLPPEP 745
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1034609085 411 GYPDPSSPQVNPQYNPQTPGTPAmyntdqfspyAAPSPQGSYQPSPSPQSYHQVAPSPA 469
Cdd:PRK07764  746 DDPPDPAGAPAQPPPPPAPAPAA----------APAAAPPPSPPSEEEEMAEDDAPSMD 794
KOW smart00739
KOW (Kyprides, Ouzounis, Woese) motif; Motif in ribosomal proteins, NusG, Spt5p, KIN17 and T54.
256-276 9.87e-03

KOW (Kyprides, Ouzounis, Woese) motif; Motif in ribosomal proteins, NusG, Spt5p, KIN17 and T54.


Pssm-ID: 128978  Cd Length: 28  Bit Score: 33.84  E-value: 9.87e-03
                           10        20
                   ....*....|....*....|.
gi 1034609085  256 IGQTVRISQGPYKGYIGVVKD 276
Cdd:smart00739   4 VGDTVRVIAGPFKGKVGKVLE 24
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH