NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1677538249|ref|NP_001165439|]
View 

cadherin-related family member 5 isoform 4 precursor [Homo sapiens]

Protein Classification

cadherin repeat domain-containing protein( domain architecture ID 10182011)

cadherin repeat domain-containing protein similar to Homo sapiens desmoglein-2, which is involved in the interaction of plaque proteins and intermediate filaments mediating cell-cell adhesion; cadherins are are calcium-dependent cell adhesion proteins that preferentially interact with themselves in connecting cells

CATH:  2.60.40.60
Gene Ontology:  GO:0007156|GO:0005509
SCOP:  4007535

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
274-343 5.65e-10

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


:

Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 56.94  E-value: 5.65e-10
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1677538249 274 IYAEDGDRGINQPIIYSIFRGNVNGTFIIHPDSGNLTVARSVP--SPMTFLLLVKGQ-QADLARYSVTQVTVE 343
Cdd:cd11304    19 VSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTAKPLDreEQSSYTLTVTATdGGGPPLSSTATVTIT 91
PHA03247 super family cl33720
large tegument protein UL36; Provisional
454-645 2.26e-09

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.49  E-value: 2.26e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  454 QEPPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENstshqPATPGGDT 533
Cdd:PHA03247  2681 QRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAG-----PATPGGPA 2755
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  534 AQTPKPGTSQPMPPG--VGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPA-TPGGGTAQTPEAG-----TSQ 605
Cdd:PHA03247  2756 RPARPPTTAGPPAPAppAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLApAAALPPAASPAGPlppptSAQ 2835
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1677538249  606 PMPPGMGTSTSHQPTTPGGGTA------QTPEPGTSQPMPLSKSTP 645
Cdd:PHA03247  2836 PTAPPPPPGPPPPSLPLGGSVApggdvrRRPPSRSPAAKPAAPARP 2881
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
37-120 4.34e-06

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


:

Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 45.77  E-value: 4.34e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  37 FEVEENTNVTEPLVDIHV-----PEGQEVT--LGALSTPFAFRIQGN--QLFLNVTPDYEEKSLLEAQLLCQSGGT--LV 105
Cdd:cd11304     4 VSVPENAPPGTVVLTVSAtdpdsGENGEVTysIVSGNEDGLFSIDPStgEITTAKPLDREEQSSYTLTVTATDGGGppLS 83
                          90
                  ....*....|....*
gi 1677538249 106 TQLRVFVSVLDVNDN 120
Cdd:cd11304    84 STATVTITVLDVNDN 98
 
Name Accession Description Interval E-value
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
274-343 5.65e-10

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 56.94  E-value: 5.65e-10
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1677538249 274 IYAEDGDRGINQPIIYSIFRGNVNGTFIIHPDSGNLTVARSVP--SPMTFLLLVKGQ-QADLARYSVTQVTVE 343
Cdd:cd11304    19 VSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTAKPLDreEQSSYTLTVTATdGGGPPLSSTATVTIT 91
PHA03247 PHA03247
large tegument protein UL36; Provisional
454-645 2.26e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.49  E-value: 2.26e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  454 QEPPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENstshqPATPGGDT 533
Cdd:PHA03247  2681 QRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAG-----PATPGGPA 2755
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  534 AQTPKPGTSQPMPPG--VGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPA-TPGGGTAQTPEAG-----TSQ 605
Cdd:PHA03247  2756 RPARPPTTAGPPAPAppAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLApAAALPPAASPAGPlppptSAQ 2835
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1677538249  606 PMPPGMGTSTSHQPTTPGGGTA------QTPEPGTSQPMPLSKSTP 645
Cdd:PHA03247  2836 PTAPPPPPGPPPPSLPLGGSVApggdvrRRPPSRSPAAKPAAPARP 2881
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
451-646 1.15e-07

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 55.69  E-value: 1.15e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 451 VSEQEPPSTEAGGTTGPwTSTTSEVPRPPEpSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQP---- 526
Cdd:pfam05109 595 VGETSPQANTTNHTLGG-TSSTPVVTSPPK-NATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPllts 672
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 527 ATPGG------------------DTAQTPKPGT-SQPMPPGvGTSTSHQPATPSGgTAQTPEPGTSQPMPPSmGTSTSHQ 587
Cdd:pfam05109 673 AHPTGgenitqvtpaststhhvsTSSPAPRPGTtSQASGPG-NSSTSTKPGEVNV-TKGTPPKNATSPQAPS-GQKTAVP 749
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1677538249 588 PATPGGGTAQTPEAGTSQPmppGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPS 646
Cdd:pfam05109 750 TVTSTGGKANSTTGGKHTT---GHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSS 805
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
276-314 2.85e-06

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 45.80  E-value: 2.85e-06
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1677538249  276 AEDGDRGINQPIIYSIFRGNVNGTFIIHPDSGNLTVARS 314
Cdd:smart00112   2 ATDADSGENGKVTYSILSGNDDGLFSIDPETGEITTTKP 40
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
37-120 4.34e-06

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 45.77  E-value: 4.34e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  37 FEVEENTNVTEPLVDIHV-----PEGQEVT--LGALSTPFAFRIQGN--QLFLNVTPDYEEKSLLEAQLLCQSGGT--LV 105
Cdd:cd11304     4 VSVPENAPPGTVVLTVSAtdpdsGENGEVTysIVSGNEDGLFSIDPStgEITTAKPLDREEQSSYTLTVTATDGGGppLS 83
                          90
                  ....*....|....*
gi 1677538249 106 TQLRVFVSVLDVNDN 120
Cdd:cd11304    84 STATVTITVLDVNDN 98
Cadherin pfam00028
Cadherin domain;
253-343 6.34e-06

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 45.37  E-value: 6.34e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 253 YHGAVPTGhILPSPLVLRpgpIYAEDGDRGINQPIIYSIFRGNVNGTFIIHPDSGNLTVARSV--PSPMTFLLLVKGQQA 330
Cdd:pfam00028   1 YSASVPEN-APVGTEVLT---VTATDPDLGPNGRIFYSILGGGPGGNFRIDPDTGDISTTKPLdrESIGEYELTVEATDS 76
                          90
                  ....*....|....
gi 1677538249 331 DL-ARYSVTQVTVE 343
Cdd:pfam00028  77 GGpPLSSTATVTIT 90
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
71-122 3.23e-05

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 42.72  E-value: 3.23e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1677538249   71 FRI--QGNQLFLNVTPDYEEKSLLEAQLLCQSGGT--LVTQLRVFVSVLDVNDNAP 122
Cdd:smart00112  26 FSIdpETGEITTTKPLDREEQPEYTLTVEATDGGGppLSSTATVTITVLDVNDNAP 81
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
419-617 7.36e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 46.28  E-value: 7.36e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 419 TTTLAQAGAFYAEVEAHNTVTSGTATTVieiqvseqepPSTEAGGTTGPWTSTTSEVPRPPEPSqgpsTTSSGGGTGPHP 498
Cdd:COG3469    38 TATTVVSTTGSVVVAASGSAGSGTGTTA----------ASSTAATSSTTSTTATATAAAAAATS----TSATLVATSTAS 103
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 499 PSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSqpmpPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPP 578
Cdd:COG3469   104 GANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGST----TTTTTVSGTETATGGTTTTSTTTTTTSASTTP 179
                         170       180       190
                  ....*....|....*....|....*....|....*....
gi 1677538249 579 SMGTSTShqpATPGGGTAQTPEAGTSQPMPPGMGTSTSH 617
Cdd:COG3469   180 SATTTAT---ATTASGATTPSATTTATTTGPPTPGLPKH 215
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
476-637 6.86e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 39.75  E-value: 6.86e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 476 PRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPG-----GPPGAENSTSHQPATPGGDT---AQTPKPGTsQPMPP 547
Cdd:NF033839  292 PSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKpevkpQPEKPKPEVKPQLETPKPEVkpqPEKPKPEV-KPQPE 370
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 548 GVGTSTSHQPATPSGGTAQTPEPGTS--QPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQ--PMPPGMGTSTSHQPTTPg 623
Cdd:NF033839  371 KPKPEVKPQPETPKPEVKPQPEKPKPevKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEvkPQPEKPKPEVKPQPEKP- 449
                         170
                  ....*....|....
gi 1677538249 624 gGTAQTPEPGTSQP 637
Cdd:NF033839  450 -KPEVKPQPETPKP 462
 
Name Accession Description Interval E-value
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
274-343 5.65e-10

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 56.94  E-value: 5.65e-10
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1677538249 274 IYAEDGDRGINQPIIYSIFRGNVNGTFIIHPDSGNLTVARSVP--SPMTFLLLVKGQ-QADLARYSVTQVTVE 343
Cdd:cd11304    19 VSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTAKPLDreEQSSYTLTVTATdGGGPPLSSTATVTIT 91
PHA03247 PHA03247
large tegument protein UL36; Provisional
454-645 2.26e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.49  E-value: 2.26e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  454 QEPPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENstshqPATPGGDT 533
Cdd:PHA03247  2681 QRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAG-----PATPGGPA 2755
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  534 AQTPKPGTSQPMPPG--VGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPA-TPGGGTAQTPEAG-----TSQ 605
Cdd:PHA03247  2756 RPARPPTTAGPPAPAppAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLApAAALPPAASPAGPlppptSAQ 2835
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1677538249  606 PMPPGMGTSTSHQPTTPGGGTA------QTPEPGTSQPMPLSKSTP 645
Cdd:PHA03247  2836 PTAPPPPPGPPPPSLPLGGSVApggdvrRRPPSRSPAAKPAAPARP 2881
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
311-637 1.36e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 58.46  E-value: 1.36e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 311 VARSVPSPMTFLLLVKGQQADLARYSVTQVTVEAVAAAGSPPRFPQRLyrgTVARGAGAGVVVKDAAAPSQPLRIQAQDP 390
Cdd:PRK07764  438 APAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPP---AAPAPAAAPAAPAAPAAPAGADDAATLRE 514
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 391 EFSDLNSAITyritNHSHFRME-------------GEVVLTTTTLAQAGAFYAEVEAHNTVTSGTATTVIEIQVS-EQEP 456
Cdd:PRK07764  515 RWPEILAAVP----KRSRKTWAillpeatvlgvrgDTLVLGFSTGGLARRFASPGNAEVLVTALAEELGGDWQVEaVVGP 590
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 457 PSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQT 536
Cdd:PRK07764  591 APGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWP 670
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 537 PKPGTSQPMPPGvGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQPMPP------- 609
Cdd:PRK07764  671 AKAGGAAPAAPP-PAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPpepddpp 749
                         330       340       350
                  ....*....|....*....|....*....|
gi 1677538249 610 --GMGTSTSHQPTTPGGGTAQTPEPGTSQP 637
Cdd:PRK07764  750 dpAGAPAQPPPPPAPAPAAAPAAAPPPSPP 779
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
451-646 1.15e-07

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 55.69  E-value: 1.15e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 451 VSEQEPPSTEAGGTTGPwTSTTSEVPRPPEpSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQP---- 526
Cdd:pfam05109 595 VGETSPQANTTNHTLGG-TSSTPVVTSPPK-NATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPllts 672
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 527 ATPGG------------------DTAQTPKPGT-SQPMPPGvGTSTSHQPATPSGgTAQTPEPGTSQPMPPSmGTSTSHQ 587
Cdd:pfam05109 673 AHPTGgenitqvtpaststhhvsTSSPAPRPGTtSQASGPG-NSSTSTKPGEVNV-TKGTPPKNATSPQAPS-GQKTAVP 749
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1677538249 588 PATPGGGTAQTPEAGTSQPmppGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPS 646
Cdd:pfam05109 750 TVTSTGGKANSTTGGKHTT---GHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSS 805
PHA03247 PHA03247
large tegument protein UL36; Provisional
454-647 1.82e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.33  E-value: 1.82e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  454 QEPPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQPATPG--- 530
Cdd:PHA03247  2611 PAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarp 2690
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  531 -----GDTAQTPKPG-TSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPggGTAQTPeAGTS 604
Cdd:PHA03247  2691 tvgslTSLADPPPPPpTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARP--ARPPTT-AGPP 2767
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 1677538249  605 QPMPPGmGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPSS 647
Cdd:PHA03247  2768 APAPPA-APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPA 2809
PHA03247 PHA03247
large tegument protein UL36; Provisional
433-648 2.03e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.33  E-value: 2.03e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  433 EAHNTVTSGTATTVIEIQVSEQEPPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTP 512
Cdd:PHA03247  2793 ESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPA 2872
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  513 GGPPGAENSTSHQPATPggdtaQTPKPGTSQPMPPgVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPG 592
Cdd:PHA03247  2873 AKPAAPARPPVRRLARP-----AVSRSTESFALPP-DQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT 2946
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1677538249  593 GGTAQTPEAGTSQPMPPGMGTSTSHQPTTpgggTAQTPEPGTSQPMPLSKSTPSSG 648
Cdd:PHA03247  2947 TDPAGAGEPSGAVPQPWLGALVPGRVAVP----RFRVPQPAPSREAPASSTPPLTG 2998
PHA03378 PHA03378
EBNA-3B; Provisional
467-653 1.23e-06

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 52.38  E-value: 1.23e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 467 PWTSTTSEVP----RPPEPSQGPSTTSSGGGTGPHP--PSGTTLRPPTSSTPGGPPGAENS-----------TSHQPATP 529
Cdd:PHA03378  600 PHPSQTPEPPttqsHIPETSAPRQWPMPLRPIPMRPlrMQPITFNVLVFPTPHQPPQVEITpykptwtqighIPYQPSPT 679
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 530 GGDTAQTPK--PGTSQP-------MPPGVGTSTSHQPATPSGGTAQTPE--PGTSQP-------MPPSMGTSTSHQPATP 591
Cdd:PHA03378  680 GANTMLPIQwaPGTMQPppraptpMRPPAAPPGRAQRPAAATGRARPPAaaPGRARPpaaapgrARPPAAAPGRARPPAA 759
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1677538249 592 GGGTAQTPEA--GTSQPMPPGMGTSTSHQptTPGGGTAQTPEP-GTSQPMPLSKSTPSSGGGPSE 653
Cdd:PHA03378  760 APGRARPPAAapGAPTPQPPPQAPPAPQQ--RPRGAPTPQPPPqAGPTSMQLMPRAAPGQQGPTK 822
PHA03378 PHA03378
EBNA-3B; Provisional
452-653 1.39e-06

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 51.99  E-value: 1.39e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 452 SEQEPPSTEAGGTTGPWTSTtsevprPPEPSQGPSTTSSGGGTGPHPPsGTTLRPPTSSTPGGPPGA--------ENSTS 523
Cdd:PHA03378  650 TPHQPPQVEITPYKPTWTQI------GHIPYQPSPTGANTMLPIQWAP-GTMQPPPRAPTPMRPPAAppgraqrpAAATG 722
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 524 HQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPE--PGTSQPMPPSMGTSTSHQPATPGGGTAQTPEA 601
Cdd:PHA03378  723 RARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAaaPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQA 802
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1677538249 602 G-----TSQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKS-----TPSSGGGPSE 653
Cdd:PHA03378  803 GptsmqLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQaaagpTPSPGSGTSD 864
PHA03247 PHA03247
large tegument protein UL36; Provisional
455-724 2.55e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.48  E-value: 2.55e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  455 EPPSTEAGGTTGPwTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTT----------LRPPTSSTPGGP-PGAENSTS 523
Cdd:PHA03247  2700 DPPPPPPTPEPAP-HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGpatpggparpARPPTTAGPPAPaPPAAPAAG 2778
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  524 HQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPA-TPSGGTAQTPEPG-----TSQPMPPSMGTSTSHQPATPGGGTAq 597
Cdd:PHA03247  2779 PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLApAAALPPAASPAGPlppptSAQPTAPPPPPGPPPPSLPLGGSVA- 2857
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  598 tPEAGTSQPMPPGmgtSTSHQPTTPgggtAQTPEPGTSQPmPLSKSTPSSGGGPSEDKRFSVVDMAALGGVLGALLLLAL 677
Cdd:PHA03247  2858 -PGGDVRRRPPSR---SPAAKPAAP----ARPPVRRLARP-AVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ 2928
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 1677538249  678 LGLAVLVHKHYGPRLKCCCGKAPEPQPQGFDNQAFLPDHKANWAPVP 724
Cdd:PHA03247  2929 PQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVP 2975
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
276-314 2.85e-06

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 45.80  E-value: 2.85e-06
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1677538249  276 AEDGDRGINQPIIYSIFRGNVNGTFIIHPDSGNLTVARS 314
Cdd:smart00112   2 ATDADSGENGKVTYSILSGNDDGLFSIDPETGEITTTKP 40
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
37-120 4.34e-06

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 45.77  E-value: 4.34e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  37 FEVEENTNVTEPLVDIHV-----PEGQEVT--LGALSTPFAFRIQGN--QLFLNVTPDYEEKSLLEAQLLCQSGGT--LV 105
Cdd:cd11304     4 VSVPENAPPGTVVLTVSAtdpdsGENGEVTysIVSGNEDGLFSIDPStgEITTAKPLDREEQSSYTLTVTATDGGGppLS 83
                          90
                  ....*....|....*
gi 1677538249 106 TQLRVFVSVLDVNDN 120
Cdd:cd11304    84 STATVTITVLDVNDN 98
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
474-653 4.39e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 50.54  E-value: 4.39e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 474 EVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSt 553
Cdd:pfam03154 175 QAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMT- 253
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 554 shQPATPSGGTAQ-TPEPGTSQPMPPsmgtstshQPATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPgggtaQTPEP 632
Cdd:pfam03154 254 --QPPPPSQVSPQpLPQPSLHGQMPP--------MPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGP-----SPAAP 318
                         170       180
                  ....*....|....*....|....
gi 1677538249 633 GTSQPM---PLSKSTPSSGGGPSE 653
Cdd:pfam03154 319 GQSQQRihtPPSQSQLQSQQPPRE 342
Cadherin pfam00028
Cadherin domain;
253-343 6.34e-06

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 45.37  E-value: 6.34e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 253 YHGAVPTGhILPSPLVLRpgpIYAEDGDRGINQPIIYSIFRGNVNGTFIIHPDSGNLTVARSV--PSPMTFLLLVKGQQA 330
Cdd:pfam00028   1 YSASVPEN-APVGTEVLT---VTATDPDLGPNGRIFYSILGGGPGGNFRIDPDTGDISTTKPLdrESIGEYELTVEATDS 76
                          90
                  ....*....|....
gi 1677538249 331 DL-ARYSVTQVTVE 343
Cdd:pfam00028  77 GGpPLSSTATVTIT 90
PHA03247 PHA03247
large tegument protein UL36; Provisional
456-651 6.80e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 6.80e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  456 PPSTEAGGTTGPwtSTTSEVPRPPEPSQGPSTTSSGGGTGPHP-PSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTA 534
Cdd:PHA03247  2569 PPPRPAPRPSEP--AVTSRARRPDAPPQSARPRAPVDDRGDPRgPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTV 2646
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  535 QTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTS-HQPATPGGGTAQTPEAGTSQ-PMPPGMG 612
Cdd:PHA03247  2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSlADPPPPPPTPEPAPHALVSAtPLPPGPA 2726
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 1677538249  613 TSTSHQPTTPGGGTAQTPEPGTSQPM-PLSKSTPSSGGGP 651
Cdd:PHA03247  2727 AARQASPALPAAPAPPAVPAGPATPGgPARPARPPTTAGP 2766
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
498-613 1.80e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 48.44  E-value: 1.80e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 498 PPSGTTLRPPTSSTPGGPPGAENSTSHQPATPggDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMP 577
Cdd:PRK07764  398 APSAAAAAPAAAPAPAAAAPAAAAAPAPAAAP--QPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPE 475
                          90       100       110
                  ....*....|....*....|....*....|....*.
gi 1677538249 578 PSMGTSTSHQPATPGGGTAQTPEAGTSQPMPPGMGT 613
Cdd:PRK07764  476 PTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAAT 511
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
504-624 2.16e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 48.14  E-value: 2.16e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 504 LRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPM---PPGVGTSTSHQPATPSGGTAQTPEP--GTSQPMPP 578
Cdd:PRK14959  365 LMPVESLRPSGGGASAPSGSAAEGPASGGAATIPTPGTQGPQgtaPAAGMTPSSAAPATPAPSAAPSPRVpwDDAPPAPP 444
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 1677538249 579 SMGTSTSHQPATPGGGTAQTPEAGTSQP--MPPGMGTSTSHQPTTPGG 624
Cdd:PRK14959  445 RSGIPPRPAPRMPEASPVPGAPDSVASAsdAPPTLGDPSDTAEHTPSG 492
PHA03247 PHA03247
large tegument protein UL36; Provisional
417-652 2.59e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.40  E-value: 2.59e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  417 LTTTTLAQAGAFYAEVEAHNTVTSGTATTVieiQVSEQEPPSTEAggTTGPWTSTTSEVP-RPPEPSQGPSTTSSGGGTG 495
Cdd:PHA03247  2721 LPPGPAAARQASPALPAAPAPPAVPAGPAT---PGGPARPARPPT--TAGPPAPAPPAAPaAGPPRRLTRPAVASLSESR 2795
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  496 PHPPSGTTLRPPTSSTPggPPGAENSTSHQPAT---PGGDTAQTPKPGTSQPMPPGVGTSTSHQPATP--SGGTAQTPEP 570
Cdd:PHA03247  2796 ESLPSPWDPADPPAAVL--APAAALPPAASPAGplpPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDvrRRPPSRSPAA 2873
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  571 GTSQPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQPMPPGMGTSTSH-QPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGG 649
Cdd:PHA03247  2874 KPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQpQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAG 2953

                   ...
gi 1677538249  650 GPS 652
Cdd:PHA03247  2954 EPS 2956
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
71-122 3.23e-05

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 42.72  E-value: 3.23e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1677538249   71 FRI--QGNQLFLNVTPDYEEKSLLEAQLLCQSGGT--LVTQLRVFVSVLDVNDNAP 122
Cdd:smart00112  26 FSIdpETGEITTTKPLDREEQPEYTLTVEATDGGGppLSSTATVTITVLDVNDNAP 81
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
431-661 3.61e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.47  E-value: 3.61e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  431 EVEAHNTVTSGTATTVIEIQVSEQEPPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPH---PPSGTTLRPP 507
Cdd:PHA03307    83 ESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAaspPAAGASPAAV 162
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  508 TSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQtPEPGTSQPMPPSMGTSTSHQ 587
Cdd:PHA03307   163 ASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPA-PAPGRSAADDAGASSSDSSS 241
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1677538249  588 PATPGGGTAQTPEAGTSQPMPPGMGTST-SHQPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPSEDKRFSVVD 661
Cdd:PHA03307   242 SESSGCGWGPENECPLPRPAPITLPTRIwEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASS 316
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
502-645 4.73e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 47.29  E-value: 4.73e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 502 TTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPAtpsggtaQTPEPGTSQPMPPSMG 581
Cdd:PRK07764  376 ARLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPA-------PAPAPAPAPPSPAGNA 448
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1677538249 582 TSTSHQPATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTP 645
Cdd:PRK07764  449 PAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATL 512
PRK11901 PRK11901
hypothetical protein; Reviewed
508-652 4.82e-05

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 46.21  E-value: 4.82e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 508 TSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTsqpMPPGVGTSTSHQPATPSGGTAQTPEPGT-----SQ------PM 576
Cdd:PRK11901   87 LSSGNQSSPSAANNTSDGHDASGVKNTAPPQDIS---APPISPTPTQAAPPQTPNGQQRIELPGNisdalSQqqgqvnAA 163
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1677538249 577 PPSMGTSTSHQP---ATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPgggtaqTPEPGTSQPmPLSKSTPSSGGGPS 652
Cdd:PRK11901  164 SQNAQGNTSTLPtapATVAPSKGAKVPATAETHPTPPQKPATKKPAVNH------HKTATVAVP-PATSGKPKSGAASA 235
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
456-640 5.63e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.07  E-value: 5.63e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 456 PPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGT---------GPHPPSGTTLRPPTSSTPGGPPGAENSTS--- 523
Cdd:pfam03154 188 PPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAphtliqqtpTLHPQRLPSPHPPLQPMTQPPPPSQVSPQplp 267
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 524 ----HQPATPGGDTAQT-----PKPGTSQPMP------------------PGVGTSTSHQPatPSGGTAQTPEPGTSQPM 576
Cdd:pfam03154 268 qpslHGQMPPMPHSLQTgpshmQHPVPPQPFPltpqssqsqvppgpspaaPGQSQQRIHTP--PSQSQLQSQQPPREQPL 345
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1677538249 577 PPSmGTSTSHQPATPGGGTAQTPEAGT-------SQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPL 640
Cdd:pfam03154 346 PPA-PLSMPHIKPPPTTPIPQLPNPQShkhpphlSGPSPFQMNSNLPPPPALKPLSSLSTHHPPSAHPPPL 415
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
435-654 5.86e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.09  E-value: 5.86e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  435 HNTVTSGTATTVIEIQVSEQEPPSTEAGGTTGPwtsTTSEVPRPPEP-SQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPG 513
Cdd:PHA03307    39 SQGQLVSDSAELAAVTVVAGAAACDRFEPPTGP---PPGPGTEAPANeSRSTPTWSLSTLAPASPAREGSPTPPGPSSPD 115
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  514 GPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPGG 593
Cdd:PHA03307   116 PPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPP 195
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1677538249  594 GTAqtPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPSED 654
Cdd:PHA03307   196 STP--PAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENE 254
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
374-657 5.99e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 46.79  E-value: 5.99e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 374 KDAAAPSQPLRIQAQDPEFSDL----------NSAITYRITNHShfrmEGEVVLT-------TTTLAQAGAFyaeveaHN 436
Cdd:PRK12323  296 KIALAQVVPAAVQDDWPEADDIrrlagrfdaqEVQLFYQIANLG----RSELALApdeyagfTMTLLRMLAF------RP 365
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 437 TVTSGTATTVIEIQVSEQEPPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPP 516
Cdd:PRK12323  366 GQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPG 445
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 517 GAENSTSHQPATPggdtAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPE-PGTSQPMPPSMGTSTSHQ--PATPGG 593
Cdd:PRK12323  446 GAPAPAPAPAAAP----AAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDdPPPWEELPPEFASPAPAQpdAAPAGW 521
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1677538249 594 GTAQTPEAGTSQPMPPGmgtstshqPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPSEDKRF 657
Cdd:PRK12323  522 VAESIPDPATADPDDAF--------ETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMF 577
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
454-661 6.64e-05

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 46.46  E-value: 6.64e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 454 QEPPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTT----LRPPTSSTPGGPPGAENSTSHQPATP 529
Cdd:PLN03209  326 QRVPPKESDAADGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTayedLKPPTSPIPTPPSSSPASSKSVDAVA 405
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 530 GGDTAQT-PKPGTSQPMP---PGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPS-MGTSTSHQPATPG-GGTAQTPEAGT 603
Cdd:PLN03209  406 KPAEPDVvPSPGSASNVPevePAQVEAKKTRPLSPYARYEDLKPPTSPSPTAPTgVSPSVSSTSSVPAvPDTAPATAATD 485
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1677538249 604 SQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPSEDKRFSVVD 661
Cdd:PLN03209  486 AAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALAD 543
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
419-617 7.36e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 46.28  E-value: 7.36e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 419 TTTLAQAGAFYAEVEAHNTVTSGTATTVieiqvseqepPSTEAGGTTGPWTSTTSEVPRPPEPSqgpsTTSSGGGTGPHP 498
Cdd:COG3469    38 TATTVVSTTGSVVVAASGSAGSGTGTTA----------ASSTAATSSTTSTTATATAAAAAATS----TSATLVATSTAS 103
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 499 PSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSqpmpPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPP 578
Cdd:COG3469   104 GANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGST----TTTTTVSGTETATGGTTTTSTTTTTTSASTTP 179
                         170       180       190
                  ....*....|....*....|....*....|....*....
gi 1677538249 579 SMGTSTShqpATPGGGTAQTPEAGTSQPMPPGMGTSTSH 617
Cdd:COG3469   180 SATTTAT---ATTASGATTPSATTTATTTGPPTPGLPKH 215
PRK13700 PRK13700
conjugal transfer protein TraD; Provisional
538-614 8.56e-05

conjugal transfer protein TraD; Provisional


Pssm-ID: 184256 [Multi-domain]  Cd Length: 732  Bit Score: 46.11  E-value: 8.56e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 538 KPGTSQPMPPGVGTSTSHQPATPSGGTAQTPePGTSQPMPPSMG-TSTSHQPATPGGGT----AQTPEAGTSQPMPPGMG 612
Cdd:PRK13700  604 EPDVPEVASGEDVTQAEQPQQPQQPQQPQQP-QQPQQPVSPVINdKKSDAGVNVPAGGIeqelKMKPEEEMEQQLPPGIS 682

                  ..
gi 1677538249 613 TS 614
Cdd:PRK13700  683 ES 684
PHA03255 PHA03255
BDLF3; Provisional
508-650 1.84e-04

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 43.74  E-value: 1.84e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 508 TSSTP-----GGPPGAENSTSHQPATPGGDTAQTP-KPGTSQPMPPGVGTSTSHQPATpSGGTAQTPEPGTSQPMPPSMG 581
Cdd:PHA03255   25 TSSGSstasaGNVTGTTAVTTPSPSASGPSTNQSTtLTTTSAPITTTAILSTNTTTVT-STGTTVTPVPTTSNASTINVT 103
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1677538249 582 TSTSHQ--PATPGGGTAQTPEAGTSQPMPPGMGTSTSH-------QPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGG 650
Cdd:PHA03255  104 TKVTAQniTATEAGTGTSTGVTSNVTTRSSSTTSATTRitnattlAPTLSSKGTSNATKTTAELPTVPDERQPSLSYG 181
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
444-622 1.84e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 44.92  E-value: 1.84e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 444 TTVIEIQVSEQEPPSTEAGGTTGPWTSTTSEVPRPPePSQGPSTTSSGGGTGPHPPSGTT----LRPPTSSTPGGPPGAE 519
Cdd:PLN03209  384 TSPIPTPPSSSPASSKSVDAVAKPAEPDVVPSPGSA-SNVPEVEPAQVEAKKTRPLSPYAryedLKPPTSPSPTAPTGVS 462
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 520 NSTSHQPATPG-GDTA--------QTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPAT 590
Cdd:PLN03209  463 PSVSSTSSVPAvPDTApataatdaAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALA 542
                         170       180       190
                  ....*....|....*....|....*....|...
gi 1677538249 591 PGGGTAQ-TPEAGTSQPMPPGMGTSTSHQPTTP 622
Cdd:PLN03209  543 DEQHHAQpKPRPLSPYTMYEDLKPPTSPTPSPV 575
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
417-652 2.39e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 44.57  E-value: 2.39e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 417 LTTTTLAQAGAFYAEVEAHNTVTSGTATTVieiqvsEQEPPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTssgggtgp 496
Cdd:pfam17823  97 LSEPATREGAADGAASRALAAAASSSPSSA------AQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAI-------- 162
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 497 hppsgTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSH-QPATPSGGTA------QTPE 569
Cdd:pfam17823 163 -----AAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAAtATGHPAAGTAlaavgnSSPA 237
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 570 PGT-----SQPMPPSMGTSTSH-QPATPGGGTAQT--PEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLS 641
Cdd:pfam17823 238 AGTvtaavGTVTPAALATLAAAaGTVASAAGTINMgdPHARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVH 317
                         250
                  ....*....|.
gi 1677538249 642 KSTPSSGGGPS 652
Cdd:pfam17823 318 NTAGEPTPSPS 328
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
452-656 2.63e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.78  E-value: 2.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  452 SEQEPPSteaGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGG 531
Cdd:PHA03307    67 PPTGPPP---GPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGS 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  532 DT---AQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQT-PEPGTSQPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQPM 607
Cdd:PHA03307   144 PGpppAASPPAAGASPAAVASDAASSRQAALPLSSPEETaRAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPA 223
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1677538249  608 PPGM-----GTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPSEDKR 656
Cdd:PHA03307   224 PGRSaaddaGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNG 277
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
558-648 3.57e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 44.00  E-value: 3.57e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 558 ATPSGGTAQTPEPGTSQPM---PPSMGTSTSHQP------ATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQ 628
Cdd:PRK14971  369 ASGGRGPKQHIKPVFTQPAaapQPSAAAAASPSPsqssaaAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVR 448
                          90       100
                  ....*....|....*....|
gi 1677538249 629 TPEPGTSQPMPLSKsTPSSG 648
Cdd:PRK14971  449 PAQFKEEKKIPVSK-VSSLG 467
PHA03377 PHA03377
EBNA-3C; Provisional
454-652 3.60e-04

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 44.27  E-value: 3.60e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  454 QEPPSTEAGGTTGPWTSTTSEVPRP-PEPSQGPSTTSSGGGTGPHPPSGTtlRP------PTSSTPGGP------PGAEN 520
Cdd:PHA03377   663 QQEPSSRRQPATQSTPPRPSWLPSVfVLPSVDAGRAQPSEESHLSSMSPT--QPisheeqPRYEDPDDPldlslhPDQAP 740
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  521 STSHQPATPGGD---TAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPE---PGTSQPMPPSMGTSTSHQPATPGGG 594
Cdd:PHA03377   741 PPSHQAPYSGHEepqAQQAPYPGYWEPRPPQAPYLGYQEPQAQGVQVSSYPGyagPWGLRAQHPRYRHSWAYWSQYPGHG 820
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1677538249  595 TAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEPG------TSQPMPLSKSTPSSGGGPS 652
Cdd:PHA03377   821 HPQGPWAPRPPHLPPQWDGSAGHGQDQVSQFPHLQSETGpprlqlSQVPQLPYSQTLVSSSAPS 884
PHA03264 PHA03264
envelope glycoprotein D; Provisional
501-614 3.80e-04

envelope glycoprotein D; Provisional


Pssm-ID: 223029 [Multi-domain]  Cd Length: 416  Bit Score: 43.84  E-value: 3.80e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 501 GTTLRPPTSSTPGGPPGAENSTShqPATPGGDTAQT-PKPGTSQPMPPG---VGTSTSHQPATPSGGTAQTPEPGTSQPM 576
Cdd:PHA03264  252 GVVPPYFEESKGYEPPPAPSGGS--PAPPGDDRPEAkPEPGPVEDGAPGretGGEGEGPEPAGRDGAAGGEPKPGPPRPA 329
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 1677538249 577 PPSMGTS-------TSHQPATPGggtaqTPEAGTSQPMPPGMGTS 614
Cdd:PHA03264  330 PDADRPEgwpsleaITFPPPTPA-----TPAVPRARPVIVGTGIA 369
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
504-654 4.26e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.01  E-value: 4.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  504 LRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTP-KPGTSQPMPPGVGTSTSHQPATPSGGTAQtPEPGTSQPMPPSMGT 582
Cdd:PHA03307   774 LLEPAEPQRGAGSSPPVRAEAAFRRPGRLRRSGPaADAASRTASKRKSRSHTPDGGSESSGPAR-PPGAAARPPPARSSE 852
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1677538249  583 STSHQPATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPSED 654
Cdd:PHA03307   853 SSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMPPGGPDPR 924
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
534-653 4.43e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 43.90  E-value: 4.43e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 534 AQTPKPGTSQPMPPGVG----TSTSHQPATPSGGTAQTPEPGTSQPM---PPSMGTSTSHQPATPGGGTAQTPEA--GTS 604
Cdd:PRK14959  360 AMLPRLMPVESLRPSGGgasaPSGSAAEGPASGGAATIPTPGTQGPQgtaPAAGMTPSSAAPATPAPSAAPSPRVpwDDA 439
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 1677538249 605 QPMPPGMGTSTSHQPTTPGGgtaqTPEPGTSQPMPLSKSTPSSGGGPSE 653
Cdd:PRK14959  440 PPAPPRSGIPPRPAPRMPEA----SPVPGAPDSVASASDAPPTLGDPSD 484
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
497-652 4.64e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 43.80  E-value: 4.64e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 497 HPPSGTTLRPPTSSTpGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMP------PGVGTSTSHQPATPSGGTAQTPEP 570
Cdd:pfam17823  90 HTPHGTDLSEPATRE-GAADGAASRALAAAASSSPSSAAQSLPAAIAALPseafsaPRAAACRANASAAPRAAIAAASAP 168
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 571 GTSQPMPPSMGTSTSHQPATPGGGTAQTPEAGTS-QPMPPGMGTSTSHQPT-TPGGGTAqTPEPGTSQPMPLSKSTPSSG 648
Cdd:pfam17823 169 HAASPAPRTAASSTTAASSTTAASSAPTTAASSApATLTPARGISTAATATgHPAAGTA-LAAVGNSSPAAGTVTAAVGT 247

                  ....
gi 1677538249 649 GGPS 652
Cdd:pfam17823 248 VTPA 251
PHA03247 PHA03247
large tegument protein UL36; Provisional
497-652 5.11e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 5.11e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  497 HPPSGTTLRPPTSSTPggPPGAENSTS---HQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTP-EPGT 572
Cdd:PHA03247   346 HYPLGFPKRRRPTWTP--PSSLEDLSAgrhHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPaPTPV 423
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  573 SQPMPPSmgTSTSHQPATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPGG----GTAQTPEPGTSQPMPLSKSTPSSG 648
Cdd:PHA03247   424 PASAPPP--PATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATRKAldalRERRPPEPPGADLAELLGRHPDTA 501

                   ....
gi 1677538249  649 GGPS 652
Cdd:PHA03247   502 GTVV 505
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
527-623 5.57e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 43.61  E-value: 5.57e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 527 ATPGGDTAQTPKPGTSQPM---PPGVGTSTSHQP------ATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPGGGTAQ 597
Cdd:PRK14971  369 ASGGRGPKQHIKPVFTQPAaapQPSAAAAASPSPsqssaaAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVR 448
                          90       100
                  ....*....|....*....|....*.
gi 1677538249 598 TPEAGTSQPMPPgMGTSTSHQPTTPG 623
Cdd:PRK14971  449 PAQFKEEKKIPV-SKVSSLGPSTLRP 473
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
441-644 5.84e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 43.52  E-value: 5.84e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 441 GTATTVIEIQVSEQEPPSTEAGGTTGPWTSTTSEVP-------RPPEPSQGPSTTSSGGGTGPHPPSgttlRPPTSSTPG 513
Cdd:PTZ00449  547 GKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPkdpkhpkDPEEPKKPKRPRSAQRPTRPKSPK----LPELLDIPK 622
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 514 GPPGAENSTS-------HQPATP----GGDTAQTPKPGTSqPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGT 582
Cdd:PTZ00449  623 SPKRPESPKSpkrppppQRPSSPerpeGPKIIKSPKPPKS-PKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFES 701
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1677538249 583 STSHQPATPGGGTAQTPeagtsQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKST 644
Cdd:PTZ00449  702 ILKETLPETPGTPFTTP-----RPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERT 758
PRK13700 PRK13700
conjugal transfer protein TraD; Provisional
508-583 6.99e-04

conjugal transfer protein TraD; Provisional


Pssm-ID: 184256 [Multi-domain]  Cd Length: 732  Bit Score: 43.41  E-value: 6.99e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 508 TSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVG-TSTSHQPATPSGGT----AQTPEPGTSQPMPPSMGT 582
Cdd:PRK13700  604 EPDVPEVASGEDVTQAEQPQQPQQPQQPQQPQQPQQPVSPVINdKKSDAGVNVPAGGIeqelKMKPEEEMEQQLPPGISE 683

                  .
gi 1677538249 583 S 583
Cdd:PRK13700  684 S 684
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
418-586 8.58e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.82  E-value: 8.58e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 418 TTTTLAQAGAFYAEVEAHNTVTSGTATTVIeiQVSEQEPPSTEAGGTTGPWTSTTSEVPRPPepsqgpsttssgGGTGPH 497
Cdd:COG3469    64 TAASSTAATSSTTSTTATATAAAAAATSTS--ATLVATSTASGANTGTSTVTTTSTGAGSVT------------STTSST 129
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 498 PPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTShqpATPSGGTAQTPEPGTSQPMP 577
Cdd:COG3469   130 AGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTAT---ATTASGATTPSATTTATTTG 206

                  ....*....
gi 1677538249 578 PSMGTSTSH 586
Cdd:COG3469   207 PPTPGLPKH 215
motB PRK12799
flagellar motor protein MotB; Reviewed
517-636 9.35e-04

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 42.40  E-value: 9.35e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 517 GAENSTSHQPATPGGDTAQTPKPGTSQPMPPG---VGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPGG 593
Cdd:PRK12799  294 DTHGTVPVAAVTPSSAVTQSSAITPSSAAIPSpavIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVN 373
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 1677538249 594 GTAQTPEAGTSQPMPPGMGTSTSHQPTT--PGGGTAQTPEPGTSQ 636
Cdd:PRK12799  374 MQPQPMSTTETQQSSTGNITSTANGPTTslPAAPASNIPVSPTSR 418
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
469-652 1.02e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.98  E-value: 1.02e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 469 TSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTL----RPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQP 544
Cdd:pfam05109 413 TTTTHKVIFSKAPESTTTSPTLNTTGFAAPNTTTGLpsstHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSP 492
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 545 MPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMP-----------PSMGTSTSHQPATPGGGTAQTPEAGTSQPMP----P 609
Cdd:pfam05109 493 SPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPavttptpnatsPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPnatiP 572
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....
gi 1677538249 610 GMG-TSTSHQPTTPgggTAQTPEPGTSQPMPLSKSTPSSGGGPS 652
Cdd:pfam05109 573 TLGkTSPTSAVTTP---TPNATSPTVGETSPQANTTNHTLGGTS 613
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
509-652 1.10e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 42.53  E-value: 1.10e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 509 SSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQP 588
Cdd:PRK07003  395 AVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADS 474
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1677538249 589 ATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGT------------AQTPEPGTSQPMPLSKSTPSSGGGPS 652
Cdd:PRK07003  475 GSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARApaaasredapaaAAPPAPEARPPTPAAAAPAARAGGAA 550
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
459-652 1.54e-03

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 41.94  E-value: 1.54e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 459 TEAGGTTGPWTSTTSEVPrppePSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPK 538
Cdd:COG5164    84 AQNQGGTRPAGNTGGTTP----AGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGSTGPGGSTTPP 159
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 539 PGTSQPMPPGVGTSTShqpATPSGGTAQTPEPGTSQPMPPSMGTSTSHQ--PATPGGGTAQTPEAGTSQPMPPGMGTSTS 616
Cdd:COG5164   160 GDGGSTTPPGPGGSTT---PPDDGGSTTPPNKGETGTDIPTGGTPRQGPdgPVKKDDKNGKGNPPDDRGGKTGPKDQRPK 236
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 1677538249 617 HQPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPS 652
Cdd:COG5164   237 TNPIERRGPERPEAAALPAELTALEAENRAANPEPA 272
G_path_suppress pfam15991
G-protein pathway suppressor; This family of proteins inhibits G-protein- and ...
498-651 1.61e-03

G-protein pathway suppressor; This family of proteins inhibits G-protein- and mitogen-activated protein kinase-mediated signal transduction.


Pssm-ID: 464961 [Multi-domain]  Cd Length: 272  Bit Score: 41.44  E-value: 1.61e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 498 PPSGTTLRPPTSSTPGGPPGAENSTsHQPATP---------GGDTAQTPKPGTSQPMPPG----VGTSTSHQPATPSGGT 564
Cdd:pfam15991 114 PQLSMQGQPHHQQHPGPQVGVLKRT-RSPSPPvqqqayykqPAFSPGYAEHGQQKHDDGRrgydVARFGSWNKSTAQYPP 192
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 565 AQTPEPGTSQPMPPSmgtSTSHQPATpgGGTAQTPEAGTSQPMPPGMgtstsHQPTTPGGgtaqTPEPGTSQPMPLSKST 644
Cdd:pfam15991 193 SGQLFYPTHQYLPPP---QTQGQADA--RLQTIYPQPGYALPLQQQY-----EHANQPSP----FVSSSPLKQMQSPKAG 258

                  ....*..
gi 1677538249 645 PSSGGGP 651
Cdd:pfam15991 259 PGPQPMQ 265
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
498-623 1.82e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 42.01  E-value: 1.82e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 498 PPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPgTSQPMP 577
Cdd:PRK14951  366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAP-AAAPAA 444
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 1677538249 578 PSMGTSTSHQPA----TPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPG 623
Cdd:PRK14951  445 VALAPAPPAQAApetvAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEG 494
PHA03269 PHA03269
envelope glycoprotein C; Provisional
507-632 1.93e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 41.64  E-value: 1.93e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 507 PTSSTPGGPPGAENSTSHQPATPggDTAQTPKPGTSQPMPPGVgtsTSHQPATpsggtaQTPEPGTSqpmppsmgtSTSH 586
Cdd:PHA03269   46 PHQAASRAPDPAVAPTSAASRKP--DLAQAPTPAASEKFDPAP---APHQAAS------RAPDPAVA---------PQLA 105
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 1677538249 587 QPATPGGGTAQTpEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEP 632
Cdd:PHA03269  106 AAPKPDAAEAFT-SAAQAHEAPADAGTSAASKKPDPAAHTQHSPPP 150
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
502-610 2.29e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 41.33  E-value: 2.29e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 502 TTLRPPTSSTPGGPpgaeNSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMG 581
Cdd:PRK14950  358 ALLVPVPAPQPAKP----TAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTR 433
                          90       100       110
                  ....*....|....*....|....*....|
gi 1677538249 582 TStshqpatpgggtAQTPEAGTS-QPMPPG 610
Cdd:PRK14950  434 AA------------IPVDEKPKYtPPAPPK 451
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
456-610 2.40e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.70  E-value: 2.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  456 PPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQ 535
Cdd:PHA03307   283 GPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADP 362
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1677538249  536 TPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQPMPPG 610
Cdd:PHA03307   363 SSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPSG 437
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
449-652 2.43e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.68  E-value: 2.43e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 449 IQVSEQEPPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGtgphPPSGTTLRPPTSSTPGGP------PGAENST 522
Cdd:pfam03154 249 LQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPV----PPQPFPLTPQSSQSQVPPgpspaaPGQSQQR 324
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 523 SHQPatPGGDTAQTPKPGTSQPMPPGvGTSTSHQPATPSGGTAQTPEPGT-------SQPMPPSMGT------------- 582
Cdd:pfam03154 325 IHTP--PSQSQLQSQQPPREQPLPPA-PLSMPHIKPPPTTPIPQLPNPQShkhpphlSGPSPFQMNSnlppppalkplss 401
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1677538249 583 -STSHQPATPGGGTAQTPEAGTSQP---MPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPmPLSKSTPSSGGGPS 652
Cdd:pfam03154 402 lSTHHPPSAHPPPLQLMPQSQQLPPppaQPPVLTQSQSLPPPAASHPPTSGLHQVPSQS-PFPQHPFVPGGPPP 474
PHA03269 PHA03269
envelope glycoprotein C; Provisional
445-570 2.52e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 41.25  E-value: 2.52e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 445 TVIEIQVSEQEPPSTEAGGTT-GPWTSTTSEVPRPPEPSQGpsttssgggtgphPPSGTTLRP-----PTSSTPGGPPGA 518
Cdd:PHA03269   33 TSAATQKPDPAPAPHQAASRApDPAVAPTSAASRKPDLAQA-------------PTPAASEKFdpapaPHQAASRAPDPA 99
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1677538249 519 ENSTSHQPATPGGDTAQTPKPgTSQPMPPGVGTSTSHQPATPSGGTAQTPEP 570
Cdd:PHA03269  100 VAPQLAAAPKPDAAEAFTSAA-QAHEAPADAGTSAASKKPDPAAHTQHSPPP 150
PHA03247 PHA03247
large tegument protein UL36; Provisional
442-621 2.70e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.46  E-value: 2.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  442 TATTVIEIQVSEQEPPSTEAGGTT---GPWTSTTSEVPRPPEPSqgpsttssgggTGPHPPSGTTLRP----PTSSTPGG 514
Cdd:PHA03247  2833 SAQPTAPPPPPGPPPPSLPLGGSVapgGDVRRRPPSRSPAAKPA-----------APARPPVRRLARPavsrSTESFALP 2901
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  515 PPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTShqPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATpggg 594
Cdd:PHA03247  2902 PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP--PLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVP---- 2975
                          170       180
                   ....*....|....*....|....*..
gi 1677538249  595 TAQTPEAGTSQPMPPGMGTSTSHQPTT 621
Cdd:PHA03247  2976 RFRVPQPAPSREAPASSTPPLTGHSLS 3002
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
507-649 3.19e-03

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 41.15  E-value: 3.19e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 507 PTSSTPGGPPGAE--------------NSTSHQPATPGGDTAQT------PKPGTSQPMPPGVGTSTSHQPATPSGGTAQ 566
Cdd:pfam09606 101 PMGPGPGGPMGQQmggpgtasnllaslGRPQMPMGGAGFPSQMSrvgrmqPGGQAGGMMQPSSGQPGSGTPNQMGPNGGP 180
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 567 ------TPEPGTSQPM----PPSMGTSTSHQPATPGGGTAQtpEAGTSQPMPPG------MGTSTSHQPTTPGGGTAQTp 630
Cdd:pfam09606 181 gqgqagGMNGGQQGPMggqmPPQMGVPGMPGPADAGAQMGQ--QAQANGGMNPQqmggapNQVAMQQQQPQQQGQQSQL- 257
                         170
                  ....*....|....*....
gi 1677538249 631 EPGTSQPMPLSKSTPSSGG 649
Cdd:pfam09606 258 GMGINQMQQMPQGVGGGAG 276
PRK12495 PRK12495
hypothetical protein; Provisional
507-628 3.69e-03

hypothetical protein; Provisional


Pssm-ID: 183558 [Multi-domain]  Cd Length: 226  Bit Score: 39.85  E-value: 3.69e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 507 PTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPePGTSQPMPPSMGTSTSH 586
Cdd:PRK12495   62 PTCQQPVTEDGAAGDDAGDGAEATAPSDAGSQASPDDDAQPAAEAEAADQSAPPEASSTSAT-DEAATDPPATAAARDGP 140
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 1677538249 587 QPATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQ 628
Cdd:PRK12495  141 TPDPTAQPATPDERRSPRQRPPVSGEPPTPSTPDAHVAGTLQ 182
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
498-651 4.09e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 40.79  E-value: 4.09e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 498 PPSGTTLRPPTSSTPGGPPGAEN-------------STSHQPAT-PGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGG 563
Cdd:pfam09770 169 KAAAPAPAPQPAAQPASLPAPSRkmmsleeveaamrAQAKKPAQqPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQ 248
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 564 TAQTPEPGTSQPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQPMPPGMGTSTS--HQPTTPGGGTAQTPEPGT----SQP 637
Cdd:pfam09770 249 QPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQilQNPNRLSAARVGYPQNPQpgvqPAP 328
                         170
                  ....*....|....
gi 1677538249 638 MPLSKSTPSSGGGP 651
Cdd:pfam09770 329 AHQAHRQQGSFGRQ 342
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
454-631 4.44e-03

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 40.42  E-value: 4.44e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 454 QEPPSTEAGGTTGPWTSttsEVPRPPEPSQGPSTTssgggtgpHPPSG--TTLRPPTSSTPGGPPGAENSTSHQPATPgg 531
Cdd:pfam05539 166 KEPKTAVTTSKTTSWPT---EVSHPTYPSQVTPQS--------QPATQghQTATANQRLSSTEPVGTQGTTTSSNPEP-- 232
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 532 dtAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSmGTSTSHQPATPgGGTAQTPEAGTSQPMPPGM 611
Cdd:pfam05539 233 --QTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATS-NRRSPHSTATP-PPTTKRQETGRPTPRPTAT 308
                         170       180
                  ....*....|....*....|
gi 1677538249 612 GTSTSHQPTTPGGGTAQTPE 631
Cdd:pfam05539 309 TQSGSSPPHSSPPGVQANPT 328
motB PRK12799
flagellar motor protein MotB; Reviewed
517-647 6.10e-03

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 40.08  E-value: 6.10e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 517 GAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTStshQPATpSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPGGGTA 596
Cdd:PRK12799  289 GLKQIDTHGTVPVAAVTPSSAVTQSSAITPSSAAIP---SPAV-IPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTV 364
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1677538249 597 QTPEAGTSQPMPPGMGTSTSHQPTTpGGGTAQTPEPGTSQP-MPLSKSTPSS 647
Cdd:PRK12799  365 ALPAAEPVNMQPQPMSTTETQQSST-GNITSTANGPTTSLPaAPASNIPVSP 415
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
470-606 6.27e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 40.28  E-value: 6.27e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 470 STTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSST-PGGPPGAENSTSHQPATPGGDTAQTPKPGTSqpmppG 548
Cdd:pfam05109 695 STSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATsPQAPSGQKTAVPTVTSTGGKANSTTGGKHTT-----G 769
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1677538249 549 VGTSTSHQPATPSGGTAQTPEP--GTSQPMPPSmgTSTSHQP----ATPGGGTAQT--PEAGTSQP 606
Cdd:pfam05109 770 HGARTSTEPTTDYGGDSTTPRTryNATTYLPPS--TSSKLRPrwtfTSPPVTTAQAtvPVPPTSQP 833
PRK10263 PRK10263
DNA translocase FtsK; Provisional
506-638 6.40e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 40.45  E-value: 6.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249  506 PPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTS 585
Cdd:PRK10263   751 PVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQY 830
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1677538249  586 HQPATPgggTAQTPEAGTSQPMPPGMGTSTS-HQPTTP-GGGTAQTPEPGTSQPM 638
Cdd:PRK10263   831 QQPQQP---VAPQPQDTLLHPLLMRNGDSRPlHKPTTPlPSLDLLTPPPSEVEPV 882
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
476-637 6.86e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 39.75  E-value: 6.86e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 476 PRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPG-----GPPGAENSTSHQPATPGGDT---AQTPKPGTsQPMPP 547
Cdd:NF033839  292 PSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKpevkpQPEKPKPEVKPQLETPKPEVkpqPEKPKPEV-KPQPE 370
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 548 GVGTSTSHQPATPSGGTAQTPEPGTS--QPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQ--PMPPGMGTSTSHQPTTPg 623
Cdd:NF033839  371 KPKPEVKPQPETPKPEVKPQPEKPKPevKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEvkPQPEKPKPEVKPQPEKP- 449
                         170
                  ....*....|....
gi 1677538249 624 gGTAQTPEPGTSQP 637
Cdd:NF033839  450 -KPEVKPQPETPKP 462
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
436-646 7.01e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 39.94  E-value: 7.01e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 436 NTVTSGTATTVIEIQVSEQEPPSTEAGGTTGPW----------TSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLR 505
Cdd:pfam17823  45 DAVPRADNKSSEQ*NFCAATAAPAPVTLTKGTSaahlnstevtAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPS 124
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 506 PPTSSTP---GGPPGAENST-------SHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTS-Q 574
Cdd:pfam17823 125 SAAQSLPaaiAALPSEAFSApraaacrANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSApA 204
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 575 PMPPSMGTSTSH-QPATPGGGTA-----------QTPEAGTSQPMPPGMGTSTSH-QPTTPGGGTAQTPEPGTSQPMPlS 641
Cdd:pfam17823 205 TLTPARGISTAAtATGHPAAGTAlaavgnsspaaGTVTAAVGTVTPAALATLAAAaGTVASAAGTINMGDPHARRLSP-A 283

                  ....*
gi 1677538249 642 KSTPS 646
Cdd:pfam17823 284 KHMPS 288
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
459-596 7.71e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 39.97  E-value: 7.71e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 459 TEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSsgggtgphPPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPK 538
Cdd:PRK07764  386 GVAGGAGAPAAAAPSAAAAAPAAAPAPAAAA--------PAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPP 457
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1677538249 539 PGTSQPMPPG-VGTSTSHQPATPSGGTAQTPEPGTSQPMPPsmgtstshQPATPGGGTA 596
Cdd:PRK07764  458 PAAAPSAQPApAPAAAPEPTAAPAPAPPAAPAPAAAPAAPA--------APAAPAGADD 508
SOBP pfam15279
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ...
476-639 8.32e-03

Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.


Pssm-ID: 464609 [Multi-domain]  Cd Length: 325  Bit Score: 39.41  E-value: 8.32e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 476 PRPPEPSQGPSTTSSGGGTGPHPPsgtTLRPPTSSTPGGPPGAEnsTSHQPATPGGDTAQTPK-----PGTSQPMPPgvg 550
Cdd:pfam15279 128 PKPHEPPSLPPPPLPPKKGRRHRP---GLHPPLGRPPGSPPMSM--TPRGLLGKPQQHPPPSPlpafmEPSSMPPPF--- 199
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 551 TSTSHQPATPSGGTAQTPEPGT-SQPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQPMPPGMgtstshQPTTPGGGTAQT 629
Cdd:pfam15279 200 LRPPPSIPQPNSPLSNPMLPGIgPPPKPPRNLGPPSNPMHRPPFSPHHPPPPPTPPGPPPGL------PPPPPRGFTPPF 273
                         170
                  ....*....|
gi 1677538249 630 PEPGTSQPMP 639
Cdd:pfam15279 274 GPPFPPVNMM 283
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
505-635 9.00e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 39.70  E-value: 9.00e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 505 RPPTSStpGGPPGAENSTSHQPATPGGDTA---QTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMG 581
Cdd:PRK14951  365 KPAAAA--EAAAPAEKKTPARPEAAAPAAApvaQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAP 442
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1677538249 582 TSTSHQPATPgggtAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTS 635
Cdd:PRK14951  443 AAVALAPAPP----AQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTE 492
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH