NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1695091487|ref|NP_068743|]
View 

cadherin-related family member 5 isoform 1 precursor [Homo sapiens]

Protein Classification

cadherin repeat domain-containing protein( domain architecture ID 10182011)

cadherin repeat domain-containing protein similar to Homo sapiens desmoglein-2, which is involved in the interaction of plaque proteins and intermediate filaments mediating cell-cell adhesion; cadherins are are calcium-dependent cell adhesion proteins that preferentially interact with themselves in connecting cells

CATH:  2.60.40.60
Gene Ontology:  GO:0007156|GO:0005509
SCOP:  4007535

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
274-343 5.69e-10

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


:

Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 56.94  E-value: 5.69e-10
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1695091487 274 IYAEDGDRGINQPIIYSIFRGNVNGTFIIHPDSGNLTVARSVP--SPMTFLLLVKGQ-QADLARYSVTQVTVE 343
Cdd:cd11304    19 VSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTAKPLDreEQSSYTLTVTATdGGGPPLSSTATVTIT 91
PHA03247 super family cl33720
large tegument protein UL36; Provisional
452-651 8.90e-10

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 63.03  E-value: 8.90e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  452 SEQEPPSTDVPPS----PEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENs 527
Cdd:PHA03247  2669 RLGRAAQASSPPQrprrRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAG- 2747
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  528 tshqPATPGGDTAQTPKPGTSQPMPPG--VGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPA-TPGGGTAQT 604
Cdd:PHA03247  2748 ----PATPGGPARPARPPTTAGPPAPAppAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLApAAALPPAAS 2823
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1695091487  605 PEAG-----TSQPMPPGMGTSTSHQPTTPGGGTA------QTPEPGTSQPMPLSKSTP 651
Cdd:PHA03247  2824 PAGPlppptSAQPTAPPPPPGPPPPSLPLGGSVApggdvrRRPPSRSPAAKPAAPARP 2881
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
37-120 4.37e-06

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


:

Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 45.77  E-value: 4.37e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  37 FEVEENTNVTEPLVDIHV-----PEGQEVT--LGALSTPFAFRIQGN--QLFLNVTPDYEEKSLLEAQLLCQSGGT--LV 105
Cdd:cd11304     4 VSVPENAPPGTVVLTVSAtdpdsGENGEVTysIVSGNEDGLFSIDPStgEITTAKPLDREEQSSYTLTVTATDGGGppLS 83
                          90
                  ....*....|....*
gi 1695091487 106 TQLRVFVSVLDVNDN 120
Cdd:cd11304    84 STATVTITVLDVNDN 98
 
Name Accession Description Interval E-value
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
274-343 5.69e-10

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 56.94  E-value: 5.69e-10
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1695091487 274 IYAEDGDRGINQPIIYSIFRGNVNGTFIIHPDSGNLTVARSVP--SPMTFLLLVKGQ-QADLARYSVTQVTVE 343
Cdd:cd11304    19 VSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTAKPLDreEQSSYTLTVTATdGGGPPLSSTATVTIT 91
PHA03247 PHA03247
large tegument protein UL36; Provisional
452-651 8.90e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 63.03  E-value: 8.90e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  452 SEQEPPSTDVPPS----PEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENs 527
Cdd:PHA03247  2669 RLGRAAQASSPPQrprrRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAG- 2747
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  528 tshqPATPGGDTAQTPKPGTSQPMPPG--VGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPA-TPGGGTAQT 604
Cdd:PHA03247  2748 ----PATPGGPARPARPPTTAGPPAPAppAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLApAAALPPAAS 2823
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1695091487  605 PEAG-----TSQPMPPGMGTSTSHQPTTPGGGTA------QTPEPGTSQPMPLSKSTP 651
Cdd:PHA03247  2824 PAGPlppptSAQPTAPPPPPGPPPPSLPLGGSVApggdvrRRPPSRSPAAKPAAPARP 2881
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
456-652 3.48e-07

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 54.15  E-value: 3.48e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 456 PPSTDVPPSPEAGGTTGPWTSTTSEVPRPPEpSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQP--- 532
Cdd:pfam05109 593 PTVGETSPQANTTNHTLGGTSSTPVVTSPPK-NATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPllt 671
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 533 -ATPGG------------------DTAQTPKPGT-SQPMPPGvGTSTSHQPATPSGgTAQTPEPGTSQPMPPSmGTSTSH 592
Cdd:pfam05109 672 sAHPTGgenitqvtpaststhhvsTSSPAPRPGTtSQASGPG-NSSTSTKPGEVNV-TKGTPPKNATSPQAPS-GQKTAV 748
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 593 QPATPGGGTAQTPEAGTSQPmppGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPS 652
Cdd:pfam05109 749 PTVTSTGGKANSTTGGKHTT---GHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSS 805
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
276-314 3.26e-06

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 45.80  E-value: 3.26e-06
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1695091487  276 AEDGDRGINQPIIYSIFRGNVNGTFIIHPDSGNLTVARS 314
Cdd:smart00112   2 ATDADSGENGKVTYSILSGNDDGLFSIDPETGEITTTKP 40
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
37-120 4.37e-06

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 45.77  E-value: 4.37e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  37 FEVEENTNVTEPLVDIHV-----PEGQEVT--LGALSTPFAFRIQGN--QLFLNVTPDYEEKSLLEAQLLCQSGGT--LV 105
Cdd:cd11304     4 VSVPENAPPGTVVLTVSAtdpdsGENGEVTysIVSGNEDGLFSIDPStgEITTAKPLDREEQSSYTLTVTATDGGGppLS 83
                          90
                  ....*....|....*
gi 1695091487 106 TQLRVFVSVLDVNDN 120
Cdd:cd11304    84 STATVTITVLDVNDN 98
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
416-623 4.39e-06

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.14  E-value: 4.39e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 416 VLTTTTLAQAGAFYAEVEAHNTVTSGTATTVIEIQVSEQEPPSTDVPPSPEAGGTTGPWTSTTSEVPRPPEPSQGPSTTS 495
Cdd:COG3469    16 SATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSAT 95
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 496 SGGGTGPHPPSGTTLRPPTSSTPGGPPGaeNSTSHQPATPGGDTAQTPKPGTSqpmppGVGTSTSHQPATPSGGTAQTPE 575
Cdd:COG3469    96 LVATSTASGANTGTSTVTTTSTGAGSVT--STTSSTAGSTTTSGASATSSAGS-----TTTTTTVSGTETATGGTTTTST 168
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*...
gi 1695091487 576 PGTSQPMPPSMGTSTSHQPATPGGGTAqTPEAGTSQPMPPGMGTSTSH 623
Cdd:COG3469   169 TTTTTSASTTPSATTTATATTASGATT-PSATTTATTTGPPTPGLPKH 215
Cadherin pfam00028
Cadherin domain;
253-343 6.77e-06

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 44.98  E-value: 6.77e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 253 YHGAVPTGhILPSPLVLRpgpIYAEDGDRGINQPIIYSIFRGNVNGTFIIHPDSGNLTVARSV--PSPMTFLLLVKGQQA 330
Cdd:pfam00028   1 YSASVPEN-APVGTEVLT---VTATDPDLGPNGRIFYSILGGGPGGNFRIDPDTGDISTTKPLdrESIGEYELTVEATDS 76
                          90
                  ....*....|....
gi 1695091487 331 DL-ARYSVTQVTVE 343
Cdd:pfam00028  77 GGpPLSSTATVTIT 90
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
71-122 3.52e-05

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 42.72  E-value: 3.52e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1695091487   71 FRI--QGNQLFLNVTPDYEEKSLLEAQLLCQSGGT--LVTQLRVFVSVLDVNDNAP 122
Cdd:smart00112  26 FSIdpETGEITTTKPLDREEQPEYTLTVEATDGGGppLSSTATVTITVLDVNDNAP 81
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
482-643 9.92e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 39.37  E-value: 9.92e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 482 PRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPG-----GPPGAENSTSHQPATPGGDT---AQTPKPGTsQPMPP 553
Cdd:NF033839  292 PSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKpevkpQPEKPKPEVKPQLETPKPEVkpqPEKPKPEV-KPQPE 370
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 554 GVGTSTSHQPATPSGGTAQTPEPGTS--QPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQ--PMPPGMGTSTSHQPTTPg 629
Cdd:NF033839  371 KPKPEVKPQPETPKPEVKPQPEKPKPevKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEvkPQPEKPKPEVKPQPEKP- 449
                         170
                  ....*....|....
gi 1695091487 630 gGTAQTPEPGTSQP 643
Cdd:NF033839  450 -KPEVKPQPETPKP 462
 
Name Accession Description Interval E-value
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
274-343 5.69e-10

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 56.94  E-value: 5.69e-10
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1695091487 274 IYAEDGDRGINQPIIYSIFRGNVNGTFIIHPDSGNLTVARSVP--SPMTFLLLVKGQ-QADLARYSVTQVTVE 343
Cdd:cd11304    19 VSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTAKPLDreEQSSYTLTVTATdGGGPPLSSTATVTIT 91
PHA03247 PHA03247
large tegument protein UL36; Provisional
452-651 8.90e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 63.03  E-value: 8.90e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  452 SEQEPPSTDVPPS----PEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENs 527
Cdd:PHA03247  2669 RLGRAAQASSPPQrprrRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAG- 2747
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  528 tshqPATPGGDTAQTPKPGTSQPMPPG--VGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPA-TPGGGTAQT 604
Cdd:PHA03247  2748 ----PATPGGPARPARPPTTAGPPAPAppAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLApAAALPPAAS 2823
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1695091487  605 PEAG-----TSQPMPPGMGTSTSHQPTTPGGGTA------QTPEPGTSQPMPLSKSTP 651
Cdd:PHA03247  2824 PAGPlppptSAQPTAPPPPPGPPPPSLPLGGSVApggdvrRRPPSRSPAAKPAAPARP 2881
PHA03247 PHA03247
large tegument protein UL36; Provisional
451-653 6.79e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 59.95  E-value: 6.79e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  451 VSEQEPPSTDVPPSPEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSH 530
Cdd:PHA03247  2602 VDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQ 2681
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  531 QPATPG--------GDTAQTPKPG-TSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPggGT 601
Cdd:PHA03247  2682 RPRRRAarptvgslTSLADPPPPPpTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARP--AR 2759
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1695091487  602 AQTPeAGTSQPMPPGmGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPSS 653
Cdd:PHA03247  2760 PPTT-AGPPAPAPPA-APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPA 2809
PHA03247 PHA03247
large tegument protein UL36; Provisional
457-730 4.75e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.26  E-value: 4.75e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  457 PSTDVPPSPEAGGTTGPWTSTTSEVPRPPEpsqgPSTTSSGGGTGPHPPsgTTLRPPTSSTPGGPPGAENSTSHQPATPG 536
Cdd:PHA03247  2717 SATPLPPGPAAARQASPALPAAPAPPAVPA----GPATPGGPARPARPP--TTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  537 GDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQT-PEPGTSQPMPPSMGTSTSHQPATPGGGTAqtPEAGTSQPMPP 615
Cdd:PHA03247  2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPlPPPTSAQPTAPPPPPGPPPPSLPLGGSVA--PGGDVRRRPPS 2868
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  616 GmgtSTSHQPTTPgggtAQTPEPGTSQPmPLSKSTPSSGGGPSEDKRFSVVDMAALGGVLGALLLLALLGLAVLVHKHYG 695
Cdd:PHA03247  2869 R---SPAAKPAAP----ARPPVRRLARP-AVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ 2940
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 1695091487  696 PRLKCCCGKAPEPQPQGFDNQAFLPDHKANWAPVP 730
Cdd:PHA03247  2941 PPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVP 2975
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
462-643 2.65e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.61  E-value: 2.65e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 462 PPSPEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQ 541
Cdd:PRK07764  590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGW 669
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 542 TPKPGTSQPMPPGvGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQPMPP------ 615
Cdd:PRK07764  670 PAKAGGAAPAAPP-PAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPpepddp 748
                         170       180       190
                  ....*....|....*....|....*....|.
gi 1695091487 616 ---GMGTSTSHQPTTPGGGTAQTPEPGTSQP 643
Cdd:PRK07764  749 pdpAGAPAQPPPPPAPAPAAAPAAAPPPSPP 779
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
456-652 3.48e-07

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 54.15  E-value: 3.48e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 456 PPSTDVPPSPEAGGTTGPWTSTTSEVPRPPEpSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQP--- 532
Cdd:pfam05109 593 PTVGETSPQANTTNHTLGGTSSTPVVTSPPK-NATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPllt 671
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 533 -ATPGG------------------DTAQTPKPGT-SQPMPPGvGTSTSHQPATPSGgTAQTPEPGTSQPMPPSmGTSTSH 592
Cdd:pfam05109 672 sAHPTGgenitqvtpaststhhvsTSSPAPRPGTtSQASGPG-NSSTSTKPGEVNV-TKGTPPKNATSPQAPS-GQKTAV 748
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 593 QPATPGGGTAQTPEAGTSQPmppGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPS 652
Cdd:pfam05109 749 PTVTSTGGKANSTTGGKHTT---GHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSS 805
PHA03247 PHA03247
large tegument protein UL36; Provisional
456-657 3.71e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 3.71e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  456 PPSTD--VPPSPEAGGTTGPwtSTTSEVPRPPEPSQGPSTTSSGGGTGPHP-PSGTTLRPPTSSTPGGPPGAENSTSHQP 532
Cdd:PHA03247  2561 PAAPDrsVPPPRPAPRPSEP--AVTSRARRPDAPPQSARPRAPVDDRGDPRgPAPPSPLPPDTHAPDPPPPSPSPAANEP 2638
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  533 ATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTS-HQPATPGGGTAQTPEAGTSQ 611
Cdd:PHA03247  2639 DPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSlADPPPPPPTPEPAPHALVSA 2718
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*...
gi 1695091487  612 -PMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPM-PLSKSTPSSGGGP 657
Cdd:PHA03247  2719 tPLPPGPAAARQASPALPAAPAPPAVPAGPATPGgPARPARPPTTAGP 2766
PHA03378 PHA03378
EBNA-3B; Provisional
447-659 3.82e-07

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 53.92  E-value: 3.82e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 447 IEIQVSEQEPPSTDVPPSPEAGGTTGPWT--STTSEVP----RPPEPSQGPSTTSSGGGTGPHP--PSGTTLRPPTSSTP 518
Cdd:PHA03378  572 LQIQPLTSPTTSQLASSAPSYAQTPWPVPhpSQTPEPPttqsHIPETSAPRQWPMPLRPIPMRPlrMQPITFNVLVFPTP 651
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 519 GGPPGAENS-----------TSHQPATPGGDTAQTPK--PGTSQP-------MPPGVGTSTSHQPATPSGGTAQTPE--P 576
Cdd:PHA03378  652 HQPPQVEITpykptwtqighIPYQPSPTGANTMLPIQwaPGTMQPppraptpMRPPAAPPGRAQRPAAATGRARPPAaaP 731
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 577 GTSQP-------MPPSMGTSTSHQPATPGGGTAQTPEA--GTSQPMPPGMGTSTSHQptTPGGGTAQTPEP-GTSQPMPL 646
Cdd:PHA03378  732 GRARPpaaapgrARPPAAAPGRARPPAAAPGRARPPAAapGAPTPQPPPQAPPAPQQ--RPRGAPTPQPPPqAGPTSMQL 809
                         250
                  ....*....|...
gi 1695091487 647 SKSTPSSGGGPSE 659
Cdd:PHA03378  810 MPRAAPGQQGPTK 822
PHA03378 PHA03378
EBNA-3B; Provisional
457-659 4.09e-07

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 53.92  E-value: 4.09e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 457 PSTDVPPSPEAGGTTGPWTSTtsevprPPEPSQGPSTTSSGGGTGPHPPsGTTLRPPTSSTPGGPPGA--------ENST 528
Cdd:PHA03378  649 PTPHQPPQVEITPYKPTWTQI------GHIPYQPSPTGANTMLPIQWAP-GTMQPPPRAPTPMRPPAAppgraqrpAAAT 721
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 529 SHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPE--PGTSQPMPPSMGTSTSHQPATPGGGTAQTPE 606
Cdd:PHA03378  722 GRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAaaPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQ 801
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1695091487 607 AG-----TSQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKS-----TPSSGGGPSE 659
Cdd:PHA03378  802 AGptsmqLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQaaagpTPSPGSGTSD 864
PHA03247 PHA03247
large tegument protein UL36; Provisional
442-654 5.62e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.79  E-value: 5.62e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  442 TATTVIEIQVSEQEPPSTDVPPSPEAGGTTGPWTSTTSEVPRPPEPsqgPSTTSSGGGTGPHP-PSGTTLRPPTSSTPGG 520
Cdd:PHA03247  2784 TRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLP---PPTSAQPTAPPPPPgPPPPSLPLGGSVAPGG 2860
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  521 PPGAENSTSHQPATPGGDT---------AQTPKPGTSQPMPPgVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTS 591
Cdd:PHA03247  2861 DVRRRPPSRSPAAKPAAPArppvrrlarPAVSRSTESFALPP-DQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1695091487  592 HQPATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTpgggTAQTPEPGTSQPMPLSKSTPSSG 654
Cdd:PHA03247  2940 QPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVP----RFRVPQPAPSREAPASSTPPLTG 2998
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
456-658 8.57e-07

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 52.85  E-value: 8.57e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 456 PPSTDVPPSPEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGT---------GPHPPSGTTLRPPTSSTPGGPPGAEN 526
Cdd:pfam03154 182 SPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAphtliqqtpTLHPQRLPSPHPPLQPMTQPPPPSQV 261
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 527 STS-------HQPATPGGDTAQTPKPGTSQPMPPgvgtstshQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPatPGG 599
Cdd:pfam03154 262 SPQplpqpslHGQMPPMPHSLQTGPSHMQHPVPP--------QPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTP--PSQ 331
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1695091487 600 GTAQTPEAGTSQPMPPGmGTSTSHQPTTPGGGTAQTPEPgTSQPMPLSKSTPSSGGGPS 658
Cdd:pfam03154 332 SQLQSQQPPREQPLPPA-PLSMPHIKPPPTTPIPQLPNP-QSHKHPPHLSGPSPFQMNS 388
PHA03247 PHA03247
large tegument protein UL36; Provisional
456-736 9.90e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.02  E-value: 9.90e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  456 PPSTDVP------PSPEAGGTTGPWTSTTSEVPRP---PEPSQGPSTTSSGGGTGPHPPSGTTLR--------------- 511
Cdd:PHA03247  2618 PPDTHAPdppppsPSPAANEPDPHPPPTVPPPERPrddPAPGRVSRPRRARRLGRAAQASSPPQRprrraarptvgslts 2697
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  512 ----PPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPgTSQPMPPGVGTStshqPATPSGGTAQTPEPGTSQPMPPS-- 585
Cdd:PHA03247  2698 ladpPPPPPTPEPAPHALVSATPLPPGPAAARQASPAL-PAAPAPPAVPAG----PATPGGPARPARPPTTAGPPAPApp 2772
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  586 MGTSTSHQPATPGGGTAQTPEAGTSQPMPPGMGTSTS----HQPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPSEDK 661
Cdd:PHA03247  2773 AAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAavlaPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPL 2852
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1695091487  662 RFSVVDMAALGGVLGALLLLALLGLAVLVHKHYGPRLKccCGKAPEPQPQGFDNQAFLPDHKANWAPVPSPTHDP 736
Cdd:PHA03247  2853 GGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPA--VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPP 2925
PHA03247 PHA03247
large tegument protein UL36; Provisional
417-658 1.11e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 1.11e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  417 LTTTTLAQAGAFYAEVEAHNTVTSGTATTVIEIQVSEQEPPSTDVPPSPEAggTTGPWTSTTSEVPRPPEPSQGPSTTSS 496
Cdd:PHA03247  2721 LPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAP--PAAPAAGPPRRLTRPAVASLSESRESL 2798
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  497 GGGTGPHPPSGTTLRPPTSSTPGGPPGAenstshqPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATP--SGGTAQTP 574
Cdd:PHA03247  2799 PSPWDPADPPAAVLAPAAALPPAASPAG-------PLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDvrRRPPSRSP 2871
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  575 EPGTSQPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQPMPPGMGTSTSH-QPTTPGGGTAQTPEPGTSQPMPLSKSTPSS 653
Cdd:PHA03247  2872 AAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQpQPPPPPQPQPPPPPPPRPQPPLAPTTDPAG 2951

                   ....*
gi 1695091487  654 GGGPS 658
Cdd:PHA03247  2952 AGEPS 2956
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
276-314 3.26e-06

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 45.80  E-value: 3.26e-06
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1695091487  276 AEDGDRGINQPIIYSIFRGNVNGTFIIHPDSGNLTVARS 314
Cdd:smart00112   2 ATDADSGENGKVTYSILSGNDDGLFSIDPETGEITTTKP 40
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
37-120 4.37e-06

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 45.77  E-value: 4.37e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  37 FEVEENTNVTEPLVDIHV-----PEGQEVT--LGALSTPFAFRIQGN--QLFLNVTPDYEEKSLLEAQLLCQSGGT--LV 105
Cdd:cd11304     4 VSVPENAPPGTVVLTVSAtdpdsGENGEVTysIVSGNEDGLFSIDPStgEITTAKPLDREEQSSYTLTVTATDGGGppLS 83
                          90
                  ....*....|....*
gi 1695091487 106 TQLRVFVSVLDVNDN 120
Cdd:cd11304    84 STATVTITVLDVNDN 98
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
416-623 4.39e-06

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.14  E-value: 4.39e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 416 VLTTTTLAQAGAFYAEVEAHNTVTSGTATTVIEIQVSEQEPPSTDVPPSPEAGGTTGPWTSTTSEVPRPPEPSQGPSTTS 495
Cdd:COG3469    16 SATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSAT 95
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 496 SGGGTGPHPPSGTTLRPPTSSTPGGPPGaeNSTSHQPATPGGDTAQTPKPGTSqpmppGVGTSTSHQPATPSGGTAQTPE 575
Cdd:COG3469    96 LVATSTASGANTGTSTVTTTSTGAGSVT--STTSSTAGSTTTSGASATSSAGS-----TTTTTTVSGTETATGGTTTTST 168
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*...
gi 1695091487 576 PGTSQPMPPSMGTSTSHQPATPGGGTAqTPEAGTSQPMPPGMGTSTSH 623
Cdd:COG3469   169 TTTTTSASTTPSATTTATATTASGATT-PSATTTATTTGPPTPGLPKH 215
Cadherin pfam00028
Cadherin domain;
253-343 6.77e-06

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 44.98  E-value: 6.77e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 253 YHGAVPTGhILPSPLVLRpgpIYAEDGDRGINQPIIYSIFRGNVNGTFIIHPDSGNLTVARSV--PSPMTFLLLVKGQQA 330
Cdd:pfam00028   1 YSASVPEN-APVGTEVLT---VTATDPDLGPNGRIFYSILGGGPGGNFRIDPDTGDISTTKPLdrESIGEYELTVEATDS 76
                          90
                  ....*....|....
gi 1695091487 331 DL-ARYSVTQVTVE 343
Cdd:pfam00028  77 GGpPLSSTATVTIT 90
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
480-659 7.07e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 49.77  E-value: 7.07e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 480 EVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSt 559
Cdd:pfam03154 175 QAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMT- 253
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 560 shQPATPSGGTAQ-TPEPGTSQPMPPsmgtstshQPATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPgggtaQTPEP 638
Cdd:pfam03154 254 --QPPPPSQVSPQpLPQPSLHGQMPP--------MPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGP-----SPAAP 318
                         170       180
                  ....*....|....*....|....
gi 1695091487 639 GTSQPM---PLSKSTPSSGGGPSE 659
Cdd:pfam03154 319 GQSQQRihtPPSQSQLQSQQPPRE 342
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
437-636 1.27e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 49.14  E-value: 1.27e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 437 TVTSGTATTVIEIQVSEQEPPSTDVPPSPEAGGTTGPWTST----TSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTlRP 512
Cdd:pfam05109 406 TRTATNATTTTHKVIFSKAPESTTTSPTLNTTGFAAPNTTTglpsSTHVPTNLTAPASTGPTVSTADVTSPTPAGTT-SG 484
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 513 PTSSTPGGPPGAENSTSHQP--------ATPGGDTAQTPKPGTSQPMP----PGVGTSTSHQPATPSGGTAQTPEPGTSQ 580
Cdd:pfam05109 485 ASPVTPSPSPRDNGTESKAPdmtsptsaVTTPTPNATSPTPAVTTPTPnatsPTLGKTSPTSAVTTPTPNATSPTPAVTT 564
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 581 PMP----PSMGTSTSHQPATPGGGTAQTPEAGTSQPmppgMGTSTSHQPttpgGGTAQTP 636
Cdd:pfam05109 565 PTPnatiPTLGKTSPTSAVTTPTPNATSPTVGETSP----QANTTNHTL----GGTSSTP 616
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
457-667 1.87e-05

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 48.38  E-value: 1.87e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 457 PSTDVPPsPEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTT----LRPPTSSTPGGPPGAENSTSHQP 532
Cdd:PLN03209  324 PSQRVPP-KESDAADGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTayedLKPPTSPIPTPPSSSPASSKSVD 402
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 533 ATPGGDTAQT-PKPGTSQPMP---PGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPS-MGTSTSHQPATPG-GGTAQTPE 606
Cdd:PLN03209  403 AVAKPAEPDVvPSPGSASNVPevePAQVEAKKTRPLSPYARYEDLKPPTSPSPTAPTgVSPSVSSTSSVPAvPDTAPATA 482
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1695091487 607 AGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPSEDKRFSVVD 667
Cdd:PLN03209  483 ATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALAD 543
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
504-619 2.12e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 48.44  E-value: 2.12e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 504 PPSGTTLRPPTSSTPGGPPGAENSTSHQPATPggDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMP 583
Cdd:PRK07764  398 APSAAAAAPAAAPAPAAAAPAAAAAPAPAAAP--QPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPE 475
                          90       100       110
                  ....*....|....*....|....*....|....*.
gi 1695091487 584 PSMGTSTSHQPATPGGGTAQTPEAGTSQPMPPGMGT 619
Cdd:PRK07764  476 PTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAAT 511
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
510-630 2.46e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 47.75  E-value: 2.46e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 510 LRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPM---PPGVGTSTSHQPATPSGGTAQTPEP--GTSQPMPP 584
Cdd:PRK14959  365 LMPVESLRPSGGGASAPSGSAAEGPASGGAATIPTPGTQGPQgtaPAAGMTPSSAAPATPAPSAAPSPRVpwDDAPPAPP 444
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 1695091487 585 SMGTSTSHQPATPGGGTAQTPEAGTSQP--MPPGMGTSTSHQPTTPGG 630
Cdd:PRK14959  445 RSGIPPRPAPRMPEASPVPGAPDSVASAsdAPPTLGDPSDTAEHTPSG 492
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
453-663 2.66e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 47.95  E-value: 2.66e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 453 EQEPPSTDVPPSPEAGGTTGPWTSTTSEVPRPPE-----PSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENS 527
Cdd:PRK12323  371 GAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAapaaaPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAP 450
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 528 TSHQPATPggdtAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPE-PGTSQPMPPSMGTSTSHQ--PATPGGGTAQT 604
Cdd:PRK12323  451 APAPAAAP----AAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDdPPPWEELPPEFASPAPAQpdAAPAGWVAESI 526
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1695091487 605 PEAGTSQPMPPGmgtstshqPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPSEDKRF 663
Cdd:PRK12323  527 PDPATADPDDAF--------ETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMF 577
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
71-122 3.52e-05

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 42.72  E-value: 3.52e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1695091487   71 FRI--QGNQLFLNVTPDYEEKSLLEAQLLCQSGGT--LVTQLRVFVSVLDVNDNAP 122
Cdd:smart00112  26 FSIdpETGEITTTKPLDREEQPEYTLTVEATDGGGppLSSTATVTITVLDVNDNAP 81
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
508-651 5.91e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.90  E-value: 5.91e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 508 TTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPAtpsggtaQTPEPGTSQPMPPSMG 587
Cdd:PRK07764  376 ARLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPA-------PAPAPAPAPPSPAGNA 448
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1695091487 588 TSTSHQPATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTP 651
Cdd:PRK07764  449 PAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATL 512
PRK11901 PRK11901
hypothetical protein; Reviewed
514-658 6.28e-05

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 45.83  E-value: 6.28e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 514 TSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTsqpMPPGVGTSTSHQPATPSGGTAQTPEPGT-----SQ------PM 582
Cdd:PRK11901   87 LSSGNQSSPSAANNTSDGHDASGVKNTAPPQDIS---APPISPTPTQAAPPQTPNGQQRIELPGNisdalSQqqgqvnAA 163
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1695091487 583 PPSMGTSTSHQP---ATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPgggtaqTPEPGTSQPmPLSKSTPSSGGGPS 658
Cdd:PRK11901  164 SQNAQGNTSTLPtapATVAPSKGAKVPATAETHPTPPQKPATKKPAVNH------HKTATVAVP-PATSGKPKSGAASA 235
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
450-616 6.77e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 46.70  E-value: 6.77e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  450 QVSEQEPPSTDVPPSPEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTS 529
Cdd:PHA03307   271 EASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRS 350
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  530 HQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPGGGTAQTPEAGT 609
Cdd:PHA03307   351 PSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARY 430

                   ....*..
gi 1695091487  610 SQPMPPG 616
Cdd:PHA03307   431 PLLTPSG 437
PRK13700 PRK13700
conjugal transfer protein TraD; Provisional
544-620 9.25e-05

conjugal transfer protein TraD; Provisional


Pssm-ID: 184256 [Multi-domain]  Cd Length: 732  Bit Score: 46.11  E-value: 9.25e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 544 KPGTSQPMPPGVGTSTSHQPATPSGGTAQTPePGTSQPMPPSMG-TSTSHQPATPGGGT----AQTPEAGTSQPMPPGMG 618
Cdd:PRK13700  604 EPDVPEVASGEDVTQAEQPQQPQQPQQPQQP-QQPQQPVSPVINdKKSDAGVNVPAGGIeqelKMKPEEEMEQQLPPGIS 682

                  ..
gi 1695091487 619 TS 620
Cdd:PRK13700  683 ES 684
PHA03247 PHA03247
large tegument protein UL36; Provisional
456-658 1.11e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 1.11e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  456 PPSTDVPPSPEAGGTTGP-----WTSTTSEVPRPPEpsqgpsttssgggtgpHPPSGTTLRPPTSSTPggPPGAENSTS- 529
Cdd:PHA03247   310 PAPPDPPPPAPAGDAEEEddedgAMEVVSPLPRPRQ----------------HYPLGFPKRRRPTWTP--PSSLEDLSAg 371
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  530 --HQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTP-EPGTSQPMPPSmgTSTSHQPATPGGGTAQTPE 606
Cdd:PHA03247   372 rhHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPaPTPVPASAPPP--PATPLPSAEPGSDDGPAPP 449
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1695091487  607 AGTSQPMPPGMGTSTSHQPTTPGG----GTAQTPEPGTSQPMPLSKSTPSSGGGPS 658
Cdd:PHA03247   450 PERQPPAPATEPAPDDPDDATRKAldalRERRPPEPPGADLAELLGRHPDTAGTVV 505
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
448-667 1.36e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.93  E-value: 1.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  448 EIQVSEQEPPSTDVPPSPEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENS 527
Cdd:PHA03307    84 SRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVA 163
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  528 TSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEP-----GTSQPMPPSM-----GTSTSHQPATP 597
Cdd:PHA03307   164 SDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPisasaSSPAPAPGRSaaddaGASSSDSSSSE 243
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1695091487  598 GGGTAQTPEAGTSQPMPPGMGTST---SHQPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPSEDKRFSVVD 667
Cdd:PHA03307   244 SSGCGWGPENECPLPRPAPITLPTriwEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASS 316
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
441-650 1.38e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 45.84  E-value: 1.38e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 441 GTATTVIEIQVSEQEPPSTDVPPSPEAGGTTGPWTSTTSEVPRPPEpSQGPSTTSSGGGTGPHPPSGTtlRPPTSSTPGG 520
Cdd:PTZ00449  547 GKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPE-EPKKPKRPRSAQRPTRPKSPK--LPELLDIPKS 623
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 521 PPGAENSTS-------HQPATP----GGDTAQTPKPGTSqPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTS 589
Cdd:PTZ00449  624 PKRPESPKSpkrppppQRPSSPerpeGPKIIKSPKPPKS-PKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESI 702
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1695091487 590 TSHQPATPGGGTAQTPeagtsQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKST 650
Cdd:PTZ00449  703 LKETLPETPGTPFTTP-----RPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERT 758
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
456-628 1.40e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 45.30  E-value: 1.40e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 456 PPSTDVPPSPEAGGTTGPWTSTTSEVPRPPePSQGPSTTSSGGGTGPHPPSGTT----LRPPTSSTPGGPPGAENSTSHQ 531
Cdd:PLN03209  390 PPSSSPASSKSVDAVAKPAEPDVVPSPGSA-SNVPEVEPAQVEAKKTRPLSPYAryedLKPPTSPSPTAPTGVSPSVSST 468
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 532 PATPG-GDTA--------QTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPGGGTA 602
Cdd:PLN03209  469 SSVPAvPDTApataatdaAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHA 548
                         170       180
                  ....*....|....*....|....*..
gi 1695091487 603 Q-TPEAGTSQPMPPGMGTSTSHQPTTP 628
Cdd:PLN03209  549 QpKPRPLSPYTMYEDLKPPTSPTPSPV 575
PHA03255 PHA03255
BDLF3; Provisional
514-656 2.03e-04

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 43.74  E-value: 2.03e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 514 TSSTP-----GGPPGAENSTSHQPATPGGDTAQTP-KPGTSQPMPPGVGTSTSHQPATpSGGTAQTPEPGTSQPMPPSMG 587
Cdd:PHA03255   25 TSSGSstasaGNVTGTTAVTTPSPSASGPSTNQSTtLTTTSAPITTTAILSTNTTTVT-STGTTVTPVPTTSNASTINVT 103
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1695091487 588 TSTSHQ--PATPGGGTAQTPEAGTSQPMPPGMGTSTSH-------QPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGG 656
Cdd:PHA03255  104 TKVTAQniTATEAGTGTSTGVTSNVTTRSSSTTSATTRitnattlAPTLSSKGTSNATKTTAELPTVPDERQPSLSYG 181
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
476-660 2.91e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.78  E-value: 2.91e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  476 STTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTL----------RPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKP 545
Cdd:PHA03307    62 CDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLApasparegspTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPV 141
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  546 GTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPGGGTAqtPEAGTSQPMPPGMGTSTSHQP 625
Cdd:PHA03307   142 GSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTP--PAAASPRPPRRSSPISASASS 219
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 1695091487  626 TTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPSED 660
Cdd:PHA03307   220 PAPAPGRSAADDAGASSSDSSSSESSGCGWGPENE 254
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
420-657 3.42e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.52  E-value: 3.42e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 420 TTLAQAGAFYAEVEAHNTVTSGTATTVIEIQVSEQEPPSTDVPPSPEAGGTTGPWTST------------TSEVPRPPEP 487
Cdd:pfam05109 413 TTTTHKVIFSKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTadvtsptpagttSGASPVTPSP 492
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 488 SQGPSTTSSGGGTGPHPPSGTTLRPP--TSSTPGGPPGAENSTSHQPATPGGDTA-QTPKPGTSQPMpPGVGTSTSHQPA 564
Cdd:pfam05109 493 SPRDNGTESKAPDMTSPTSAVTTPTPnaTSPTPAVTTPTPNATSPTLGKTSPTSAvTTPTPNATSPT-PAVTTPTPNATI 571
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 565 TPSGGTAQTPEPGTSQP--MPPSMGTSTSHQPATPG--GGTAQTPEAgTSQPMPPGMGTSTSHQPTTPGGGTAQTPEPGT 640
Cdd:pfam05109 572 PTLGKTSPTSAVTTPTPnaTSPTVGETSPQANTTNHtlGGTSSTPVV-TSPPKNATSAVTTGQHNITSSSTSSMSLRPSS 650
                         250
                  ....*....|....*...
gi 1695091487 641 -SQPMPLSKSTPSSGGGP 657
Cdd:pfam05109 651 iSETLSPSTSDNSTSHMP 668
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
564-654 3.69e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 44.00  E-value: 3.69e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 564 ATPSGGTAQTPEPGTSQPM---PPSMGTSTSHQP------ATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQ 634
Cdd:PRK14971  369 ASGGRGPKQHIKPVFTQPAaapQPSAAAAASPSPsqssaaAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVR 448
                          90       100
                  ....*....|....*....|
gi 1695091487 635 TPEPGTSQPMPLSKsTPSSG 654
Cdd:PRK14971  449 PAQFKEEKKIPVSK-VSSLG 467
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
440-658 4.44e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 43.80  E-value: 4.44e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 440 SGTATTVIEIQVSEQ-EPPSTDVPPSPeAGGTTGPWTSTTSEVPRPPEpsqgpsttssgggtgpHPPSGTTLRPPTSSTp 518
Cdd:pfam17823  43 SGDAVPRADNKSSEQ*NFCAATAAPAP-VTLTKGTSAAHLNSTEVTAE----------------HTPHGTDLSEPATRE- 104
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 519 GGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMP------PGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSH 592
Cdd:pfam17823 105 GAADGAASRALAAAASSSPSSAAQSLPAAIAALPseafsaPRAAACRANASAAPRAAIAAASAPHAASPAPRTAASSTTA 184
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1695091487 593 QPATPGGGTAQTPEAGTS-QPMPPGMGTSTSHQPT-TPGGGTAqTPEPGTSQPMPLSKSTPSSGGGPS 658
Cdd:pfam17823 185 ASSTTAASSAPTTAASSApATLTPARGISTAATATgHPAAGTA-LAAVGNSSPAAGTVTAAVGTVTPA 251
PHA03264 PHA03264
envelope glycoprotein D; Provisional
507-620 4.63e-04

envelope glycoprotein D; Provisional


Pssm-ID: 223029 [Multi-domain]  Cd Length: 416  Bit Score: 43.46  E-value: 4.63e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 507 GTTLRPPTSSTPGGPPGAENSTShqPATPGGDTAQT-PKPGTSQPMPPG---VGTSTSHQPATPSGGTAQTPEPGTSQPM 582
Cdd:PHA03264  252 GVVPPYFEESKGYEPPPAPSGGS--PAPPGDDRPEAkPEPGPVEDGAPGretGGEGEGPEPAGRDGAAGGEPKPGPPRPA 329
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 1695091487 583 PPSMGTS-------TSHQPATPGggtaqTPEAGTSQPMPPGMGTS 620
Cdd:PHA03264  330 PDADRPEgwpsleaITFPPPTPA-----TPAVPRARPVIVGTGIA 369
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
540-659 4.95e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 43.52  E-value: 4.95e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 540 AQTPKPGTSQPMPPGVG----TSTSHQPATPSGGTAQTPEPGTSQPM---PPSMGTSTSHQPATPGGGTAQTPEA--GTS 610
Cdd:PRK14959  360 AMLPRLMPVESLRPSGGgasaPSGSAAEGPASGGAATIPTPGTQGPQgtaPAAGMTPSSAAPATPAPSAAPSPRVpwDDA 439
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 1695091487 611 QPMPPGMGTSTSHQPTTPGGgtaqTPEPGTSQPMPLSKSTPSSGGGPSE 659
Cdd:PRK14959  440 PPAPPRSGIPPRPAPRMPEA----SPVPGAPDSVASASDAPPTLGDPSD 484
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
510-660 5.14e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.01  E-value: 5.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  510 LRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTP-KPGTSQPMPPGVGTSTSHQPATPSGGTAQtPEPGTSQPMPPSMGT 588
Cdd:PHA03307   774 LLEPAEPQRGAGSSPPVRAEAAFRRPGRLRRSGPaADAASRTASKRKSRSHTPDGGSESSGPAR-PPGAAARPPPARSSE 852
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1695091487  589 STSHQPATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPSED 660
Cdd:PHA03307   853 SSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMPPGGPDPR 924
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
533-629 5.77e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 43.61  E-value: 5.77e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 533 ATPGGDTAQTPKPGTSQPM---PPGVGTSTSHQP------ATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPGGGTAQ 603
Cdd:PRK14971  369 ASGGRGPKQHIKPVFTQPAaapQPSAAAAASPSPsqssaaAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVR 448
                          90       100
                  ....*....|....*....|....*.
gi 1695091487 604 TPEAGTSQPMPPgMGTSTSHQPTTPG 629
Cdd:PRK14971  449 PAQFKEEKKIPV-SKVSSLGPSTLRP 473
PRK13700 PRK13700
conjugal transfer protein TraD; Provisional
514-589 7.35e-04

conjugal transfer protein TraD; Provisional


Pssm-ID: 184256 [Multi-domain]  Cd Length: 732  Bit Score: 43.03  E-value: 7.35e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 514 TSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVG-TSTSHQPATPSGGT----AQTPEPGTSQPMPPSMGT 588
Cdd:PRK13700  604 EPDVPEVASGEDVTQAEQPQQPQQPQQPQQPQQPQQPVSPVINdKKSDAGVNVPAGGIeqelKMKPEEEMEQQLPPGISE 683

                  .
gi 1695091487 589 S 589
Cdd:PRK13700  684 S 684
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
465-658 7.74e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 43.30  E-value: 7.74e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 465 PEAGGTTGPWTSTTSEVPRP-PEPSQGPSttssgggtgphPPSGTTLRPPTSSTPGGPPGAensTSHQPATPGGDTAQTP 543
Cdd:PRK07003  360 PAVTGGGAPGGGVPARVAGAvPAPGARAA-----------AAVGASAVPAVTAVTGAAGAA---LAPKAAAAAAATRAEA 425
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 544 KPgtSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQPMPPGMGTSTSH 623
Cdd:PRK07003  426 PP--AAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAAT 503
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 1695091487 624 QPTTPGGGT------------AQTPEPGTSQPMPLSKSTPSSGGGPS 658
Cdd:PRK07003  504 PAAVPDARApaaasredapaaAAPPAPEARPPTPAAAAPAARAGGAA 550
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
440-658 9.17e-04

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 42.71  E-value: 9.17e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 440 SGTATTVIEIQVSEQEPPSTDVPPSPEAGGTTGPWTSTTSEVPRPPEPSQGpsttssgggtgphPPSGTTLRPPTSSTPG 519
Cdd:COG5164    68 NQGATGPAQNQGGTTPAQNQGGTRPAGNTGGTTPAGDGGATGPPDDGGATG-------------PPDDGGSTTPPSGGST 134
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 520 GPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTShqpATPSGGTAQTPEPGTSQPMPPSMGTSTSHQ--PATP 597
Cdd:COG5164   135 TPPGDGGSTPPGPGSTGPGGSTTPPGDGGSTTPPGPGGSTT---PPDDGGSTTPPNKGETGTDIPTGGTPRQGPdgPVKK 211
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1695091487 598 GGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPS 658
Cdd:COG5164   212 DDKNGKGNPPDDRGGKTGPKDQRPKTNPIERRGPERPEAAALPAELTALEAENRAANPEPA 272
motB PRK12799
flagellar motor protein MotB; Reviewed
523-642 1.07e-03

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 42.40  E-value: 1.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 523 GAENSTSHQPATPGGDTAQTPKPGTSQPMPPG---VGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPGG 599
Cdd:PRK12799  294 DTHGTVPVAAVTPSSAVTQSSAITPSSAAIPSpavIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVN 373
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 1695091487 600 GTAQTPEAGTSQPMPPGMGTSTSHQPTT--PGGGTAQTPEPGTSQ 642
Cdd:PRK12799  374 MQPQPMSTTETQQSSTGNITSTANGPTTslPAAPASNIPVSPTSR 418
PHA03377 PHA03377
EBNA-3C; Provisional
452-658 1.57e-03

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 42.35  E-value: 1.57e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  452 SEQEPPSTDVPPSPEAGGTTGPWTSTTSEVPRPPE--------PSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPggPPG 523
Cdd:PHA03377   668 SRRQPATQSTPPRPSWLPSVFVLPSVDAGRAQPSEeshlssmsPTQPISHEEQPRYEDPDDPLDLSLHPDQAPPP--SHQ 745
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  524 AENSTSHQPATPggdtaQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPE---PGTSQPMPPSMGTSTSHQPATPGGG 600
Cdd:PHA03377   746 APYSGHEEPQAQ-----QAPYPGYWEPRPPQAPYLGYQEPQAQGVQVSSYPGyagPWGLRAQHPRYRHSWAYWSQYPGHG 820
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1695091487  601 TAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEPG------TSQPMPLSKSTPSSGGGPS 658
Cdd:PHA03377   821 HPQGPWAPRPPHLPPQWDGSAGHGQDQVSQFPHLQSETGpprlqlSQVPQLPYSQTLVSSSAPS 884
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
415-658 1.57e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.87  E-value: 1.57e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 415 VVLTTTTLA---QAGAFYAEVEAHNTVTSGTATTVIEIQVSEQEPPSTDVPPSPEAGGTTgpwTSTTSEVPRPPEPSQGP 491
Cdd:pfam17823  70 VTLTKGTSAahlNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQS---LPAAIAALPSEAFSAPR 146
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 492 STTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSH-QPATPSGGT 570
Cdd:pfam17823 147 AAACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAAtATGHPAAGT 226
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 571 A------QTPEPGT-----SQPMPPSMGTSTSH-QPATPGGGTAQT--PEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTP 636
Cdd:pfam17823 227 AlaavgnSSPAAGTvtaavGTVTPAALATLAAAaGTVASAAGTINMgdPHARRLSPAKHMPSDTMARNPAAPMGAQAQGP 306
                         250       260
                  ....*....|....*....|..
gi 1695091487 637 EPGTSQPMPLSKSTPSSGGGPS 658
Cdd:pfam17823 307 IIQVSTDQPVHNTAGEPTPSPS 328
PRK13335 PRK13335
superantigen-like protein SSL3; Reviewed;
438-604 1.79e-03

superantigen-like protein SSL3; Reviewed;


Pssm-ID: 139494 [Multi-domain]  Cd Length: 356  Bit Score: 41.65  E-value: 1.79e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 438 VTSGtATTVIEIQVSEQEPPST--DVPPSPEAGG------TTGPWTSTT----SEVPRPPEPSQGPSTTSSGggtgphpp 505
Cdd:PRK13335   16 LTTG-AITVTTQSVKAEKIQSTkvDKVPTLKAERlaminiTAGANSATTqaanTRQERTPKLEKAPNTNEEK-------- 86
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 506 SGTTLRPPTSStpggPPGAENSTSHQPATPGgdtaqtpkPGTSQPmppgvgtSTSHQPATPSggTAQTPEPGTSQPMPps 585
Cdd:PRK13335   87 TSASKIEKISQ----PKQEEQKSLNISATPA--------PKQEQS-------QTTTESTTPK--TKVTTPPSTNTPQP-- 143
                         170
                  ....*....|....*....
gi 1695091487 586 MGTSTSHQPATPGGGTAQT 604
Cdd:PRK13335  144 MQSTKSDTPQSPTIKQAQT 162
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
456-663 1.90e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.06  E-value: 1.90e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 456 PPSTDVPPSPEAGG------TTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPP------SGTTLRPPtsstPGGPPG 523
Cdd:pfam03154 358 PPTTPIPQLPNPQShkhpphLSGPSPFQMNSNLPPPPALKPLSSLSTHHPPSAHPPplqlmpQSQQLPPP----PAQPPV 433
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 524 AENSTSHQPAtpggdTAQTPKPGTSQPMPPgvGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMgtSTSHQPATPGGGTAQ 603
Cdd:pfam03154 434 LTQSQSLPPP-----AASHPPTSGLHQVPS--QSPFPQHPFVPGGPPPITPPSGPPTSTSSAM--PGIQPPSSASVSSSG 504
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1695091487 604 TPEAGTSQPMPP----GMGTSTSHQPTTPgggtaqTPEPGTSQPMPLSKSTPSSGggpSEDKRF 663
Cdd:pfam03154 505 PVPAAVSCPLPPvqikEEALDEAEEPESP------PPPPRSPSPEPTVVNTPSHA---SQSARF 559
G_path_suppress pfam15991
G-protein pathway suppressor; This family of proteins inhibits G-protein- and ...
504-657 1.96e-03

G-protein pathway suppressor; This family of proteins inhibits G-protein- and mitogen-activated protein kinase-mediated signal transduction.


Pssm-ID: 464961 [Multi-domain]  Cd Length: 272  Bit Score: 41.06  E-value: 1.96e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 504 PPSGTTLRPPTSSTPGGPPGAENSTsHQPATP---------GGDTAQTPKPGTSQPMPPG----VGTSTSHQPATPSGGT 570
Cdd:pfam15991 114 PQLSMQGQPHHQQHPGPQVGVLKRT-RSPSPPvqqqayykqPAFSPGYAEHGQQKHDDGRrgydVARFGSWNKSTAQYPP 192
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 571 AQTPEPGTSQPMPPSmgtSTSHQPATpgGGTAQTPEAGTSQPMPPGMgtstsHQPTTPGGgtaqTPEPGTSQPMPLSKST 650
Cdd:pfam15991 193 SGQLFYPTHQYLPPP---QTQGQADA--RLQTIYPQPGYALPLQQQY-----EHANQPSP----FVSSSPLKQMQSPKAG 258

                  ....*..
gi 1695091487 651 PSSGGGP 657
Cdd:pfam15991 259 PGPQPMQ 265
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
504-629 1.98e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 41.62  E-value: 1.98e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 504 PPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPgTSQPMP 583
Cdd:PRK14951  366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAP-AAAPAA 444
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 1695091487 584 PSMGTSTSHQPA----TPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPG 629
Cdd:PRK14951  445 VALAPAPPAQAApetvAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEG 494
PHA03269 PHA03269
envelope glycoprotein C; Provisional
513-638 2.35e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 41.64  E-value: 2.35e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 513 PTSSTPGGPPGAENSTSHQPATPggDTAQTPKPGTSQPMPPGVgtsTSHQPATpsggtaQTPEPGTSqpmppsmgtSTSH 592
Cdd:PHA03269   46 PHQAASRAPDPAVAPTSAASRKP--DLAQAPTPAASEKFDPAP---APHQAAS------RAPDPAVA---------PQLA 105
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 1695091487 593 QPATPGGGTAQTpEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEP 638
Cdd:PHA03269  106 AAPKPDAAEAFT-SAAQAHEAPADAGTSAASKKPDPAAHTQHSPPP 150
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
508-616 2.50e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 41.33  E-value: 2.50e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 508 TTLRPPTSSTPGGPpgaeNSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMG 587
Cdd:PRK14950  358 ALLVPVPAPQPAKP----TAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTR 433
                          90       100       110
                  ....*....|....*....|....*....|
gi 1695091487 588 TStshqpatpgggtAQTPEAGTS-QPMPPG 616
Cdd:PRK14950  434 AA------------IPVDEKPKYtPPAPPK 451
dnaA PRK14086
chromosomal replication initiator protein DnaA;
445-634 2.65e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 41.35  E-value: 2.65e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 445 TVIEIQVSEQEPPSTDVPPSPEAGGTTGPWTSTTSEVPRPPEPS---QGPSTTSSGGGTGPHPPSGTTLRPPTSSTP--- 518
Cdd:PRK14086   91 SAGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPrqdQLPTARPAYPAYQQRPEPGAWPRAADDYGWqqq 170
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 519 --GGPPGAENSTSHQPATPGGDTAQTP---KPGTSQPMPPGVGTSTSHQPatPSGGTAQTPEPgtsqpmPPSMGTSTSHQ 593
Cdd:PRK14086  171 rlGFPPRAPYASPASYAPEQERDREPYdagRPEYDQRRRDYDHPRPDWDR--PRRDRTDRPEP------PPGAGHVHRGG 242
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|.
gi 1695091487 594 PATPGGGTAQTPEAGTSQPMPPGMGTSTShqpTTPGGGTAQ 634
Cdd:PRK14086  243 PGPPERDDAPVVPIRPSAPGPLAAQPAPA---PGPGEPTAR 280
PHA03132 PHA03132
thymidine kinase; Provisional
425-652 2.82e-03

thymidine kinase; Provisional


Pssm-ID: 222997 [Multi-domain]  Cd Length: 580  Bit Score: 41.29  E-value: 2.82e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 425 AGAFYAEVEAHNTVTSGTATTVIEiqvsEQEppstDVPPSPEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHP 504
Cdd:PHA03132   26 DENFDAERDDFLTPLGSTSEATSE----DDD----DLYPPRETGSGGGVATSTIYTVPRPPRGPEQTLDKPDSLPASREL 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 505 PSGTTLRPPTSSTP-GGPPGAENSTSHQPATPGGDTAQtpkPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMP 583
Cdd:PHA03132   98 PPGPTPVPPGGFRGaSSPRLGADSTSPRFLYQVNFPVI---LAPIGESNSSSEELSEEEEHSRPPPSESLKVKNGGKVYP 174
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 584 PSMGTSTSHQPATPGGGT--AQTPEAGTSQPMPPG-----------MGTSTSHQPTTPgggTAQTPEP-GTSQPMPLSKS 649
Cdd:PHA03132  175 KGFSKHKTHKRSEFSGLTkkAARKRKGSFVFKPSQlkelsgslknlLHLDDSAETDPA---TRQVPVPvHVLYPPLLTEY 251

                  ...
gi 1695091487 650 TPS 652
Cdd:PHA03132  252 VPY 254
SOBP pfam15279
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ...
451-645 2.89e-03

Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.


Pssm-ID: 464609 [Multi-domain]  Cd Length: 325  Bit Score: 40.57  E-value: 2.89e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 451 VSEQEPPSTDVPP--SPEAGGTTGPWTSTTSE----VPRPPEPSQGPSTTSSGGGTGPHPPsgtTLRPPTSSTPGGPPGA 524
Cdd:pfam15279  91 ESVSPGPSSSASPssSPTSSNSSKPLISVASSskllAPKPHEPPSLPPPPLPPKKGRRHRP---GLHPPLGRPPGSPPMS 167
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 525 EnsTSHQPATPGGDTAQTPK-----PGTSQPMPPgvgTSTSHQPATPSGGTAQTPEPGT-SQPMPPSMGTSTSHQPATPG 598
Cdd:pfam15279 168 M--TPRGLLGKPQQHPPPSPlpafmEPSSMPPPF---LRPPPSIPQPNSPLSNPMLPGIgPPPKPPRNLGPPSNPMHRPP 242
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 1695091487 599 GGTAQTPEAGTSQPMPPGMgtstshQPTTPGGGTAQTPEPGTSQPMP 645
Cdd:pfam15279 243 FSPHHPPPPPTPPGPPPGL------PPPPPRGFTPPFGPPFPPVNMM 283
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
465-637 3.55e-03

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 40.80  E-value: 3.55e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 465 PEAGGTTGPWTSTTSEVPRPPEPSQGPSTTssgggtgpHPPSG--TTLRPPTSSTPGGPPGAENSTSHQPATPggdtAQT 542
Cdd:pfam05539 168 PKTAVTTSKTTSWPTEVSHPTYPSQVTPQS--------QPATQghQTATANQRLSSTEPVGTQGTTTSSNPEP----QTE 235
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 543 PKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSmGTSTSHQPATPgGGTAQTPEAGTSQPMPPGMGTSTS 622
Cdd:pfam05539 236 PPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATS-NRRSPHSTATP-PPTTKRQETGRPTPRPTATTQSGS 313
                         170
                  ....*....|....*
gi 1695091487 623 HQPTTPGGGTAQTPE 637
Cdd:pfam05539 314 SPPHSSPPGVQANPT 328
PRK12495 PRK12495
hypothetical protein; Provisional
513-634 3.90e-03

hypothetical protein; Provisional


Pssm-ID: 183558 [Multi-domain]  Cd Length: 226  Bit Score: 39.85  E-value: 3.90e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 513 PTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPePGTSQPMPPSMGTSTSH 592
Cdd:PRK12495   62 PTCQQPVTEDGAAGDDAGDGAEATAPSDAGSQASPDDDAQPAAEAEAADQSAPPEASSTSAT-DEAATDPPATAAARDGP 140
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 1695091487 593 QPATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQ 634
Cdd:PRK12495  141 TPDPTAQPATPDERRSPRQRPPVSGEPPTPSTPDAHVAGTLQ 182
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
513-655 3.99e-03

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 40.76  E-value: 3.99e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 513 PTSSTPGGPPGAE--------------NSTSHQPATPGGDTAQT------PKPGTSQPMPPGVGTSTSHQPATPSGGTAQ 572
Cdd:pfam09606 101 PMGPGPGGPMGQQmggpgtasnllaslGRPQMPMGGAGFPSQMSrvgrmqPGGQAGGMMQPSSGQPGSGTPNQMGPNGGP 180
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 573 ------TPEPGTSQPM----PPSMGTSTSHQPATPGGGTAQtpEAGTSQPMPPG------MGTSTSHQPTTPGGGTAQTp 636
Cdd:pfam09606 181 gqgqagGMNGGQQGPMggqmPPQMGVPGMPGPADAGAQMGQ--QAQANGGMNPQqmggapNQVAMQQQQPQQQGQQSQL- 257
                         170
                  ....*....|....*....
gi 1695091487 637 EPGTSQPMPLSKSTPSSGG 655
Cdd:pfam09606 258 GMGINQMQQMPQGVGGGAG 276
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
504-657 4.53e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 40.79  E-value: 4.53e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 504 PPSGTTLRPPTSSTPGGPPGAEN-------------STSHQPAT-PGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGG 569
Cdd:pfam09770 169 KAAAPAPAPQPAAQPASLPAPSRkmmsleeveaamrAQAKKPAQqPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQ 248
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 570 TAQTPEPGTSQPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQPMPPGMGTSTS--HQPTTPGGGTAQTPEPGT----SQP 643
Cdd:pfam09770 249 QPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQilQNPNRLSAARVGYPQNPQpgvqPAP 328
                         170
                  ....*....|....
gi 1695091487 644 MPLSKSTPSSGGGP 657
Cdd:pfam09770 329 AHQAHRQQGSFGRQ 342
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
465-602 5.49e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 40.35  E-value: 5.49e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 465 PEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSsgggtgphPPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPK 544
Cdd:PRK07764  386 GVAGGAGAPAAAAPSAAAAAPAAAPAPAAAA--------PAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPP 457
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1695091487 545 PGTSQPMPPG-VGTSTSHQPATPSGGTAQTPEPGTSQPMPPsmgtstshQPATPGGGTA 602
Cdd:PRK07764  458 PAAAPSAQPApAPAAAPEPTAAPAPAPPAAPAPAAAPAAPA--------APAAPAGADD 508
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
504-662 6.16e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 40.54  E-value: 6.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  504 PPSGTTLRPPTSSTPGGPPGAENSTSHQPATP---GGDTAQTPKPGTSQPMPPGvgtstSHQPATPSGGTAQTPEPGTSQ 580
Cdd:PHA03307    71 PPPGPGTEAPANESRSTPTWSLSTLAPASPARegsPTPPGPSSPDPPPPTPPPA-----SPPPSPAPDLSEMLRPVGSPG 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  581 PMPPSMGTSTSHQPATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPSED 660
Cdd:PHA03307   146 PPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPG 225

                   ..
gi 1695091487  661 KR 662
Cdd:PHA03307   226 RS 227
motB PRK12799
flagellar motor protein MotB; Reviewed
523-653 6.94e-03

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 39.70  E-value: 6.94e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 523 GAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTStshQPATpSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPGGGTA 602
Cdd:PRK12799  289 GLKQIDTHGTVPVAAVTPSSAVTQSSAITPSSAAIP---SPAV-IPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTV 364
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1695091487 603 QTPEAGTSQPMPPGMGTSTSHQPTTpGGGTAQTPEPGTSQP-MPLSKSTPSS 653
Cdd:PRK12799  365 ALPAAEPVNMQPQPMSTTETQQSST-GNITSTANGPTTSLPaAPASNIPVSP 415
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
476-612 7.19e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 39.90  E-value: 7.19e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 476 STTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSST-PGGPPGAENSTSHQPATPGGDTAQTPKPGTSqpmppG 554
Cdd:pfam05109 695 STSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATsPQAPSGQKTAVPTVTSTGGKANSTTGGKHTT-----G 769
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1695091487 555 VGTSTSHQPATPSGGTAQTPEP--GTSQPMPPSmgTSTSHQP----ATPGGGTAQT--PEAGTSQP 612
Cdd:pfam05109 770 HGARTSTEPTTDYGGDSTTPRTryNATTYLPPS--TSSKLRPrwtfTSPPVTTAQAtvPVPPTSQP 833
PHA03378 PHA03378
EBNA-3B; Provisional
450-712 7.22e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 40.05  E-value: 7.22e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 450 QVSEQEPPSTDVPPSPEA--GGTTGPWTSTT-----SEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPP 522
Cdd:PHA03378  514 EDMEQRVMATLLPPSPPQprAGRRAPCVYTEdldieSDEPASTEPVHDQLLPAPGLGPLQIQPLTSPTTSQLASSAPSYA 593
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 523 GAENSTSHQPATPGGDTAQTPKPGTSQP----MPPG----------------VGTSTSHQP----ATPSGGTAQTPEPGT 578
Cdd:PHA03378  594 QTPWPVPHPSQTPEPPTTQSHIPETSAPrqwpMPLRpipmrplrmqpitfnvLVFPTPHQPpqveITPYKPTWTQIGHIP 673
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 579 SQPMPPSMGTSTSHQPATpggGTAQTPEAGTSqPMPPGMGTSTSHQPttPGGGTAQTPEP-GTSQPMPLSKSTPSSGGGP 657
Cdd:PHA03378  674 YQPSPTGANTMLPIQWAP---GTMQPPPRAPT-PMRPPAAPPGRAQR--PAAATGRARPPaAAPGRARPPAAAPGRARPP 747
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 1695091487 658 SEDKRFSVVDMAALGGVLGALLLLALLGLAVLVHKHYGPRLKCCCGKAPEPQPQG 712
Cdd:PHA03378  748 AAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQA 802
PRK10263 PRK10263
DNA translocase FtsK; Provisional
512-644 7.72e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 40.07  E-value: 7.72e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  512 PPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTS 591
Cdd:PRK10263   751 PVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQY 830
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1695091487  592 HQPATPgggTAQTPEAGTSQPMPPGMGTSTS-HQPTTP-GGGTAQTPEPGTSQPM 644
Cdd:PRK10263   831 QQPQQP---VAPQPQDTLLHPLLMRNGDSRPlHKPTTPlPSLDLLTPPPSEVEPV 882
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
456-666 8.08e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 40.15  E-value: 8.08e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  456 PPSTDVPPSPEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTShQPATP 535
Cdd:PHA03307   194 PPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPT-RIWEA 272
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487  536 GGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEP-----GTSQPMPPSMGTSTSHQPATPGGGTAQTP-EAGT 609
Cdd:PHA03307   273 SGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRassssSSSRESSSSSTSSSSESSRGAAVSPGPSPsRSPS 352
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1695091487  610 SQPMPP---GMGTSTSHQPTTPGGGTAQT---PEPGTSQPMPLSKSTPSSGGGPSEDKRFSVV 666
Cdd:PHA03307   353 PSRPPPpadPSSPRKRPRPSRAPSSPAASagrPTRRRARAAVAGRARRRDATGRFPAGRPRPS 415
PHA03269 PHA03269
envelope glycoprotein C; Provisional
457-576 8.79e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 39.71  E-value: 8.79e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 457 PSTDVPPSPEAGGTTGPWTSTTSEVPRPPEPSQGpsttssgggtgphPPSGTTLRP-----PTSSTPGGPPGAENSTSHQ 531
Cdd:PHA03269   40 PDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQA-------------PTPAASEKFdpapaPHQAASRAPDPAVAPQLAA 106
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 1695091487 532 PATPGGDTAQTPKPgTSQPMPPGVGTSTSHQPATPSGGTAQTPEP 576
Cdd:PHA03269  107 APKPDAAEAFTSAA-QAHEAPADAGTSAASKKPDPAAHTQHSPPP 150
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
511-641 9.23e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 39.70  E-value: 9.23e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 511 RPPTSStpGGPPGAENSTSHQPATPGGDTA---QTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMG 587
Cdd:PRK14951  365 KPAAAA--EAAAPAEKKTPARPEAAAPAAApvaQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAP 442
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1695091487 588 TSTSHQPATPgggtAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTS 641
Cdd:PRK14951  443 AAVALAPAPP----AQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTE 492
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
482-643 9.92e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 39.37  E-value: 9.92e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 482 PRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPG-----GPPGAENSTSHQPATPGGDT---AQTPKPGTsQPMPP 553
Cdd:NF033839  292 PSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKpevkpQPEKPKPEVKPQLETPKPEVkpqPEKPKPEV-KPQPE 370
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1695091487 554 GVGTSTSHQPATPSGGTAQTPEPGTS--QPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQ--PMPPGMGTSTSHQPTTPg 629
Cdd:NF033839  371 KPKPEVKPQPETPKPEVKPQPEKPKPevKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEvkPQPEKPKPEVKPQPEKP- 449
                         170
                  ....*....|....
gi 1695091487 630 gGTAQTPEPGTSQP 643
Cdd:NF033839  450 -KPEVKPQPETPKP 462
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH