|
Name |
Accession |
Description |
Interval |
E-value |
| WW |
pfam00397 |
WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds ... |
133-160 |
3.94e-09 |
|
WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro. :
Pssm-ID: 459800 [Multi-domain] Cd Length: 30 Bit Score: 52.12 E-value: 3.94e-09
10 20
....*....|....*....|....*...
gi 2462519535 133 DDWSEHISSSGKKYYYNCRTEVSQWEKP 160
Cdd:pfam00397 3 PGWEERWDPDGRVYYYNHETGETQWEKP 30
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
235-527 |
5.44e-07 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.40 E-value: 5.44e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 235 PRAETHSSSTPVQHPIKPvvhPTATPSTVPSSPFTLQSDHQPKKSFDANGASTLSKLPTPTSSVPAQKTERKESTSGDKP 314
Cdd:PHA03247 2609 RGPAPPSPLPPDTHAPDP---PPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRR 2685
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 315 VSHSCTTPSTSSASglNPTSAPPTSASAvPVSPVPQSPIPPLLQDPNLLRQLLPALQATLQLNNSNVDISKINEVLTAAV 394
Cdd:PHA03247 2686 RAARPTVGSLTSLA--DPPPPPPTPEPA-PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT 2762
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 395 TQASLQSIIHKFLTAGPSAfNITSLISQAAQLSTQAQPSNQSPMSLTSDASSPRSYVSPRIS-----TPQTNTVPIKPLI 469
Cdd:PHA03247 2763 TAGPPAPAPPAAPAAGPPR-RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASpagplPPPTSAQPTAPPP 2841
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*....
gi 2462519535 470 STPPVSSQPKVSTPVVKQGPVSQSA-TQQPVTADKQQGHEPVSpRSLQRSSQRSPSPGP 527
Cdd:PHA03247 2842 PPGPPPPSLPLGGSVAPGGDVRRRPpSRSPAAKPAAPARPPVR-RLARPAVSRSTESFA 2899
|
|
| PRP40 super family |
cl34905 |
Splicing factor [RNA processing and modification]; |
131-198 |
3.65e-03 |
|
Splicing factor [RNA processing and modification]; The actual alignment was detected with superfamily member COG5104:
Pssm-ID: 227435 [Multi-domain] Cd Length: 590 Bit Score: 40.45 E-value: 3.65e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 131 SADDWSEHISSSGKKYYYNCRTEVSQWEKPKEW--LEREQRQKEA---------NKMAVNSFPKD---RDYRREVMQATA 196
Cdd:COG5104 54 DVDPWKECRTADGKVYYYNSITRESRWKIPPERkkVEPIAEQKHDersmiggngNDMAITDHETSepkYLLGRLMSQYGI 133
|
..
gi 2462519535 197 TS 198
Cdd:COG5104 134 TS 135
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WW |
pfam00397 |
WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds ... |
133-160 |
3.94e-09 |
|
WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.
Pssm-ID: 459800 [Multi-domain] Cd Length: 30 Bit Score: 52.12 E-value: 3.94e-09
10 20
....*....|....*....|....*...
gi 2462519535 133 DDWSEHISSSGKKYYYNCRTEVSQWEKP 160
Cdd:pfam00397 3 PGWEERWDPDGRVYYYNHETGETQWEKP 30
|
|
| WW |
cd00201 |
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; ... |
133-162 |
1.45e-08 |
|
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
Pssm-ID: 238122 [Multi-domain] Cd Length: 31 Bit Score: 50.60 E-value: 1.45e-08
10 20 30
....*....|....*....|....*....|
gi 2462519535 133 DDWSEHISSSGKKYYYNCRTEVSQWEKPKE 162
Cdd:cd00201 2 PGWEERWDPDGRVYYYNHNTKETQWEDPRE 31
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
235-527 |
5.44e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.40 E-value: 5.44e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 235 PRAETHSSSTPVQHPIKPvvhPTATPSTVPSSPFTLQSDHQPKKSFDANGASTLSKLPTPTSSVPAQKTERKESTSGDKP 314
Cdd:PHA03247 2609 RGPAPPSPLPPDTHAPDP---PPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRR 2685
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 315 VSHSCTTPSTSSASglNPTSAPPTSASAvPVSPVPQSPIPPLLQDPNLLRQLLPALQATLQLNNSNVDISKINEVLTAAV 394
Cdd:PHA03247 2686 RAARPTVGSLTSLA--DPPPPPPTPEPA-PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT 2762
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 395 TQASLQSIIHKFLTAGPSAfNITSLISQAAQLSTQAQPSNQSPMSLTSDASSPRSYVSPRIS-----TPQTNTVPIKPLI 469
Cdd:PHA03247 2763 TAGPPAPAPPAAPAAGPPR-RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASpagplPPPTSAQPTAPPP 2841
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*....
gi 2462519535 470 STPPVSSQPKVSTPVVKQGPVSQSA-TQQPVTADKQQGHEPVSpRSLQRSSQRSPSPGP 527
Cdd:PHA03247 2842 PPGPPPPSLPLGGSVAPGGDVRRRPpSRSPAAKPAAPARPPVR-RLARPAVSRSTESFA 2899
|
|
| WW |
smart00456 |
Domain with 2 conserved Trp (W) residues; Also known as the WWP or rsp5 domain. Binds ... |
135-162 |
1.25e-06 |
|
Domain with 2 conserved Trp (W) residues; Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
Pssm-ID: 197736 [Multi-domain] Cd Length: 33 Bit Score: 45.28 E-value: 1.25e-06
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
252-523 |
3.00e-04 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 43.80 E-value: 3.00e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 252 PVVHPTATPSTVPSSpFTLQSDHQPKKSFDANGASTLSklpTPTSSVPAQKTERKESTSGDKPVSHSCTTPSTSSASGLN 331
Cdd:pfam17823 115 LAAAASSSPSSAAQS-LPAAIAALPSEAFSAPRAAACR---ANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTA 190
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 332 PTSAPPTSASAVPVSPVPQSPIppllqDPNLLRQLLPALQATLQLNNSNVDISKINEVLTAAVTQASLQSIIHKFLTAGP 411
Cdd:pfam17823 191 ASSAPTTAASSAPATLTPARGI-----STAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVAS 265
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 412 SAfnitSLISQAAQLSTQAQPSNQSPMSltSDASSPRSYVSPRISTPQTNTVPIKPLISTPPVSSQPKVSTPVVKQGPVS 491
Cdd:pfam17823 266 AA----GTINMGDPHARRLSPAKHMPSD--TMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKS 339
|
250 260 270
....*....|....*....|....*....|...
gi 2462519535 492 QSATQQP-VTADKQQGHEPVSPRSLQRSSQRSP 523
Cdd:pfam17823 340 VASTNLAvVTTTKAQAKEPSASPVPVLHTSMIP 372
|
|
| PRP40 |
COG5104 |
Splicing factor [RNA processing and modification]; |
132-168 |
8.11e-04 |
|
Splicing factor [RNA processing and modification];
Pssm-ID: 227435 [Multi-domain] Cd Length: 590 Bit Score: 42.37 E-value: 8.11e-04
10 20 30
....*....|....*....|....*....|....*..
gi 2462519535 132 ADDWSEHISSSGKKYYYNCRTEVSQWEKPKEWLEREQ 168
Cdd:COG5104 14 RSEWEELKAPDGRIYYYNKRTGKSSWEKPKELLKGSE 50
|
|
| PRP40 |
COG5104 |
Splicing factor [RNA processing and modification]; |
131-198 |
3.65e-03 |
|
Splicing factor [RNA processing and modification];
Pssm-ID: 227435 [Multi-domain] Cd Length: 590 Bit Score: 40.45 E-value: 3.65e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 131 SADDWSEHISSSGKKYYYNCRTEVSQWEKPKEW--LEREQRQKEA---------NKMAVNSFPKD---RDYRREVMQATA 196
Cdd:COG5104 54 DVDPWKECRTADGKVYYYNSITRESRWKIPPERkkVEPIAEQKHDersmiggngNDMAITDHETSepkYLLGRLMSQYGI 133
|
..
gi 2462519535 197 TS 198
Cdd:COG5104 134 TS 135
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WW |
pfam00397 |
WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds ... |
133-160 |
3.94e-09 |
|
WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.
Pssm-ID: 459800 [Multi-domain] Cd Length: 30 Bit Score: 52.12 E-value: 3.94e-09
10 20
....*....|....*....|....*...
gi 2462519535 133 DDWSEHISSSGKKYYYNCRTEVSQWEKP 160
Cdd:pfam00397 3 PGWEERWDPDGRVYYYNHETGETQWEKP 30
|
|
| WW |
cd00201 |
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; ... |
133-162 |
1.45e-08 |
|
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
Pssm-ID: 238122 [Multi-domain] Cd Length: 31 Bit Score: 50.60 E-value: 1.45e-08
10 20 30
....*....|....*....|....*....|
gi 2462519535 133 DDWSEHISSSGKKYYYNCRTEVSQWEKPKE 162
Cdd:cd00201 2 PGWEERWDPDGRVYYYNHNTKETQWEDPRE 31
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
235-527 |
5.44e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.40 E-value: 5.44e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 235 PRAETHSSSTPVQHPIKPvvhPTATPSTVPSSPFTLQSDHQPKKSFDANGASTLSKLPTPTSSVPAQKTERKESTSGDKP 314
Cdd:PHA03247 2609 RGPAPPSPLPPDTHAPDP---PPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRR 2685
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 315 VSHSCTTPSTSSASglNPTSAPPTSASAvPVSPVPQSPIPPLLQDPNLLRQLLPALQATLQLNNSNVDISKINEVLTAAV 394
Cdd:PHA03247 2686 RAARPTVGSLTSLA--DPPPPPPTPEPA-PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT 2762
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 395 TQASLQSIIHKFLTAGPSAfNITSLISQAAQLSTQAQPSNQSPMSLTSDASSPRSYVSPRIS-----TPQTNTVPIKPLI 469
Cdd:PHA03247 2763 TAGPPAPAPPAAPAAGPPR-RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASpagplPPPTSAQPTAPPP 2841
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*....
gi 2462519535 470 STPPVSSQPKVSTPVVKQGPVSQSA-TQQPVTADKQQGHEPVSpRSLQRSSQRSPSPGP 527
Cdd:PHA03247 2842 PPGPPPPSLPLGGSVAPGGDVRRRPpSRSPAAKPAAPARPPVR-RLARPAVSRSTESFA 2899
|
|
| WW |
smart00456 |
Domain with 2 conserved Trp (W) residues; Also known as the WWP or rsp5 domain. Binds ... |
135-162 |
1.25e-06 |
|
Domain with 2 conserved Trp (W) residues; Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
Pssm-ID: 197736 [Multi-domain] Cd Length: 33 Bit Score: 45.28 E-value: 1.25e-06
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
234-561 |
1.41e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.86 E-value: 1.41e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 234 LPRAETHSSSTPVQHPIKPVVHPTATPSTVPSSPFTLQSDHQPKKSFDANGASTLSKLPTPTSSVPAQKTERKESTSGDK 313
Cdd:PHA03247 2721 LPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPS 2800
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 314 PVSHSCTTPSTSSASGLNPTSAPPTSASAVPVSPVPQSPIPPLlqdpnllrqllPALQATLQLNNSNV---DISKINEVL 390
Cdd:PHA03247 2801 PWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPP-----------GPPPPSLPLGGSVApggDVRRRPPSR 2869
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 391 TAAVTQASlqsiihkfltagPSAFNITSLISQAAQLSTQ--AQPSNQSPMSLTSDASSPrsyvspristPQTNTVPIKPL 468
Cdd:PHA03247 2870 SPAAKPAA------------PARPPVRRLARPAVSRSTEsfALPPDQPERPPQPQAPPP----------PQPQPQPPPPP 2927
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 469 ISTPPVSSQPKVSTPVVKQGPVSQSATQQPVTADKQQGHepVSPRSLQRSSQRSPSPGPNHTSNSSnasnATVVPQNSSA 548
Cdd:PHA03247 2928 QPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGA--LVPGRVAVPRFRVPQPAPSREAPAS----STPPLTGHSL 3001
|
330
....*....|...
gi 2462519535 549 RSTCSLTPALAAH 561
Cdd:PHA03247 3002 SRVSSWASSLALH 3014
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
252-523 |
3.00e-04 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 43.80 E-value: 3.00e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 252 PVVHPTATPSTVPSSpFTLQSDHQPKKSFDANGASTLSklpTPTSSVPAQKTERKESTSGDKPVSHSCTTPSTSSASGLN 331
Cdd:pfam17823 115 LAAAASSSPSSAAQS-LPAAIAALPSEAFSAPRAAACR---ANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTA 190
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 332 PTSAPPTSASAVPVSPVPQSPIppllqDPNLLRQLLPALQATLQLNNSNVDISKINEVLTAAVTQASLQSIIHKFLTAGP 411
Cdd:pfam17823 191 ASSAPTTAASSAPATLTPARGI-----STAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVAS 265
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 412 SAfnitSLISQAAQLSTQAQPSNQSPMSltSDASSPRSYVSPRISTPQTNTVPIKPLISTPPVSSQPKVSTPVVKQGPVS 491
Cdd:pfam17823 266 AA----GTINMGDPHARRLSPAKHMPSD--TMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKS 339
|
250 260 270
....*....|....*....|....*....|...
gi 2462519535 492 QSATQQP-VTADKQQGHEPVSPRSLQRSSQRSP 523
Cdd:pfam17823 340 VASTNLAvVTTTKAQAKEPSASPVPVLHTSMIP 372
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
427-589 |
5.13e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 43.54 E-value: 5.13e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 427 STQAQPSNQSPMSLTSDASSPRSYVSPRISTPQTNTVPIKPLISTPPV-SSQPKVSTPVVKQGPVSQSATQQPVTADKQQ 505
Cdd:PRK10263 299 ATQPEYDEYDPLLNGAPITEPVAVAAAATTATQSWAAPVEPVTQTPPVaSVDVPPAQPTVAWQPVPGPQTGEPVIAPAPE 378
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 506 GHEPVSPRSLQRSSQRSPSPGPNHTSNSSNASNATVVPQNSSARSTCSlTPALAAHFSENLIKHVQGWPAdHAEKQASRL 585
Cdd:PRK10263 379 GYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPE-QPAQQPYYAPAPEQPVAGNAW-QAEEQQSTF 456
|
....
gi 2462519535 586 REEA 589
Cdd:PRK10263 457 APQS 460
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
259-556 |
5.83e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.98 E-value: 5.83e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 259 TPSTVPSSPFTLQSDHQPKKSFDANGASTLSKLPTPTSSVPAQKTE---RKESTSGDKPVSH--------SCTTPSTSSA 327
Cdd:pfam05109 392 TVSGLGTAPKTLIITRTATNATTTTHKVIFSKAPESTTTSPTLNTTgfaAPNTTTGLPSSTHvptnltapASTGPTVSTA 471
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 328 SGLNPTSAPPTSAsAVPVSPVPqSPippllQDPNLLRQLLPALQATLQLNNSNVDISKINEVLTAAVTQASLQSI----- 402
Cdd:pfam05109 472 DVTSPTPAGTTSG-ASPVTPSP-SP-----RDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLgktsp 544
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 403 IHKFLTAGPSAFNITSLISQAAQLSTQAQPSNQSPMSLTSDASSPRSYVSPRISTPQTNTV--PIKPLISTPPVSSQPKV 480
Cdd:pfam05109 545 TSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTnhTLGGTSSTPVVTSPPKN 624
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 481 STPVVKQG---------------PVSQSATQQPVTADKQQGHEP---------------VSP--RSLQRSSQRSPSPGPN 528
Cdd:pfam05109 625 ATSAVTTGqhnitssstssmslrPSSISETLSPSTSDNSTSHMPlltsahptggenitqVTPasTSTHHVSTSSPAPRPG 704
|
330 340
....*....|....*....|....*...
gi 2462519535 529 HTSNSSNASNATVVPQNSSARSTCSLTP 556
Cdd:pfam05109 705 TTSQASGPGNSSTSTKPGEVNVTKGTPP 732
|
|
| PRP40 |
COG5104 |
Splicing factor [RNA processing and modification]; |
132-168 |
8.11e-04 |
|
Splicing factor [RNA processing and modification];
Pssm-ID: 227435 [Multi-domain] Cd Length: 590 Bit Score: 42.37 E-value: 8.11e-04
10 20 30
....*....|....*....|....*....|....*..
gi 2462519535 132 ADDWSEHISSSGKKYYYNCRTEVSQWEKPKEWLEREQ 168
Cdd:COG5104 14 RSEWEELKAPDGRIYYYNKRTGKSSWEKPKELLKGSE 50
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
234-539 |
1.35e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 42.06 E-value: 1.35e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 234 LPRAETHSSSTPVQHPIKpvVHPTATPSTVPSSPFTLQSDhqpkksfdaNGASTLSKLPTPTSSVPAQkterkeSTSGDK 313
Cdd:pfam03154 266 LPQPSLHGQMPPMPHSLQ--TGPSHMQHPVPPQPFPLTPQ---------SSQSQVPPGPSPAAPGQSQ------QRIHTP 328
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 314 PVSHSCTTPSTSSASGLNPTSAPPTSASAVPVSPVPQSPIPPLLQDPNLLRQLLP-ALQATLQLNNSNVDISKINEVLTA 392
Cdd:pfam03154 329 PSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPfQMNSNLPPPPALKPLSSLSTHHPP 408
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 393 AVTQASLQSIIHKFLTAGPSAFNITSLISQAAQLSTQAQPSNQSPMSLTSDASSPRSYVSPRISTPQTNTVPIKPLISTP 472
Cdd:pfam03154 409 SAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSA 488
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462519535 473 PVSSQPKVSTPVVKQGPV--SQSATQQPVTADKQQGHEPVSPRSlQRSSQRSPSPGPNHTSNSSNASNA 539
Cdd:pfam03154 489 MPGIQPPSSASVSSSGPVpaAVSCPLPPVQIKEEALDEAEEPES-PPPPPRSPSPEPTVVNTPSHASQS 556
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
234-549 |
1.52e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 41.83 E-value: 1.52e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 234 LPRAETHSSSTPVQHPIKPVVHPTATPSTVPSSPFTlqsdhqPKKSFDANGasTLSKLPTPTSSVPAQKTERKESTSGDK 313
Cdd:pfam05109 454 VPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVT------PSPSPRDNG--TESKAPDMTSPTSAVTTPTPNATSPTP 525
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 314 PVSHSCTTPSTSSASGLNPTSA----PPTSASAVP--VSPVPQSPIPPLLQDPNLLRQLLPALQATL--------QLNNS 379
Cdd:pfam05109 526 AVTTPTPNATSPTLGKTSPTSAvttpTPNATSPTPavTTPTPNATIPTLGKTSPTSAVTTPTPNATSptvgetspQANTT 605
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 380 NVDISKINEV---------LTAAVT--QASLQSIIHKFLTAGPSAFN-----------------ITSLISQAAQLSTQAQ 431
Cdd:pfam05109 606 NHTLGGTSSTpvvtsppknATSAVTtgQHNITSSSTSSMSLRPSSISetlspstsdnstshmplLTSAHPTGGENITQVT 685
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 432 PSNQSPMSLTSDASSPRSYVSPRISTPQTNTVPIKPLISTPPVSSQPKVSTPvvKQGPVSQSATQQPVT-----ADKQQG 506
Cdd:pfam05109 686 PASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATS--PQAPSGQKTAVPTVTstggkANSTTG 763
|
330 340 350 360
....*....|....*....|....*....|....*....|...
gi 2462519535 507 HEPVSPRSLQRSSQRSPSPGPNHTSNSSNASNATVVPQNSSAR 549
Cdd:pfam05109 764 GKHTTGHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSK 806
|
|
| PRP40 |
COG5104 |
Splicing factor [RNA processing and modification]; |
131-198 |
3.65e-03 |
|
Splicing factor [RNA processing and modification];
Pssm-ID: 227435 [Multi-domain] Cd Length: 590 Bit Score: 40.45 E-value: 3.65e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 131 SADDWSEHISSSGKKYYYNCRTEVSQWEKPKEW--LEREQRQKEA---------NKMAVNSFPKD---RDYRREVMQATA 196
Cdd:COG5104 54 DVDPWKECRTADGKVYYYNSITRESRWKIPPERkkVEPIAEQKHDersmiggngNDMAITDHETSepkYLLGRLMSQYGI 133
|
..
gi 2462519535 197 TS 198
Cdd:COG5104 134 TS 135
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
233-561 |
6.96e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 39.77 E-value: 6.96e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 233 RLPRAETHSSSTPVQHPIKPVVHPTATPSTVPSSPFTLQSDHQPKKSFDA---NGASTLSKLPTPTSSVPAQKTERKEST 309
Cdd:PHA03307 66 EPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDpppPTPPPASPPPSPAPDLSEMLRPVGSPG 145
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 310 SGDKPVSHSCTTPSTSSASGLnPTSAPPTSASAVPVS----PVPQSPIPPLLQDPNLLRQLLPALQATLQLNNSNVDISK 385
Cdd:PHA03307 146 PPPAASPPAAGASPAAVASDA-ASSRQAALPLSSPEEtaraPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAP 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 386 I-NEVLTAAVTQASLQSIIHKFLTAGPSAFNITSLISQAAQLSTQAQPSNQSPMSLTSDASSPRSyvSPRISTPqtntvp 464
Cdd:PHA03307 225 GrSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSS--SPRERSP------ 296
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462519535 465 iKPLISTPPVSSQPKVSTPVVKQGPVSQSATQQPVTADKQQGHEPVSP-RSLQRSSQRSPSPGPNHTSNSSNASNATVVP 543
Cdd:PHA03307 297 -SPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPgPSPSRSPSPSRPPPPADPSSPRKRPRPSRAP 375
|
330 340
....*....|....*....|.
gi 2462519535 544 QN---SSARSTCSLTPALAAH 561
Cdd:PHA03307 376 SSpaaSAGRPTRRRARAAVAG 396
|
|
|