NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1533911195|ref|NP_001354553|]
View 

zinc finger protein 469 [Homo sapiens]

Protein Classification

C2H2-type zinc finger protein( domain architecture ID 10442881)

Cys2His2 (C2H2)-type zinc finger protein may be involved in transcriptional regulation

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
2-422 4.56e-10

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 66.50  E-value: 4.56e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195    2 PGERPRGAPPPTMTGDLQ--PRQVASSPGHPSQPPLEDNTPATRTTkgAREAGGQAQAMELPEAQPRQARDGELKPPSLR 79
Cdd:PHA03247  2589 PDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPSPSPA--ANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR 2666
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195   80 GQAPSSTPGKRGSPQTPPGRSPLQAPSRLAGRAEGSPPQRyilgIASSRTKPTLDETPENPQLEAAQLPEVDTPQGPGT- 158
Cdd:PHA03247  2667 ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPP----TPEPAPHALVSATPLPPGPAAARQASPALPAAPAPp 2742
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  159 ---------GAPLRPGLPRTEAQPAAEELGFHRCfQEPPSSFTSTNYTSPSATPRPPAPGPPQSRGTSPLQP--GSYPEY 227
Cdd:PHA03247  2743 avpagpatpGGPARPARPPTTAGPPAPAPPAAPA-AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLApaAALPPA 2821
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  228 QASGADSWPPAAENSFPGANFGVPPAEPEPIPKGSRPGGSPRGVSFQFPFPALHGASTKPFPADVAGHAFTNGPLVFAFH 307
Cdd:PHA03247  2822 ASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALP 2901
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  308 QPQGAWPEEAVGTGPAYPLPT-QPAPSPLPCYQGQPGGLNRHSDLSGALSSPGAAHSAPRPFSDSLHKSLTKILPERPPS 386
Cdd:PHA03247  2902 PDQPERPPQPQAPPPPQPQPQpPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQ 2981
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|
gi 1533911195  387 AQDGLGSTRGPPSSLPQRHFPGQAYRASG----VDTSPGP 422
Cdd:PHA03247  2982 PAPSREAPASSTPPLTGHSLSRVSSWASSlalhEETDPPP 3021
PHA03247 super family cl33720
large tegument protein UL36; Provisional
3456-3938 1.22e-09

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.96  E-value: 1.22e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3456 PGAPGQKaRALEGTLPSKRRRVAMPGSAPGPGEDRPPPRGSSPilsegslPALL--HLCSEVAPSTTKGWPETLErpvdp 3533
Cdd:PHA03247  2475 PGAPVYR-RPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLA-------PAILpdEPVGEPVHPRMLTWIRGLE----- 2541
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3534 vthpirgcELPSNHQECPPPSLSPFPAALADGRgdcaldgalERPENEASPGSPGPllqqalPLGASLPRPGARGQDAEG 3613
Cdd:PHA03247  2542 --------ELASDDAGDPPPPLPPAAPPAAPDR---------SVPPPRPAPRPSEP------AVTSRARRPDAPPQSARP 2598
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3614 kRAPLVFSGKRRAPGARGRCAPDhfqeDHLLQKEKEVSSSHMVSEGGPRGAF---HKGSATKPAGCQSSSKDR-SAASTP 3689
Cdd:PHA03247  2599 -RAPVDDRGDPRGPAPPSPLPPD----THAPDPPPPSPSPAANEPDPHPPPTvppPERPRDDPAPGRVSRPRRaRRLGRA 2673
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3690 SKALKFPVHPRKAVGSLAPGELARGTENGMKPATPKAKPGPSSQGSGSPrPGTKTGGGSQPQPASGQLQSETATTPAKPS 3769
Cdd:PHA03247  2674 AQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLP-PGPAAARQASPALPAAPAPPAVPAGPATPG 2752
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3770 FPSRSPAPErLPARAQAKSCTKGPreagEQGPHGSLGPKEKGESSTKRKKGQVPGPARSESVGSFGRAPSAPdkpprtpr 3849
Cdd:PHA03247  2753 GPARPARPP-TTAGPPAPAPPAAP----AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP-------- 2819
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3850 KQATPSRVLPTKPKPNSQNKPRPPPSEQRKAEPGhtqrkdrlGKAFPQGRPLLRPPKRGTAVHGAEPAEPHTHRTAEAQS 3929
Cdd:PHA03247  2820 PAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG--------GSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAV 2891

                   ....*....
gi 1533911195 3930 DLLSQLFGQ 3938
Cdd:PHA03247  2892 SRSTESFAL 2900
PHA03247 super family cl33720
large tegument protein UL36; Provisional
2147-2594 1.04e-07

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.80  E-value: 1.04e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2147 GGQLPASPSCRDPPGPQQLLACS-PAWAPLEEADGVQATtdtgaedSPVAPPSLTT--SPCDPKEALAGCLLQGEGSPLE 2223
Cdd:PHA03247  2549 GDPPPPLPPAAPPAAPDRSVPPPrPAPRPSEPAVTSRAR-------RPDAPPQSARprAPVDDRGDPRGPAPPSPLPPDT 2621
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2224 DPSSWPPGSVSAvtctHSGDTPKDSTLRIPEDSRKEKLwESPGRATSPPLAGAVSpsvavRATGLSSTPTGDEAQAGRGL 2303
Cdd:PHA03247  2622 HAPDPPPPSPSP----AANEPDPHPPPTVPPPERPRDD-PAPGRVSRPRRARRLG-----RAAQASSPPQRPRRRAARPT 2691
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2304 PGP-DPQSRGAPPHTNPDRMPRGHSSYSPSNTARLGHREGQ-AVTAVPTEPPTLQG----AGPDSPACLEGEMGTSSKEP 2377
Cdd:PHA03247  2692 VGSlTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASpALPAAPAPPAVPAGpatpGGPARPARPPTTAGPPAPAP 2771
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2378 edPGTPETGRSGATKMPRVTcpSTGLGLGRTTAPSSTASDFQSDSPQSHRNASHQTPQGDPLGPQDLKQRSRGYKKKPAS 2457
Cdd:PHA03247  2772 --PAAPAAGPPRRLTRPAVA--SLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPP 2847
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2458 TENGQWKGQAPHGPVTCEVCAASFRSGPGLSRHKARKHRPHPGAPAEPSPAALPAQQPLEPLAQKCQPPRKKSHRVSGKE 2537
Cdd:PHA03247  2848 PSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPP 2927
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1533911195 2538 RPNHS-RGDPSHVTQPPPAQGSKEVLRAPGSPHSQQLHPPSPTEHEVdVKTPASKPRP 2594
Cdd:PHA03247  2928 QPQPPpPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAV-PRFRVPQPAP 2984
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1130-1543 1.30e-07

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.41  E-value: 1.30e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1130 PREDEPQKPRKAARQEAGGDGAPANPEEPGGSRP--------GPGRSPQARGPSRSLETGAAAREGGPKCADRPSVAPKD 1201
Cdd:PHA03247  2484 AEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPailpdepvGEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAA 2563
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1202 PLQ-VPTntetseeTRPSldfPQEAKEPETAEESAPDSTEFTEALRSPpaacAGEMGASPGLLIPEQPPPSRHDTGTPKP 1280
Cdd:PHA03247  2564 PDRsVPP-------PRPA---PRPSEPAVTSRARRPDAPPQSARPRAP----VDDRGDPRGPAPPSPLPPDTHAPDPPPP 2629
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1281 SGSlantaPHGSSPtPGVGSLLGGPGGTQAPVSHNSKDPPARQPGEFLAPVANPSSTACPKPSVLSSKISSFgcdpAGFN 1360
Cdd:PHA03247  2630 SPS-----PAANEP-DPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL----TSLA 2699
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1361 RDPlgvpvaKKGPQPYSSPHSELFLGPKDLAGCFLEELHPKPSARDAPPASSSCLCQDGEDAGSLEPQLPRSPP------ 1434
Cdd:PHA03247  2700 DPP------PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPapappa 2773
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1435 GTAETEPGRAASPPTLESSSLFPDLPV--DRFDPP--LYGSLSANRDSGLPFACADPPQKTVPSDPPYPSflllEEVSPM 1510
Cdd:PHA03247  2774 APAAGPPRRLTRPAVASLSESRESLPSpwDPADPPaaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPP----GPPPPS 2849
                          410       420       430
                   ....*....|....*....|....*....|...
gi 1533911195 1511 LPSHFPDLSGGKVlSKTCPPERTVVPGAAPSLP 1543
Cdd:PHA03247  2850 LPLGGSVAPGGDV-RRRPPSRSPAAKPAAPARP 2881
PHA03247 super family cl33720
large tegument protein UL36; Provisional
255-723 3.02e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.79  E-value: 3.02e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  255 PEPIPKGSRPGGSPRGVsfqfpfPALHGASTKPFPADVAGHAFTNGPLVFAFHQPQGAWPEEAVGTGPAYPLPTQPAPSP 334
Cdd:PHA03247  2552 PPPLPPAAPPAAPDRSV------PPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD 2625
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  335 LPCYQGQPGGLNRHSDLSGALSSPGAAHSAPRPFSDSLHKSLTKilPERPPSAqdglgstrgppSSLPQRHFPGQAYRAS 414
Cdd:PHA03247  2626 PPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARR--LGRAAQA-----------SSPPQRPRRRAARPTV 2692
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  415 GVDTSPGPPdtelAAPGPPPARLPQLWDPTAAPYPTPPGGPLAAtrsmffngqpspgqrlclPQSAPLPWPQVLPTARPS 494
Cdd:PHA03247  2693 GSLTSLADP----PPPPPTPEPAPHALVSATPLPPGPAAARQAS------------------PALPAAPAPPAVPAGPAT 2750
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  495 PHGMEMLSRLPFPAGGPewqggSQGALGTAGKTPGPREKLPAVRSSQGGSPALFTYNGMTDPGAQPLFFGVAQPQVSPHG 574
Cdd:PHA03247  2751 PGGPARPARPPTTAGPP-----APAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA 2825
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  575 TPSLPPPRVVGASPSESPLPSPATNTAG-------------STCSSLSPMSSSPANPSSEESQLPGPLGPSAFFHPPTHP 641
Cdd:PHA03247  2826 GPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvapggdvrrrpPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP 2905
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  642 QETGSPFPSPEP-------PHSLPTHYQPEPAKAFPFPADGLGAEGAFQCLEETPFPHEGPEVGRGGLQGFPRAPPPYPT 714
Cdd:PHA03247  2906 ERPPQPQAPPPPqpqpqppPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985

                   ....*....
gi 1533911195  715 HHFSLSSAS 723
Cdd:PHA03247  2986 REAPASSTP 2994
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1635-2272 3.01e-04

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.24  E-value: 3.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1635 AHREGAESAVATVEAVQGRPGGTWPCPASFHPGHAALLPCAQEDLVSGAPFSPRGANFHFQPVQKAGASKtglcqaeGDS 1714
Cdd:PHA03247  2479 VYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDA-------GDP 2551
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1715 RPPQDVCLPEPskqpgpqldagslakcSPDQelSFPKNKEAASSQESEDSLRllpcEQRGGFLPEPGTADQPhrGAPAPE 1794
Cdd:PHA03247  2552 PPPLPPAAPPA----------------APDR--SVPPPRPAPRPSEPAVTSR----ARRPDAPPQSARPRAP--VDDRGD 2607
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1795 AFGSPAVHLAPDLAFQGDGAPPLDATWPFGASPSHAAQGHSAGRAGGHLHPT-AGRPGFEGNEFAPAGASSLTAPRGREA 1873
Cdd:PHA03247  2608 PRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGrVSRPRRARRLGRAAQASSPPQRPRRRA 2687
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1874 WLVPVPSPACVSNTHPSRRSQDPAlsPPIRQLQLPGPGVAKSKDGILGLQELTPAAQSPPRVNPSGLEGGTVEGGKVACG 1953
Cdd:PHA03247  2688 ARPTVGSLTSLADPPPPPPTPEPA--PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAG 2765
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1954 PAQGSPGGVQVTTLPAVAghqlgleadghwgllgqaekTQGQGTANQLQPENGVSPGGTDNHASVNASPKTALTGPTEGA 2033
Cdd:PHA03247  2766 PPAPAPPAAPAAGPPRRL--------------------TRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA 2825
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2034 VLLEkckgsrAAMSLQEEAEPTPSPPSPNRESL--ALALTAAHSRSGSEGRTPERASSPglnkpllatgdsPAPSVGDLA 2111
Cdd:PHA03247  2826 GPLP------PPTSAQPTAPPPPPGPPPPSLPLggSVAPGGDVRRRPPSRSPAAKPAAP------------ARPPVRRLA 2887
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2112 ACAPS-PTSAAHMPCSLGPLPREDPLTSPSRAQGGLGGQLPASPSCRDPPGPQQLLACSPAWAPLEEADGVQATTDTGA- 2189
Cdd:PHA03247  2888 RPAVSrSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAl 2967
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2190 -EDSPVAPPSLTTSPCDPKEALAGCLLQGEGSPLEDPSSWP----------PGSVSAVTCTHSGDTPKDST-----LRIP 2253
Cdd:PHA03247  2968 vPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWAsslalheetdPPPVSLKQTLWPPDDTEDSDadslfDSDS 3047
                          650
                   ....*....|....*....
gi 1533911195 2254 EDSRKEKLWESPGRATSPP 2272
Cdd:PHA03247  3048 ERSDLEALDPLPPEPHDPF 3066
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
3339-3359 1.83e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


:

Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 38.05  E-value: 1.83e-03
                           10        20
                   ....*....|....*....|.
gi 1533911195 3339 CHHCGKRFPKPFKLQRHLAVH 3359
Cdd:pfam00096    3 CPDCGKSFSRKSNLKRHLRTH 23
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
2-422 4.56e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 66.50  E-value: 4.56e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195    2 PGERPRGAPPPTMTGDLQ--PRQVASSPGHPSQPPLEDNTPATRTTkgAREAGGQAQAMELPEAQPRQARDGELKPPSLR 79
Cdd:PHA03247  2589 PDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPSPSPA--ANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR 2666
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195   80 GQAPSSTPGKRGSPQTPPGRSPLQAPSRLAGRAEGSPPQRyilgIASSRTKPTLDETPENPQLEAAQLPEVDTPQGPGT- 158
Cdd:PHA03247  2667 ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPP----TPEPAPHALVSATPLPPGPAAARQASPALPAAPAPp 2742
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  159 ---------GAPLRPGLPRTEAQPAAEELGFHRCfQEPPSSFTSTNYTSPSATPRPPAPGPPQSRGTSPLQP--GSYPEY 227
Cdd:PHA03247  2743 avpagpatpGGPARPARPPTTAGPPAPAPPAAPA-AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLApaAALPPA 2821
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  228 QASGADSWPPAAENSFPGANFGVPPAEPEPIPKGSRPGGSPRGVSFQFPFPALHGASTKPFPADVAGHAFTNGPLVFAFH 307
Cdd:PHA03247  2822 ASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALP 2901
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  308 QPQGAWPEEAVGTGPAYPLPT-QPAPSPLPCYQGQPGGLNRHSDLSGALSSPGAAHSAPRPFSDSLHKSLTKILPERPPS 386
Cdd:PHA03247  2902 PDQPERPPQPQAPPPPQPQPQpPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQ 2981
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|
gi 1533911195  387 AQDGLGSTRGPPSSLPQRHFPGQAYRASG----VDTSPGP 422
Cdd:PHA03247  2982 PAPSREAPASSTPPLTGHSLSRVSSWASSlalhEETDPPP 3021
PHA03247 PHA03247
large tegument protein UL36; Provisional
3456-3938 1.22e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.96  E-value: 1.22e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3456 PGAPGQKaRALEGTLPSKRRRVAMPGSAPGPGEDRPPPRGSSPilsegslPALL--HLCSEVAPSTTKGWPETLErpvdp 3533
Cdd:PHA03247  2475 PGAPVYR-RPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLA-------PAILpdEPVGEPVHPRMLTWIRGLE----- 2541
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3534 vthpirgcELPSNHQECPPPSLSPFPAALADGRgdcaldgalERPENEASPGSPGPllqqalPLGASLPRPGARGQDAEG 3613
Cdd:PHA03247  2542 --------ELASDDAGDPPPPLPPAAPPAAPDR---------SVPPPRPAPRPSEP------AVTSRARRPDAPPQSARP 2598
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3614 kRAPLVFSGKRRAPGARGRCAPDhfqeDHLLQKEKEVSSSHMVSEGGPRGAF---HKGSATKPAGCQSSSKDR-SAASTP 3689
Cdd:PHA03247  2599 -RAPVDDRGDPRGPAPPSPLPPD----THAPDPPPPSPSPAANEPDPHPPPTvppPERPRDDPAPGRVSRPRRaRRLGRA 2673
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3690 SKALKFPVHPRKAVGSLAPGELARGTENGMKPATPKAKPGPSSQGSGSPrPGTKTGGGSQPQPASGQLQSETATTPAKPS 3769
Cdd:PHA03247  2674 AQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLP-PGPAAARQASPALPAAPAPPAVPAGPATPG 2752
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3770 FPSRSPAPErLPARAQAKSCTKGPreagEQGPHGSLGPKEKGESSTKRKKGQVPGPARSESVGSFGRAPSAPdkpprtpr 3849
Cdd:PHA03247  2753 GPARPARPP-TTAGPPAPAPPAAP----AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP-------- 2819
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3850 KQATPSRVLPTKPKPNSQNKPRPPPSEQRKAEPGhtqrkdrlGKAFPQGRPLLRPPKRGTAVHGAEPAEPHTHRTAEAQS 3929
Cdd:PHA03247  2820 PAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG--------GSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAV 2891

                   ....*....
gi 1533911195 3930 DLLSQLFGQ 3938
Cdd:PHA03247  2892 SRSTESFAL 2900
PHA03247 PHA03247
large tegument protein UL36; Provisional
2147-2594 1.04e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.80  E-value: 1.04e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2147 GGQLPASPSCRDPPGPQQLLACS-PAWAPLEEADGVQATtdtgaedSPVAPPSLTT--SPCDPKEALAGCLLQGEGSPLE 2223
Cdd:PHA03247  2549 GDPPPPLPPAAPPAAPDRSVPPPrPAPRPSEPAVTSRAR-------RPDAPPQSARprAPVDDRGDPRGPAPPSPLPPDT 2621
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2224 DPSSWPPGSVSAvtctHSGDTPKDSTLRIPEDSRKEKLwESPGRATSPPLAGAVSpsvavRATGLSSTPTGDEAQAGRGL 2303
Cdd:PHA03247  2622 HAPDPPPPSPSP----AANEPDPHPPPTVPPPERPRDD-PAPGRVSRPRRARRLG-----RAAQASSPPQRPRRRAARPT 2691
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2304 PGP-DPQSRGAPPHTNPDRMPRGHSSYSPSNTARLGHREGQ-AVTAVPTEPPTLQG----AGPDSPACLEGEMGTSSKEP 2377
Cdd:PHA03247  2692 VGSlTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASpALPAAPAPPAVPAGpatpGGPARPARPPTTAGPPAPAP 2771
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2378 edPGTPETGRSGATKMPRVTcpSTGLGLGRTTAPSSTASDFQSDSPQSHRNASHQTPQGDPLGPQDLKQRSRGYKKKPAS 2457
Cdd:PHA03247  2772 --PAAPAAGPPRRLTRPAVA--SLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPP 2847
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2458 TENGQWKGQAPHGPVTCEVCAASFRSGPGLSRHKARKHRPHPGAPAEPSPAALPAQQPLEPLAQKCQPPRKKSHRVSGKE 2537
Cdd:PHA03247  2848 PSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPP 2927
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1533911195 2538 RPNHS-RGDPSHVTQPPPAQGSKEVLRAPGSPHSQQLHPPSPTEHEVdVKTPASKPRP 2594
Cdd:PHA03247  2928 QPQPPpPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAV-PRFRVPQPAP 2984
PHA03247 PHA03247
large tegument protein UL36; Provisional
1130-1543 1.30e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.41  E-value: 1.30e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1130 PREDEPQKPRKAARQEAGGDGAPANPEEPGGSRP--------GPGRSPQARGPSRSLETGAAAREGGPKCADRPSVAPKD 1201
Cdd:PHA03247  2484 AEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPailpdepvGEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAA 2563
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1202 PLQ-VPTntetseeTRPSldfPQEAKEPETAEESAPDSTEFTEALRSPpaacAGEMGASPGLLIPEQPPPSRHDTGTPKP 1280
Cdd:PHA03247  2564 PDRsVPP-------PRPA---PRPSEPAVTSRARRPDAPPQSARPRAP----VDDRGDPRGPAPPSPLPPDTHAPDPPPP 2629
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1281 SGSlantaPHGSSPtPGVGSLLGGPGGTQAPVSHNSKDPPARQPGEFLAPVANPSSTACPKPSVLSSKISSFgcdpAGFN 1360
Cdd:PHA03247  2630 SPS-----PAANEP-DPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL----TSLA 2699
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1361 RDPlgvpvaKKGPQPYSSPHSELFLGPKDLAGCFLEELHPKPSARDAPPASSSCLCQDGEDAGSLEPQLPRSPP------ 1434
Cdd:PHA03247  2700 DPP------PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPapappa 2773
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1435 GTAETEPGRAASPPTLESSSLFPDLPV--DRFDPP--LYGSLSANRDSGLPFACADPPQKTVPSDPPYPSflllEEVSPM 1510
Cdd:PHA03247  2774 APAAGPPRRLTRPAVASLSESRESLPSpwDPADPPaaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPP----GPPPPS 2849
                          410       420       430
                   ....*....|....*....|....*....|...
gi 1533911195 1511 LPSHFPDLSGGKVlSKTCPPERTVVPGAAPSLP 1543
Cdd:PHA03247  2850 LPLGGSVAPGGDV-RRRPPSRSPAAKPAAPARP 2881
PHA03247 PHA03247
large tegument protein UL36; Provisional
255-723 3.02e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.79  E-value: 3.02e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  255 PEPIPKGSRPGGSPRGVsfqfpfPALHGASTKPFPADVAGHAFTNGPLVFAFHQPQGAWPEEAVGTGPAYPLPTQPAPSP 334
Cdd:PHA03247  2552 PPPLPPAAPPAAPDRSV------PPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD 2625
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  335 LPCYQGQPGGLNRHSDLSGALSSPGAAHSAPRPFSDSLHKSLTKilPERPPSAqdglgstrgppSSLPQRHFPGQAYRAS 414
Cdd:PHA03247  2626 PPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARR--LGRAAQA-----------SSPPQRPRRRAARPTV 2692
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  415 GVDTSPGPPdtelAAPGPPPARLPQLWDPTAAPYPTPPGGPLAAtrsmffngqpspgqrlclPQSAPLPWPQVLPTARPS 494
Cdd:PHA03247  2693 GSLTSLADP----PPPPPTPEPAPHALVSATPLPPGPAAARQAS------------------PALPAAPAPPAVPAGPAT 2750
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  495 PHGMEMLSRLPFPAGGPewqggSQGALGTAGKTPGPREKLPAVRSSQGGSPALFTYNGMTDPGAQPLFFGVAQPQVSPHG 574
Cdd:PHA03247  2751 PGGPARPARPPTTAGPP-----APAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA 2825
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  575 TPSLPPPRVVGASPSESPLPSPATNTAG-------------STCSSLSPMSSSPANPSSEESQLPGPLGPSAFFHPPTHP 641
Cdd:PHA03247  2826 GPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvapggdvrrrpPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP 2905
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  642 QETGSPFPSPEP-------PHSLPTHYQPEPAKAFPFPADGLGAEGAFQCLEETPFPHEGPEVGRGGLQGFPRAPPPYPT 714
Cdd:PHA03247  2906 ERPPQPQAPPPPqpqpqppPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985

                   ....*....
gi 1533911195  715 HHFSLSSAS 723
Cdd:PHA03247  2986 REAPASSTP 2994
PHA03247 PHA03247
large tegument protein UL36; Provisional
1635-2272 3.01e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.24  E-value: 3.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1635 AHREGAESAVATVEAVQGRPGGTWPCPASFHPGHAALLPCAQEDLVSGAPFSPRGANFHFQPVQKAGASKtglcqaeGDS 1714
Cdd:PHA03247  2479 VYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDA-------GDP 2551
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1715 RPPQDVCLPEPskqpgpqldagslakcSPDQelSFPKNKEAASSQESEDSLRllpcEQRGGFLPEPGTADQPhrGAPAPE 1794
Cdd:PHA03247  2552 PPPLPPAAPPA----------------APDR--SVPPPRPAPRPSEPAVTSR----ARRPDAPPQSARPRAP--VDDRGD 2607
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1795 AFGSPAVHLAPDLAFQGDGAPPLDATWPFGASPSHAAQGHSAGRAGGHLHPT-AGRPGFEGNEFAPAGASSLTAPRGREA 1873
Cdd:PHA03247  2608 PRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGrVSRPRRARRLGRAAQASSPPQRPRRRA 2687
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1874 WLVPVPSPACVSNTHPSRRSQDPAlsPPIRQLQLPGPGVAKSKDGILGLQELTPAAQSPPRVNPSGLEGGTVEGGKVACG 1953
Cdd:PHA03247  2688 ARPTVGSLTSLADPPPPPPTPEPA--PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAG 2765
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1954 PAQGSPGGVQVTTLPAVAghqlgleadghwgllgqaekTQGQGTANQLQPENGVSPGGTDNHASVNASPKTALTGPTEGA 2033
Cdd:PHA03247  2766 PPAPAPPAAPAAGPPRRL--------------------TRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA 2825
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2034 VLLEkckgsrAAMSLQEEAEPTPSPPSPNRESL--ALALTAAHSRSGSEGRTPERASSPglnkpllatgdsPAPSVGDLA 2111
Cdd:PHA03247  2826 GPLP------PPTSAQPTAPPPPPGPPPPSLPLggSVAPGGDVRRRPPSRSPAAKPAAP------------ARPPVRRLA 2887
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2112 ACAPS-PTSAAHMPCSLGPLPREDPLTSPSRAQGGLGGQLPASPSCRDPPGPQQLLACSPAWAPLEEADGVQATTDTGA- 2189
Cdd:PHA03247  2888 RPAVSrSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAl 2967
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2190 -EDSPVAPPSLTTSPCDPKEALAGCLLQGEGSPLEDPSSWP----------PGSVSAVTCTHSGDTPKDST-----LRIP 2253
Cdd:PHA03247  2968 vPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWAsslalheetdPPPVSLKQTLWPPDDTEDSDadslfDSDS 3047
                          650
                   ....*....|....*....
gi 1533911195 2254 EDSRKEKLWESPGRATSPP 2272
Cdd:PHA03247  3048 ERSDLEALDPLPPEPHDPF 3066
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
3339-3359 1.83e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 38.05  E-value: 1.83e-03
                           10        20
                   ....*....|....*....|.
gi 1533911195 3339 CHHCGKRFPKPFKLQRHLAVH 3359
Cdd:pfam00096    3 CPDCGKSFSRKSNLKRHLRTH 23
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
2-422 4.56e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 66.50  E-value: 4.56e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195    2 PGERPRGAPPPTMTGDLQ--PRQVASSPGHPSQPPLEDNTPATRTTkgAREAGGQAQAMELPEAQPRQARDGELKPPSLR 79
Cdd:PHA03247  2589 PDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPSPSPA--ANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR 2666
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195   80 GQAPSSTPGKRGSPQTPPGRSPLQAPSRLAGRAEGSPPQRyilgIASSRTKPTLDETPENPQLEAAQLPEVDTPQGPGT- 158
Cdd:PHA03247  2667 ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPP----TPEPAPHALVSATPLPPGPAAARQASPALPAAPAPp 2742
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  159 ---------GAPLRPGLPRTEAQPAAEELGFHRCfQEPPSSFTSTNYTSPSATPRPPAPGPPQSRGTSPLQP--GSYPEY 227
Cdd:PHA03247  2743 avpagpatpGGPARPARPPTTAGPPAPAPPAAPA-AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLApaAALPPA 2821
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  228 QASGADSWPPAAENSFPGANFGVPPAEPEPIPKGSRPGGSPRGVSFQFPFPALHGASTKPFPADVAGHAFTNGPLVFAFH 307
Cdd:PHA03247  2822 ASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALP 2901
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  308 QPQGAWPEEAVGTGPAYPLPT-QPAPSPLPCYQGQPGGLNRHSDLSGALSSPGAAHSAPRPFSDSLHKSLTKILPERPPS 386
Cdd:PHA03247  2902 PDQPERPPQPQAPPPPQPQPQpPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQ 2981
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|
gi 1533911195  387 AQDGLGSTRGPPSSLPQRHFPGQAYRASG----VDTSPGP 422
Cdd:PHA03247  2982 PAPSREAPASSTPPLTGHSLSRVSSWASSlalhEETDPPP 3021
PHA03247 PHA03247
large tegument protein UL36; Provisional
3456-3938 1.22e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.96  E-value: 1.22e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3456 PGAPGQKaRALEGTLPSKRRRVAMPGSAPGPGEDRPPPRGSSPilsegslPALL--HLCSEVAPSTTKGWPETLErpvdp 3533
Cdd:PHA03247  2475 PGAPVYR-RPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLA-------PAILpdEPVGEPVHPRMLTWIRGLE----- 2541
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3534 vthpirgcELPSNHQECPPPSLSPFPAALADGRgdcaldgalERPENEASPGSPGPllqqalPLGASLPRPGARGQDAEG 3613
Cdd:PHA03247  2542 --------ELASDDAGDPPPPLPPAAPPAAPDR---------SVPPPRPAPRPSEP------AVTSRARRPDAPPQSARP 2598
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3614 kRAPLVFSGKRRAPGARGRCAPDhfqeDHLLQKEKEVSSSHMVSEGGPRGAF---HKGSATKPAGCQSSSKDR-SAASTP 3689
Cdd:PHA03247  2599 -RAPVDDRGDPRGPAPPSPLPPD----THAPDPPPPSPSPAANEPDPHPPPTvppPERPRDDPAPGRVSRPRRaRRLGRA 2673
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3690 SKALKFPVHPRKAVGSLAPGELARGTENGMKPATPKAKPGPSSQGSGSPrPGTKTGGGSQPQPASGQLQSETATTPAKPS 3769
Cdd:PHA03247  2674 AQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLP-PGPAAARQASPALPAAPAPPAVPAGPATPG 2752
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3770 FPSRSPAPErLPARAQAKSCTKGPreagEQGPHGSLGPKEKGESSTKRKKGQVPGPARSESVGSFGRAPSAPdkpprtpr 3849
Cdd:PHA03247  2753 GPARPARPP-TTAGPPAPAPPAAP----AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP-------- 2819
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3850 KQATPSRVLPTKPKPNSQNKPRPPPSEQRKAEPGhtqrkdrlGKAFPQGRPLLRPPKRGTAVHGAEPAEPHTHRTAEAQS 3929
Cdd:PHA03247  2820 PAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG--------GSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAV 2891

                   ....*....
gi 1533911195 3930 DLLSQLFGQ 3938
Cdd:PHA03247  2892 SRSTESFAL 2900
PHA03247 PHA03247
large tegument protein UL36; Provisional
3479-3935 2.40e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.19  E-value: 2.40e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3479 MPGSAPGPGEDR--PPPRgSSPILSEGSLPALLHLCSEVAPSTTkgwPETLERPVDPVTHPIRGCELPSNHQECPPPSLS 3556
Cdd:PHA03247  2555 LPPAAPPAAPDRsvPPPR-PAPRPSEPAVTSRARRPDAPPQSAR---PRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3557 PFPAALADGRGDCALDGALERPENEASPGS---PGPLLQQALPLGASLPRPGARGQDAEGKRAPLVFSGKRRAPGARGRC 3633
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRvsrPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEP 2710
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3634 APDHFQEDHLLQKEKEVSSSHM-------------VSEGGPRGAFHKGSATKPAGCQSSSKDRSAASTPSKALkfpvhPR 3700
Cdd:PHA03247  2711 APHALVSATPLPPGPAAARQASpalpaapappavpAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRL-----TR 2785
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3701 KAVGSLAPGelargTENGMKPATPKAKPGPSSQGSGSPRPGTKTGGGSQPQPASGQLQSETATTPAKPSFPSRSPAPERL 3780
Cdd:PHA03247  2786 PAVASLSES-----RESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGG 2860
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3781 PARAQAKSCTKGPREAGEQGPHGSLGPKEKGESSTKRKKGQVPGPARSesvgsfgRAPSAPDKPPRTPRKQATPSRVLPT 3860
Cdd:PHA03247  2861 DVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP-------PQPQAPPPPQPQPQPPPPPQPQPPP 2933
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1533911195 3861 KPKPNSQNKPRPPPSEQRKAEPGHTQRKDRLGKAFPQGRPLLR---PPKRGTAVHGAEPAEPHTHRTAEAQSDLLSQL 3935
Cdd:PHA03247  2934 PPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRfrvPQPAPSREAPASSTPPLTGHSLSRVSSWASSL 3011
PHA03247 PHA03247
large tegument protein UL36; Provisional
2147-2594 1.04e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.80  E-value: 1.04e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2147 GGQLPASPSCRDPPGPQQLLACS-PAWAPLEEADGVQATtdtgaedSPVAPPSLTT--SPCDPKEALAGCLLQGEGSPLE 2223
Cdd:PHA03247  2549 GDPPPPLPPAAPPAAPDRSVPPPrPAPRPSEPAVTSRAR-------RPDAPPQSARprAPVDDRGDPRGPAPPSPLPPDT 2621
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2224 DPSSWPPGSVSAvtctHSGDTPKDSTLRIPEDSRKEKLwESPGRATSPPLAGAVSpsvavRATGLSSTPTGDEAQAGRGL 2303
Cdd:PHA03247  2622 HAPDPPPPSPSP----AANEPDPHPPPTVPPPERPRDD-PAPGRVSRPRRARRLG-----RAAQASSPPQRPRRRAARPT 2691
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2304 PGP-DPQSRGAPPHTNPDRMPRGHSSYSPSNTARLGHREGQ-AVTAVPTEPPTLQG----AGPDSPACLEGEMGTSSKEP 2377
Cdd:PHA03247  2692 VGSlTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASpALPAAPAPPAVPAGpatpGGPARPARPPTTAGPPAPAP 2771
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2378 edPGTPETGRSGATKMPRVTcpSTGLGLGRTTAPSSTASDFQSDSPQSHRNASHQTPQGDPLGPQDLKQRSRGYKKKPAS 2457
Cdd:PHA03247  2772 --PAAPAAGPPRRLTRPAVA--SLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPP 2847
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2458 TENGQWKGQAPHGPVTCEVCAASFRSGPGLSRHKARKHRPHPGAPAEPSPAALPAQQPLEPLAQKCQPPRKKSHRVSGKE 2537
Cdd:PHA03247  2848 PSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPP 2927
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1533911195 2538 RPNHS-RGDPSHVTQPPPAQGSKEVLRAPGSPHSQQLHPPSPTEHEVdVKTPASKPRP 2594
Cdd:PHA03247  2928 QPQPPpPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAV-PRFRVPQPAP 2984
PHA03247 PHA03247
large tegument protein UL36; Provisional
1130-1543 1.30e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.41  E-value: 1.30e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1130 PREDEPQKPRKAARQEAGGDGAPANPEEPGGSRP--------GPGRSPQARGPSRSLETGAAAREGGPKCADRPSVAPKD 1201
Cdd:PHA03247  2484 AEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPailpdepvGEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAA 2563
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1202 PLQ-VPTntetseeTRPSldfPQEAKEPETAEESAPDSTEFTEALRSPpaacAGEMGASPGLLIPEQPPPSRHDTGTPKP 1280
Cdd:PHA03247  2564 PDRsVPP-------PRPA---PRPSEPAVTSRARRPDAPPQSARPRAP----VDDRGDPRGPAPPSPLPPDTHAPDPPPP 2629
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1281 SGSlantaPHGSSPtPGVGSLLGGPGGTQAPVSHNSKDPPARQPGEFLAPVANPSSTACPKPSVLSSKISSFgcdpAGFN 1360
Cdd:PHA03247  2630 SPS-----PAANEP-DPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL----TSLA 2699
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1361 RDPlgvpvaKKGPQPYSSPHSELFLGPKDLAGCFLEELHPKPSARDAPPASSSCLCQDGEDAGSLEPQLPRSPP------ 1434
Cdd:PHA03247  2700 DPP------PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPapappa 2773
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1435 GTAETEPGRAASPPTLESSSLFPDLPV--DRFDPP--LYGSLSANRDSGLPFACADPPQKTVPSDPPYPSflllEEVSPM 1510
Cdd:PHA03247  2774 APAAGPPRRLTRPAVASLSESRESLPSpwDPADPPaaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPP----GPPPPS 2849
                          410       420       430
                   ....*....|....*....|....*....|...
gi 1533911195 1511 LPSHFPDLSGGKVlSKTCPPERTVVPGAAPSLP 1543
Cdd:PHA03247  2850 LPLGGSVAPGGDV-RRRPPSRSPAAKPAAPARP 2881
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3482-3919 2.75e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 57.10  E-value: 2.75e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3482 SAPGPGEDRP--PPRGSSPILSEGSLPALLHLCSEVAPSTTKGWPETLERPVDPVTHPIRGCELPSNHQECPPPSLSPFP 3559
Cdd:PHA03307    23 RPPATPGDAAddLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAR 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3560 AALADGRGDCALDGALERPENEASPGSPGPLLQQALPLGASLPRPGARGQDAEGKRAPLVFSGKRRAPGARGRCApdhfq 3639
Cdd:PHA03307   103 EGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLS----- 177
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3640 edhLLQKEKEVSSSHMVSEGGPRGAFHKGSATKPAGCQSSSKDRSAASTPSKALKFPVHPRKAVGSLAPGelaRGTENGM 3719
Cdd:PHA03307   178 ---SPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSES---SGCGWGP 251
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3720 KPATPKAKPGPSSQGSGSPRPGTKTGGGSQPQPASgqlqsetattpakPSFPSRSPAPERLPARAQAKSCTKGPREAGEQ 3799
Cdd:PHA03307   252 ENECPLPRPAPITLPTRIWEASGWNGPSSRPGPAS-------------SSSSPRERSPSPSPSSPGSGPAPSSPRASSSS 318
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3800 GPHGSLGPKEKGESSTKRKKGQVPGPARSESVGSFGRAPSAPDKPPrtPRKQATPSRVLPTKpkpnSQNKPRPPPSEQRK 3879
Cdd:PHA03307   319 SSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSS--PRKRPRPSRAPSSP----AASAGRPTRRRARA 392
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|
gi 1533911195 3880 AEPGHTQRKDRLGkAFPQGRPLLRPPKRGTAVHGAEPAEP 3919
Cdd:PHA03307   393 AVAGRARRRDATG-RFPAGRPRPSPLDAGAASGAFYARYP 431
PHA03247 PHA03247
large tegument protein UL36; Provisional
255-723 3.02e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.79  E-value: 3.02e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  255 PEPIPKGSRPGGSPRGVsfqfpfPALHGASTKPFPADVAGHAFTNGPLVFAFHQPQGAWPEEAVGTGPAYPLPTQPAPSP 334
Cdd:PHA03247  2552 PPPLPPAAPPAAPDRSV------PPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD 2625
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  335 LPCYQGQPGGLNRHSDLSGALSSPGAAHSAPRPFSDSLHKSLTKilPERPPSAqdglgstrgppSSLPQRHFPGQAYRAS 414
Cdd:PHA03247  2626 PPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARR--LGRAAQA-----------SSPPQRPRRRAARPTV 2692
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  415 GVDTSPGPPdtelAAPGPPPARLPQLWDPTAAPYPTPPGGPLAAtrsmffngqpspgqrlclPQSAPLPWPQVLPTARPS 494
Cdd:PHA03247  2693 GSLTSLADP----PPPPPTPEPAPHALVSATPLPPGPAAARQAS------------------PALPAAPAPPAVPAGPAT 2750
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  495 PHGMEMLSRLPFPAGGPewqggSQGALGTAGKTPGPREKLPAVRSSQGGSPALFTYNGMTDPGAQPLFFGVAQPQVSPHG 574
Cdd:PHA03247  2751 PGGPARPARPPTTAGPP-----APAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA 2825
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  575 TPSLPPPRVVGASPSESPLPSPATNTAG-------------STCSSLSPMSSSPANPSSEESQLPGPLGPSAFFHPPTHP 641
Cdd:PHA03247  2826 GPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvapggdvrrrpPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP 2905
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  642 QETGSPFPSPEP-------PHSLPTHYQPEPAKAFPFPADGLGAEGAFQCLEETPFPHEGPEVGRGGLQGFPRAPPPYPT 714
Cdd:PHA03247  2906 ERPPQPQAPPPPqpqpqppPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985

                   ....*....
gi 1533911195  715 HHFSLSSAS 723
Cdd:PHA03247  2986 REAPASSTP 2994
PHA03247 PHA03247
large tegument protein UL36; Provisional
6-666 7.47e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 7.47e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195    6 PRGAPPPTMTGDLQPrQVASSPGHPSQPPLEDNTPAtrttkgarEAGGQAQAMELPEAQPRQARDGELKPPSLRGQAPSS 85
Cdd:PHA03247  2492 AGAAPDPGGGGPPDP-DAPPAPSRLAPAILPDEPVG--------EPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPA 2562
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195   86 TPGKRGSPQTPPGRSPLQAPSRLAGRAeGSPPQryilgiassrtkPTLDETPENPQLEaaqlPEVDTPQGPGTGAPLRPG 165
Cdd:PHA03247  2563 APDRSVPPPRPAPRPSEPAVTSRARRP-DAPPQ------------SARPRAPVDDRGD----PRGPAPPSPLPPDTHAPD 2625
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  166 LPRTEAQPAAEELGFHRCFQEPPSSFTSTNYTSPSATPRPPAPGppQSRGTSPLQPGSYPEYQASgadswPPAAENSFPG 245
Cdd:PHA03247  2626 PPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARR--LGRAAQASSPPQRPRRRAA-----RPTVGSLTSL 2698
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  246 ANFGVPPAEPEPIPKGSRPGGSPRgvsfqfPFPALHGASTKPFPADVAGHAFTNGPLVFAFHQPQGAWPEEAVGTGPAYP 325
Cdd:PHA03247  2699 ADPPPPPPTPEPAPHALVSATPLP------PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPP 2772
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  326 -LPTQPAPSPLPCYQGQPGGLNRHSDLSGALSSPGAAHSAPRPFSDSLHKSLTKILPErPPSAQdglgstrgpPSSLPQR 404
Cdd:PHA03247  2773 aAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPP-PTSAQ---------PTAPPPP 2842
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  405 HFPGQAYRASGVDTSPGPPDTELAAPGPPPARlpqlwdptaapyptPPGGPLAATRSMFFNGQPSPGQRLCLPQSAPLPW 484
Cdd:PHA03247  2843 PGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAK--------------PAAPARPPVRRLARPAVSRSTESFALPPDQPERP 2908
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  485 PQVLPTARPSPHGMEMLSRLPFPAggPEWQGGSQGALGTAGKTPGPREKLPAVRSSQGGSPAlftyngmtdpgaqPLFFG 564
Cdd:PHA03247  2909 PQPQAPPPPQPQPQPPPPPQPQPP--PPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALV-------------PGRVA 2973
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  565 VAQPQVsphgtpslPPPRvvgaspseSPLPSPATNTAGSTCSSLSPMSSSPANPSSEESQLPGPLGPSAFFHPP------ 638
Cdd:PHA03247  2974 VPRFRV--------PQPA--------PSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQTLWPPddteds 3037
                          650       660       670
                   ....*....|....*....|....*....|....*..
gi 1533911195  639 ---------THPQETGSPFPSPEPPHSLPTHyQPEPA 666
Cdd:PHA03247  3038 dadslfdsdSERSDLEALDPLPPEPHDPFAH-EPDPA 3073
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
3709-3926 9.95e-06

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 52.00  E-value: 9.95e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3709 GELARGTE-NGMKPATPKAKPGPS-----SQGSGSPRPGTKTG------GGSQPQPASGQLQSETATTPAKPSFPSRSPA 3776
Cdd:PTZ00449   508 DEPPEGPEaSGLPPKAPGDKEGEEgehedSKESDEPKEGGKPGetkegeVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKH 587
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3777 PERLPARAQAKSctkgPREAgeQGPHGSLGPKeKGESSTKRKKGQVPGPARSEsvgsfgRAPSAPDKPPRTPRKQATPSr 3856
Cdd:PTZ00449   588 PKDPEEPKKPKR----PRSA--QRPTRPKSPK-LPELLDIPKSPKRPESPKSP------KRPPPPQRPSSPERPEGPKI- 653
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1533911195 3857 vlPTKPKPNSQNKPRPPPSEQRK---------AEPGHTQRKDRLGKAFPQGRPLLRPPKRGTAVHGAEPAEPHTHRTAE 3926
Cdd:PTZ00449   654 --IKSPKPPKSPKPPFDPKFKEKfyddyldaaAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEE 730
PHA03247 PHA03247
large tegument protein UL36; Provisional
3451-3854 2.62e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 2.62e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3451 RGVRRPGAPGQKARALEGTlpskRRRVAMPGSAPGPGEDRPPPRGSSPilsegslpallhlcsEVAPSttkgwPETLERP 3530
Cdd:PHA03247  2665 RRARRLGRAAQASSPPQRP----RRRAARPTVGSLTSLADPPPPPPTP---------------EPAPH-----ALVSATP 2720
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3531 VDPVTHPIRgcelpsnhQECPPPSLSPFPAALADGRGDCALDGALERPENEASPGSPGPLLQQALPLGASLPRPGARGQD 3610
Cdd:PHA03247  2721 LPPGPAAAR--------QASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLS 2792
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3611 AEGKRAPLvfsgkrrapgargrcAPDhfqedhllqkekevSSSHMVSEGGPRGAFHKGSATKPAGCQSSSkdrSAASTPS 3690
Cdd:PHA03247  2793 ESRESLPS---------------PWD--------------PADPPAAVLAPAAALPPAASPAGPLPPPTS---AQPTAPP 2840
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3691 KALKFPVHPRKAVGSLAP-GELARGTENGMKPATPKAKPGPSSQGSGSPRPGTKTGGGSQPQ-----PASGQLQSETATT 3764
Cdd:PHA03247  2841 PPPGPPPPSLPLGGSVAPgGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPdqperPPQPQAPPPPQPQ 2920
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3765 PAKPSFPSRSPAPE---RLPARAQAKSCTKGPREAGEQGPHGSLGPKEKGESSTKRKKGQVPGPARsesvgsfgrapSAP 3841
Cdd:PHA03247  2921 PQPPPPPQPQPPPPpppRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSR-----------EAP 2989
                          410
                   ....*....|...
gi 1533911195 3842 DKPPRTPRKQATP 3854
Cdd:PHA03247  2990 ASSTPPLTGHSLS 3002
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
25-251 4.17e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 49.87  E-value: 4.17e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195   25 SSPGHPSQPPLEDNTPATRTTKGAREAGGQAQAMELPEAQPRQARDGELKPPSLRGQAPSSTPGKRGSPQTPPGRSPLQA 104
Cdd:PRK12323   372 AGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPA 451
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195  105 PSRLAGRAEGSPPQryilgIASSRTKPTLDETPENPQLEAAQLPEVDTPQGPGTGAPLRPGLPrteaQPAAEELGFHRCF 184
Cdd:PRK12323   452 PAPAAAPAAAARPA-----AAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASP----APAQPDAAPAGWV 522
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1533911195  185 QEPPSSFTSTNYTSPSATPRPPAPGPPQSRGTSPLQPGSYPEYQASGADSWPPAAENSFPGANFGVP 251
Cdd:PRK12323   523 AESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPALAARLP 589
dnaA PRK14086
chromosomal replication initiator protein DnaA;
1124-1326 1.18e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 48.28  E-value: 1.18e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1124 VELTQGPREDEPQKPRKAARQEAGGDGaPANPEEPGGSRPGPGRSPQARGPSRSlETGAAAREGGPKCADRPsvapkDPL 1203
Cdd:PRK14086    84 IAITVDPSAGEPAPPPPHARRTSEPEL-PRPGRRPYEGYGGPRADDRPPGLPRQ-DQLPTARPAYPAYQQRP-----EPG 156
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1204 QVPTNTETSEETRPSLDFPQEAKEPETAEESAPdstefTEALRSPPAACAGEMGA---SPGLLIPEQPPPSRHDTGTPKP 1280
Cdd:PRK14086   157 AWPRAADDYGWQQQRLGFPPRAPYASPASYAPE-----QERDREPYDAGRPEYDQrrrDYDHPRPDWDRPRRDRTDRPEP 231
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1533911195 1281 SGSlANTAPHGSSPTPGVGSLLGGPGGTQAPVSHNSKDPPARQPGE 1326
Cdd:PRK14086   232 PPG-AGHVHRGGPGPPERDDAPVVPIRPSAPGPLAAQPAPAPGPGE 276
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
3664-3910 2.22e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 47.18  E-value: 2.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3664 AFHKGSATKPAGCQSSSKDRSAASTPSKALKFPVHPRKAVGSLAPGELARGTENGMKPATPKAKPGPSSQGSGSPRPGTK 3743
Cdd:PRK12323   362 AFRPGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASA 441
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3744 TGGGSQPQPASGQLQSETATTPAKPSFPSRSPAPERLPARAQAKSCTkgPREAGEQGPHGSLGPKEKGesstkrkkgqVP 3823
Cdd:PRK12323   442 RGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAA--PAPADDDPPPWEELPPEFA----------SP 509
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3824 GPARSESVGSFGRAPSAPD---KPPRTPRKQATPSRVLPTKPKPNSQNKPRPPPSEQRKAEpghtqrkDRLGKAFPQGRP 3900
Cdd:PRK12323   510 APAQPDAAPAGWVAESIPDpatADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASA-------SGLPDMFDGDWP 582
                          250
                   ....*....|..
gi 1533911195 3901 LL--RPPKRGTA 3910
Cdd:PRK12323   583 ALaaRLPVRGLA 594
PHA03247 PHA03247
large tegument protein UL36; Provisional
1635-2272 3.01e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.24  E-value: 3.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1635 AHREGAESAVATVEAVQGRPGGTWPCPASFHPGHAALLPCAQEDLVSGAPFSPRGANFHFQPVQKAGASKtglcqaeGDS 1714
Cdd:PHA03247  2479 VYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDA-------GDP 2551
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1715 RPPQDVCLPEPskqpgpqldagslakcSPDQelSFPKNKEAASSQESEDSLRllpcEQRGGFLPEPGTADQPhrGAPAPE 1794
Cdd:PHA03247  2552 PPPLPPAAPPA----------------APDR--SVPPPRPAPRPSEPAVTSR----ARRPDAPPQSARPRAP--VDDRGD 2607
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1795 AFGSPAVHLAPDLAFQGDGAPPLDATWPFGASPSHAAQGHSAGRAGGHLHPT-AGRPGFEGNEFAPAGASSLTAPRGREA 1873
Cdd:PHA03247  2608 PRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGrVSRPRRARRLGRAAQASSPPQRPRRRA 2687
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1874 WLVPVPSPACVSNTHPSRRSQDPAlsPPIRQLQLPGPGVAKSKDGILGLQELTPAAQSPPRVNPSGLEGGTVEGGKVACG 1953
Cdd:PHA03247  2688 ARPTVGSLTSLADPPPPPPTPEPA--PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAG 2765
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1954 PAQGSPGGVQVTTLPAVAghqlgleadghwgllgqaekTQGQGTANQLQPENGVSPGGTDNHASVNASPKTALTGPTEGA 2033
Cdd:PHA03247  2766 PPAPAPPAAPAAGPPRRL--------------------TRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA 2825
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2034 VLLEkckgsrAAMSLQEEAEPTPSPPSPNRESL--ALALTAAHSRSGSEGRTPERASSPglnkpllatgdsPAPSVGDLA 2111
Cdd:PHA03247  2826 GPLP------PPTSAQPTAPPPPPGPPPPSLPLggSVAPGGDVRRRPPSRSPAAKPAAP------------ARPPVRRLA 2887
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2112 ACAPS-PTSAAHMPCSLGPLPREDPLTSPSRAQGGLGGQLPASPSCRDPPGPQQLLACSPAWAPLEEADGVQATTDTGA- 2189
Cdd:PHA03247  2888 RPAVSrSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAl 2967
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2190 -EDSPVAPPSLTTSPCDPKEALAGCLLQGEGSPLEDPSSWP----------PGSVSAVTCTHSGDTPKDST-----LRIP 2253
Cdd:PHA03247  2968 vPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWAsslalheetdPPPVSLKQTLWPPDDTEDSDadslfDSDS 3047
                          650
                   ....*....|....*....
gi 1533911195 2254 EDSRKEKLWESPGRATSPP 2272
Cdd:PHA03247  3048 ERSDLEALDPLPPEPHDPF 3066
PHA03247 PHA03247
large tegument protein UL36; Provisional
2041-2353 3.08e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.24  E-value: 3.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2041 GSRAAMSLQEEAEPTPSPPSPNRESLALALTAAHSRSGSEGRTPERASSPglnkpllATGDSPAPSVGDLAACAPSPTSA 2120
Cdd:PHA03247  2693 GSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPP-------AVPAGPATPGGPARPARPPTTAG 2765
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2121 AHMPCSLGPLPREDPLTSPSRAQGGLGGQLPASPSCRDPPGPQqllACSPAWAPLEEADGVQATTDTGAEDSPVAPPSLT 2200
Cdd:PHA03247  2766 PPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPP---AAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP 2842
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2201 TSPCDPKEALAGCLLQGEGSPLEDPSSWPPGSVSAVTcthsgdTPKDSTLRIPEDSRKEKLWESPGRATSPPLAGAVSPs 2280
Cdd:PHA03247  2843 PGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPA------RPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPP- 2915
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1533911195 2281 vavratglSSTPTGDEAQAGRGLPGPDPQSRGAPPhTNPDRMPRGHSSYSPSNTA-RLGHREGQAVTAVPTEPP 2353
Cdd:PHA03247  2916 --------PPQPQPQPPPPPQPQPPPPPPPRPQPP-LAPTTDPAGAGEPSGAVPQpWLGALVPGRVAVPRFRVP 2980
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
3725-3882 8.49e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 45.45  E-value: 8.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3725 KAKPGPSSQGSGSPRPGTKTGGGSQPQPASGQLQSETATTPAKPSFPSRSPAPERLPARAQAKSCTKGPREAGEQGPHGS 3804
Cdd:PTZ00449   493 KKKLAPIEEEDSDKHDEPPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKI 572
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1533911195 3805 LGPKEKGESSTKRKKGQVPGPARSESVGSFGRAPSAPdKPPRTPRkqatpSRVLPTKPK-PNSQNKPRPPPSEQRKAEP 3882
Cdd:PTZ00449   573 PTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRP-KSPKLPE-----LLDIPKSPKrPESPKSPKRPPPPQRPSSP 645
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3456-3856 8.55e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.55  E-value: 8.55e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3456 PGAPGQKARALEGTLPSKRRRVAMPGSAPGPGEDRPPPRGSSPILSEGSLPAllhlcsEVAPSTTKGWPETLeRPVDPVT 3535
Cdd:PHA03307    73 PGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPA------SPPPSPAPDLSEML-RPVGSPG 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3536 HPIRGCELPSNHQECPPPSLSPFPAALADgrgdcALDGAlerPENEASPGSPGPLLQQALPLGASLPRPGARGQDAEGKR 3615
Cdd:PHA03307   146 PPPAASPPAAGASPAAVASDAASSRQAAL-----PLSSP---EETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASA 217
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3616 APLVFSGKRRAPGARGRCAPDhfqedhllqkekevssshmvseggprgafhkGSATKPAGCQSSSKDRSAASTPSKaLKF 3695
Cdd:PHA03307   218 SSPAPAPGRSAADDAGASSSD-------------------------------SSSSESSGCGWGPENECPLPRPAP-ITL 265
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3696 PVHPRKAVGSLAPGELARGTENGMKPATPKAKPGPSSQGSGSPRPGTKTGGGSQPQPASGQLQSETATTPAKPSFPSRSP 3775
Cdd:PHA03307   266 PTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGP 345
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3776 APERLPARAQAKSCTKG--PREAGEQGPHGSLGPKEKGESSTKRKKGQVPGPAR-SESVGSFGRAPSAPDKPPRTPRKQA 3852
Cdd:PHA03307   346 SPSRSPSPSRPPPPADPssPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARrRDATGRFPAGRPRPSPLDAGAASGA 425

                   ....
gi 1533911195 3853 TPSR 3856
Cdd:PHA03307   426 FYAR 429
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
3339-3359 1.83e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 38.05  E-value: 1.83e-03
                           10        20
                   ....*....|....*....|.
gi 1533911195 3339 CHHCGKRFPKPFKLQRHLAVH 3359
Cdd:pfam00096    3 CPDCGKSFSRKSNLKRHLRTH 23
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
3456-3883 1.89e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.21  E-value: 1.89e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3456 PGAPGQkARALEGTLPSKRRRVAMPGSAPGPGEDRPPPRGSSPILSEGSLPALLHLCSEVAPSTTK---GWPETLERPVD 3532
Cdd:PRK07764   365 PSASDD-ERGLLARLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPqpaPAPAPAPAPPS 443
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3533 PVTHPIRGCELPSNHQECPPPSLSPFPAAlADGRGDCALDGALERPENEASPGSPGPLLQQALPLGASLPR--------- 3603
Cdd:PRK07764   444 PAGNAPAGGAPSPPPAAAPSAQPAPAPAA-APEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRerwpeilaa 522
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3604 -------------PGARGQDAEGKRAPLVFSgkrrAPGARGRCApdhfqedhlLQKEKEVSSSHMVSEGGPRGAFHKGSA 3670
Cdd:PRK07764   523 vpkrsrktwaillPEATVLGVRGDTLVLGFS----TGGLARRFA---------SPGNAEVLVTALAEELGGDWQVEAVVG 589
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3671 TKPAGCQSSSKDRSAASTPSKALKFPVHPRKAVGSLAPGELARGTENGmKPATPKAKPGPSSQGSGSPRPGTKTGGGSQP 3750
Cdd:PRK07764   590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPA-EASAAPAPGVAAPEHHPKHVAVPDASDGGDG 668
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 3751 QPASGQLQSETATTPAKPSFPSRSPAPERLPARAQAKSCTKGPREAGEQGPHGSLGPKEKGESSTKRKKGQVPGPARSES 3830
Cdd:PRK07764   669 WPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDP 748
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1533911195 3831 VGSFGRAPSAPDKPPRTPRKQ-ATPSRVLPTKPKPNSQNKPRPPPSEQRKAEPG 3883
Cdd:PRK07764   749 PDPAGAPAQPPPPPAPAPAAApAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAE 802
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
2-177 3.25e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.44  E-value: 3.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195    2 PGERPRGAPPPTMTGDLQPRQVASSPGHPSQPPLEDNTPATRTTKGAREAGGQAQAMELPEAQPRQARDGelkppslRGQ 81
Cdd:PRK07764   623 APAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAA-------PAG 695
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195   82 APSSTPGKRGSPQTPPGRSPLQAPSRLAGRAEGSPPQRYILGIASSRTKPTLDETPENPQLEAAQLPEVDTPQGPGTGAP 161
Cdd:PRK07764   696 AAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPP 775
                          170
                   ....*....|....*.
gi 1533911195  162 LRPGLPRTEAQPAAEE 177
Cdd:PRK07764   776 PSPPSEEEEMAEDDAP 791
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2077-2468 4.30e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 43.24  E-value: 4.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2077 SGSEGRTPERASSPGLNKPLLATGdSPAPSVGDLAACAPSPTSAAHMPCSLGPLPREDPLTSPSRAqgglGGQLPASPSC 2156
Cdd:PHA03307    75 PGTEAPANESRSTPTWSLSTLAPA-SPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLR----PVGSPGPPPA 149
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2157 RDPPGPQQLLACSPAWAPLEEADGVQATTDTGAEDSPVAPPSLTTSPCDPKEALAGCllQGEGSPLEDPSSWPPGSvsav 2236
Cdd:PHA03307   150 ASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRP--PRRSSPISASASSPAPA---- 223
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2237 tcthSGDTPKDSTLRIPEDSRKEKLWESPGRATSPPLAGAVSPSVAVRATGLSSTPTGDEAQAGRGLPGPDPQSRGAPPh 2316
Cdd:PHA03307   224 ----PGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSP- 298
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 2317 tnpdrmPRGHSSYSPSNTARLGHREGQAVTAVPTEPPTLQGAGPDSPACLEGEMGTSSKEPEDPGTPETGRSGATKMPRV 2396
Cdd:PHA03307   299 ------SPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPS 372
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1533911195 2397 TCPSTGLGLGRTTAPSSTASDFQSDSPQSHRNASHQTPQGDPLGPQDLKQRSRGYKKKPASTENGQ-WKGQAP 2468
Cdd:PHA03307   373 RAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPSGEpWPGSPP 445
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1247-1462 8.12e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.17  E-value: 8.12e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1247 SPPAACAGEMGASPGLLIPEQ---PPPSRHDTGTPKPSGSLANTAPHGSSPTPGVGSLLGGPGGTQAPvshnskdppARQ 1323
Cdd:PRK12323   373 GPATAAAAPVAQPAPAAAAPAaaaPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQAS---------ARG 443
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1533911195 1324 PGEFLAPVANPSSTacPKPSVLSSKISSFGCDPAGFNRDPLGVPVAKKGPQPYSSPHSELFlgPKDLAGCFLEELHPKPS 1403
Cdd:PRK12323   444 PGGAPAPAPAPAAA--PAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEEL--PPEFASPAPAQPDAAPA 519
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1533911195 1404 ARDAPPASSSCLCQDGEDAGSLEPQLPRS--PPGTAETEPGRAASPPTLESSSLFPDLPVD 1462
Cdd:PRK12323   520 GWVAESIPDPATADPDDAFETLAPAPAAApaPRAAAATEPVVAPRPPRASASGLPDMFDGD 580
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH