NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|238908524|ref|NP_001155018|]
View 

proline-rich basic protein 1 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
DUF4585 super family cl21094
Domain of unknown function (DUF4585); The function of this protein domain family is yet to be ...
874-941 1.20e-16

Domain of unknown function (DUF4585); The function of this protein domain family is yet to be characterized. It is putatively thought to lie in the C-terminal domain of the DNA nucleotide repair protein, Xeroderma pigmentosa complementation group A (XPA). The function of XPA is to bind to DNA and repair any mismatched base pairs. This domain family is often found in eukaryotes, and is approximately 70 amino acids in length. There is a conserved DPE sequence motif. In humans, this protein is encoded for in the chromosomal position, Chromosome 5 open reading frame 65. Mutations in the gene lead to myelodysplastic syndromes, where there is inefficient stem cell production in the bone marrow. This suggests that the protein may have a role in forming blood cells.


The actual alignment was detected with superfamily member pfam15232:

Pssm-ID: 464574 [Multi-domain]  Cd Length: 73  Bit Score: 75.49  E-value: 1.20e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 238908524   874 PGAAPLGKVLVDPESGRYYFVEAPRQPRLRVLFDPESGQYVEVL-----LPPSSPGPPHRVYTPLALGLGLYP 941
Cdd:pfam15232    1 PYPATQRKVLLDPESGQYYVVDAPLQPQLKTLFDPETGQYVEVSipsseSNVSPPTPPELLYSPLALYPGAYP 73
PHA03247 super family cl33720
large tegument protein UL36; Provisional
83-624 1.16e-08

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 59.57  E-value: 1.16e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524   83 GPGSGFPRGPGSGPRPPQPQLRTLPSGEMEVIFGVGPLFGCSGADDREAQQQFTEPAFISPLPPGPASPAAVPRQSQVPd 162
Cdd:PHA03247 2497 DPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPRPAP- 2575
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  163 ggsrwatylelRPRGPSPAAPAQFECVEVALEEGAAPARPRTVPKRQielrPRPQSPPRAAGAPRPRLLLRTGSLDESLG 242
Cdd:PHA03247 2576 -----------RPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGP----APPSPLPPDTHAPDPPPPSPSPAANEPDP 2640
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  243 PLQAAAgfvqtalarklsPEAPAPSSATFGSTGRSEPETRETARSTHVVLEKAKSRPLRVRDNSAP-AKAPRPWPSLRER 321
Cdd:PHA03247 2641 HPPPTV------------PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSlTSLADPPPPPPTP 2708
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  322 AIRRDKPAPGTePLGPVSSSiflqSEEKIQEARKTRFPREAPDRTVQRARSPPFECRIPSEVPSRAVRPRSPS--PPRQT 399
Cdd:PHA03247 2709 EPAPHALVSAT-PLPPGPAA----ARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAagPPRRL 2783
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  400 PNGAVRGPRCPSPQNLSPWDRTTrrVSSPLFPEASSEWENQNPAVEEtvsrrsPSPPILSQWNQCVAGERSPSLEAPSLW 479
Cdd:PHA03247 2784 TRPAVASLSESRESLPSPWDPAD--PPAAVLAPAAALPPAASPAGPL------PPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  480 EIPHSAVADAVEPRSSPSPPAffpweAPDRPIGTWGPSPQetwdPMGPGSSIAFTQEAQNgltqeelAPPTPSAPGTPEP 559
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAAKPA-----APARPPVRRLARPA----VSRSTESFALPPDQPE-------RPPQPQAPPPPQP 2919
                         490       500       510       520       530       540
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 238908524  560 TEMQSPSTREISDLAFGGSQQSPEVAAPEPPGSHPVGTLDADKCPEVLGPGEAASGRPRMAIPRP 624
Cdd:PHA03247 2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAP 2984
PHA03247 super family cl33720
large tegument protein UL36; Provisional
548-1002 4.73e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.63  E-value: 4.73e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  548 PPTPSAPGTPEPTEMQSPSTRE------------ISDLAFGGSQQSPEVAAPEPPGSHPVGTLDADKC-PEVLGPGEAAS 614
Cdd:PHA03247 2506 PDAPPAPSRLAPAILPDEPVGEpvhprmltwirgLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPRPaPRPSEPAVTSR 2585
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  615 GRPRMAIPRPRDVRklvkTTYAPGFPAGAQGSGLPAPPADPCGEEGGESKTQEPPALGPPAPAHYTSVFIKDFLPVVPHP 694
Cdd:PHA03247 2586 ARRPDAPPQSARPR----APVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRV 2661
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  695 YEPPEPSFDTVARDASQPNGVLRRRAENSTAKPFKRTEIRLPGALALGRRPEVTSRVRARGPGGENRDVEAQRLVPDGDG 774
Cdd:PHA03247 2662 SRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAP 2741
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  775 RTSPLGGARSSSQRSPVGPAGVRSP-RPGSPQMQASPSPGIAPKPKTPPTAPEPAAAVQAPLPREPLALAGRTAPAQPRA 853
Cdd:PHA03247 2742 PAVPAGPATPGGPARPARPPTTAGPpAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  854 AS-----APPTDRSPQSPSQGARRQPGAAPLGKVLVD--------PESGRYYFVEAPRQPRLRVLFDPESGQYVEVLLPP 920
Cdd:PHA03247 2822 ASpagplPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrrpPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALP 2901
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  921 SSPGPPHRVYTPLALGLGLYPPAYGPIPSLSLPPSPGPQALGSPQLPWVSEAGP------------LDGTYYLPVSGTPN 988
Cdd:PHA03247 2902 PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPsgavpqpwlgalVPGRVAVPRFRVPQ 2981
                         490
                  ....*....|....
gi 238908524  989 PAPPLLLCAPPSSS 1002
Cdd:PHA03247 2982 PAPSREAPASSTPP 2995
 
Name Accession Description Interval E-value
DUF4585 pfam15232
Domain of unknown function (DUF4585); The function of this protein domain family is yet to be ...
874-941 1.20e-16

Domain of unknown function (DUF4585); The function of this protein domain family is yet to be characterized. It is putatively thought to lie in the C-terminal domain of the DNA nucleotide repair protein, Xeroderma pigmentosa complementation group A (XPA). The function of XPA is to bind to DNA and repair any mismatched base pairs. This domain family is often found in eukaryotes, and is approximately 70 amino acids in length. There is a conserved DPE sequence motif. In humans, this protein is encoded for in the chromosomal position, Chromosome 5 open reading frame 65. Mutations in the gene lead to myelodysplastic syndromes, where there is inefficient stem cell production in the bone marrow. This suggests that the protein may have a role in forming blood cells.


Pssm-ID: 464574 [Multi-domain]  Cd Length: 73  Bit Score: 75.49  E-value: 1.20e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 238908524   874 PGAAPLGKVLVDPESGRYYFVEAPRQPRLRVLFDPESGQYVEVL-----LPPSSPGPPHRVYTPLALGLGLYP 941
Cdd:pfam15232    1 PYPATQRKVLLDPESGQYYVVDAPLQPQLKTLFDPETGQYVEVSipsseSNVSPPTPPELLYSPLALYPGAYP 73
PHA03247 PHA03247
large tegument protein UL36; Provisional
83-624 1.16e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 59.57  E-value: 1.16e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524   83 GPGSGFPRGPGSGPRPPQPQLRTLPSGEMEVIFGVGPLFGCSGADDREAQQQFTEPAFISPLPPGPASPAAVPRQSQVPd 162
Cdd:PHA03247 2497 DPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPRPAP- 2575
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  163 ggsrwatylelRPRGPSPAAPAQFECVEVALEEGAAPARPRTVPKRQielrPRPQSPPRAAGAPRPRLLLRTGSLDESLG 242
Cdd:PHA03247 2576 -----------RPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGP----APPSPLPPDTHAPDPPPPSPSPAANEPDP 2640
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  243 PLQAAAgfvqtalarklsPEAPAPSSATFGSTGRSEPETRETARSTHVVLEKAKSRPLRVRDNSAP-AKAPRPWPSLRER 321
Cdd:PHA03247 2641 HPPPTV------------PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSlTSLADPPPPPPTP 2708
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  322 AIRRDKPAPGTePLGPVSSSiflqSEEKIQEARKTRFPREAPDRTVQRARSPPFECRIPSEVPSRAVRPRSPS--PPRQT 399
Cdd:PHA03247 2709 EPAPHALVSAT-PLPPGPAA----ARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAagPPRRL 2783
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  400 PNGAVRGPRCPSPQNLSPWDRTTrrVSSPLFPEASSEWENQNPAVEEtvsrrsPSPPILSQWNQCVAGERSPSLEAPSLW 479
Cdd:PHA03247 2784 TRPAVASLSESRESLPSPWDPAD--PPAAVLAPAAALPPAASPAGPL------PPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  480 EIPHSAVADAVEPRSSPSPPAffpweAPDRPIGTWGPSPQetwdPMGPGSSIAFTQEAQNgltqeelAPPTPSAPGTPEP 559
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAAKPA-----APARPPVRRLARPA----VSRSTESFALPPDQPE-------RPPQPQAPPPPQP 2919
                         490       500       510       520       530       540
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 238908524  560 TEMQSPSTREISDLAFGGSQQSPEVAAPEPPGSHPVGTLDADKCPEVLGPGEAASGRPRMAIPRP 624
Cdd:PHA03247 2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAP 2984
PHA03247 PHA03247
large tegument protein UL36; Provisional
548-1002 4.73e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.63  E-value: 4.73e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  548 PPTPSAPGTPEPTEMQSPSTRE------------ISDLAFGGSQQSPEVAAPEPPGSHPVGTLDADKC-PEVLGPGEAAS 614
Cdd:PHA03247 2506 PDAPPAPSRLAPAILPDEPVGEpvhprmltwirgLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPRPaPRPSEPAVTSR 2585
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  615 GRPRMAIPRPRDVRklvkTTYAPGFPAGAQGSGLPAPPADPCGEEGGESKTQEPPALGPPAPAHYTSVFIKDFLPVVPHP 694
Cdd:PHA03247 2586 ARRPDAPPQSARPR----APVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRV 2661
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  695 YEPPEPSFDTVARDASQPNGVLRRRAENSTAKPFKRTEIRLPGALALGRRPEVTSRVRARGPGGENRDVEAQRLVPDGDG 774
Cdd:PHA03247 2662 SRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAP 2741
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  775 RTSPLGGARSSSQRSPVGPAGVRSP-RPGSPQMQASPSPGIAPKPKTPPTAPEPAAAVQAPLPREPLALAGRTAPAQPRA 853
Cdd:PHA03247 2742 PAVPAGPATPGGPARPARPPTTAGPpAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  854 AS-----APPTDRSPQSPSQGARRQPGAAPLGKVLVD--------PESGRYYFVEAPRQPRLRVLFDPESGQYVEVLLPP 920
Cdd:PHA03247 2822 ASpagplPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrrpPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALP 2901
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  921 SSPGPPHRVYTPLALGLGLYPPAYGPIPSLSLPPSPGPQALGSPQLPWVSEAGP------------LDGTYYLPVSGTPN 988
Cdd:PHA03247 2902 PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPsgavpqpwlgalVPGRVAVPRFRVPQ 2981
                         490
                  ....*....|....
gi 238908524  989 PAPPLLLCAPPSSS 1002
Cdd:PHA03247 2982 PAPSREAPASSTPP 2995
 
Name Accession Description Interval E-value
DUF4585 pfam15232
Domain of unknown function (DUF4585); The function of this protein domain family is yet to be ...
874-941 1.20e-16

Domain of unknown function (DUF4585); The function of this protein domain family is yet to be characterized. It is putatively thought to lie in the C-terminal domain of the DNA nucleotide repair protein, Xeroderma pigmentosa complementation group A (XPA). The function of XPA is to bind to DNA and repair any mismatched base pairs. This domain family is often found in eukaryotes, and is approximately 70 amino acids in length. There is a conserved DPE sequence motif. In humans, this protein is encoded for in the chromosomal position, Chromosome 5 open reading frame 65. Mutations in the gene lead to myelodysplastic syndromes, where there is inefficient stem cell production in the bone marrow. This suggests that the protein may have a role in forming blood cells.


Pssm-ID: 464574 [Multi-domain]  Cd Length: 73  Bit Score: 75.49  E-value: 1.20e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 238908524   874 PGAAPLGKVLVDPESGRYYFVEAPRQPRLRVLFDPESGQYVEVL-----LPPSSPGPPHRVYTPLALGLGLYP 941
Cdd:pfam15232    1 PYPATQRKVLLDPESGQYYVVDAPLQPQLKTLFDPETGQYVEVSipsseSNVSPPTPPELLYSPLALYPGAYP 73
PHA03247 PHA03247
large tegument protein UL36; Provisional
83-624 1.16e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 59.57  E-value: 1.16e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524   83 GPGSGFPRGPGSGPRPPQPQLRTLPSGEMEVIFGVGPLFGCSGADDREAQQQFTEPAFISPLPPGPASPAAVPRQSQVPd 162
Cdd:PHA03247 2497 DPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPRPAP- 2575
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  163 ggsrwatylelRPRGPSPAAPAQFECVEVALEEGAAPARPRTVPKRQielrPRPQSPPRAAGAPRPRLLLRTGSLDESLG 242
Cdd:PHA03247 2576 -----------RPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGP----APPSPLPPDTHAPDPPPPSPSPAANEPDP 2640
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  243 PLQAAAgfvqtalarklsPEAPAPSSATFGSTGRSEPETRETARSTHVVLEKAKSRPLRVRDNSAP-AKAPRPWPSLRER 321
Cdd:PHA03247 2641 HPPPTV------------PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSlTSLADPPPPPPTP 2708
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  322 AIRRDKPAPGTePLGPVSSSiflqSEEKIQEARKTRFPREAPDRTVQRARSPPFECRIPSEVPSRAVRPRSPS--PPRQT 399
Cdd:PHA03247 2709 EPAPHALVSAT-PLPPGPAA----ARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAagPPRRL 2783
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  400 PNGAVRGPRCPSPQNLSPWDRTTrrVSSPLFPEASSEWENQNPAVEEtvsrrsPSPPILSQWNQCVAGERSPSLEAPSLW 479
Cdd:PHA03247 2784 TRPAVASLSESRESLPSPWDPAD--PPAAVLAPAAALPPAASPAGPL------PPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  480 EIPHSAVADAVEPRSSPSPPAffpweAPDRPIGTWGPSPQetwdPMGPGSSIAFTQEAQNgltqeelAPPTPSAPGTPEP 559
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAAKPA-----APARPPVRRLARPA----VSRSTESFALPPDQPE-------RPPQPQAPPPPQP 2919
                         490       500       510       520       530       540
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 238908524  560 TEMQSPSTREISDLAFGGSQQSPEVAAPEPPGSHPVGTLDADKCPEVLGPGEAASGRPRMAIPRP 624
Cdd:PHA03247 2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAP 2984
PHA03247 PHA03247
large tegument protein UL36; Provisional
7-593 4.97e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 4.97e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524    7 PPALPgiPRQLPTAPARrqdssgssgsyYTAPGSPEPPDVGPDAKGPANWPWVAPgrgAGAQPRLSVSAQNSRQRHGPGS 86
Cdd:PHA03247 2552 PPPLP--PAAPPAAPDR-----------SVPPPRPAPRPSEPAVTSRARRPDAPP---QSARPRAPVDDRGDPRGPAPPS 2615
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524   87 GFPRGPgSGPRPPQPQLRTLPSGEMEVIFGVGPLFGCSGADDREAQQQFTEPAFISPLPPGPASPAAVPRQSQVPDGGSR 166
Cdd:PHA03247 2616 PLPPDT-HAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGS 2694
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  167 WATYLELRPRGPSPAAPAQFECVEVALEEGAAPARPRTVPKRQIELRPRPQSPPRAAGAPRPRLLLRTGSLDESLGPLQA 246
Cdd:PHA03247 2695 LTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA 2774
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  247 AAGFVQtalarklsPEAPAPSSATFGSTGRSEPETRETARSTHVVLEKAKSRPlrvrDNSAPAKAPRPWPSLrerairrd 326
Cdd:PHA03247 2775 PAAGPP--------RRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP----PAASPAGPLPPPTSA-------- 2834
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  327 KPAPGTEPLGPVSSSIFLQSeekiQEARKTRFPREAPDRtvQRARSPPFECRIPSEVPSRAVRPRSPSPPRQTPNGAVRG 406
Cdd:PHA03247 2835 QPTAPPPPPGPPPPSLPLGG----SVAPGGDVRRRPPSR--SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP 2908
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  407 PRCPSPQNLSPWDRTTRRVSSPLFPEASSEWENQNPAVEETVSRRSPSPPILSQWNQCVagerspsleAPSLWEIPHSAV 486
Cdd:PHA03247 2909 PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL---------VPGRVAVPRFRV 2979
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  487 ADAVEPRSSPSPPAFFPWEAPDRPIGTWGPSPQETWDPMGPGSSIAFTQEAQNGLTQEELAPPTPSAPGTPEPTEMQSPS 566
Cdd:PHA03247 2980 PQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEALDPLP 3059
                         570       580
                  ....*....|....*....|....*..
gi 238908524  567 TREISDLAFGGSQQSPEVAAPEPPGSH 593
Cdd:PHA03247 3060 PEPHDPFAHEPDPATPEAGARESPSSQ 3086
PHA03247 PHA03247
large tegument protein UL36; Provisional
548-1002 4.73e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.63  E-value: 4.73e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  548 PPTPSAPGTPEPTEMQSPSTRE------------ISDLAFGGSQQSPEVAAPEPPGSHPVGTLDADKC-PEVLGPGEAAS 614
Cdd:PHA03247 2506 PDAPPAPSRLAPAILPDEPVGEpvhprmltwirgLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPRPaPRPSEPAVTSR 2585
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  615 GRPRMAIPRPRDVRklvkTTYAPGFPAGAQGSGLPAPPADPCGEEGGESKTQEPPALGPPAPAHYTSVFIKDFLPVVPHP 694
Cdd:PHA03247 2586 ARRPDAPPQSARPR----APVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRV 2661
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  695 YEPPEPSFDTVARDASQPNGVLRRRAENSTAKPFKRTEIRLPGALALGRRPEVTSRVRARGPGGENRDVEAQRLVPDGDG 774
Cdd:PHA03247 2662 SRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAP 2741
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  775 RTSPLGGARSSSQRSPVGPAGVRSP-RPGSPQMQASPSPGIAPKPKTPPTAPEPAAAVQAPLPREPLALAGRTAPAQPRA 853
Cdd:PHA03247 2742 PAVPAGPATPGGPARPARPPTTAGPpAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  854 AS-----APPTDRSPQSPSQGARRQPGAAPLGKVLVD--------PESGRYYFVEAPRQPRLRVLFDPESGQYVEVLLPP 920
Cdd:PHA03247 2822 ASpagplPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrrpPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALP 2901
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  921 SSPGPPHRVYTPLALGLGLYPPAYGPIPSLSLPPSPGPQALGSPQLPWVSEAGP------------LDGTYYLPVSGTPN 988
Cdd:PHA03247 2902 PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPsgavpqpwlgalVPGRVAVPRFRVPQ 2981
                         490
                  ....*....|....
gi 238908524  989 PAPPLLLCAPPSSS 1002
Cdd:PHA03247 2982 PAPSREAPASSTPP 2995
PHA03247 PHA03247
large tegument protein UL36; Provisional
359-884 5.36e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 5.36e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  359 PREAPDRTVQRARSPPFecriPSE--VPSRAVRPRSPSPPR--QTPNGAVRGPRCPSPQNLSPWDRTTRRVSSPLFPEAS 434
Cdd:PHA03247 2560 PPAAPDRSVPPPRPAPR----PSEpaVTSRARRPDAPPQSArpRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA 2635
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  435 SEWENQNPAV--EETVSRRSPSPPILSQWNQCVAGERSPSLEAPSLweiphsavadavEPRSSPSPPAFFPWEAPDRPig 512
Cdd:PHA03247 2636 NEPDPHPPPTvpPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQ------------RPRRRAARPTVGSLTSLADP-- 2701
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  513 twgPSPQETWDPmgpgssiaftqeaqngltqeelaPPTPSAPGTPEPTEMQSpstreisdlAFGGSQQSPEVAAPEPPGS 592
Cdd:PHA03247 2702 ---PPPPPTPEP-----------------------APHALVSATPLPPGPAA---------ARQASPALPAAPAPPAVPA 2746
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  593 HPVgtldADKCPEVLGPGEAASGRPRMAIPRPRDVRKLVKTTYAPGFPAGAQGSGLPAP--PADPCGEEGGESKTQEPPA 670
Cdd:PHA03247 2747 GPA----TPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPwdPADPPAAVLAPAAALPPAA 2822
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  671 LGPPAPAHYTSVfikdfLPVVPHPYEPPEPSFDTVArDASQPNGVLRRRAENSTAkpfkrteirlPGALALGRRPEVTSR 750
Cdd:PHA03247 2823 SPAGPLPPPTSA-----QPTAPPPPPGPPPPSLPLG-GSVAPGGDVRRRPPSRSP----------AAKPAAPARPPVRRL 2886
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  751 VRARGPggenRDVEAQRLVPDGdgrtsplggarsssqrspvgpagvrSPRPGSPQMQASPSPgiapkpktpptapepaaa 830
Cdd:PHA03247 2887 ARPAVS----RSTESFALPPDQ-------------------------PERPPQPQAPPPPQP------------------ 2919
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 238908524  831 vQAPLPREPLALAGRTAPAQPRAASAPPTDRSPQSPSQGARRQP--GAAPLGKVLV 884
Cdd:PHA03247 2920 -QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPwlGALVPGRVAV 2974
PHA03247 PHA03247
large tegument protein UL36; Provisional
4-287 9.85e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 9.85e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524    4 ALAPPALPGIPRQLPTAPARRQDSSGSSGSYYTAPGSPEPPDvgpdakgPANWPWVAPGRGAGAqprlsvsaqnsrqrhg 83
Cdd:PHA03247 2762 TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWD-------PADPPAAVLAPAAAL---------------- 2818
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524   84 PGSGFPRGPGSGPRPPQPQLRTLPSGEMEVIFGVGPLFGCSGADDREAQQQftepafisPLPPGPASPAAVPRQSQVPDG 163
Cdd:PHA03247 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSR--------SPAAKPAAPARPPVRRLARPA 2890
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  164 GSRWATYLELRPRGPSPAAPAQFECVEVALEEGAAPARPRTVPKRQielrPRPQSPPRAAGAPRPRLLLRTGSLDESLGP 243
Cdd:PHA03247 2891 VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPP----PRPQPPLAPTTDPAGAGEPSGAVPQPWLGA 2966
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....
gi 238908524  244 LQAAAGFVQTALARKLSPEAPAPSSATFGSTGRSEPETRETARS 287
Cdd:PHA03247 2967 LVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASS 3010
PHA03247 PHA03247
large tegument protein UL36; Provisional
832-1008 1.05e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 1.05e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  832 QAPLPREPLALAGrTAPAQPRAASAPPTDRSPQSPSQGARRQPGAAPLGKVLVDPESGRYYFVEAPRQPRLRvlfdpesg 911
Cdd:PHA03247 2594 QSARPRAPVDDRG-DPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRP-------- 2664
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  912 QYVEVLLPPSSPGPPHRVYTPLALglglyPPAYGPIPSLSLPPSPGPQALGSPQlPWVSEAGPLDGTyylPVSGTPNPAP 991
Cdd:PHA03247 2665 RRARRLGRAAQASSPPQRPRRRAA-----RPTVGSLTSLADPPPPPPTPEPAPH-ALVSATPLPPGP---AAARQASPAL 2735
                         170
                  ....*....|....*..
gi 238908524  992 PLLLCAPPSSSGPTQPG 1008
Cdd:PHA03247 2736 PAAPAPPAVPAGPATPG 2752
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
137-419 2.13e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 42.14  E-value: 2.13e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  137 EPAFISPLPPGPASPAAVPRqsQVPDGGSRWATYLELRPRGPSPAAPAQfecvevaleeGAAPARPRTVPKRQIElrpRP 216
Cdd:PRK07003  359 EPAVTGGGAPGGGVPARVAG--AVPAPGARAAAAVGASAVPAVTAVTGA----------AGAALAPKAAAAAAAT---RA 423
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  217 QSPPRAAGAPRPRLLLRTGSLDESLGPLQAAAGFVQTALARKLSPEAPAPSSATFGSTGRSEPETRETARSTHVvlEKAK 296
Cdd:PRK07003  424 EAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAA--APSA 501
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  297 SRPLRVRDNSAPAKAPRPwpslrERAIRRDKPAPGTEPLGPVSSS-------------IFLQSEEKIQEARKTRFPREAP 363
Cdd:PRK07003  502 ATPAAVPDARAPAAASRE-----DAPAAAAPPAPEARPPTPAAAApaaraggaaaaldVLRNAGMRVSSDRGARAAAAAK 576
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 238908524  364 DRTVQRARSPPFECRIPSEVPSravrPRSP-SPPRQTPNGAVRGP-RCPSPQNLSPWD 419
Cdd:PRK07003  577 PAAAPAAAPKPAAPRVAVQVPT----PRARaATGDAPPNGAARAEqAAESRGAPPPWE 630
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
6-203 2.84e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.79  E-value: 2.84e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524    6 APPALPGIPRQLPTAPARRQdssgsSGSYYTAPGSPEPPDVGPDAKGPANWPWVAPGRGAGAQPRLSVSAQNSRQRHGPG 85
Cdd:PRK12323  373 GPATAAAAPVAQPAPAAAAP-----AAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGA 447
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524   86 SGFPRGPGSGP----RPPQPQLRTLPSGEMEVIFGVGPLFGCSGADDREAQQQFTEPAFISPLPPGPASPAAVPRQSQVP 161
Cdd:PRK12323  448 PAPAPAPAAAPaaaaRPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIP 527
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 238908524  162 DGGSRWATylELRPRGPSPAAPAQFECVEVALEEGAAPARPR 203
Cdd:PRK12323  528 DPATADPD--DAFETLAPAPAAAPAPRAAAATEPVVAPRPPR 567
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
261-594 3.48e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 41.60  E-value: 3.48e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  261 PEAPAPSSATFGSTGRSEPETRETARSTHVVLEKAKSRPLRVRDNSApAKAPRPWPSLRERAIrrdkPAPGTEPLGPVSS 340
Cdd:PTZ00449  511 PEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEV-GKKPGPAKEHKPSKI----PTLSKKPEFPKDP 585
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  341 siflQSEEKIQEARKTRFPReapdrTVQRARSPPfECRIP--SEVPSRAVRPRSPSPPRqtpngavrgpRCPSPQnlspw 418
Cdd:PTZ00449  586 ----KHPKDPEEPKKPKRPR-----SAQRPTRPK-SPKLPelLDIPKSPKRPESPKSPK----------RPPPPQ----- 640
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  419 drttrRVSSPLFPEASSEWENQNPAveetvsrRSPSPPILSQWNQ-------------------CVAGERSPSLEAPSLW 479
Cdd:PTZ00449  641 -----RPSSPERPEGPKIIKSPKPP-------KSPKPPFDPKFKEkfyddyldaaaksketkttVVLDESFESILKETLP 708
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  480 EIPHSAVADAVE-PRSSPSPPAFfPWEAPDRPIGTwGPSPQETWDPmgPGSSIAFTQEAQN-----GLTQEELAPPTPSA 553
Cdd:PTZ00449  709 ETPGTPFTTPRPlPPKLPRDEEF-PFEPIGDPDAE-QPDDIEFFTP--PEEERTFFHETPAdtplpDILAEEFKEEDIHA 784
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|..
gi 238908524  554 PgTPEPTE-MQSPstreisdlafggsqQSPEVAAPEPPGSHP 594
Cdd:PTZ00449  785 E-TGEPDEaMKRP--------------DSPSEHEDKPPGDHP 811
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
196-400 4.49e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.01  E-value: 4.49e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  196 GAAPARPRTVPKRQIE---LRPRPQSPPRAAGAPRPrlllrtgsldeSLGPLQAAAGFVQTALARKLSPEAPAPSSATFG 272
Cdd:PRK12323  371 GAGPATAAAAPVAQPApaaAAPAAAAPAPAAPPAAP-----------AAAPAAAAAARAVAAAPARRSPAPEALAAARQA 439
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  273 STGRSEPETRETARSTHVVLEKAKSRPLRVRDNSAPA-KAPRPWPSLRERAIRRDKPAPGTEPLGPVSSSIFLQSEEKIQ 351
Cdd:PRK12323  440 SARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAaAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPA 519
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*....
gi 238908524  352 EARKTRFPREApdrtvQRARSPPFECRIPSEVPSRAVRPRSPSPPRQTP 400
Cdd:PRK12323  520 GWVAESIPDPA-----TADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
380-606 5.19e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 40.63  E-value: 5.19e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  380 PSEVPSRAVRPRSPSPPRQTPNGAVRGPRCPSPqnlspwdrttRRVSSPLFPEASSEWENQNPAVEETVSRRSPSPPILS 459
Cdd:PRK12323  365 PGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAP----------APAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALA 434
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  460 QWNQCVAGERSPSLE-APSLWEIPHSAVADAVEPrSSPSPPAFFPWEAPDRPIGTWGPSPQET--WDPMGPgssiaftQE 536
Cdd:PRK12323  435 AARQASARGPGGAPApAPAPAAAPAAAARPAAAG-PRPVAAAAAAAPARAAPAAAPAPADDDPppWEELPP-------EF 506
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  537 AQNGLTQEELAPPTPSAPGTPEPTEMQSPSTREISDLAFGGSQQSPEVAAPEPPGSHPVGTLDADKCPEV 606
Cdd:PRK12323  507 ASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDM 576
PHA03247 PHA03247
large tegument protein UL36; Provisional
6-321 7.84e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 40.31  E-value: 7.84e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524    6 APPALPGIPRQLPTAPARRQDSSGSSGSYYTA-PGSPEPPDVGPDAKGPANwPWVAPGRGAGAQPRLSVSAQNSRQRHGP 84
Cdd:PHA03247 2718 ATPLPPGPAAARQASPALPAAPAPPAVPAGPAtPGGPARPARPPTTAGPPA-PAPPAAPAAGPPRRLTRPAVASLSESRE 2796
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524   85 GSGFPRGPGSGPRPPQPQLRTLPSgemevifgvgplfgcsgaddrEAQQQFTEPAFISPLPPGPASPAAVPRQSQVPDGG 164
Cdd:PHA03247 2797 SLPSPWDPADPPAAVLAPAAALPP---------------------AASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 238908524  165 SrwATYLELRPRGPSPAAPAQfecvevaleegaaPARPRTVPKRQIELRPRPQSPPRAAGAPRPRLLLRTGSLDESLGPL 244
Cdd:PHA03247 2856 V--APGGDVRRRPPSRSPAAK-------------PAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQ 2920
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 238908524  245 QAAAGFVQTALARKLSPEAPAPSSATFGSTGRSEPETRETA-RSTHVVLEKAKSRPLRVRDNSAPAKAPRPWPSLRER 321
Cdd:PHA03247 2921 PQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQpWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTG 2998
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH