NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|578817790|ref|XP_006717305|]
View 

snRNA-activating protein complex subunit 4 isoform X1 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
817-1215 1.91e-14

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.21  E-value: 1.91e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  817 PRLPQAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLASSRVERTLPQASLLASTGPRPKPKTVSEL 896
Cdd:PHA03247 2616 PLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL 2695
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  897 LQ------EKRLQEARAREATRGpvvLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAakpgTSGSwq 970
Cdd:PHA03247 2696 TSladpppPPPTPEPAPHALVSA---TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT----TAGP-- 2766
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  971 EAGTSAKDKRLSTMQALPLAPVFSEAEgTAPAASQAPALGPGQISVSCPESGLGQSQAPAAsrkqGLPEAPPFLPAAPSP 1050
Cdd:PHA03247 2767 PAPAPPAAPAAGPPRRLTRPAVASLSE-SRESLPSPWDPADPPAAVLAPAAALPPAASPAG----PLPPPTSAQPTAPPP 2841
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1051 TPLPVQPlSLTHIGGphVATSVPlpVTWVLTAQGLLPVPV----PAVVSLPRPAGTPGPAGLLATLLPPLTEtRAAQGPR 1126
Cdd:PHA03247 2842 PPGPPPP-SLPLGGS--VAPGGD--VRRRPPSRSPAAKPAaparPPVRRLARPAVSRSTESFALPPDQPERP-PQPQAPP 2915
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1127 APALSSSWQPPANMNREPEPSCRTDTPAPPThALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRL 1206
Cdd:PHA03247 2916 PPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT-TDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994

                  ....*....
gi 578817790 1207 PAFGGVIPA 1215
Cdd:PHA03247 2995 PLTGHSLSR 3003
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
401-448 4.00e-14

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


:

Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 67.63  E-value: 4.00e-14
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 578817790    401 KGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLH 448
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPGRTAEQCRERWRNLLK 48
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
297-357 2.30e-08

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


:

Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 51.93  E-value: 2.30e-08
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 578817790   297 WSREEEERLQAIAAAHGhLEWQKIAEELGtSRSAFQCLQKFQQHNKA-LKRKEWTEEEDRML 357
Cdd:pfam13921    1 WTEEEDEKLLKLVEKYG-NDWKQIAKELG-RRTPKQCFDRWRRKLNPkISRGPWSKEEDQRL 60
PLN03091 super family cl33633
hypothetical protein; Provisional
399-502 2.69e-08

hypothetical protein; Provisional


The actual alignment was detected with superfamily member PLN03091:

Pssm-ID: 215570 [Multi-domain]  Cd Length: 459  Bit Score: 58.06  E-value: 2.69e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  399 LKKGYWAPEEDAKLLQAVAKYGEQDWfkirEEVPGRSDAQ-----CRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGv 473
Cdd:PLN03091   12 LRKGLWSPEEDEKLLRHITKYGHGCW----SSVPKQAGLQrcgksCRLRWINYLRPDLKRGTFSQQEENLIIELHAVLG- 86
                          90       100
                  ....*....|....*....|....*....
gi 578817790  474 GHWAKIASELPHRSGSQCLSKWKIMMGKK 502
Cdd:PLN03091   87 NRWSQIAAQLPGRTDNEIKNLWNSCLKKK 115
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
346-397 1.44e-07

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


:

Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 49.14  E-value: 1.44e-07
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 578817790    346 RKEWTEEEDRMLTQLVQEMRVGShipYRRIVYYMEGRDSMQLIYRWTKSLDP 397
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNN---WEKIAKELPGRTAEQCRERWRNLLKP 49
SANT super family cl21498
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
262-305 1.36e-04

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


The actual alignment was detected with superfamily member pfam13921:

Pssm-ID: 473887 [Multi-domain]  Cd Length: 60  Bit Score: 41.14  E-value: 1.36e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 578817790   262 DWEKISNInfEGSRSAEEIRKFWQNSEHPSINKQEWSREEEERL 305
Cdd:pfam13921   19 DWKQIAKE--LGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
817-1215 1.91e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.21  E-value: 1.91e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  817 PRLPQAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLASSRVERTLPQASLLASTGPRPKPKTVSEL 896
Cdd:PHA03247 2616 PLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL 2695
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  897 LQ------EKRLQEARAREATRGpvvLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAakpgTSGSwq 970
Cdd:PHA03247 2696 TSladpppPPPTPEPAPHALVSA---TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT----TAGP-- 2766
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  971 EAGTSAKDKRLSTMQALPLAPVFSEAEgTAPAASQAPALGPGQISVSCPESGLGQSQAPAAsrkqGLPEAPPFLPAAPSP 1050
Cdd:PHA03247 2767 PAPAPPAAPAAGPPRRLTRPAVASLSE-SRESLPSPWDPADPPAAVLAPAAALPPAASPAG----PLPPPTSAQPTAPPP 2841
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1051 TPLPVQPlSLTHIGGphVATSVPlpVTWVLTAQGLLPVPV----PAVVSLPRPAGTPGPAGLLATLLPPLTEtRAAQGPR 1126
Cdd:PHA03247 2842 PPGPPPP-SLPLGGS--VAPGGD--VRRRPPSRSPAAKPAaparPPVRRLARPAVSRSTESFALPPDQPERP-PQPQAPP 2915
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1127 APALSSSWQPPANMNREPEPSCRTDTPAPPThALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRL 1206
Cdd:PHA03247 2916 PPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT-TDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994

                  ....*....
gi 578817790 1207 PAFGGVIPA 1215
Cdd:PHA03247 2995 PLTGHSLSR 3003
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
401-448 4.00e-14

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 67.63  E-value: 4.00e-14
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 578817790    401 KGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLH 448
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPGRTAEQCRERWRNLLK 48
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
404-447 1.15e-13

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 66.44  E-value: 1.15e-13
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....
gi 578817790  404 WAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRL 447
Cdd:cd00167     2 WTEEEDELLLEAVKKYGKNNWEKIAKELPGRTPKQCRERWRNLL 45
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
401-447 2.37e-12

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 62.52  E-value: 2.37e-12
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 578817790   401 KGYWAPEEDAKLLQAVAKYGEqDWFKIREEVPGRSDAQCRDRYLRRL 447
Cdd:pfam00249    1 RGPWTPEEDELLLEAVEKLGN-RWKKIAKLLPGRTDNQCKNRWQNYL 46
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
297-357 2.30e-08

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 51.93  E-value: 2.30e-08
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 578817790   297 WSREEEERLQAIAAAHGhLEWQKIAEELGtSRSAFQCLQKFQQHNKA-LKRKEWTEEEDRML 357
Cdd:pfam13921    1 WTEEEDEKLLKLVEKYG-NDWKQIAKELG-RRTPKQCFDRWRRKLNPkISRGPWSKEEDQRL 60
PLN03091 PLN03091
hypothetical protein; Provisional
399-502 2.69e-08

hypothetical protein; Provisional


Pssm-ID: 215570 [Multi-domain]  Cd Length: 459  Bit Score: 58.06  E-value: 2.69e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  399 LKKGYWAPEEDAKLLQAVAKYGEQDWfkirEEVPGRSDAQ-----CRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGv 473
Cdd:PLN03091   12 LRKGLWSPEEDEKLLRHITKYGHGCW----SSVPKQAGLQrcgksCRLRWINYLRPDLKRGTFSQQEENLIIELHAVLG- 86
                          90       100
                  ....*....|....*....|....*....
gi 578817790  474 GHWAKIASELPHRSGSQCLSKWKIMMGKK 502
Cdd:PLN03091   87 NRWSQIAAQLPGRTDNEIKNLWNSCLKKK 115
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
294-342 2.73e-08

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 51.46  E-value: 2.73e-08
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*....
gi 578817790    294 KQEWSREEEERLQAIAAAHGHLEWQKIAEELGTsRSAFQCLQKFQQHNK 342
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPG-RTAEQCRERWRNLLK 48
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
346-397 1.44e-07

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 49.14  E-value: 1.44e-07
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 578817790    346 RKEWTEEEDRMLTQLVQEMRVGShipYRRIVYYMEGRDSMQLIYRWTKSLDP 397
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNN---WEKIAKELPGRTAEQCRERWRNLLKP 49
SANT_CDC5_II cd11659
SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, ...
290-339 1.64e-07

SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, cell division cycle 5-like protein (CDC5) functions in pre-mRNA splicing in cell cycle control. The DNA-binding, myb-like domain of CDC5 is a member of the SANT/myb group. SANT is named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. The SANT domain resembles the 3 alpha-helix bundle of DNA-binding Myb domains and is found in a diverse set of proteins.


Pssm-ID: 212557 [Multi-domain]  Cd Length: 53  Bit Score: 49.23  E-value: 1.64e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 578817790  290 PSINKQEWSREEEERLQAIaAAHGHLEWQKIAEELGtsRSAFQCLQKFQQ 339
Cdd:cd11659     1 PSIKKTEWTREEDEKLLHL-AKLLPTQWRTIAPIVG--RTAQQCLERYNK 47
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
256-357 2.99e-07

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 54.79  E-value: 2.99e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  256 NRLDShDWEKISNINFE------GSRSAEEIRKFWQNSEHPSINKQEWSREEEERLQAIAAAHGHLeWQKIAEELGtSRS 329
Cdd:COG5147    29 EDLKA-LVKKLGPNNWSkvasllISSTGKQSSNRWNNHLNPQLKKKNWSEEEDEQLIDLDKELGTQ-WSTIADYKD-RRT 105
                          90       100
                  ....*....|....*....|....*...
gi 578817790  330 AFQCLQKFQQHNKALKRKEWTEEEDRML 357
Cdd:COG5147   106 AQQCVERYVNTLEDLSSTHDSKLQRRNE 133
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
836-1230 4.44e-07

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 54.20  E-value: 4.44e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790   836 SSAQSTPGHLFPNVPAQ-EASKSASHKGSRRLASSRVER----TLPQASLLASTGPrPKPKTVSELLQEKRLQEAR--AR 908
Cdd:pfam17823   11 FSLPLSESHAAPADPRHfVLNKMWNGAGKQNASGDAVPRadnkSSEQ*NFCAATAA-PAPVTLTKGTSAAHLNSTEvtAE 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790   909 EATRG-----PVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLS-GPGAPAAAKPGTSGSwqeAGTSAKDKRLS 982
Cdd:pfam17823   90 HTPHGtdlsePATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSeAFSAPRAAACRANAS---AAPRAAIAAAS 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790   983 TMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTH 1062
Cdd:pfam17823  167 APHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVG 246
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  1063 IGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETraaQGPRAPAlsSSWQPPANMNR 1142
Cdd:pfam17823  247 TVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTMARNPAAPMGAQA---QGPIIQV--STDQPVHNTAG 321
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  1143 EPEPSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSShadPPEAEPPWSGRLPAfggVIPATEPRGTP 1222
Cdd:pfam17823  322 EPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSM---IPEVEATSPTTQPS---PLLPTQGAAGP 395

                   ....*...
gi 578817790  1223 GSPSGTQE 1230
Cdd:pfam17823  396 GILLAPEQ 403
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
334-453 5.18e-07

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 54.02  E-value: 5.18e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  334 LQKFQQHNKALKRKE--WTEEEDRMLTQLVQEM------RVGSHIPYRrivyyMEGRDSMqliyRWTKSLDPGLKKGYWA 405
Cdd:COG5147     6 NKELQIKLMQTKRKGgsWKRTEDEDLKALVKKLgpnnwsKVASLLISS-----TGKQSSN----RWNNHLNPQLKKKNWS 76
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 578817790  406 PEEDAKLLQAVAKYGEQdWFKIREEVPGRSDAQCRDRYLRRLHFSLKK 453
Cdd:COG5147    77 EEEDEQLIDLDKELGTQ-WSTIADYKDRRTAQQCVERYVNTLEDLSST 123
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
349-412 3.31e-06

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 45.76  E-value: 3.31e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 578817790   349 WTEEEDRMLTQLVQEMrvgsHIPYRRIVYYMEGRDSMQLIYRWTKSLDPGLKKGYWAPEEDAKL 412
Cdd:pfam13921    1 WTEEEDEKLLKLVEKY----GNDWKQIAKELGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
470-496 4.31e-06

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 44.87  E-value: 4.31e-06
                          10        20
                  ....*....|....*....|....*..
gi 578817790  470 KYGVGHWAKIASELPHRSGSQCLSKWK 496
Cdd:cd00167    16 KYGKNNWEKIAKELPGRTPKQCRERWR 42
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
470-496 5.70e-06

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 44.52  E-value: 5.70e-06
                            10        20
                    ....*....|....*....|....*..
gi 578817790    470 KYGVGHWAKIASELPHRSGSQCLSKWK 496
Cdd:smart00717   18 KYGKNNWEKIAKELPGRTAEQCRERWR 44
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
399-495 4.68e-05

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 47.86  E-value: 4.68e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  399 LKKGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGVgHWAK 478
Cdd:COG5147    18 RKGGSWKRTEDEDLKALVKKLGPNNWSKVASLLISSTGKQSSNRWNNHLNPQLKKKNWSEEEDEQLIDLDKELGT-QWST 96
                          90
                  ....*....|....*..
gi 578817790  479 IASELPHRSGSQCLSKW 495
Cdd:COG5147    97 IADYKDRRTAQQCVERY 113
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
262-305 1.36e-04

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 41.14  E-value: 1.36e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 578817790   262 DWEKISNInfEGSRSAEEIRKFWQNSEHPSINKQEWSREEEERL 305
Cdd:pfam13921   19 DWKQIAKE--LGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
193-359 1.91e-03

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 43.03  E-value: 1.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790   193 RKSVVSDRLQRL---LQPKLLKLEYLHQKQSKVSSELERQALEKQGREAEKEIQdiNQLPEEALLGNRLD-SHDWEKISN 268
Cdd:TIGR00618  220 RKQVLEKELKHLreaLQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELR--AQEAVLEETQERINrARKAAPLAA 297
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790   269 InfegSRSAEEIRKFWQNSeHPSINKQEWSReEEERLQAIAAAHGHLEWQKIAEELGTSRSafQCLQKFQQHNKALKRKE 348
Cdd:TIGR00618  298 H----IKAVTQIEQQAQRI-HTELQSKMRSR-AKLLMKRAAHVKQQSSIEEQRRLLQTLHS--QEIHIRDAHEVATSIRE 369
                          170
                   ....*....|....*
gi 578817790   349 ----WTEEEDRMLTQ 359
Cdd:TIGR00618  370 iscqQHTLTQHIHTL 384
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
817-1215 1.91e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.21  E-value: 1.91e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  817 PRLPQAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLASSRVERTLPQASLLASTGPRPKPKTVSEL 896
Cdd:PHA03247 2616 PLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL 2695
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  897 LQ------EKRLQEARAREATRGpvvLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAakpgTSGSwq 970
Cdd:PHA03247 2696 TSladpppPPPTPEPAPHALVSA---TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT----TAGP-- 2766
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  971 EAGTSAKDKRLSTMQALPLAPVFSEAEgTAPAASQAPALGPGQISVSCPESGLGQSQAPAAsrkqGLPEAPPFLPAAPSP 1050
Cdd:PHA03247 2767 PAPAPPAAPAAGPPRRLTRPAVASLSE-SRESLPSPWDPADPPAAVLAPAAALPPAASPAG----PLPPPTSAQPTAPPP 2841
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1051 TPLPVQPlSLTHIGGphVATSVPlpVTWVLTAQGLLPVPV----PAVVSLPRPAGTPGPAGLLATLLPPLTEtRAAQGPR 1126
Cdd:PHA03247 2842 PPGPPPP-SLPLGGS--VAPGGD--VRRRPPSRSPAAKPAaparPPVRRLARPAVSRSTESFALPPDQPERP-PQPQAPP 2915
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1127 APALSSSWQPPANMNREPEPSCRTDTPAPPThALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRL 1206
Cdd:PHA03247 2916 PPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT-TDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994

                  ....*....
gi 578817790 1207 PAFGGVIPA 1215
Cdd:PHA03247 2995 PLTGHSLSR 3003
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
401-448 4.00e-14

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 67.63  E-value: 4.00e-14
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 578817790    401 KGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLH 448
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPGRTAEQCRERWRNLLK 48
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
404-447 1.15e-13

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 66.44  E-value: 1.15e-13
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....
gi 578817790  404 WAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRL 447
Cdd:cd00167     2 WTEEEDELLLEAVKKYGKNNWEKIAKELPGRTPKQCRERWRNLL 45
PHA03247 PHA03247
large tegument protein UL36; Provisional
812-1232 3.54e-13

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 74.97  E-value: 3.54e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  812 RKALPPRLPQAGARD--PPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLASSRVERTLPQASLLASTGPRPK 889
Cdd:PHA03247 2572 RPAPRPSEPAVTSRArrPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPER 2651
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  890 PKTVSELLQEKRLQEAR-------AREATRGPVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLS-GPGAPAAA 961
Cdd:PHA03247 2652 PRDDPAPGRVSRPRRARrlgraaqASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPpGPAAARQA 2731
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  962 KPGTSGSWQEAGTSAKdkrlstmqalPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLgqSQAPAASRKQGLPEAP 1041
Cdd:PHA03247 2732 SPALPAAPAPPAVPAG----------PATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRL--TRPAVASLSESRESLP 2799
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1042 pfLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSlprPAGTPGPAGLLATLLPPLTETRA 1121
Cdd:PHA03247 2800 --SPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP---LGGSVAPGGDVRRRPPSRSPAAK 2874
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1122 AQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEAdgsvafvPGEAQVAREIPEPRTSSHADPPEAEPP 1201
Cdd:PHA03247 2875 PAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQ-------PQPPPPPQPQPPPPPPPRPQPPLAPTT 2947
                         410       420       430
                  ....*....|....*....|....*....|.
gi 578817790 1202 WSGRLPAFGGVIPAtePRGTPGSPSGTQEPR 1232
Cdd:PHA03247 2948 DPAGAGEPSGAVPQ--PWLGALVPGRVAVPR 2976
PHA03247 PHA03247
large tegument protein UL36; Provisional
812-1307 8.42e-13

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 73.82  E-value: 8.42e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  812 RKALPPRLPQA--GARDPPvHLLQASSSAQSTPGHLFPNVPAQEASKSASH-------KGSRRLASSRV---ERTLPQAS 879
Cdd:PHA03247 2481 RRPAEARFPFAagAAPDPG-GGGPPDPDAPPAPSRLAPAILPDEPVGEPVHprmltwiRGLEELASDDAgdpPPPLPPAA 2559
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  880 LLASTG-----PRPKPKTVSELLQEKRLQEARAREATRG--PVVLPSQLLVSSSVILQPPLPHTPHgRPAPGPTVLNVPL 952
Cdd:PHA03247 2560 PPAAPDrsvppPRPAPRPSEPAVTSRARRPDAPPQSARPraPVDDRGDPRGPAPPSPLPPDTHAPD-PPPPSPSPAANEP 2638
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  953 SGPGaPAAAKPGTSGSWQEAGTSAKDKRLSTMQALPLAPVFSEAEGTAPAASqaPALGPGQISVSCPESGLGQSQAP-AA 1031
Cdd:PHA03247 2639 DPHP-PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAAR--PTVGSLTSLADPPPPPPTPEPAPhAL 2715
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1032 SRKQGLPEAP-------PFLPAAPSPTPLPVQPLSlthIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVvSLPRPAGTP- 1103
Cdd:PHA03247 2716 VSATPLPPGPaaarqasPALPAAPAPPAVPAGPAT---PGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASl 2791
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1104 GPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPANmnrEPEPSCRTDTPAPPTHALSQSPAEADGSVAfvPGeAQVARE 1183
Cdd:PHA03247 2792 SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGP---LPPPTSAQPTAPPPPPGPPPPSLPLGGSVA--PG-GDVRRR 2865
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1184 IP------------EPRTSSHADPPEAEPPWSGRLPAFGgviPATEPRGTPGSPSGTQEPRGPLGLEKLPLRQPGPEKGA 1251
Cdd:PHA03247 2866 PPsrspaakpaapaRPPVRRLARPAVSRSTESFALPPDQ---PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 578817790 1252 LDLEKPPLPQPGPEKGALDlgllsqegeaatqQWLGGQRGVRVPLLGSRLPYQPPA 1307
Cdd:PHA03247 2943 LAPTTDPAGAGEPSGAVPQ-------------PWLGALVPGRVAVPRFRVPQPAPS 2985
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
401-447 2.37e-12

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 62.52  E-value: 2.37e-12
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 578817790   401 KGYWAPEEDAKLLQAVAKYGEqDWFKIREEVPGRSDAQCRDRYLRRL 447
Cdd:pfam00249    1 RGPWTPEEDELLLEAVEKLGN-RWKKIAKLLPGRTDNQCKNRWQNYL 46
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
404-457 3.28e-11

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 60.02  E-value: 3.28e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 578817790   404 WAPEEDAKLLQAVAKYGeQDWFKIREEVPGRSDAQCRDRYLRRLHFSLKKGRWN 457
Cdd:pfam13921    1 WTEEEDEKLLKLVEKYG-NDWKQIAKELGRRTPKQCFDRWRRKLNPKISRGPWS 53
PHA03378 PHA03378
EBNA-3B; Provisional
901-1267 5.12e-10

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 64.32  E-value: 5.12e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  901 RLQEARAREATRGPVVL----PSQLLVSSSVILQPPLPHTPHgRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEAGTSA 976
Cdd:PHA03378  437 RTEQPRATPHSQAPTVVlhrpPTQPLEGPTGPLSVQAPLEPW-QPLPHPQVTPVILHQPPAQGVQAHGSMLDLLEKDDED 515
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  977 KDKRLSTMQALPLAP----------VFSE---AEGTAPAASQA------PALGPGQISV-------------SCPESGLG 1024
Cdd:PHA03378  516 MEQRVMATLLPPSPPqpragrrapcVYTEdldIESDEPASTEPvhdqllPAPGLGPLQIqpltspttsqlasSAPSYAQT 595
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1025 QSQAPAASRKQGLPEAPPFLPA--APSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGT 1102
Cdd:PHA03378  596 PWPVPHPSQTPEPPTTQSHIPEtsAPRQWPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQ 675
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1103 PGPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEADGSV---AFVPGEAQ 1179
Cdd:PHA03378  676 PSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRArppAAAPGRAR 755
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1180 VAREIPEPRTSSHADPPEAEPpwsgRLPAFGGVIPATEPRgtpGSPSGTQEPRGPLGLEKLPLRQPGPEKGALDLEKPPL 1259
Cdd:PHA03378  756 PPAAAPGRARPPAAAPGAPTP----QPPPQAPPAPQQRPR---GAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQL 828

                  ....*...
gi 578817790 1260 PQPGPEKG 1267
Cdd:PHA03378  829 LTGGVKRG 836
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
297-357 2.30e-08

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 51.93  E-value: 2.30e-08
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 578817790   297 WSREEEERLQAIAAAHGhLEWQKIAEELGtSRSAFQCLQKFQQHNKA-LKRKEWTEEEDRML 357
Cdd:pfam13921    1 WTEEEDEKLLKLVEKYG-NDWKQIAKELG-RRTPKQCFDRWRRKLNPkISRGPWSKEEDQRL 60
PLN03091 PLN03091
hypothetical protein; Provisional
399-502 2.69e-08

hypothetical protein; Provisional


Pssm-ID: 215570 [Multi-domain]  Cd Length: 459  Bit Score: 58.06  E-value: 2.69e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  399 LKKGYWAPEEDAKLLQAVAKYGEQDWfkirEEVPGRSDAQ-----CRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGv 473
Cdd:PLN03091   12 LRKGLWSPEEDEKLLRHITKYGHGCW----SSVPKQAGLQrcgksCRLRWINYLRPDLKRGTFSQQEENLIIELHAVLG- 86
                          90       100
                  ....*....|....*....|....*....
gi 578817790  474 GHWAKIASELPHRSGSQCLSKWKIMMGKK 502
Cdd:PLN03091   87 NRWSQIAAQLPGRTDNEIKNLWNSCLKKK 115
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
294-342 2.73e-08

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 51.46  E-value: 2.73e-08
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*....
gi 578817790    294 KQEWSREEEERLQAIAAAHGHLEWQKIAEELGTsRSAFQCLQKFQQHNK 342
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPG-RTAEQCRERWRNLLK 48
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
814-1200 5.24e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 57.69  E-value: 5.24e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  814 ALPPRLPQAGARDPPVhlLQASSSAQSTPghlfPNVPAQEASKSASHKGSRRLASSrvertlPQASLLASTGPRPKPKTV 893
Cdd:PRK07764  380 RLERRLGVAGGAGAPA--AAAPSAAAAAP----AAAPAPAAAAPAAAAAPAPAAAP------QPAPAPAPAPAPPSPAGN 447
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  894 SELLQEKRLQEARAREATRGPVVLPSQllvssSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAAKPgtSGSWQEAG 973
Cdd:PRK07764  448 APAGGAPSPPPAAAPSAQPAPAPAAAP-----EPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATL--RERWPEIL 520
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  974 TSAKDKRLSTMQAL---------------------PLAPVFSEAE-----------------------GTAPAASQAPAL 1009
Cdd:PRK07764  521 AAVPKRSRKTWAILlpeatvlgvrgdtlvlgfstgGLARRFASPGnaevlvtalaeelggdwqveavvGPAPGAAGGEGP 600
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1010 GPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPV- 1088
Cdd:PRK07764  601 PAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAa 680
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1089 PVPAVVSLPRPAGTPGPAGLLATLLPPlteTRAAQGPRAPAlsSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEAD 1168
Cdd:PRK07764  681 PPPAPAPAAPAAPAGAAPAQPAPAPAA---TPPAGQADDPA--AQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAP 755
                         410       420       430
                  ....*....|....*....|....*....|..
gi 578817790 1169 GSVAFVPGEAQVAREIPEPRTSSHADPPEAEP 1200
Cdd:PRK07764  756 AQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAE 787
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
346-397 1.44e-07

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 49.14  E-value: 1.44e-07
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 578817790    346 RKEWTEEEDRMLTQLVQEMRVGShipYRRIVYYMEGRDSMQLIYRWTKSLDP 397
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNN---WEKIAKELPGRTAEQCRERWRNLLKP 49
SANT_CDC5_II cd11659
SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, ...
290-339 1.64e-07

SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, cell division cycle 5-like protein (CDC5) functions in pre-mRNA splicing in cell cycle control. The DNA-binding, myb-like domain of CDC5 is a member of the SANT/myb group. SANT is named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. The SANT domain resembles the 3 alpha-helix bundle of DNA-binding Myb domains and is found in a diverse set of proteins.


Pssm-ID: 212557 [Multi-domain]  Cd Length: 53  Bit Score: 49.23  E-value: 1.64e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 578817790  290 PSINKQEWSREEEERLQAIaAAHGHLEWQKIAEELGtsRSAFQCLQKFQQ 339
Cdd:cd11659     1 PSIKKTEWTREEDEKLLHL-AKLLPTQWRTIAPIVG--RTAQQCLERYNK 47
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
820-1227 2.99e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 55.56  E-value: 2.99e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  820 PQAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLASSRvERTLPQASLLASTGPRPKPKTVSELLQE 899
Cdd:PHA03307   24 PPATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPG-PGTEAPANESRSTPTWSLSTLAPASPAR 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  900 KRLQEARAREATRGPVVLPSQLLVSSSVilQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEAGTSAKDK 979
Cdd:PHA03307  103 EGSPTPPGPSSPDPPPPTPPPASPPPSP--APDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPE 180
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  980 RLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQA---------PAASRKQGLPEAPPFLPAAPSP 1050
Cdd:PHA03307  181 ETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAAddagasssdSSSSESSGCGWGPENECPLPRP 260
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1051 TPLPVQPLSLTHIgGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGP-RAPA 1129
Cdd:PHA03307  261 APITLPTRIWEAS-GWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSEsSRGA 339
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1130 LSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRLPAF 1209
Cdd:PHA03307  340 AVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDA 419
                         410
                  ....*....|....*...
gi 578817790 1210 GGVIPATEPRGTPGSPSG 1227
Cdd:PHA03307  420 GAASGAFYARYPLLTPSG 437
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
256-357 2.99e-07

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 54.79  E-value: 2.99e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  256 NRLDShDWEKISNINFE------GSRSAEEIRKFWQNSEHPSINKQEWSREEEERLQAIAAAHGHLeWQKIAEELGtSRS 329
Cdd:COG5147    29 EDLKA-LVKKLGPNNWSkvasllISSTGKQSSNRWNNHLNPQLKKKNWSEEEDEQLIDLDKELGTQ-WSTIADYKD-RRT 105
                          90       100
                  ....*....|....*....|....*...
gi 578817790  330 AFQCLQKFQQHNKALKRKEWTEEEDRML 357
Cdd:COG5147   106 AQQCVERYVNTLEDLSSTHDSKLQRRNE 133
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
296-340 3.14e-07

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 47.96  E-value: 3.14e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 578817790  296 EWSREEEERLQAIAAAHGHLEWQKIAEELGTsRSAFQCLQKFQQH 340
Cdd:cd00167     1 PWTEEEDELLLEAVKKYGKNNWEKIAKELPG-RTPKQCRERWRNL 44
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
836-1230 4.44e-07

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 54.20  E-value: 4.44e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790   836 SSAQSTPGHLFPNVPAQ-EASKSASHKGSRRLASSRVER----TLPQASLLASTGPrPKPKTVSELLQEKRLQEAR--AR 908
Cdd:pfam17823   11 FSLPLSESHAAPADPRHfVLNKMWNGAGKQNASGDAVPRadnkSSEQ*NFCAATAA-PAPVTLTKGTSAAHLNSTEvtAE 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790   909 EATRG-----PVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLS-GPGAPAAAKPGTSGSwqeAGTSAKDKRLS 982
Cdd:pfam17823   90 HTPHGtdlsePATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSeAFSAPRAAACRANAS---AAPRAAIAAAS 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790   983 TMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTH 1062
Cdd:pfam17823  167 APHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVG 246
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  1063 IGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETraaQGPRAPAlsSSWQPPANMNR 1142
Cdd:pfam17823  247 TVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTMARNPAAPMGAQA---QGPIIQV--STDQPVHNTAG 321
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  1143 EPEPSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSShadPPEAEPPWSGRLPAfggVIPATEPRGTP 1222
Cdd:pfam17823  322 EPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSM---IPEVEATSPTTQPS---PLLPTQGAAGP 395

                   ....*...
gi 578817790  1223 GSPSGTQE 1230
Cdd:pfam17823  396 GILLAPEQ 403
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
930-1264 5.02e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.61  E-value: 5.02e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  930 QPPLPHTPHGRPAPGPtvlnvplSGPGAPAAAKPGTSGSWQEAGT-SAKDKRLSTMQALPLAPVFSEAEGTAPAASQAPA 1008
Cdd:PRK07764  397 AAPSAAAAAPAAAPAP-------AAAAPAAAAAPAPAAAPQPAPApAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPA 469
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1009 LGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAA---------------------------PSPTPLPVQP--LS 1059
Cdd:PRK07764  470 PAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDaatlrerwpeilaavpkrsrktwaillPEATVLGVRGdtLV 549
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1060 LTH--------IGGPHVATSVplpVTwVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALS 1131
Cdd:PRK07764  550 LGFstgglarrFASPGNAEVL---VT-ALAEELGGDWQVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPA 625
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1132 SSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEA-----DGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRL 1206
Cdd:PRK07764  626 APAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDasdggDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAP 705
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 578817790 1207 PAFGGVIPATEPRGTPGSPSGTQEPRGPLGLEKLPLRQPGPEKGALDLEKPPLPQPGP 1264
Cdd:PRK07764  706 AATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPA 763
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
876-1248 5.05e-07

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 54.77  E-value: 5.05e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790   876 PQASLLASTGPRPKPKTVSELLQEKRLQEARAREATRGPVVLpsqllVSSSVILQP---PLPHTP-HGRPAPGPTVLNVP 951
Cdd:pfam03154  189 PGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTL-----IQQTPTLHPqrlPSPHPPlQPMTQPPPPSQVSP 263
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790   952 LSGPGAPAAAKPGTSGSWQEAGTSAKDKRLSTmQALPLAPVFSEAEGTAPAASQAPalgpgqisvscpesglGQSQApaa 1031
Cdd:pfam03154  264 QPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPP-QPFPLTPQSSQSQVPPGPSPAAP----------------GQSQQ--- 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  1032 srkqgLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQG-LLPVPVPAVVSLPRPAGTPGPAGLLA 1110
Cdd:pfam03154  324 -----RIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQShKHPPHLSGPSPFQMNSNLPPPPALKP 398
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  1111 TLLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPA-----PPTHALSQSPAEAD-GSVAFVPGEAQVAREI 1184
Cdd:pfam03154  399 LSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPpaashPPTSGLHQVPSQSPfPQHPFVPGGPPPITPP 478
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  1185 PEPRTSSHADPPEAEPPWSGRlPAFGGVIPATEPRGTPG------SPSGTQEPRGPlgleKLPLRQPGPE 1248
Cdd:pfam03154  479 SGPPTSTSSAMPGIQPPSSAS-VSSSGPVPAAVSCPLPPvqikeeALDEAEEPESP----PPPPRSPSPE 543
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
334-453 5.18e-07

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 54.02  E-value: 5.18e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  334 LQKFQQHNKALKRKE--WTEEEDRMLTQLVQEM------RVGSHIPYRrivyyMEGRDSMqliyRWTKSLDPGLKKGYWA 405
Cdd:COG5147     6 NKELQIKLMQTKRKGgsWKRTEDEDLKALVKKLgpnnwsKVASLLISS-----TGKQSSN----RWNNHLNPQLKKKNWS 76
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 578817790  406 PEEDAKLLQAVAKYGEQdWFKIREEVPGRSDAQCRDRYLRRLHFSLKK 453
Cdd:COG5147    77 EEEDEQLIDLDKELGTQ-WSTIADYKDRRTAQQCVERYVNTLEDLSST 123
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
878-1234 1.52e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 53.25  E-value: 1.52e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  878 ASLLASTGPRPKPKTVSELLQEKRLQEARAREATRGPVVLPSQLLVSSSVIlqPPLPHTPHGRPAPGPTVLNVPLSGPGA 957
Cdd:PHA03307   57 AGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPT--PPGPSSPDPPPPTPPPASPPPSPAPDL 134
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  958 PAAAKPGTSGSwqeagtsakdKRLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESG---LGQSQAPAASRK 1034
Cdd:PHA03307  135 SEMLRPVGSPG----------PPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPpaePPPSTPPAAASP 204
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1035 QGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTwVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLP 1114
Cdd:PHA03307  205 RPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGC-GWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPG 283
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1115 PLTETRAAQGPRAPALSSSwqppanmnrepepSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHAD 1194
Cdd:PHA03307  284 PASSSSSPRERSPSPSPSS-------------PGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRS 350
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|
gi 578817790 1195 PPEAEPPwsgrlPAFGGVIPATEPRGTPGSPSGTQEPRGP 1234
Cdd:PHA03307  351 PSPSRPP-----PPADPSSPRKRPRPSRAPSSPAASAGRP 385
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
349-412 3.31e-06

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 45.76  E-value: 3.31e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 578817790   349 WTEEEDRMLTQLVQEMrvgsHIPYRRIVYYMEGRDSMQLIYRWTKSLDPGLKKGYWAPEEDAKL 412
Cdd:pfam13921    1 WTEEEDEKLLKLVEKY----GNDWKQIAKELGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
960-1170 4.24e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 51.42  E-value: 4.24e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  960 AAKPGTSGSWQEAGTSAKDKRLSTMQAL----PLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQAPAASRKQ 1035
Cdd:PRK12323  362 AFRPGQSGGGAGPATAAAAPVAQPAPAAaapaAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASA 441
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1036 GLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGP----AGLLAT 1111
Cdd:PRK12323  442 RGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPaqpdAAPAGW 521
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 578817790 1112 LLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEADGS 1170
Cdd:PRK12323  522 VAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGD 580
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
470-496 4.31e-06

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 44.87  E-value: 4.31e-06
                          10        20
                  ....*....|....*....|....*..
gi 578817790  470 KYGVGHWAKIASELPHRSGSQCLSKWK 496
Cdd:cd00167    16 KYGKNNWEKIAKELPGRTPKQCRERWR 42
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1000-1270 4.96e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 51.39  E-value: 4.96e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1000 APAASQAPAlGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVAtsvPLPVTWV 1079
Cdd:PRK07003  361 AVTGGGAPG-GGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAA---PAPPATA 436
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1080 LTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALSSswqppanmnREPEPSCrtdtpAPPTHA 1159
Cdd:PRK07003  437 DRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAA---------FEPAPRA-----AAPSAA 502
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1160 LSQSPAEADGSVAFVPGEAQVAREIPEPRTSShADPPEAEPPWSGrlpafGGVIPATEPRGTPGSPSGTQEPRGPLGLEK 1239
Cdd:PRK07003  503 TPAAVPDARAPAAASREDAPAAAAPPAPEARP-PTPAAAAPAARA-----GGAAAALDVLRNAGMRVSSDRGARAAAAAK 576
                         250       260       270
                  ....*....|....*....|....*....|.
gi 578817790 1240 LPLRQPGPEKGALDLEKPPLPQPGPEKGALD 1270
Cdd:PRK07003  577 PAAAPAAAPKPAAPRVAVQVPTPRARAATGD 607
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
470-496 5.70e-06

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 44.52  E-value: 5.70e-06
                            10        20
                    ....*....|....*....|....*..
gi 578817790    470 KYGVGHWAKIASELPHRSGSQCLSKWK 496
Cdd:smart00717   18 KYGKNNWEKIAKELPGRTAEQCRERWR 44
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
987-1215 8.81e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 50.26  E-value: 8.81e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  987 LPLAPVFSEAEGTAPAASQAPALGPG----QISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTH 1062
Cdd:PRK12323  361 LAFRPGQSGGGAGPATAAAAPVAQPApaaaAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQAS 440
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1063 IGGPhVATSVPLPVtwvltaqgllPVPVPAVVSLPRPAGTPGPAglLATLLPPLTETRAAQGPRAPALSSSWQPPANMNR 1142
Cdd:PRK12323  441 ARGP-GGAPAPAPA----------PAAAPAAAARPAAAGPRPVA--AAAAAAPARAAPAAAPAPADDDPPPWEELPPEFA 507
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 578817790 1143 EPEPSCRTDTPAPPTHALSQSPAEADGSVAF-VPGEAQVAREIPEPRTSSHADPPEAEPPWS--GRLPAFGGVIPA 1215
Cdd:PRK12323  508 SPAPAQPDAAPAGWVAESIPDPATADPDDAFeTLAPAPAAAPAPRAAAATEPVVAPRPPRASasGLPDMFDGDWPA 583
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
931-1138 9.92e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 50.26  E-value: 9.92e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  931 PPLPHTPHGRPAPG---PTVLNVPLSGPGAPAAAKPGTSGSWQEAGtSAKDKRLSTMQALPLApvfSEAEGTAPAASQAP 1007
Cdd:PRK12323  375 ATAAAAPVAQPAPAaaaPAAAAPAPAAPPAAPAAAPAAAAAARAVA-AAPARRSPAPEALAAA---RQASARGPGGAPAP 450
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1008 ALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAqglLP 1087
Cdd:PRK12323  451 APAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAES---IP 527
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|.
gi 578817790 1088 VPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAqgPRAPALSSSWQPPA 1138
Cdd:PRK12323  528 DPATADPDDAFETLAPAPAAAPAPRAAAATEPVVA--PRPPRASASGLPDM 576
PHA03247 PHA03247
large tegument protein UL36; Provisional
930-1272 1.69e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.94  E-value: 1.69e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  930 QPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEAGT---SAKDKRLSTMQALPLAPVFSEaegtaPAASQA 1006
Cdd:PHA03247 2414 QPDPPGPPDVRFVGSEEIEELPFVSPGGDVLAGLAADGDPFFARTilgAPFSLSLLLGELFPGAPVYRR-----PAEARF 2488
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1007 P-ALGPGqisvscPESGLGQSQAPAASRKQGLPeAPPFLPAAPSPTPLPVQPLSLTH---------IGGPhvatSVPLPv 1076
Cdd:PHA03247 2489 PfAAGAA------PDPGGGGPPDPDAPPAPSRL-APAILPDEPVGEPVHPRMLTWIRgleelasddAGDP----PPPLP- 2556
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1077 twvltaqgllPVPVPAVV--SLPRPAGTPGPAGLLAtllpplteTRAAQGPRAPALSSSWQPPANmNREPEPSCRTDTPA 1154
Cdd:PHA03247 2557 ----------PAAPPAAPdrSVPPPRPAPRPSEPAV--------TSRARRPDAPPQSARPRAPVD-DRGDPRGPAPPSPL 2617
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1155 PPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRLP--AFGGVIPATEPR------------- 1219
Cdd:PHA03247 2618 PPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLgrAAQASSPPQRPRrraarptvgslts 2697
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 578817790 1220 -GTPGSPSGTQEPRGPLGLEKLPLrQPGPEKGALDLEKPPL---PQPGPEKGALDLG 1272
Cdd:PHA03247 2698 lADPPPPPPTPEPAPHALVSATPL-PPGPAAARQASPALPAapaPPAVPAGPATPGG 2753
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
294-340 2.30e-05

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 42.88  E-value: 2.30e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 578817790   294 KQEWSREEEERLQAIAAAHGHlEWQKIAEELGTsRSAFQCLQKFQQH 340
Cdd:pfam00249    1 RGPWTPEEDELLLEAVEKLGN-RWKKIAKLLPG-RTDNQCKNRWQNY 45
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
903-1202 3.29e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 48.69  E-value: 3.29e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  903 QEARAREATRGPVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEaGTSAKDKRLS 982
Cdd:PRK07003  372 VPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGD-DAADGDAPVP 450
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  983 TMQALPLAPVFSEAEGTA-PAASQAPALGPGqisvscpesglgqSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLT 1061
Cdd:PRK07003  451 AKANARASADSRCDERDAqPPADSGSASAPA-------------SDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAAS 517
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1062 HIGGPHVAtSVPLPvtwvltaqgLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPANMN 1141
Cdd:PRK07003  518 REDAPAAA-APPAP---------EARPPTPAAAAPAARAGGAAAALDVLRNAGMRVSSDRGARAAAAAKPAAAPAAAPKP 587
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 578817790 1142 REPEPSCRTDTPAPPTHALSQSPAEAdgsvafvpgeaqvareipePRTSSHADPPEAEPPW 1202
Cdd:PRK07003  588 AAPRVAVQVPTPRARAATGDAPPNGA-------------------ARAEQAAESRGAPPPW 629
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1064-1302 3.90e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 48.33  E-value: 3.90e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1064 GGPHVATSVPLPVTWVLtaqgllPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPANMNRE 1143
Cdd:PRK12323  370 GGAGPATAAAAPVAQPA------PAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARG 443
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1144 PEPSCRTDTPAPPTHALSQSPAEADgsvafvpgeaqvareiPEPRTSSHADPPEAEPPWSGRLPAFGGVIPATEPRGTPG 1223
Cdd:PRK12323  444 PGGAPAPAPAPAAAPAAAARPAAAG----------------PRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFA 507
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1224 SPSGTQEPRGPLGLEKLPLRQPG---PEKGALDLEKPPLPQPGPEKGALDLGLLSQEGEAATQQWLGGQRGVRVPLLGSR 1300
Cdd:PRK12323  508 SPAPAQPDAAPAGWVAESIPDPAtadPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPALAAR 587

                  ..
gi 578817790 1301 LP 1302
Cdd:PRK12323  588 LP 589
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
399-495 4.68e-05

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 47.86  E-value: 4.68e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  399 LKKGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGVgHWAK 478
Cdd:COG5147    18 RKGGSWKRTEDEDLKALVKKLGPNNWSKVASLLISSTGKQSSNRWNNHLNPQLKKKNWSEEEDEQLIDLDKELGT-QWST 96
                          90
                  ....*....|....*..
gi 578817790  479 IASELPHRSGSQCLSKW 495
Cdd:COG5147    97 IADYKDRRTAQQCVERY 113
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
931-1216 4.74e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 47.99  E-value: 4.74e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790   931 PPLPHTPHGRPAP---GPTVLNVPLSGP---GAPAAAKPGT-SGSWQEAGTSAKDKRLSTmqalplaPVFSEAEGTAPAA 1003
Cdd:pfam05109  449 PSSTHVPTNLTAPastGPTVSTADVTSPtpaGTTSGASPVTpSPSPRDNGTESKAPDMTS-------PTSAVTTPTPNAT 521
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  1004 SQAPALGPGQISVSCPESGlgqSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQP-LSLThigGPHVATSVPLPVTWVLTA 1082
Cdd:pfam05109  522 SPTPAVTTPTPNATSPTLG---KTSPTSAVTTPTPNATSPTPAVTTPTPNATIPtLGKT---SPTSAVTTPTPNATSPTV 595
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  1083 QGLLP------------VPVPAVVSLPRPAGTPGPAGLLATLLppltETRAAQGPRAPALSSSWQPPANMNR-------- 1142
Cdd:pfam05109  596 GETSPqanttnhtlggtSSTPVVTSPPKNATSAVTTGQHNITS----SSTSSMSLRPSSISETLSPSTSDNStshmpllt 671
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  1143 EPEPS-----------------CRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPePRTSSHADPPEAEPPWSGR 1205
Cdd:pfam05109  672 SAHPTggenitqvtpaststhhVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTP-PKNATSPQAPSGQKTAVPT 750
                          330
                   ....*....|.
gi 578817790  1206 LPAFGGVIPAT 1216
Cdd:pfam05109  751 VTSTGGKANST 761
SANT_CDC5_II cd11659
SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, ...
397-443 5.69e-05

SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, cell division cycle 5-like protein (CDC5) functions in pre-mRNA splicing in cell cycle control. The DNA-binding, myb-like domain of CDC5 is a member of the SANT/myb group. SANT is named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. The SANT domain resembles the 3 alpha-helix bundle of DNA-binding Myb domains and is found in a diverse set of proteins.


Pssm-ID: 212557 [Multi-domain]  Cd Length: 53  Bit Score: 41.91  E-value: 5.69e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 578817790  397 PGLKKGYWAPEEDAKLLQAVAKYGEQdWFKIREEVpGRSDAQCRDRY 443
Cdd:cd11659     1 PSIKKTEWTREEDEKLLHLAKLLPTQ-WRTIAPIV-GRTAQQCLERY 45
PHA03247 PHA03247
large tegument protein UL36; Provisional
559-1059 6.24e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 6.24e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  559 LLSPQYMVPDMDLWVPARQSTSQPWRGGAGAWLGGPAAslsPPKGSSASQGGSKEASTTAAAPgeetsPVQVPARAHGPV 638
Cdd:PHA03247 2554 PLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDA---PPQSARPRAPVDDRGDPRGPAP-----PSPLPPDTHAPD 2625
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  639 PRSAQASHSADTRPAGAEKQALEGGRRLLTVPVETVLRVLRANTAARSCTQKEQLRQPPLPTSSPGVSSGDSVARSHVQw 718
Cdd:PHA03247 2626 PPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP- 2704
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  719 lrHRATQSGQRRWRHALHRRLLNRRLLLAVTPWVGDVVVPCTQASqrPAVVQTQADGLREQLQQARLASTPvftlftqlf 798
Cdd:PHA03247 2705 --PPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAG--PATPGGPARPARPPTTAGPPAPAP--------- 2771
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  799 hidtagclevvrerKALPPRLPQAGARDPPVhllqaSSSAQSTPGHLFPNVPAqEASKSASHKGSRRLASSRVERTLPQA 878
Cdd:PHA03247 2772 --------------PAAPAAGPPRRLTRPAV-----ASLSESRESLPSPWDPA-DPPAAVLAPAAALPPAASPAGPLPPP 2831
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  879 SLLASTGPRPKPKTVSELLQEKRLQEARAREATRGPVVLPSQLLVSSSvilQPPLPHTPhgRPAPGPTVLNVPLSGPGAP 958
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPA---RPPVRRLA--RPAVSRSTESFALPPDQPE 2906
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  959 AAAKPgtsgswqEAGTSAKDKRLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQ--SQAPAASRKQG 1036
Cdd:PHA03247 2907 RPPQP-------QAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAlvPGRVAVPRFRV 2979
                         490       500
                  ....*....|....*....|...
gi 578817790 1037 LPEAPPFLPAAPSPTPLPVQPLS 1059
Cdd:PHA03247 2980 PQPAPSREAPASSTPPLTGHSLS 3002
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
938-1156 7.36e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 47.29  E-value: 7.36e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  938 HGRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEAGTSAKDKRlstmqalplAPVFSEAEGTAPAASQAPALGPGQISVS 1017
Cdd:PRK07764  590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGA---------AAAPAEASAAPAPGVAAPEHHPKHVAVP 660
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1018 CPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLP 1097
Cdd:PRK07764  661 DASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVP 740
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 578817790 1098 RPaGTPGPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPAnmnrEPEPSCRTDTPAPP 1156
Cdd:PRK07764  741 LP-PEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS----EEEEMAEDDAPSMD 794
PHA03378 PHA03378
EBNA-3B; Provisional
874-1209 1.19e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 46.98  E-value: 1.19e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  874 TLPQASLLASTGP----RPKPKTVSELLQEKRLQEARAREAT---RGPVVL---PSQLLVSSSVILQPPLPHTPHGRPAP 943
Cdd:PHA03378  578 TSPTTSQLASSAPsyaqTPWPVPHPSQTPEPPTTQSHIPETSaprQWPMPLrpiPMRPLRMQPITFNVLVFPTPHQPPQV 657
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  944 GPTVLNV----PLSGPGAPAAAKPGTSGSWQEAGTsakdkrlsTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCP 1019
Cdd:PHA03378  658 EITPYKPtwtqIGHIPYQPSPTGANTMLPIQWAPG--------TMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAA 729
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1020 ESGLGQSQAPAASRKQGlPEAPPFLPAAPSPTPLPVQPLSlthiGGPHVATSVPLPVTWVLTAQ----GLLPVPVPAV-- 1093
Cdd:PHA03378  730 APGRARPPAAAPGRARP-PAAAPGRARPPAAAPGRARPPA----AAPGAPTPQPPPQAPPAPQQrprgAPTPQPPPQAgp 804
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1094 ----VSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALSSSwQPPANMNREPEPSCRTDT-PAPPTHALSQSPAEAD 1168
Cdd:PHA03378  805 tsmqLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALER-QAAAGPTPSPGSGTSDKIvQAPVFYPPVLQPIQVM 883
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 578817790 1169 GSVAFV---------------PGEAQVA-----REIPEPRTSSHADPPEAEPPWSGRLPAF 1209
Cdd:PHA03378  884 RQLGSVraaaastvtqapteyTGERRGVgpmhpTDIPPSKRAKTDAYVESQPPHGGQSHSF 944
SANT_TRF cd11660
Telomere repeat binding factor-like DNA-binding domains of the SANT/myb-like family; Human ...
404-443 1.36e-04

Telomere repeat binding factor-like DNA-binding domains of the SANT/myb-like family; Human telomere repeat binding factors, TRF1 and TRF2, function as part of the 6 component shelterin complex. TRF2 binds DNA and recruits RAP1 (via binding to the RAP1 protein c-terminal (RCT)) and TIN2 in the protection of telomeres from DNA repair machinery. Metazoan shelterin consists of 3 DNA binding proteins (TRF2, TRF1, and POT1) and 3 recruited proteins that bind to one or more of these DNA-binding proteins (RAP1, TIN2, TPP1). Schizosaccharomyces pombe TAZ1 is an orthlog and binds RAP1. Human TRF1 and TRF2 bind double-stranded DNA. hTRF2 consists of a basic N-terminus, a TRF homology domain, the RAP1 binding motif (RBM), the TIN2 binding motif (TBM) and a myb-like DNA binding domain, SANT, named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. Tandem copies of the domain bind telomeric DNA tandem repeats as part of the capping complex. The single myb-like domain of TRF-type proteins is similar to the tandem myb_like domains found in yeast RAP1.


Pssm-ID: 212558 [Multi-domain]  Cd Length: 50  Bit Score: 41.01  E-value: 1.36e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 578817790  404 WAPEEDAKLLQAVAKYGEQDWFKIREE---VPGRSDAQCRDRY 443
Cdd:cd11660     3 WTDEEDEALVEGVEKYGVGNWAKILKDyffVNNRTSVDLKDKW 45
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
262-305 1.36e-04

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 41.14  E-value: 1.36e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 578817790   262 DWEKISNInfEGSRSAEEIRKFWQNSEHPSINKQEWSREEEERL 305
Cdd:pfam13921   19 DWKQIAKE--LGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
292-411 3.11e-04

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 45.16  E-value: 3.11e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  292 INKQEWSREEEERLQAIAAAHGHLEWQKIAEELgTSRSAFQC-LQKFQQHNKALKRKEWTEEEDRMLTQLVQEMrvGSHI 370
Cdd:COG5147    18 RKGGSWKRTEDEDLKALVKKLGPNNWSKVASLL-ISSTGKQSsNRWNNHLNPQLKKKNWSEEEDEQLIDLDKEL--GTQW 94
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|.
gi 578817790  371 pyRRIVYYMEGRDSMQLIYRWTKSLDPGLKKGYWAPEEDAK 411
Cdd:COG5147    95 --STIADYKDRRTAQQCVERYVNTLEDLSSTHDSKLQRRNE 133
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
233-375 3.50e-04

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 45.16  E-value: 3.50e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  233 KQGREAEKEiQDINQLPE-----EALLGNRLDSHDWEKISNINFE----GSRSAEEIRKFWQNSEHPSINKQEWSREEEE 303
Cdd:COG5147   222 KKGETLALE-QEINEYKEkkglsRKQFCERIWSTDRDEDKFWPNIykklPYRDKKSIYKHLRRKYNIFEQRGKWTKEEEQ 300
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 578817790  304 RLQAIAAAHGHLeWQKIAEELGTSRSafQCLQKFQQHNK---ALKRKEWTEEEDRMLTQLVQEMRVGSHiPYRRI 375
Cdd:COG5147   301 ELAKLVVEHGGS-WTEIGKLLGRMPN--DCRDRWRDYVKcgdTLKRNRWSIEEEELLDKVVNEMRLEAQ-QSSRI 371
PksD COG3321
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ...
792-1319 5.91e-04

Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 442550 [Multi-domain]  Cd Length: 1386  Bit Score: 44.48  E-value: 5.91e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  792 TLFTQLFHIDTAGCLEVVRERKALPPRLP----QAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLA 867
Cdd:COG3321   839 QLWVAGVPVDWSALYPGRGRRRVPLPTYPfqreDAAAALLAAALAAALAAAAALGALLLAALAAALAAALLALAAAAAAA 918
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  868 SSRVERTLPQASLLASTGPRPKPKTVSELLQEKRLQEARAREATRGPVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTV 947
Cdd:COG3321   919 LALAAAALAALLALVALAAAAAALLALAAAAAAAAAALAAAEAGALLLLAAAAAAAAAAAAAAAAAAAAAAAAAAAALAA 998
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  948 LNVPLSGPGAPAAAKPGTSGSWQEAGTSAKDKRLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQ 1027
Cdd:COG3321   999 AAALALLAAAALLLAAAAAAAALLALAALLAAAAAALAAAAAAAAAAAALAALAAAAAAAAALALALAALLLLAALAELA 1078
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1028 APAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAG 1107
Cdd:COG3321  1079 LAAAALALAAALAAAALALALAALAAALLLLALLAALALAAAAAALLALAALLAAAAAAAALAAAAAAAAALALAAAAAA 1158
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1108 LLATLLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEP 1187
Cdd:COG3321  1159 LAAALAAALLAAAALLLALALALAAALAAALAGLAALLLAALLAALLAALLALALAALAAAAAALLAAAAAAAALALLAL 1238
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1188 RTSSHADPPEAE-PPWSGRLPAFGGVIPATEPRGTPGSPSGTQEPRGPLGLEKLPLRQPGPEKGALDLEKPPLPQPGPEK 1266
Cdd:COG3321  1239 AAAAAAVAALAAaAAALLAALAALALLAAAAGLAALAAAAAAAAAALALAAAAAAAAAALAALLAAAAAAAAAAAAAAAA 1318
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|...
gi 578817790 1267 GALDLGLLSQEGEAATQQWLGGQRGVRVPLLGSRLPYQPPALCSLRALSGLLL 1319
Cdd:COG3321  1319 AALAAALLAAALAALAAAVAAALALAAAAAAAAAAAAAAAAAAALAAAAGAAA 1371
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
966-1235 7.10e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.39  E-value: 7.10e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  966 SGSWQEAGTSAKDKRLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGlgqSQAPAASRKQGLPEAPPflP 1045
Cdd:PHA03307   37 SGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLS---TLAPASPAREGSPTPPG--P 111
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1046 AAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGllpVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGP 1125
Cdd:PHA03307  112 SSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAA---SPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSS 188
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1126 RAPALSSSWQPPANMNREPEP------SCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVARE----IPEPRTSSHADP 1195
Cdd:PHA03307  189 PPAEPPPSTPPAAASPRPPRRsspisaSASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPEnecpLPRPAPITLPTR 268
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|
gi 578817790 1196 PEAEPPWSGRLPAFGGVIPATEPRGTPGSPSGTQEPRGPL 1235
Cdd:PHA03307  269 IWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPA 308
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1006-1307 1.06e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.82  E-value: 1.06e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1006 APALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPflPAAPSPTPLPVQplslthigGPHVATSVPLPVTWVLTAQGL 1085
Cdd:PRK07764  385 LGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAA--APAPAAAPQPAP--------APAPAPAPPSPAGNAPAGGAP 454
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1086 LPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPAlSSSWQPPANMNREPEPS-----------CRTDTPA 1154
Cdd:PRK07764  455 SPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPA-APAAPAGADDAATLRERwpeilaavpkrSRKTWAI 533
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1155 PPTHAlsqSPAEADGSV---AFV----------PGEAQVAREIPEPRT-------------SSHADPPEAEPPWSGRLPA 1208
Cdd:PRK07764  534 LLPEA---TVLGVRGDTlvlGFStgglarrfasPGNAEVLVTALAEELggdwqveavvgpaPGAAGGEGPPAPASSGPPE 610
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1209 FGGVIPATEPRGTPGSPSGTQEPRGPLGLEKLPLRQPGPEKGALDLEKPPLPQPGPEKGALDLGLLSQEGEAATQQWLG- 1287
Cdd:PRK07764  611 EAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAp 690
                         330       340
                  ....*....|....*....|.
gi 578817790 1288 -GQRGVRVPLLGSRLPYQPPA 1307
Cdd:PRK07764  691 aAPAGAAPAQPAPAPAATPPA 711
PLN03212 PLN03212
Transcription repressor MYB5; Provisional
398-502 1.11e-03

Transcription repressor MYB5; Provisional


Pssm-ID: 178751 [Multi-domain]  Cd Length: 249  Bit Score: 42.37  E-value: 1.11e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  398 GLKKGYWAPEEDAKLLQAVAKYGEQDWFKIREEVP-GRSDAQCRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGvGHW 476
Cdd:PLN03212   22 GMKRGPWTVEEDEILVSFIKKEGEGRWRSLPKRAGlLRCGKSCRLRWMNYLRPSVKRGGITSDEEDLILRLHRLLG-NRW 100
                          90       100
                  ....*....|....*....|....*.
gi 578817790  477 AKIASELPHRSGSQCLSKWKIMMGKK 502
Cdd:PLN03212  101 SLIAGRIPGRTDNEIKNYWNTHLRKK 126
PHA03247 PHA03247
large tegument protein UL36; Provisional
942-1186 1.25e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 1.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  942 APGPTVLNVPLSGpGAPAAAKPGTSGSWQ-EAGTSAKDKRlstmqalPLAPVFSEAEGTAPAASQAPALGPGQISVSCPE 1020
Cdd:PHA03247  254 APAPPPVVGEGAD-RAPETARGATGPPPPpEAAAPNGAAA-------PPDGVWGAALAGAPLALPAPPDPPPPAPAGDAE 325
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1021 SGLGQSQAPAASRKQGLPEA--PPFLPAAPSPTPLPvqPLSLTHI-GGPHVATSVPLPVTWVLTA--------------- 1082
Cdd:PHA03247  326 EEDDEDGAMEVVSPLPRPRQhyPLGFPKRRRPTWTP--PSSLEDLsAGRHHPKRASLPTRKRRSArhaatpfargpggdd 403
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1083 QGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTEtraAQGPRAPALSSSWQPPANMNREPEPSCRTDT---------- 1152
Cdd:PHA03247  404 QTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAE---PGSDDGPAPPPERQPPAPATEPAPDDPDDATrkaldalrer 480
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|
gi 578817790 1153 --PAPPTHALSQ----SPAEADGSVAFVPGEAQVAREIPE 1186
Cdd:PHA03247  481 rpPEPPGADLAEllgrHPDTAGTVVRLAAREAAIAREVAE 520
PHA03379 PHA03379
EBNA-3A; Provisional
676-1234 1.25e-03

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 43.51  E-value: 1.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  676 RVLRANTAARSCTQKEQLRQPPLPTSSpgvssgdsVARSHVQWLRHRATQSGQRRWRHALHRRLLNRRLLLAVTPWVGDV 755
Cdd:PHA03379  388 RLLLMRAGKLTERAREALEKASEPTYG--------TPRPPVEKPRPEVPQSLETATSHGSAQVPEPPPVHDLEPGPLHDQ 459
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  756 --VVPCTQASQRPAVVQTQADGLREQ--LQQARLASTPVFTLFTQLFHIDTAGCLEVvrERKALPPRLPQAGARDP-PVH 830
Cdd:PHA03379  460 hsMAPCPVAQLPPGPLQDLEPGDQLPgvVQDGRPACAPVPAPAGPIVRPWEASLSQV--PGVAFAPVMPQPMPVEPvPVP 537
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  831 LLQASSSAQSTPGHLFPNVPAQEAsksashkGSRRLAssrvERTLPqasllASTGPRPkPKTVSELLQEKRLQEARA-RE 909
Cdd:PHA03379  538 TVALERPVCPAPPLIAMQGPGETS-------GIVRVR----ERWRP-----APWTPNP-PRSPSQMSVRDRLARLRAeAQ 600
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  910 ATRGPV-VLPSQL-LVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEAGTsakdkrlstmQAL 987
Cdd:PHA03379  601 PYQASVeVQPPQLtQVSPQQPMEYPLEPEQQMFPGSPFSQVADVMRAGGVPAMQPQYFDLPLQQPIS----------QGA 670
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  988 PLAPVFSEAEGTAPAASQAPALgpgqisvscPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPH 1067
Cdd:PHA03379  671 PLAPLRASMGPVPPVPATQPQY---------FDIPLTEPINQGASAAHFLPQQPMEGPLVPERWMFQGATLSQSVRPGVA 741
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1068 VATSVPLPVTWVLTAQGllpvpvPAVVSLPRPAgTPGP-AGLLATLLPPLTETRAAQGPRapALSSSWQPPANMNREPEP 1146
Cdd:PHA03379  742 QSQYFDLPLTQPINHGA------PAAHFLHQPP-MEGPwVPEQWMFQGAPPSQGTDVVQH--QLDALGYVLHVLNHPGVP 812
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1147 ScrtdTPAPPTHALSQS----PAEADGSvafvpGEAQVAREIPEP-RTSSHADPPEAEPPWSGRLPafgGVIPATEPRGT 1221
Cdd:PHA03379  813 V----SPAVNQYHVSQAafglPIDEDES-----GEGSDTSEPCEAlDLSIHGRPCPQAPEWPVQGE---GGQDATEVLDL 880
                         570
                  ....*....|...
gi 578817790 1222 pgSPSGTQEPRGP 1234
Cdd:PHA03379  881 --SIHGRPRPRTP 891
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
193-359 1.91e-03

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 43.03  E-value: 1.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790   193 RKSVVSDRLQRL---LQPKLLKLEYLHQKQSKVSSELERQALEKQGREAEKEIQdiNQLPEEALLGNRLD-SHDWEKISN 268
Cdd:TIGR00618  220 RKQVLEKELKHLreaLQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELR--AQEAVLEETQERINrARKAAPLAA 297
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790   269 InfegSRSAEEIRKFWQNSeHPSINKQEWSReEEERLQAIAAAHGHLEWQKIAEELGTSRSafQCLQKFQQHNKALKRKE 348
Cdd:TIGR00618  298 H----IKAVTQIEQQAQRI-HTELQSKMRSR-AKLLMKRAAHVKQQSSIEEQRRLLQTLHS--QEIHIRDAHEVATSIRE 369
                          170
                   ....*....|....*
gi 578817790   349 ----WTEEEDRMLTQ 359
Cdd:TIGR00618  370 iscqQHTLTQHIHTL 384
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
954-1234 3.17e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 42.14  E-value: 3.17e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  954 GPGAPAAAKPGtsgswqeagtsakdkrlstmqALPlapvfseaegtAPAASQAPALGPGQISVSCPESGlgqsQAPAASR 1033
Cdd:PRK07003  368 PGGGVPARVAG---------------------AVP-----------APGARAAAAVGASAVPAVTAVTG----AAGAALA 411
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1034 KQGLPEAPPFLPAAPSPTPLPVQplslthiggphVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTP--GPAGLLAT 1111
Cdd:PRK07003  412 PKAAAAAAATRAEAPPAAPAPPA-----------TADRGDDAADGDAPVPAKANARASADSRCDERDAQPpaDSGSASAP 480
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1112 LLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHA--LSQSPAEADGSVAFVPGEAQVAREI----- 1184
Cdd:PRK07003  481 ASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPapEARPPTPAAAAPAARAGGAAAALDVlrnag 560
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 578817790 1185 -----PEPRTSSHADPPEAEPPWSGRLPAFGGVIPATEPRGTPGSPSGTQEPRGP 1234
Cdd:PRK07003  561 mrvssDRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAAR 615
SANT_TRF cd11660
Telomere repeat binding factor-like DNA-binding domains of the SANT/myb-like family; Human ...
470-499 4.38e-03

Telomere repeat binding factor-like DNA-binding domains of the SANT/myb-like family; Human telomere repeat binding factors, TRF1 and TRF2, function as part of the 6 component shelterin complex. TRF2 binds DNA and recruits RAP1 (via binding to the RAP1 protein c-terminal (RCT)) and TIN2 in the protection of telomeres from DNA repair machinery. Metazoan shelterin consists of 3 DNA binding proteins (TRF2, TRF1, and POT1) and 3 recruited proteins that bind to one or more of these DNA-binding proteins (RAP1, TIN2, TPP1). Schizosaccharomyces pombe TAZ1 is an orthlog and binds RAP1. Human TRF1 and TRF2 bind double-stranded DNA. hTRF2 consists of a basic N-terminus, a TRF homology domain, the RAP1 binding motif (RBM), the TIN2 binding motif (TBM) and a myb-like DNA binding domain, SANT, named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. Tandem copies of the domain bind telomeric DNA tandem repeats as part of the capping complex. The single myb-like domain of TRF-type proteins is similar to the tandem myb_like domains found in yeast RAP1.


Pssm-ID: 212558 [Multi-domain]  Cd Length: 50  Bit Score: 36.78  E-value: 4.38e-03
                          10        20        30
                  ....*....|....*....|....*....|...
gi 578817790  470 KYGVGHWAKIASELP---HRSGSQCLSKWKIMM 499
Cdd:cd11660    17 KYGVGNWAKILKDYFfvnNRTSVDLKDKWRNLK 49
PRK10263 PRK10263
DNA translocase FtsK; Provisional
905-1232 7.26e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.22  E-value: 7.26e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  905 ARAREATRGPVVLPSQLLVSSSVILQP-------PLPHTPHGRPAPGPTvlnvplSGPGAPAAAKPGTSGS--WQEagts 975
Cdd:PRK10263  330 TQSWAAPVEPVTQTPPVASVDVPPAQPtvawqpvPGPQTGEPVIAPAPE------GYPQQSQYAQPAVQYNepLQQ---- 399
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  976 akdkrlstmqalPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPflPAAPSPTPLPV 1055
Cdd:PRK10263  400 ------------PVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQS--TFAPQSTYQTE 465
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1056 QPlslthiggphvatsVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRapaLSSSWQ 1135
Cdd:PRK10263  466 QT--------------YQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQ---LAAWYQ 528
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790 1136 PPANMNREPEPSCRTdtpAPPTHALSQSPAEAdgsvafVPGEAQVAREIPEPRTSSHADPPEAEPPWSgrlPAFGGVipa 1215
Cdd:PRK10263  529 PIPEPVKEPEPIKSS---LKAPSVAAVPPVEA------AAAVSPLASGVKKATLATGAAATVAAPVFS---LANSGG--- 593
                         330
                  ....*....|....*..
gi 578817790 1216 tePRGTPGSPSGTQEPR 1232
Cdd:PRK10263  594 --PRPQVKEGIGPQLPR 608
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
939-1061 9.20e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 40.47  E-value: 9.20e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578817790  939 GRPAPGPTVLNVPLSGPGAPAAAKPGTSGSwQEAGTSAKDKRLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSC 1018
Cdd:PRK14951  369 AAEAAAPAEKKTPARPEAAAPAAAPVAQAA-AAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVAL 447
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 578817790 1019 PESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLT 1061
Cdd:PRK14951  448 APAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTP 490
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH