NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|594190773|ref|NP_001277348|]
View 

snRNA-activating protein complex subunit 4 isoform b [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
314-374 4.00e-14

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


:

Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 68.11  E-value: 4.00e-14
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 594190773   314 WAPEEDAKLLQAVAKYGaQDWFKIREEVPGRSDAQCRDRYIRRLHFSLKKGRWNAKEEQQL 374
Cdd:pfam13921    1 WTEEEDEKLLKLVEKYG-NDWKQIAKELGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
363-406 1.54e-13

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


:

Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 66.09  E-value: 1.54e-13
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....
gi 594190773    363 KGRWNAKEEQQLIQLIEKYGVGHWARIASELPHRSGSQCLSKWK 406
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPGRTAEQCRERWR 44
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
204-253 1.02e-07

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


:

Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 49.53  E-value: 1.02e-07
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|
gi 594190773    204 KQEWSTEEVERLKAIAATHGHLEWHLVAEELGTsRSAFQCLQKFQQYNKT 253
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPG-RTAEQCRERWRNLLKP 49
PHA03247 super family cl33720
large tegument protein UL36; Provisional
696-1055 2.37e-07

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.71  E-value: 2.37e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  696 RKSQPPALLQPGTRNTQPHLLQASSNAKNNTGCLPSMTGEQTAKRASHKG----RPRLGSCRTEATPFQVPVAAPR--GL 769
Cdd:PHA03247 2609 RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPgrvsRPRRARRLGRAAQASSPPQRPRrrAA 2688
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  770 RPKPKTVSELLR---EKRLRESHAKKATQALGLNSQLLVSSPVILQPPLLPVPHGSPVvGPATSSVElSVPVAPVMVSSS 846
Cdd:PHA03247 2689 RPTVGSLTSLADpppPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPA-GPATPGGP-ARPARPPTTAGP 2766
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  847 PSGSWPVGGISAtdkqPPNLQTIslnPPHKGTQVAAPAAFRSLALAPGQVPTGGHLSTLGQTSTTSQkqSLPKVLPILRA 926
Cdd:PHA03247 2767 PAPAPPAAPAAG----PPRRLTR---PAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAG--PLPPPTSAQPT 2837
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  927 APSLTQLSVQPPVS-------GQPLATK--SSLPVNWVLTTQKllsvqvPAVVGLPQSVMTPETIGLQAKQLPSPAKTPA 997
Cdd:PHA03247 2838 APPPPPGPPPPSLPlggsvapGGDVRRRppSRSPAAKPAAPAR------PPVRRLARPAVSRSTESFALPPDQPERPPQP 2911
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 594190773  998 FLEQPPASTDTEPKGPQGQEIPPTPGPEKAAL----DLSLLSQESEAAIVTWLkgcqGAFVP 1055
Cdd:PHA03247 2912 QAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLapttDPAGAGEPSGAVPQPWL----GALVP 2969
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
259-322 4.71e-07

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


:

Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 48.08  E-value: 4.71e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 594190773   259 WTEEEDHMLTQLVQEMrvGNHipYRKIVYFMEGRDSMQLIYRWTKSLDPSLKRGFWAPEEDAKL 322
Cdd:pfam13921    1 WTEEEDEKLLKLVEKY--GND--WKQIAKELGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
REB1 super family cl34920
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
102-309 2.57e-04

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


The actual alignment was detected with superfamily member COG5147:

Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 45.16  E-value: 2.57e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  102 LRKSVVSDRLQRLLQP-KLLKLEYLHEKQSRVSSELERQALEKQIKEAEKEI---------QDINQLPE-----EALLGN 166
Cdd:COG5147   170 VPRVSKADVKPREKGEeNNPDIEDLQEMKELKSASITRHLILPSKSEINKAFkkgetlaleQEINEYKEkkglsRKQFCE 249
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  167 RLDSHDWEKISNINFE----GARSAEEIRKFWQSSEHPSISKQEWSTEEVERLKAIAATHGHLeWHLVAEELGTSRSafQ 242
Cdd:COG5147   250 RIWSTDRDEDKFWPNIykklPYRDKKSIYKHLRRKYNIFEQRGKWTKEEEQELAKLVVEHGGS-WTEIGKLLGRMPN--D 326
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 594190773  243 CLQKFQQYNK---TLKRKEWTEEEDHMLTQLVQEMRVGNHiPYRKIVYF---MEGRDSMQLIYRWTKSLDPSL 309
Cdd:COG5147   327 CRDRWRDYVKcgdTLKRNRWSIEEEELLDKVVNEMRLEAQ-QSSRILWLliaQNIRNRLQHHCRDKYGVLISN 398
 
Name Accession Description Interval E-value
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
314-374 4.00e-14

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 68.11  E-value: 4.00e-14
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 594190773   314 WAPEEDAKLLQAVAKYGaQDWFKIREEVPGRSDAQCRDRYIRRLHFSLKKGRWNAKEEQQL 374
Cdd:pfam13921    1 WTEEEDEKLLKLVEKYG-NDWKQIAKELGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
363-406 1.54e-13

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 66.09  E-value: 1.54e-13
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....
gi 594190773    363 KGRWNAKEEQQLIQLIEKYGVGHWARIASELPHRSGSQCLSKWK 406
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPGRTAEQCRERWR 44
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
314-357 2.29e-13

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 65.29  E-value: 2.29e-13
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....
gi 594190773  314 WAPEEDAKLLQAVAKYGAQDWFKIREEVPGRSDAQCRDRYIRRL 357
Cdd:cd00167     2 WTEEEDELLLEAVKKYGKNNWEKIAKELPGRTPKQCRERWRNLL 45
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
311-358 3.82e-13

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 64.94  E-value: 3.82e-13
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 594190773    311 RGFWAPEEDAKLLQAVAKYGAQDWFKIREEVPGRSDAQCRDRYIRRLH 358
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPGRTAEQCRERWRNLLK 48
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
365-406 4.00e-13

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 64.52  E-value: 4.00e-13
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 594190773  365 RWNAKEEQQLIQLIEKYGVGHWARIASELPHRSGSQCLSKWK 406
Cdd:cd00167     1 PWTEEEDELLLEAVKKYGKNNWEKIAKELPGRTPKQCRERWR 42
PLN03091 PLN03091
hypothetical protein; Provisional
309-423 1.13e-12

hypothetical protein; Provisional


Pssm-ID: 215570 [Multi-domain]  Cd Length: 459  Bit Score: 71.93  E-value: 1.13e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  309 LKRGFWAPEEDAKLLQAVAKYGAQDWfkirEEVPGRSDAQ-----CRDRYIRRLHFSLKKGRWNAKEEQQLIQLIEKYGv 383
Cdd:PLN03091   12 LRKGLWSPEEDEKLLRHITKYGHGCW----SSVPKQAGLQrcgksCRLRWINYLRPDLKRGTFSQQEENLIIELHAVLG- 86
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|
gi 594190773  384 GHWARIASELPHRSGSQCLSKWKILARKKqhlQRKRGQRP 423
Cdd:PLN03091   87 NRWSQIAAQLPGRTDNEIKNLWNSCLKKK---LRQRGIDP 123
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
363-405 1.33e-10

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 57.51  E-value: 1.33e-10
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 594190773   363 KGRWNAKEEQQLIQLIEKYGvGHWARIASELPHRSGSQCLSKW 405
Cdd:pfam00249    1 RGPWTPEEDELLLEAVEKLG-NRWKKIAKLLPGRTDNQCKNRW 42
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
309-405 4.32e-08

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 57.49  E-value: 4.32e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  309 LKRGFWAPEEDAKLLQAVAKYGAQDWFKIREEVPGRSDAQCRDRYIRRLHFSLKKGRWNAKEEQQLIQLIEKYGVgHWAR 388
Cdd:COG5147    18 RKGGSWKRTEDEDLKALVKKLGPNNWSKVASLLISSTGKQSSNRWNNHLNPQLKKKNWSEEEDEQLIDLDKELGT-QWST 96
                          90
                  ....*....|....*..
gi 594190773  389 IASELPHRSGSQCLSKW 405
Cdd:COG5147    97 IADYKDRRTAQQCVERY 113
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
204-253 1.02e-07

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 49.53  E-value: 1.02e-07
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|
gi 594190773    204 KQEWSTEEVERLKAIAATHGHLEWHLVAEELGTsRSAFQCLQKFQQYNKT 253
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPG-RTAEQCRERWRNLLKP 49
PHA03247 PHA03247
large tegument protein UL36; Provisional
696-1055 2.37e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.71  E-value: 2.37e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  696 RKSQPPALLQPGTRNTQPHLLQASSNAKNNTGCLPSMTGEQTAKRASHKG----RPRLGSCRTEATPFQVPVAAPR--GL 769
Cdd:PHA03247 2609 RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPgrvsRPRRARRLGRAAQASSPPQRPRrrAA 2688
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  770 RPKPKTVSELLR---EKRLRESHAKKATQALGLNSQLLVSSPVILQPPLLPVPHGSPVvGPATSSVElSVPVAPVMVSSS 846
Cdd:PHA03247 2689 RPTVGSLTSLADpppPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPA-GPATPGGP-ARPARPPTTAGP 2766
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  847 PSGSWPVGGISAtdkqPPNLQTIslnPPHKGTQVAAPAAFRSLALAPGQVPTGGHLSTLGQTSTTSQkqSLPKVLPILRA 926
Cdd:PHA03247 2767 PAPAPPAAPAAG----PPRRLTR---PAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAG--PLPPPTSAQPT 2837
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  927 APSLTQLSVQPPVS-------GQPLATK--SSLPVNWVLTTQKllsvqvPAVVGLPQSVMTPETIGLQAKQLPSPAKTPA 997
Cdd:PHA03247 2838 APPPPPGPPPPSLPlggsvapGGDVRRRppSRSPAAKPAAPAR------PPVRRLARPAVSRSTESFALPPDQPERPPQP 2911
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 594190773  998 FLEQPPASTDTEPKGPQGQEIPPTPGPEKAAL----DLSLLSQESEAAIVTWLkgcqGAFVP 1055
Cdd:PHA03247 2912 QAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLapttDPAGAGEPSGAVPQPWL----GALVP 2969
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
207-267 3.69e-07

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 48.08  E-value: 3.69e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 594190773   207 WSTEEVERLKAIAATHGhLEWHLVAEELGtSRSAFQCLQKFQQY-NKTLKRKEWTEEEDHML 267
Cdd:pfam13921    1 WTEEEDEKLLKLVEKYG-NDWKQIAKELG-RRTPKQCFDRWRRKlNPKISRGPWSKEEDQRL 60
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
259-322 4.71e-07

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 48.08  E-value: 4.71e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 594190773   259 WTEEEDHMLTQLVQEMrvGNHipYRKIVYFMEGRDSMQLIYRWTKSLDPSLKRGFWAPEEDAKL 322
Cdd:pfam13921    1 WTEEEDEKLLKLVEKY--GND--WKQIAKELGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
206-250 7.18e-07

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 46.80  E-value: 7.18e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 594190773  206 EWSTEEVERLKAIAATHGHLEWHLVAEELGTsRSAFQCLQKFQQY 250
Cdd:cd00167     1 PWTEEEDELLLEAVKKYGKNNWEKIAKELPG-RTPKQCRERWRNL 44
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
256-307 1.04e-06

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 46.45  E-value: 1.04e-06
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 594190773    256 RKEWTEEEDHMLTQLVQEMRVGNhipYRKIVYFMEGRDSMQLIYRWTKSLDP 307
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNN---WEKIAKELPGRTAEQCRERWRNLLKP 49
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
634-1044 3.16e-06

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 51.50  E-value: 3.16e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773   634 PWVGDINLACTQAPRRPATvqtkaDSIRMQLECARLASTPVftLLIQLLQIDTAGCMEVVRERKSQPPALLQPGTRN--- 710
Cdd:pfam17823   35 NGAGKQNASGDAVPRADNK-----SSEQ*NFCAATAAPAPV--TLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREgaa 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773   711 ----TQPHLLQASSNAKNNTGCLPSMTGEQTAKRAShkgRPRLGSCRTEATpfqVPVAAPRGLRPKPKTVSELLREKRLR 786
Cdd:pfam17823  108 dgaaSRALAAAASSSPSSAAQSLPAAIAALPSEAFS---APRAAACRANAS---AAPRAAIAAASAPHAASPAPRTAASS 181
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773   787 ESHAKKATQALGLNSQLLVSSPVILQP--PLL--PVPHGSPVVGPATSSVELSVPVAPVMVSSSPSGSWPVGGISATDKQ 862
Cdd:pfam17823  182 TTAASSTTAASSAPTTAASSAPATLTParGIStaATATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAG 261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773   863 --PPNLQTISLNPPHkGTQVAAPAAFRSLALAPGQVPTGGHlSTLGQTSTTSQKQSLPKVLPilRAAPSLTQLSVQPPVS 940
Cdd:pfam17823  262 tvASAAGTINMGDPH-ARRLSPAKHMPSDTMARNPAAPMGA-QAQGPIIQVSTDQPVHNTAG--EPTPSPSNTTLEPNTP 337
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773   941 GQPLATKSSLpvnwVLTTQKLLSVQVPAVVGLPQSVMTPETIGLQAKQLPSP------AKTPAFLEQP-----PASTDTE 1009
Cdd:pfam17823  338 KSVASTNLAV----VTTTKAQAKEPSASPVPVLHTSMIPEVEATSPTTQPSPllptqgAAGPGILLAPeqvatEATAGTA 413
                          410       420       430
                   ....*....|....*....|....*....|....*
gi 594190773  1010 PKGPQGQEippTPGPEKAALDLSLLSQESEAAIVT 1044
Cdd:pfam17823  414 SAGPTPRS---SGDPKTLAMASCQLSTQGQYLVVT 445
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
102-309 2.57e-04

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 45.16  E-value: 2.57e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  102 LRKSVVSDRLQRLLQP-KLLKLEYLHEKQSRVSSELERQALEKQIKEAEKEI---------QDINQLPE-----EALLGN 166
Cdd:COG5147   170 VPRVSKADVKPREKGEeNNPDIEDLQEMKELKSASITRHLILPSKSEINKAFkkgetlaleQEINEYKEkkglsRKQFCE 249
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  167 RLDSHDWEKISNINFE----GARSAEEIRKFWQSSEHPSISKQEWSTEEVERLKAIAATHGHLeWHLVAEELGTSRSafQ 242
Cdd:COG5147   250 RIWSTDRDEDKFWPNIykklPYRDKKSIYKHLRRKYNIFEQRGKWTKEEEQELAKLVVEHGGS-WTEIGKLLGRMPN--D 326
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 594190773  243 CLQKFQQYNK---TLKRKEWTEEEDHMLTQLVQEMRVGNHiPYRKIVYF---MEGRDSMQLIYRWTKSLDPSL 309
Cdd:COG5147   327 CRDRWRDYVKcgdTLKRNRWSIEEEELLDKVVNEMRLEAQ-QSSRILWLliaQNIRNRLQHHCRDKYGVLISN 398
PLN03212 PLN03212
Transcription repressor MYB5; Provisional
254-353 2.72e-04

Transcription repressor MYB5; Provisional


Pssm-ID: 178751 [Multi-domain]  Cd Length: 249  Bit Score: 43.91  E-value: 2.72e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  254 LKRKEWTEEEDHMLTQLVQEMRVGNHIPYRKIVYFMEGRDSMQLiyRWTKSLDPSLKRGFWAPEEDAKLLQAVAKYGAQd 333
Cdd:PLN03212   23 MKRGPWTVEEDEILVSFIKKEGEGRWRSLPKRAGLLRCGKSCRL--RWMNYLRPSVKRGGITSDEEDLILRLHRLLGNR- 99
                          90       100
                  ....*....|....*....|
gi 594190773  334 WFKIREEVPGRSDAQCRDRY 353
Cdd:PLN03212  100 WSLIAGRIPGRTDNEIKNYW 119
 
Name Accession Description Interval E-value
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
314-374 4.00e-14

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 68.11  E-value: 4.00e-14
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 594190773   314 WAPEEDAKLLQAVAKYGaQDWFKIREEVPGRSDAQCRDRYIRRLHFSLKKGRWNAKEEQQL 374
Cdd:pfam13921    1 WTEEEDEKLLKLVEKYG-NDWKQIAKELGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
363-406 1.54e-13

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 66.09  E-value: 1.54e-13
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....
gi 594190773    363 KGRWNAKEEQQLIQLIEKYGVGHWARIASELPHRSGSQCLSKWK 406
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPGRTAEQCRERWR 44
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
314-357 2.29e-13

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 65.29  E-value: 2.29e-13
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....
gi 594190773  314 WAPEEDAKLLQAVAKYGAQDWFKIREEVPGRSDAQCRDRYIRRL 357
Cdd:cd00167     2 WTEEEDELLLEAVKKYGKNNWEKIAKELPGRTPKQCRERWRNLL 45
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
311-358 3.82e-13

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 64.94  E-value: 3.82e-13
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 594190773    311 RGFWAPEEDAKLLQAVAKYGAQDWFKIREEVPGRSDAQCRDRYIRRLH 358
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPGRTAEQCRERWRNLLK 48
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
365-406 4.00e-13

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 64.52  E-value: 4.00e-13
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 594190773  365 RWNAKEEQQLIQLIEKYGVGHWARIASELPHRSGSQCLSKWK 406
Cdd:cd00167     1 PWTEEEDELLLEAVKKYGKNNWEKIAKELPGRTPKQCRERWR 42
PLN03091 PLN03091
hypothetical protein; Provisional
309-423 1.13e-12

hypothetical protein; Provisional


Pssm-ID: 215570 [Multi-domain]  Cd Length: 459  Bit Score: 71.93  E-value: 1.13e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  309 LKRGFWAPEEDAKLLQAVAKYGAQDWfkirEEVPGRSDAQ-----CRDRYIRRLHFSLKKGRWNAKEEQQLIQLIEKYGv 383
Cdd:PLN03091   12 LRKGLWSPEEDEKLLRHITKYGHGCW----SSVPKQAGLQrcgksCRLRWINYLRPDLKRGTFSQQEENLIIELHAVLG- 86
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|
gi 594190773  384 GHWARIASELPHRSGSQCLSKWKILARKKqhlQRKRGQRP 423
Cdd:PLN03091   87 NRWSQIAAQLPGRTDNEIKNLWNSCLKKK---LRQRGIDP 123
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
311-357 7.93e-11

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 58.28  E-value: 7.93e-11
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 594190773   311 RGFWAPEEDAKLLQAVAKYGAqDWFKIREEVPGRSDAQCRDRYIRRL 357
Cdd:pfam00249    1 RGPWTPEEDELLLEAVEKLGN-RWKKIAKLLPGRTDNQCKNRWQNYL 46
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
363-405 1.33e-10

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 57.51  E-value: 1.33e-10
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 594190773   363 KGRWNAKEEQQLIQLIEKYGvGHWARIASELPHRSGSQCLSKW 405
Cdd:pfam00249    1 RGPWTPEEDELLLEAVEKLG-NRWKKIAKLLPGRTDNQCKNRW 42
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
309-405 4.32e-08

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 57.49  E-value: 4.32e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  309 LKRGFWAPEEDAKLLQAVAKYGAQDWFKIREEVPGRSDAQCRDRYIRRLHFSLKKGRWNAKEEQQLIQLIEKYGVgHWAR 388
Cdd:COG5147    18 RKGGSWKRTEDEDLKALVKKLGPNNWSKVASLLISSTGKQSSNRWNNHLNPQLKKKNWSEEEDEQLIDLDKELGT-QWST 96
                          90
                  ....*....|....*..
gi 594190773  389 IASELPHRSGSQCLSKW 405
Cdd:COG5147    97 IADYKDRRTAQQCVERY 113
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
204-253 1.02e-07

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 49.53  E-value: 1.02e-07
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|
gi 594190773    204 KQEWSTEEVERLKAIAATHGHLEWHLVAEELGTsRSAFQCLQKFQQYNKT 253
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPG-RTAEQCRERWRNLLKP 49
PHA03247 PHA03247
large tegument protein UL36; Provisional
696-1055 2.37e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.71  E-value: 2.37e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  696 RKSQPPALLQPGTRNTQPHLLQASSNAKNNTGCLPSMTGEQTAKRASHKG----RPRLGSCRTEATPFQVPVAAPR--GL 769
Cdd:PHA03247 2609 RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPgrvsRPRRARRLGRAAQASSPPQRPRrrAA 2688
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  770 RPKPKTVSELLR---EKRLRESHAKKATQALGLNSQLLVSSPVILQPPLLPVPHGSPVvGPATSSVElSVPVAPVMVSSS 846
Cdd:PHA03247 2689 RPTVGSLTSLADpppPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPA-GPATPGGP-ARPARPPTTAGP 2766
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  847 PSGSWPVGGISAtdkqPPNLQTIslnPPHKGTQVAAPAAFRSLALAPGQVPTGGHLSTLGQTSTTSQkqSLPKVLPILRA 926
Cdd:PHA03247 2767 PAPAPPAAPAAG----PPRRLTR---PAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAG--PLPPPTSAQPT 2837
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  927 APSLTQLSVQPPVS-------GQPLATK--SSLPVNWVLTTQKllsvqvPAVVGLPQSVMTPETIGLQAKQLPSPAKTPA 997
Cdd:PHA03247 2838 APPPPPGPPPPSLPlggsvapGGDVRRRppSRSPAAKPAAPAR------PPVRRLARPAVSRSTESFALPPDQPERPPQP 2911
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 594190773  998 FLEQPPASTDTEPKGPQGQEIPPTPGPEKAAL----DLSLLSQESEAAIVTWLkgcqGAFVP 1055
Cdd:PHA03247 2912 QAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLapttDPAGAGEPSGAVPQPWL----GALVP 2969
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
207-267 3.69e-07

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 48.08  E-value: 3.69e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 594190773   207 WSTEEVERLKAIAATHGhLEWHLVAEELGtSRSAFQCLQKFQQY-NKTLKRKEWTEEEDHML 267
Cdd:pfam13921    1 WTEEEDEKLLKLVEKYG-NDWKQIAKELG-RRTPKQCFDRWRRKlNPKISRGPWSKEEDQRL 60
SANT_TRF cd11660
Telomere repeat binding factor-like DNA-binding domains of the SANT/myb-like family; Human ...
365-410 3.72e-07

Telomere repeat binding factor-like DNA-binding domains of the SANT/myb-like family; Human telomere repeat binding factors, TRF1 and TRF2, function as part of the 6 component shelterin complex. TRF2 binds DNA and recruits RAP1 (via binding to the RAP1 protein c-terminal (RCT)) and TIN2 in the protection of telomeres from DNA repair machinery. Metazoan shelterin consists of 3 DNA binding proteins (TRF2, TRF1, and POT1) and 3 recruited proteins that bind to one or more of these DNA-binding proteins (RAP1, TIN2, TPP1). Schizosaccharomyces pombe TAZ1 is an orthlog and binds RAP1. Human TRF1 and TRF2 bind double-stranded DNA. hTRF2 consists of a basic N-terminus, a TRF homology domain, the RAP1 binding motif (RBM), the TIN2 binding motif (TBM) and a myb-like DNA binding domain, SANT, named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. Tandem copies of the domain bind telomeric DNA tandem repeats as part of the capping complex. The single myb-like domain of TRF-type proteins is similar to the tandem myb_like domains found in yeast RAP1.


Pssm-ID: 212558 [Multi-domain]  Cd Length: 50  Bit Score: 47.95  E-value: 3.72e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 594190773  365 RWNAKEEQQLIQLIEKYGVGHWARIASELP---HRSGSQCLSKWKILAR 410
Cdd:cd11660     2 KWTDEEDEALVEGVEKYGVGNWAKILKDYFfvnNRTSVDLKDKWRNLKK 50
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
259-322 4.71e-07

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 48.08  E-value: 4.71e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 594190773   259 WTEEEDHMLTQLVQEMrvGNHipYRKIVYFMEGRDSMQLIYRWTKSLDPSLKRGFWAPEEDAKL 322
Cdd:pfam13921    1 WTEEEDEKLLKLVEKY--GND--WKQIAKELGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
206-250 7.18e-07

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 46.80  E-value: 7.18e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 594190773  206 EWSTEEVERLKAIAATHGHLEWHLVAEELGTsRSAFQCLQKFQQY 250
Cdd:cd00167     1 PWTEEEDELLLEAVKKYGKNNWEKIAKELPG-RTPKQCRERWRNL 44
PLN03212 PLN03212
Transcription repressor MYB5; Provisional
309-412 8.09e-07

Transcription repressor MYB5; Provisional


Pssm-ID: 178751 [Multi-domain]  Cd Length: 249  Bit Score: 52.00  E-value: 8.09e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  309 LKRGFWAPEEDAKLLQAVAKYGAQDWFKIREEVPG-RSDAQCRDRYIRRLHFSLKKGRWNAKEEQQLIQLIEKYGvGHWA 387
Cdd:PLN03212   23 MKRGPWTVEEDEILVSFIKKEGEGRWRSLPKRAGLlRCGKSCRLRWMNYLRPSVKRGGITSDEEDLILRLHRLLG-NRWS 101
                          90       100
                  ....*....|....*....|....*
gi 594190773  388 RIASELPHRSGSQCLSKWKILARKK 412
Cdd:PLN03212  102 LIAGRIPGRTDNEIKNYWNTHLRKK 126
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
256-307 1.04e-06

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 46.45  E-value: 1.04e-06
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 594190773    256 RKEWTEEEDHMLTQLVQEMRVGNhipYRKIVYFMEGRDSMQLIYRWTKSLDP 307
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNN---WEKIAKELPGRTAEQCRERWRNLLKP 49
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
244-405 1.09e-06

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 52.87  E-value: 1.09e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  244 LQKFQQYNKTLKRKE--WTEEEDHMLTQLVQEMRVGNhipYRKIVYFMEGRDSMQLIYRWTKSLDPSLKRGFWAPEEDAK 321
Cdd:COG5147     6 NKELQIKLMQTKRKGgsWKRTEDEDLKALVKKLGPNN---WSKVASLLISSTGKQSSNRWNNHLNPQLKKKNWSEEEDEQ 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  322 LLQAVAKYGAQdWFKIREEVPGRSDAQCRDRYIRRLHFSLKKgRWNAKEEQQLIQLIEKYGVGHwariaSELPHRSGSQC 401
Cdd:COG5147    83 LIDLDKELGTQ-WSTIADYKDRRTAQQCVERYVNTLEDLSST-HDSKLQRRNEFDKIDPFNENS-----ARRPDIYEDEL 155

                  ....
gi 594190773  402 LSKW 405
Cdd:COG5147   156 LERE 159
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
634-1044 3.16e-06

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 51.50  E-value: 3.16e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773   634 PWVGDINLACTQAPRRPATvqtkaDSIRMQLECARLASTPVftLLIQLLQIDTAGCMEVVRERKSQPPALLQPGTRN--- 710
Cdd:pfam17823   35 NGAGKQNASGDAVPRADNK-----SSEQ*NFCAATAAPAPV--TLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREgaa 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773   711 ----TQPHLLQASSNAKNNTGCLPSMTGEQTAKRAShkgRPRLGSCRTEATpfqVPVAAPRGLRPKPKTVSELLREKRLR 786
Cdd:pfam17823  108 dgaaSRALAAAASSSPSSAAQSLPAAIAALPSEAFS---APRAAACRANAS---AAPRAAIAAASAPHAASPAPRTAASS 181
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773   787 ESHAKKATQALGLNSQLLVSSPVILQP--PLL--PVPHGSPVVGPATSSVELSVPVAPVMVSSSPSGSWPVGGISATDKQ 862
Cdd:pfam17823  182 TTAASSTTAASSAPTTAASSAPATLTParGIStaATATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAG 261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773   863 --PPNLQTISLNPPHkGTQVAAPAAFRSLALAPGQVPTGGHlSTLGQTSTTSQKQSLPKVLPilRAAPSLTQLSVQPPVS 940
Cdd:pfam17823  262 tvASAAGTINMGDPH-ARRLSPAKHMPSDTMARNPAAPMGA-QAQGPIIQVSTDQPVHNTAG--EPTPSPSNTTLEPNTP 337
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773   941 GQPLATKSSLpvnwVLTTQKLLSVQVPAVVGLPQSVMTPETIGLQAKQLPSP------AKTPAFLEQP-----PASTDTE 1009
Cdd:pfam17823  338 KSVASTNLAV----VTTTKAQAKEPSASPVPVLHTSMIPEVEATSPTTQPSPllptqgAAGPGILLAPeqvatEATAGTA 413
                          410       420       430
                   ....*....|....*....|....*....|....*
gi 594190773  1010 PKGPQGQEippTPGPEKAALDLSLLSQESEAAIVT 1044
Cdd:pfam17823  414 SAGPTPRS---SGDPKTLAMASCQLSTQGQYLVVT 445
SANT_CDC5_II cd11659
SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, ...
307-357 5.14e-06

SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, cell division cycle 5-like protein (CDC5) functions in pre-mRNA splicing in cell cycle control. The DNA-binding, myb-like domain of CDC5 is a member of the SANT/myb group. SANT is named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. The SANT domain resembles the 3 alpha-helix bundle of DNA-binding Myb domains and is found in a diverse set of proteins.


Pssm-ID: 212557 [Multi-domain]  Cd Length: 53  Bit Score: 44.61  E-value: 5.14e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 594190773  307 PSLKRGFWAPEEDAKLLQAVAKYGAQdWFKIREEVpGRSDAQCRDRYIRRL 357
Cdd:cd11659     1 PSIKKTEWTREEDEKLLHLAKLLPTQ-WRTIAPIV-GRTAQQCLERYNKLL 49
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
166-420 9.14e-06

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 49.79  E-value: 9.14e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  166 NRLDShDWEKISNINFE------GARSAEEIRKFWQSSEHPSISKQEWSTEEVERLKAIAATHGHLeWHLVAEELGtSRS 239
Cdd:COG5147    29 EDLKA-LVKKLGPNNWSkvasllISSTGKQSSNRWNNHLNPQLKKKNWSEEEDEQLIDLDKELGTQ-WSTIADYKD-RRT 105
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  240 AFQClqkFQQYNKTLKR---KEWTEEED-------------HMLTQLVQEMRVGNHIPYRKIVYFMEG-RDSMQLIYR-- 300
Cdd:COG5147   106 AQQC---VERYVNTLEDlssTHDSKLQRrnefdkidpfnenSARRPDIYEDELLEREVNREASYRLRVpRVSKADVKPre 182
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  301 ----------------------WTKSLD-PSLKRGFWAP---EEDAK---LLQAVAKYGAQDW---------------FK 336
Cdd:COG5147   183 kgeennpdiedlqemkelksasITRHLIlPSKSEINKAFkkgETLALeqeINEYKEKKGLSRKqfceriwstdrdedkFW 262
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  337 --IREEVPGRSDAQCRDRYIRRLHFSLKKGRWNAKEEQQLIQLIEKYGvGHWARIaSELPHRSGSQCLSKWKILARKKQH 414
Cdd:COG5147   263 pnIYKKLPYRDKKSIYKHLRRKYNIFEQRGKWTKEEEQELAKLVVEHG-GSWTEI-GKLLGRMPNDCRDRWRDYVKCGDT 340

                  ....*.
gi 594190773  415 LQRKRG 420
Cdd:COG5147   341 LKRNRW 346
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
366-406 1.47e-05

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 43.84  E-value: 1.47e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 594190773   366 WNAKEEQQLIQLIEKYGvGHWARIASELPHRSGSQCLSKWK 406
Cdd:pfam13921    1 WTEEEDEKLLKLVEKYG-NDWKQIAKELGRRTPKQCFDRWR 40
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
204-250 7.28e-05

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 41.34  E-value: 7.28e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 594190773   204 KQEWSTEEVERLKAIAATHGHlEWHLVAEELGTsRSAFQCLQKFQQY 250
Cdd:pfam00249    1 RGPWTPEEDELLLEAVEKLGN-RWKKIAKLLPG-RTDNQCKNRWQNY 45
SANT_CDC5_II cd11659
SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, ...
200-249 7.66e-05

SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, cell division cycle 5-like protein (CDC5) functions in pre-mRNA splicing in cell cycle control. The DNA-binding, myb-like domain of CDC5 is a member of the SANT/myb group. SANT is named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. The SANT domain resembles the 3 alpha-helix bundle of DNA-binding Myb domains and is found in a diverse set of proteins.


Pssm-ID: 212557 [Multi-domain]  Cd Length: 53  Bit Score: 41.53  E-value: 7.66e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 594190773  200 PSISKQEWSTEEVERLKAIaATHGHLEWHLVAEELGtsRSAFQCLQKFQQ 249
Cdd:cd11659     1 PSIKKTEWTREEDEKLLHL-AKLLPTQWRTIAPIVG--RTAQQCLERYNK 47
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
102-309 2.57e-04

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 45.16  E-value: 2.57e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  102 LRKSVVSDRLQRLLQP-KLLKLEYLHEKQSRVSSELERQALEKQIKEAEKEI---------QDINQLPE-----EALLGN 166
Cdd:COG5147   170 VPRVSKADVKPREKGEeNNPDIEDLQEMKELKSASITRHLILPSKSEINKAFkkgetlaleQEINEYKEkkglsRKQFCE 249
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  167 RLDSHDWEKISNINFE----GARSAEEIRKFWQSSEHPSISKQEWSTEEVERLKAIAATHGHLeWHLVAEELGTSRSafQ 242
Cdd:COG5147   250 RIWSTDRDEDKFWPNIykklPYRDKKSIYKHLRRKYNIFEQRGKWTKEEEQELAKLVVEHGGS-WTEIGKLLGRMPN--D 326
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 594190773  243 CLQKFQQYNK---TLKRKEWTEEEDHMLTQLVQEMRVGNHiPYRKIVYF---MEGRDSMQLIYRWTKSLDPSL 309
Cdd:COG5147   327 CRDRWRDYVKcgdTLKRNRWSIEEEELLDKVVNEMRLEAQ-QSSRILWLliaQNIRNRLQHHCRDKYGVLISN 398
PLN03212 PLN03212
Transcription repressor MYB5; Provisional
254-353 2.72e-04

Transcription repressor MYB5; Provisional


Pssm-ID: 178751 [Multi-domain]  Cd Length: 249  Bit Score: 43.91  E-value: 2.72e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  254 LKRKEWTEEEDHMLTQLVQEMRVGNHIPYRKIVYFMEGRDSMQLiyRWTKSLDPSLKRGFWAPEEDAKLLQAVAKYGAQd 333
Cdd:PLN03212   23 MKRGPWTVEEDEILVSFIKKEGEGRWRSLPKRAGLLRCGKSCRL--RWMNYLRPSVKRGGITSDEEDLILRLHRLLGNR- 99
                          90       100
                  ....*....|....*....|
gi 594190773  334 WFKIREEVPGRSDAQCRDRY 353
Cdd:PLN03212  100 WSLIAGRIPGRTDNEIKNYW 119
SANT_TRF cd11660
Telomere repeat binding factor-like DNA-binding domains of the SANT/myb-like family; Human ...
314-353 2.93e-04

Telomere repeat binding factor-like DNA-binding domains of the SANT/myb-like family; Human telomere repeat binding factors, TRF1 and TRF2, function as part of the 6 component shelterin complex. TRF2 binds DNA and recruits RAP1 (via binding to the RAP1 protein c-terminal (RCT)) and TIN2 in the protection of telomeres from DNA repair machinery. Metazoan shelterin consists of 3 DNA binding proteins (TRF2, TRF1, and POT1) and 3 recruited proteins that bind to one or more of these DNA-binding proteins (RAP1, TIN2, TPP1). Schizosaccharomyces pombe TAZ1 is an orthlog and binds RAP1. Human TRF1 and TRF2 bind double-stranded DNA. hTRF2 consists of a basic N-terminus, a TRF homology domain, the RAP1 binding motif (RBM), the TIN2 binding motif (TBM) and a myb-like DNA binding domain, SANT, named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. Tandem copies of the domain bind telomeric DNA tandem repeats as part of the capping complex. The single myb-like domain of TRF-type proteins is similar to the tandem myb_like domains found in yeast RAP1.


Pssm-ID: 212558 [Multi-domain]  Cd Length: 50  Bit Score: 39.86  E-value: 2.93e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 594190773  314 WAPEEDAKLLQAVAKYGAQDWFKIREE---VPGRSDAQCRDRY 353
Cdd:cd11660     3 WTDEEDEALVEGVEKYGVGNWAKILKDyffVNNRTSVDLKDKW 45
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
250-381 6.33e-03

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 40.54  E-value: 6.33e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  250 YNKTLKRKEWTEEEDHMLTQLVQEmrvgnHIPYRKIVYFMEGRDSMQLIYRW--TKSLDPSLKRGFWAPEEDAKLLQAVA 327
Cdd:COG5147   285 YNIFEQRGKWTKEEEQELAKLVVE-----HGGSWTEIGKLLGRMPNDCRDRWrdYVKCGDTLKRNRWSIEEEELLDKVVN 359
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 594190773  328 K-------YGAQDWFKIREEVPGRSDAQCRDRYIRRLHFSlkkgrwNAKEEQQLIQLIEKY 381
Cdd:COG5147   360 EmrleaqqSSRILWLLIAQNIRNRLQHHCRDKYGVLISNS------SPFDAGAAIWLIERY 414
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
794-1216 8.37e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 40.52  E-value: 8.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773   794 TQALGLNSQllvSSPVILQPPLLPVPHGSPVVGPATSSVELSVPVAPVmVSSSPSGSWPVGGISATDKQPPNLQTISLNP 873
Cdd:pfam03154  169 TQPPVLQAQ---SGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPA-TSQPPNQTQSTAAPHTLIQQTPTLHPQRLPS 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773   874 PHKGTQVAAPAAfrslalAPGQVPtgghLSTLGQTSTTSQKQSLPKVLpilRAAPSLTQLSVQP---PVSGQPLATKSSL 950
Cdd:pfam03154  245 PHPPLQPMTQPP------PPSQVS----PQPLPQPSLHGQMPPMPHSL---QTGPSHMQHPVPPqpfPLTPQSSQSQVPP 311
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773   951 PVNWVLTTQKLLSVQVPAVVGLPQSVMTPETIGLQAKQLPSPAKTPafleqPPASTDTEPKGPQGQEIPP-TPGPEKAAL 1029
Cdd:pfam03154  312 GPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKP-----PPTTPIPQLPNPQSHKHPPhLSGPSPFQM 386
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  1030 DLSLLSQESeaaivtwlkgcqgafVPPLGSRMPYHPPSLCSLRALSSLllQKQDLEQKASSLAASQAAGAQPDPKAGA-L 1108
Cdd:pfam03154  387 NSNLPPPPA---------------LKPLSSLSTHHPPSAHPPPLQLMP--QSQQLPPPPAQPPVLTQSQSLPPPAASHpP 449
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 594190773  1109 QASLELVQRQ--FRDNPaylllktrFLAIFSLPAFLATLPPNSIP---TTLSPDVAVVSESDSEDLGDLELKDRARQLDC 1183
Cdd:pfam03154  450 TSGLHQVPSQspFPQHP--------FVPGGPPPITPPSGPPTSTSsamPGIQPPSSASVSSSGPVPAAVSCPLPPVQIKE 521
                          410       420       430
                   ....*....|....*....|....*....|...
gi 594190773  1184 MACRVQASPAAPDPVQRAPSPGEVSAPSPLDAS 1216
Cdd:pfam03154  522 EALDEAEEPESPPPPPRSPSPEPTVVNTPSHAS 554
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH