NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|569005507|ref|XP_006531675|]
View 

pecanex-like protein 3 isoform X1 [Mus musculus]

Protein Classification

oligosaccharide repeat unit polymerase; pecanex family protein( domain architecture ID 10523572)

oligosaccharide repeat unit polymerase may act to polymerize the oligosaccharide repeat units of surface polysaccharides, including O-antigen in Gram-negative bacteria and capsular polysaccharide in Gram-positive bacteria; pecanex family protein similar to Drosophila melanogaster protein pecanex that is involved in neurogenesis

Gene Ontology:  GO:0016020

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Pecanex_C pfam05041
Pecanex protein (C-terminus); This family consists of C terminal region of the pecanex protein ...
1583-1809 1.81e-138

Pecanex protein (C-terminus); This family consists of C terminal region of the pecanex protein homologs. The pecanex protein is a maternal-effect neurogenic gene found in Drosophila.


:

Pssm-ID: 461533  Cd Length: 227  Bit Score: 430.20  E-value: 1.81e-138
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  1583 DQDWNSPLVTLCFGLCVLGRRALGTASHSMSASLEPFLYGLHALFKGDFRITSPRDEWVFADMDLLHRVVAPGVRMALKL 1662
Cdd:pfam05041    1 DSDSDSTLVTLCFALSLLGRRALGSASHSMSNSLESFLYGLHFLFKGDFRITSDKDEWVFMDLDLLRKVVAPAMRMALKL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  1663 HQDHFTSPDEYEEPAALYDAIAANEERLVISHEGDPAWRSAILSNTPSLLALRHVMDDASDEYKIIMLNRRHLSFRVIKV 1742
Cdd:pfam05041   81 HQDHFTDPDEYDENEVLYDAIHTYELVIVIEHESDPRWRVAVLSNNPSLLALRHVDDDGEDEYKIIMLNRRTLSFRVIKV 160
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 569005507  1743 NRECVRGLWAGQQQELVFLRNRNPERGSIQNAKQALRNMINSSCDQPLGYPIYVSPLTTSLAGSHPQ 1809
Cdd:pfam05041  161 NRECVRGLWAGQQQELIFLRNRNRERGSIQNAKQALRNIINSSCDQPIGYPIYVSPLTTSYSNTHLQ 227
PHA03247 super family cl33720
large tegument protein UL36; Provisional
194-655 2.87e-07

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.10  E-value: 2.87e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  194 GDLPQTPPGVVPDPSLPSTDSSERSPmagdgvPWGGSGVADTPMSPLLKGSLSQELSKSFLTLTRpdralVRTSSR-REQ 272
Cdd:PHA03247 2602 VDDRGDPRGPAPPSPLPPDTHAPDPP------PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGR-----VSRPRRaRRL 2670
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  273 CRGTGGYQPLDRrgsgdPMPQKAgssdscfsgtdRETLSSFKSEKtnsthlDSPPGGHAPEgsdtdPPSEAELPASPDAG 352
Cdd:PHA03247 2671 GRAAQASSPPQR-----PRRRAA-----------RPTVGSLTSLA------DPPPPPPTPE-----PAPHALVSATPLPP 2723
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  353 VPSDDTLRSFDTVIGAGTPPGQTEPLLVVRPKDLALLR-----PSKRRPPMRGHSPPGRTPRRPLLEGSGFFEDEDTSEG 427
Cdd:PHA03247 2724 GPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPttagpPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWD 2803
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  428 SELSPASSLRSQRRYSTDSSSSTSCYSPESSQGAAGGPRKRRAPHGAEEGTAVPPKRPYG----TQRTPSTASAKTHARV 503
Cdd:PHA03247 2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRrrppSRSPAAKPAAPARPPV 2883
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  504 LSMDGAGGDVLRAPLAGSKAELEAQPGMELAA--GEPAVLPPEARRGPAANQPGW-RGELQEEGAVGGAPEETGQRECTS 580
Cdd:PHA03247 2884 RRLARPAVSRSTESFALPPDQPERPPQPQAPPppQPQPQPPPPPQPQPPPPPPPRpQPPLAPTTDPAGAGEPSGAVPQPW 2963
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  581 NVR----RAQAIRRRhnaGSNPTPPASVMGSPPSSLQEAQRGRAASHSRALTL-PSALHFASSLLLTRAGPNVHEASNFD 655
Cdd:PHA03247 2964 LGAlvpgRVAVPRFR---VPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALhEETDPPPVSLKQTLWPPDDTEDSDAD 3040
PHA03307 super family cl33723
transcriptional regulator ICP4; Provisional
1866-2035 4.25e-03

transcriptional regulator ICP4; Provisional


The actual alignment was detected with superfamily member PHA03307:

Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.47  E-value: 4.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507 1866 PPLAHPTPENAAGSSEQPLPPGPSWGPRPSLSGSGDGRPPPLLQWPPPRLPGPPPASP----APTEGPRPsRPSGPALLN 1941
Cdd:PHA03307  189 PPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGcgwgPENECPLP-RPAPITLPT 267
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507 1942 SEGPSGKWSLGGRK--------GLGGPDGEPASGSPKGGTPKSQAPLDLSLSPDVSSEASPARTTQDLP-CLDSSIPEGC 2012
Cdd:PHA03307  268 RIWEASGWNGPSSRpgpassssSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSrGAAVSPGPSP 347
                         170       180
                  ....*....|....*....|...
gi 569005507 2013 TPSGAPGDWPVPAEERESPAAQP 2035
Cdd:PHA03307  348 SRSPSPSRPPPPADPSSPRKRPR 370
 
Name Accession Description Interval E-value
Pecanex_C pfam05041
Pecanex protein (C-terminus); This family consists of C terminal region of the pecanex protein ...
1583-1809 1.81e-138

Pecanex protein (C-terminus); This family consists of C terminal region of the pecanex protein homologs. The pecanex protein is a maternal-effect neurogenic gene found in Drosophila.


Pssm-ID: 461533  Cd Length: 227  Bit Score: 430.20  E-value: 1.81e-138
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  1583 DQDWNSPLVTLCFGLCVLGRRALGTASHSMSASLEPFLYGLHALFKGDFRITSPRDEWVFADMDLLHRVVAPGVRMALKL 1662
Cdd:pfam05041    1 DSDSDSTLVTLCFALSLLGRRALGSASHSMSNSLESFLYGLHFLFKGDFRITSDKDEWVFMDLDLLRKVVAPAMRMALKL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  1663 HQDHFTSPDEYEEPAALYDAIAANEERLVISHEGDPAWRSAILSNTPSLLALRHVMDDASDEYKIIMLNRRHLSFRVIKV 1742
Cdd:pfam05041   81 HQDHFTDPDEYDENEVLYDAIHTYELVIVIEHESDPRWRVAVLSNNPSLLALRHVDDDGEDEYKIIMLNRRTLSFRVIKV 160
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 569005507  1743 NRECVRGLWAGQQQELVFLRNRNPERGSIQNAKQALRNMINSSCDQPLGYPIYVSPLTTSLAGSHPQ 1809
Cdd:pfam05041  161 NRECVRGLWAGQQQELIFLRNRNRERGSIQNAKQALRNIINSSCDQPIGYPIYVSPLTTSYSNTHLQ 227
PHA03247 PHA03247
large tegument protein UL36; Provisional
194-655 2.87e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.10  E-value: 2.87e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  194 GDLPQTPPGVVPDPSLPSTDSSERSPmagdgvPWGGSGVADTPMSPLLKGSLSQELSKSFLTLTRpdralVRTSSR-REQ 272
Cdd:PHA03247 2602 VDDRGDPRGPAPPSPLPPDTHAPDPP------PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGR-----VSRPRRaRRL 2670
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  273 CRGTGGYQPLDRrgsgdPMPQKAgssdscfsgtdRETLSSFKSEKtnsthlDSPPGGHAPEgsdtdPPSEAELPASPDAG 352
Cdd:PHA03247 2671 GRAAQASSPPQR-----PRRRAA-----------RPTVGSLTSLA------DPPPPPPTPE-----PAPHALVSATPLPP 2723
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  353 VPSDDTLRSFDTVIGAGTPPGQTEPLLVVRPKDLALLR-----PSKRRPPMRGHSPPGRTPRRPLLEGSGFFEDEDTSEG 427
Cdd:PHA03247 2724 GPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPttagpPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWD 2803
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  428 SELSPASSLRSQRRYSTDSSSSTSCYSPESSQGAAGGPRKRRAPHGAEEGTAVPPKRPYG----TQRTPSTASAKTHARV 503
Cdd:PHA03247 2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRrrppSRSPAAKPAAPARPPV 2883
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  504 LSMDGAGGDVLRAPLAGSKAELEAQPGMELAA--GEPAVLPPEARRGPAANQPGW-RGELQEEGAVGGAPEETGQRECTS 580
Cdd:PHA03247 2884 RRLARPAVSRSTESFALPPDQPERPPQPQAPPppQPQPQPPPPPQPQPPPPPPPRpQPPLAPTTDPAGAGEPSGAVPQPW 2963
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  581 NVR----RAQAIRRRhnaGSNPTPPASVMGSPPSSLQEAQRGRAASHSRALTL-PSALHFASSLLLTRAGPNVHEASNFD 655
Cdd:PHA03247 2964 LGAlvpgRVAVPRFR---VPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALhEETDPPPVSLKQTLWPPDDTEDSDAD 3040
KLF9_13_N-like cd21975
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ...
455-592 1.35e-03

Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.


Pssm-ID: 409240 [Multi-domain]  Cd Length: 163  Bit Score: 41.60  E-value: 1.35e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  455 PESSQGAAGGPRKRRAPHGAEEGTAVPPKRPYGTQrTPSTASAKTHARVLSM--DGAGGDVLRAPLAGSKAELEAQPGME 532
Cdd:cd21975    25 PEGAGLAAGLDVRATREVAKGPGPPGPAWKPDGAD-SPGLVTAAPHLLAANVlaPLRGPSVEGSSLESGDADMGSDSDVA 103
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 569005507  533 LAAGEPAVLPPEARRGPAAN-QPGWrgeLQEEGAVGGAPEETGQRECTSNVRR-AQAIRRRH 592
Cdd:cd21975   104 PASGAAASTSPESSSDAASSpSPLS---LLHPGEAGLEPERPRPRVRRGVRRRgVTPAAKRH 162
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1866-2035 4.25e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.47  E-value: 4.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507 1866 PPLAHPTPENAAGSSEQPLPPGPSWGPRPSLSGSGDGRPPPLLQWPPPRLPGPPPASP----APTEGPRPsRPSGPALLN 1941
Cdd:PHA03307  189 PPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGcgwgPENECPLP-RPAPITLPT 267
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507 1942 SEGPSGKWSLGGRK--------GLGGPDGEPASGSPKGGTPKSQAPLDLSLSPDVSSEASPARTTQDLP-CLDSSIPEGC 2012
Cdd:PHA03307  268 RIWEASGWNGPSSRpgpassssSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSrGAAVSPGPSP 347
                         170       180
                  ....*....|....*....|...
gi 569005507 2013 TPSGAPGDWPVPAEERESPAAQP 2035
Cdd:PHA03307  348 SRSPSPSRPPPPADPSSPRKRPR 370
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1859-2035 4.54e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.06  E-value: 4.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  1859 LTSLSNH-PPLAHPTPENAAGSSeQPLPPGPSWGP----RPSLSGSGDgrpppllqwpppRLPGPPPASPAPTEGPRPSR 1933
Cdd:pfam03154  399 LSSLSTHhPPSAHPPPLQLMPQS-QQLPPPPAQPPvltqSQSLPPPAA------------SHPPTSGLHQVPSQSPFPQH 465
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  1934 PSGPallnsegpsgkwslGGRKGLGGPDGEPASGSPKGgtPKSQAPLDLSLSpdvSSEASPARTTQDLPcldssipegct 2013
Cdd:pfam03154  466 PFVP--------------GGPPPITPPSGPPTSTSSAM--PGIQPPSSASVS---SSGPVPAAVSCPLP----------- 515
                          170       180
                   ....*....|....*....|..
gi 569005507  2014 PSGAPGDWPVPAEERESPAAQP 2035
Cdd:pfam03154  516 PVQIKEEALDEAEEPESPPPPP 537
 
Name Accession Description Interval E-value
Pecanex_C pfam05041
Pecanex protein (C-terminus); This family consists of C terminal region of the pecanex protein ...
1583-1809 1.81e-138

Pecanex protein (C-terminus); This family consists of C terminal region of the pecanex protein homologs. The pecanex protein is a maternal-effect neurogenic gene found in Drosophila.


Pssm-ID: 461533  Cd Length: 227  Bit Score: 430.20  E-value: 1.81e-138
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  1583 DQDWNSPLVTLCFGLCVLGRRALGTASHSMSASLEPFLYGLHALFKGDFRITSPRDEWVFADMDLLHRVVAPGVRMALKL 1662
Cdd:pfam05041    1 DSDSDSTLVTLCFALSLLGRRALGSASHSMSNSLESFLYGLHFLFKGDFRITSDKDEWVFMDLDLLRKVVAPAMRMALKL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  1663 HQDHFTSPDEYEEPAALYDAIAANEERLVISHEGDPAWRSAILSNTPSLLALRHVMDDASDEYKIIMLNRRHLSFRVIKV 1742
Cdd:pfam05041   81 HQDHFTDPDEYDENEVLYDAIHTYELVIVIEHESDPRWRVAVLSNNPSLLALRHVDDDGEDEYKIIMLNRRTLSFRVIKV 160
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 569005507  1743 NRECVRGLWAGQQQELVFLRNRNPERGSIQNAKQALRNMINSSCDQPLGYPIYVSPLTTSLAGSHPQ 1809
Cdd:pfam05041  161 NRECVRGLWAGQQQELIFLRNRNRERGSIQNAKQALRNIINSSCDQPIGYPIYVSPLTTSYSNTHLQ 227
PHA03247 PHA03247
large tegument protein UL36; Provisional
194-655 2.87e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.10  E-value: 2.87e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  194 GDLPQTPPGVVPDPSLPSTDSSERSPmagdgvPWGGSGVADTPMSPLLKGSLSQELSKSFLTLTRpdralVRTSSR-REQ 272
Cdd:PHA03247 2602 VDDRGDPRGPAPPSPLPPDTHAPDPP------PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGR-----VSRPRRaRRL 2670
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  273 CRGTGGYQPLDRrgsgdPMPQKAgssdscfsgtdRETLSSFKSEKtnsthlDSPPGGHAPEgsdtdPPSEAELPASPDAG 352
Cdd:PHA03247 2671 GRAAQASSPPQR-----PRRRAA-----------RPTVGSLTSLA------DPPPPPPTPE-----PAPHALVSATPLPP 2723
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  353 VPSDDTLRSFDTVIGAGTPPGQTEPLLVVRPKDLALLR-----PSKRRPPMRGHSPPGRTPRRPLLEGSGFFEDEDTSEG 427
Cdd:PHA03247 2724 GPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPttagpPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWD 2803
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  428 SELSPASSLRSQRRYSTDSSSSTSCYSPESSQGAAGGPRKRRAPHGAEEGTAVPPKRPYG----TQRTPSTASAKTHARV 503
Cdd:PHA03247 2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRrrppSRSPAAKPAAPARPPV 2883
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  504 LSMDGAGGDVLRAPLAGSKAELEAQPGMELAA--GEPAVLPPEARRGPAANQPGW-RGELQEEGAVGGAPEETGQRECTS 580
Cdd:PHA03247 2884 RRLARPAVSRSTESFALPPDQPERPPQPQAPPppQPQPQPPPPPQPQPPPPPPPRpQPPLAPTTDPAGAGEPSGAVPQPW 2963
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  581 NVR----RAQAIRRRhnaGSNPTPPASVMGSPPSSLQEAQRGRAASHSRALTL-PSALHFASSLLLTRAGPNVHEASNFD 655
Cdd:PHA03247 2964 LGAlvpgRVAVPRFR---VPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALhEETDPPPVSLKQTLWPPDDTEDSDAD 3040
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
286-632 3.05e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 52.48  E-value: 3.05e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  286 GSGDPMPQKAGSSDSCFSGTDRETLSSFKSEKTNSTHLDSPPGGHAPEGSDTDPPSEA----ELPASPDAGVPSDDTLRS 361
Cdd:PHA03307   74 GPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPApdlsEMLRPVGSPGPPPAASPP 153
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  362 FDTVIGAGTPPGQTEPLLVVRPkdLALLRPSKRRPPMRGHSPPGRTPrRPLLEGSGFFEDEDTSEGSELSPASSLRSQRR 441
Cdd:PHA03307  154 AAGASPAAVASDAASSRQAALP--LSSPEETARAPSSPPAEPPPSTP-PAAASPRPPRRSSPISASASSPAPAPGRSAAD 230
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  442 ystdsssstscySPESSQGAAGGPRKRRAPHGAEEGTAVPPKRPYGTQRTPSTASAKTHARvlsmdgaggdvlRAPLAGS 521
Cdd:PHA03307  231 ------------DAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPS------------SRPGPAS 286
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  522 KAELEAQPGmelaagePAVLPPEARRGPAANQPGWRGELqeegaVGGAPEETGQRECTSNVRRAQAIRR---RHNAGSNP 598
Cdd:PHA03307  287 SSSSPRERS-------PSPSPSSPGSGPAPSSPRASSSS-----SSSRESSSSSTSSSSESSRGAAVSPgpsPSRSPSPS 354
                         330       340       350
                  ....*....|....*....|....*....|....
gi 569005507  599 TPPASvmgSPPSSLQEAQRGRAASHSRALTLPSA 632
Cdd:PHA03307  355 RPPPP---ADPSSPRKRPRPSRAPSSPAASAGRP 385
PHA03247 PHA03247
large tegument protein UL36; Provisional
334-609 2.79e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.55  E-value: 2.79e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  334 GSDTDPPSEAELPASPDAGVP-SDDTLRSFDTVIGA-----GTPPGQTEPLLVVRPKDLALLRPSKRRPPMRGHSP---- 403
Cdd:PHA03247 2549 GDPPPPLPPAAPPAAPDRSVPpPRPAPRPSEPAVTSrarrpDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPdppp 2628
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  404 PGRTPRRPLLEGSGFFEDEDTSEGSELSPASSLRSQRRYSTDSSSSTSCYSPEssqgaagGPRKRRAPHGAEEGTAV--- 480
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQ-------RPRRRAARPTVGSLTSLadp 2701
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  481 -PPKRPYGTQRTPSTASAKTHARVLSMDGAGGDVLRAPLAGSKAELEAQPGME-------LAAGEPAVLPPEARRG--PA 550
Cdd:PHA03247 2702 pPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGParparppTTAGPPAPAPPAAPAAgpPR 2781
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 569005507  551 ANQPGWRGELQEEGAVGGAPEETGQRECTSNVRRAQAIRRRHNAGSNPTPPASVMGSPP 609
Cdd:PHA03247 2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP 2840
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
197-438 6.38e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 6.38e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  197 PQTPPGVVPDPSLPSTDSSERSPMAGDGVPWGGSGVADTPMSPLLKGSLSQelsksfltltrpdralvrTSSRREQCrgt 276
Cdd:PHA03307  189 PPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDS------------------SSSESSGC--- 247
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  277 gGYQPLDRRGSGDPMPQKA-GSSDSCFSGTDRETLSSFKSEKTNSTHLDSPPGGHAPEGSDTDPPSEAELPASPDAGVPS 355
Cdd:PHA03307  248 -GWGPENECPLPRPAPITLpTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSS 326
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  356 DDTLRSFDTVIGAGTPPGQTE---PLLVVRPKDLALLRPSKRRPPMRGHSPPGRTPRRPLLEGSGFFEDEDT--SEGSEL 430
Cdd:PHA03307  327 SSTSSSSESSRGAAVSPGPSPsrsPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRArrRDATGR 406

                  ....*...
gi 569005507  431 SPASSLRS 438
Cdd:PHA03307  407 FPAGRPRP 414
KLF9_13_N-like cd21975
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ...
455-592 1.35e-03

Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.


Pssm-ID: 409240 [Multi-domain]  Cd Length: 163  Bit Score: 41.60  E-value: 1.35e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  455 PESSQGAAGGPRKRRAPHGAEEGTAVPPKRPYGTQrTPSTASAKTHARVLSM--DGAGGDVLRAPLAGSKAELEAQPGME 532
Cdd:cd21975    25 PEGAGLAAGLDVRATREVAKGPGPPGPAWKPDGAD-SPGLVTAAPHLLAANVlaPLRGPSVEGSSLESGDADMGSDSDVA 103
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 569005507  533 LAAGEPAVLPPEARRGPAAN-QPGWrgeLQEEGAVGGAPEETGQRECTSNVRR-AQAIRRRH 592
Cdd:cd21975   104 PASGAAASTSPESSSDAASSpSPLS---LLHPGEAGLEPERPRPRVRRGVRRRgVTPAAKRH 162
PRK13863 PRK13863
T-DNA border endonuclease VirD2;
419-622 1.42e-03

T-DNA border endonuclease VirD2;


Pssm-ID: 237533 [Multi-domain]  Cd Length: 446  Bit Score: 43.40  E-value: 1.42e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  419 FEDEDTSEGSelsPASSLRSQRRYSTDSSSSTSCYSPESSQGAAG----GPRKRRAPHGAEEGTAVPPKRPYGTQRTPST 494
Cdd:PRK13863  211 FEDADFEEFS---PGEDHREPSQSFDTSPGEAPQGEPESAERPEKlqneSEVRLQEPAGSSIKADARIRVSLESERRAQP 287
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  495 ASAKTharvlSMDGAGGDVLRAPLAGSKAELEAQPGMELAAGEPAVLPPEAR-------RGPAANQPGWRGELQEEGAVG 567
Cdd:PRK13863  288 SASKI-----PVADDFGIETSYVAEGDVRKLEGNSGTPRLATEVATHTTSERqqrrkrpRDDEGEPSGAKRTRLNGIAVG 362
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 569005507  568 gaPEET-GQRECTSNVRRAQAIRRRHNAGSNPTPPASVMGSPPSSLQEAQRGRAAS 622
Cdd:PRK13863  363 --PEANaGEQDGRDDPITSPAQPPRSNPLADPVRASIATDSLPATADRQQQREPSS 416
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1866-2035 4.25e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.47  E-value: 4.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507 1866 PPLAHPTPENAAGSSEQPLPPGPSWGPRPSLSGSGDGRPPPLLQWPPPRLPGPPPASP----APTEGPRPsRPSGPALLN 1941
Cdd:PHA03307  189 PPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGcgwgPENECPLP-RPAPITLPT 267
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507 1942 SEGPSGKWSLGGRK--------GLGGPDGEPASGSPKGGTPKSQAPLDLSLSPDVSSEASPARTTQDLP-CLDSSIPEGC 2012
Cdd:PHA03307  268 RIWEASGWNGPSSRpgpassssSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSrGAAVSPGPSP 347
                         170       180
                  ....*....|....*....|...
gi 569005507 2013 TPSGAPGDWPVPAEERESPAAQP 2035
Cdd:PHA03307  348 SRSPSPSRPPPPADPSSPRKRPR 370
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1859-2035 4.54e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.06  E-value: 4.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  1859 LTSLSNH-PPLAHPTPENAAGSSeQPLPPGPSWGP----RPSLSGSGDgrpppllqwpppRLPGPPPASPAPTEGPRPSR 1933
Cdd:pfam03154  399 LSSLSTHhPPSAHPPPLQLMPQS-QQLPPPPAQPPvltqSQSLPPPAA------------SHPPTSGLHQVPSQSPFPQH 465
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  1934 PSGPallnsegpsgkwslGGRKGLGGPDGEPASGSPKGgtPKSQAPLDLSLSpdvSSEASPARTTQDLPcldssipegct 2013
Cdd:pfam03154  466 PFVP--------------GGPPPITPPSGPPTSTSSAM--PGIQPPSSASVS---SSGPVPAAVSCPLP----------- 515
                          170       180
                   ....*....|....*....|..
gi 569005507  2014 PSGAPGDWPVPAEERESPAAQP 2035
Cdd:pfam03154  516 PVQIKEEALDEAEEPESPPPPP 537
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
197-502 5.20e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.08  E-value: 5.20e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  197 PQTPPGVVPDPSLPSTDSSERSPMAGDGVPWGGSGVADTPMSPLLKgslsqelsksfltlTRPDRALVRTSSRREQCRGT 276
Cdd:PHA03307   84 SRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPS--------------PAPDLSEMLRPVGSPGPPPA 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  277 GGYQPLDRRGSGDPMPQKAGSSDSCFSGTDRETLSSfKSEKTNSTHLDSPPGGHAPEGSDTDPPSEAELPASPDAGVPSD 356
Cdd:PHA03307  150 ASPPAAGASPAAVASDAASSRQAALPLSSPEETARA-PSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSA 228
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  357 DTLRSFDTVIGAGTP-PGQTEPLLVVRPKDLALLRPSKRRPpmRGHSPPGRTPRRPLLEGSGFFEDEDTSEGSELSPASS 435
Cdd:PHA03307  229 ADDAGASSSDSSSSEsSGCGWGPENECPLPRPAPITLPTRI--WEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSG 306
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 569005507  436 LRSQRRYSTDSSSSTSCYSPESSQGAAGGPRKRRAPHGAEEGTAVPPKRPYGTQRTPSTASAKTHAR 502
Cdd:PHA03307  307 PAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSR 373
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
455-630 8.15e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.40  E-value: 8.15e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  455 PESSQGAAGGPRKRRAPHGAEEGTAVPPKRPYGTQRtPSTASAKTHARVlsmdGAGGDVLRAPLAGSKAELEAQPGMELA 534
Cdd:PRK12323  428 PAPEALAAARQASARGPGGAPAPAPAPAAAPAAAAR-PAAAGPRPVAAA----AAAAPARAAPAAAPAPADDDPPPWEEL 502
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569005507  535 AGEPAVLPPEArrgPAANQPGWRGELQEEGAVGGAPEEtgqrectsnvRRAQAirrrhnagsnPTPPASVMGSPPSSLQE 614
Cdd:PRK12323  503 PPEFASPAPAQ---PDAAPAGWVAESIPDPATADPDDA----------FETLA----------PAPAAAPAPRAAAATEP 559
                         170
                  ....*....|....*.
gi 569005507  615 AQRGRAASHSRALTLP 630
Cdd:PRK12323  560 VVAPRPPRASASGLPD 575
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH