NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|767914142|ref|XP_011530990|]
View 

methylcytosine dioxygenase TET3 isoform X8 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Tet_JBP super family cl40427
oxygenase domain of ten-eleven translocation (TET) enzymes, J-binding proteins (JBPs), and ...
828-1049 4.43e-163

oxygenase domain of ten-eleven translocation (TET) enzymes, J-binding proteins (JBPs), and similar proteins; TET proteins are involved in DNA demethylation through iteratively oxidizing 5-methylcytosine (5mC) into 5-hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC). TET proteins contain a C-terminal catalytic domain which consists of a cysteine-rich region and a double-stranded beta-helix (DSBH) fold. Alterations in TET protein function have been linked to cancer, and TETs influence many cell differentiation processes. J binding protein (JBP) 1 and JBP2 are thymidine hydroxylases that catalyze the first step of base J biosynthesis: the hydroxylation of thymine in DNA to form 5-hydroxymethyluracil (hmU). Base J (beta-d-glucopyranosyloxymethyluracil) is a hyper-modified DNA base found in the DNA of kinetoplastids (Trypanosoma brucei, Trypanosoma cruzi, and Leishmania). JBP1 and JBP2 each contain a J-DNA binding domain and a thymidine hydroxylase domain. Members of this TET/JBP family of dioxygenases require Fe2+ and alpha-ketoglutarate (also known as 2-oxoglutarate) for activity.


The actual alignment was detected with superfamily member cd18897:

Pssm-ID: 394797  Cd Length: 452  Bit Score: 488.34  E-value: 4.43e-163
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  828 CDCVEQIVEKDEGPYYTHLGSGPTVASIRELMEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCL 907
Cdd:cd18897     1 CDCVEQILEKDEGPYYTHLGSGPTVASIRELMEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRSSEEEKLLCL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  908 VRHRAGHHCQNAVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWS 987
Cdd:cd18897    81 VRHRAGHHCQNAVIVILILAWEGIPRALGDKLYQELTETLTKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWS 160
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767914142  988 MYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQTRQGDL 1049
Cdd:cd18897   161 MYFNGCKYARSKTPRKFRLIGDNPKEEENLRDNFQDLATEVAPLYKRLAPQAYQNQVTNEDI 222
zf-CXXC pfam02008
CXXC zinc finger domain; This domain contains eight conserved cysteine residues that bind to ...
50-89 5.62e-13

CXXC zinc finger domain; This domain contains eight conserved cysteine residues that bind to two zinc ions. The CXXC domain is found in a variety of chromatin-associated proteins. This domain binds to nonmethyl-CpG dinucleotides. The domain is characterized by two repeats, and shows a peculiar internal duplication in which the second unit is inserted into the first one. Each of these units is characterized by four conserved cysteines, displaying a CXXCXXCX(n)C motif that chelate a Zn+2 ion. The DNA binding interface has been identified by NMR. In eukaryotes, the CXXC domain is found in stramenopiles, plants and metazoans. Plants possess a mono-CXXC domain that is present in distinct chromatin proteins. Structural comparisons show that the mono-CXXC is homologous to the structural-zinc binding domain of medium chain dehydrogenases.


:

Pssm-ID: 366873  Cd Length: 48  Bit Score: 64.30  E-value: 5.62e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 767914142    50 GRKKRKRCGTCEPCRRLENCGACTSCTNR-------RTHQICKLRKC 89
Cdd:pfam02008    2 NRRKRRRCGVCEGCQRPEDCGQCSFCLDMpkfggpgKKKQKCRLRRC 48
PHA03247 super family cl33720
large tegument protein UL36; Provisional
383-752 2.21e-10

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 65.34  E-value: 2.21e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  383 PLPEALSPPAPFRS--PQSYLRAPSWPVVPPEEHSSFAPDSSAFP--PATPRTEFP-EAWGTDTPPATPRSSWPMPRPSP 457
Cdd:PHA03247 2554 PLPPAAPPAAPDRSvpPPRPAPRPSEPAVTSRARRPDAPPQSARPraPVDDRGDPRgPAPPSPLPPDTHAPDPPPPSPSP 2633
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  458 DPMAeleqlLGSASDYIQSVFKRPEALPTKPKVKVEAPSSSPAPAPSPVLQREAPTPSSEPDTHQKAQTALQQHLHHKRS 537
Cdd:PHA03247 2634 AANE-----PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  538 LFLEQVHDTSFPAPSEPSAPGWWPPPSS---------PVPRLPDRPPKEKKKKLPTPAGGPVGTEKAAPGIKPSVRKPIQ 608
Cdd:PHA03247 2709 EPAPHALVSATPLPPGPAAARQASPALPaapappavpAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAV 2788
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  609 IKKSRPREAQPLfPPVRQIVLEGLRSPASQEVQAHPPAPLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSP 688
Cdd:PHA03247 2789 ASLSESRESLPS-PWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPP 2867
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 767914142  689 MtalQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVPIQDPENQQTCLPAPESPFATR 752
Cdd:PHA03247 2868 S---RSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ 2928
 
Name Accession Description Interval E-value
TET3 cd18897
oxygenase domain of ten-eleven translocation (TET)3 methylcytosine dioxygenase and similar ...
828-1049 4.43e-163

oxygenase domain of ten-eleven translocation (TET)3 methylcytosine dioxygenase and similar proteins; TET3 is involved in DNA demethylation through iteratively oxidizing 5-methylcytosine (5mC) into 5-hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC). TET proteins contain a C-terminal catalytic domain which consists of a cysteine-rich region and a double-stranded beta-helix (DSBH) fold. TET3 serves as a tumor suppressor; it acts as a suppressor of ovarian cancer by demethylating the miR-30d precursor gene promoter to block TGF-beta1 induced epithelial-mesenchymal transition (EMT). TET3 (and TET2) promoters are silenced in melanoma cells by mechanisms triggered by TGF-beta and mediated by DNA methyltransferase 3 alpha (DNMT3A), which play a functional role in the EMT process and metastasis. In addition, TET3 (and TET2) may be guardians of regulatory T cell stability and immune homeostasis. TET3 has been shown to prevent terminal differentiation of adult neural stem cells by a mechanism involving direct binding and repression of TET3 to the imprinted gene Snrpn. TET3 has also been shown to mediate the activation of hepatic stellate cells via modulation of the long non-coding RNA HIF1A-AS1 expression. TET1 belongs to the TET/JBP family of dioxygenases that require Fe2+ and alpha-ketoglutarate (also known as 2-oxoglutarate) for activity.


Pssm-ID: 380676  Cd Length: 452  Bit Score: 488.34  E-value: 4.43e-163
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  828 CDCVEQIVEKDEGPYYTHLGSGPTVASIRELMEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCL 907
Cdd:cd18897     1 CDCVEQILEKDEGPYYTHLGSGPTVASIRELMEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRSSEEEKLLCL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  908 VRHRAGHHCQNAVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWS 987
Cdd:cd18897    81 VRHRAGHHCQNAVIVILILAWEGIPRALGDKLYQELTETLTKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWS 160
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767914142  988 MYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQTRQGDL 1049
Cdd:cd18897   161 MYFNGCKYARSKTPRKFRLIGDNPKEEENLRDNFQDLATEVAPLYKRLAPQAYQNQVTNEDI 222
Tet_JBP pfam12851
Oxygenase domain of the 2OGFeDO superfamily; A double-stranded beta helix (DSBH) fold domain ...
985-1044 8.48e-17

Oxygenase domain of the 2OGFeDO superfamily; A double-stranded beta helix (DSBH) fold domain of the 2-oxoglutarate (2OG)-Fe(II)-dependent dioxygenase (2OGFeDO) superfamily found in various eukaryotes, bacteria and bacteriophages. Members of this family catalyze nucleic acid modifications, such as thymidine hydroxylation during base J synthesis in kinetoplastids, and the conversion of 5 methyl-cytosine (5-mC) to 5-hydroxymethyl-cytosine (hmC), or further oxidation to 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC). Metazoan TET proteins contain a cysteine-rich region inserted into the core of the DSBH fold. Vertebrate TET proteins are oncogenes that are mutated in various myeloid cancers. Fungal and algal versions of this family are linked to a predicted transposase and show lineage-specific expansions.


Pssm-ID: 372343  Cd Length: 166  Bit Score: 78.96  E-value: 8.48e-17
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142   985 SWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQT 1044
Cdd:pfam12851    1 SWSMYYDGCKFPGPRKPRKFSFTPRNPKEEIKLEDELQELAALLGAIYKQIAPDLYENQI 60
zf-CXXC pfam02008
CXXC zinc finger domain; This domain contains eight conserved cysteine residues that bind to ...
50-89 5.62e-13

CXXC zinc finger domain; This domain contains eight conserved cysteine residues that bind to two zinc ions. The CXXC domain is found in a variety of chromatin-associated proteins. This domain binds to nonmethyl-CpG dinucleotides. The domain is characterized by two repeats, and shows a peculiar internal duplication in which the second unit is inserted into the first one. Each of these units is characterized by four conserved cysteines, displaying a CXXCXXCX(n)C motif that chelate a Zn+2 ion. The DNA binding interface has been identified by NMR. In eukaryotes, the CXXC domain is found in stramenopiles, plants and metazoans. Plants possess a mono-CXXC domain that is present in distinct chromatin proteins. Structural comparisons show that the mono-CXXC is homologous to the structural-zinc binding domain of medium chain dehydrogenases.


Pssm-ID: 366873  Cd Length: 48  Bit Score: 64.30  E-value: 5.62e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 767914142    50 GRKKRKRCGTCEPCRRLENCGACTSCTNR-------RTHQICKLRKC 89
Cdd:pfam02008    2 NRRKRRRCGVCEGCQRPEDCGQCSFCLDMpkfggpgKKKQKCRLRRC 48
PHA03247 PHA03247
large tegument protein UL36; Provisional
383-752 2.21e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 65.34  E-value: 2.21e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  383 PLPEALSPPAPFRS--PQSYLRAPSWPVVPPEEHSSFAPDSSAFP--PATPRTEFP-EAWGTDTPPATPRSSWPMPRPSP 457
Cdd:PHA03247 2554 PLPPAAPPAAPDRSvpPPRPAPRPSEPAVTSRARRPDAPPQSARPraPVDDRGDPRgPAPPSPLPPDTHAPDPPPPSPSP 2633
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  458 DPMAeleqlLGSASDYIQSVFKRPEALPTKPKVKVEAPSSSPAPAPSPVLQREAPTPSSEPDTHQKAQTALQQHLHHKRS 537
Cdd:PHA03247 2634 AANE-----PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  538 LFLEQVHDTSFPAPSEPSAPGWWPPPSS---------PVPRLPDRPPKEKKKKLPTPAGGPVGTEKAAPGIKPSVRKPIQ 608
Cdd:PHA03247 2709 EPAPHALVSATPLPPGPAAARQASPALPaapappavpAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAV 2788
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  609 IKKSRPREAQPLfPPVRQIVLEGLRSPASQEVQAHPPAPLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSP 688
Cdd:PHA03247 2789 ASLSESRESLPS-PWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPP 2867
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 767914142  689 MtalQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVPIQDPENQQTCLPAPESPFATR 752
Cdd:PHA03247 2868 S---RSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ 2928
TALPID3 pfam15324
Hedgehog signalling target; TALPID3 is a family of eukaryotic proteins that are targets for ...
364-459 2.63e-03

Hedgehog signalling target; TALPID3 is a family of eukaryotic proteins that are targets for Hedgehog signalling. Mutations in this gene noticed first in chickens lead to multiple abnormalities of development.


Pssm-ID: 434634 [Multi-domain]  Cd Length: 1288  Bit Score: 41.80  E-value: 2.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142   364 TQLSSALPQPSHSTPQASCPLPEALSP-PAPFRSPQ-SYLRAPSWPVVPPEEHSSFAPDSSAF--PPATPRTEFPEAwGT 439
Cdd:pfam15324  982 TLLPTPVPTPQPTPPCSPPSPLKEPSPvKTPDSSPCvSEHDFFPVKEIPPEKGADTGPAVSLVitPTVTPIATPPPA-AT 1060
                           90       100
                   ....*....|....*....|
gi 767914142   440 DTPPATPRSSWPMPRPSPDP 459
Cdd:pfam15324 1061 PTPPLSENSIDKLKSPSPEL 1080
 
Name Accession Description Interval E-value
TET3 cd18897
oxygenase domain of ten-eleven translocation (TET)3 methylcytosine dioxygenase and similar ...
828-1049 4.43e-163

oxygenase domain of ten-eleven translocation (TET)3 methylcytosine dioxygenase and similar proteins; TET3 is involved in DNA demethylation through iteratively oxidizing 5-methylcytosine (5mC) into 5-hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC). TET proteins contain a C-terminal catalytic domain which consists of a cysteine-rich region and a double-stranded beta-helix (DSBH) fold. TET3 serves as a tumor suppressor; it acts as a suppressor of ovarian cancer by demethylating the miR-30d precursor gene promoter to block TGF-beta1 induced epithelial-mesenchymal transition (EMT). TET3 (and TET2) promoters are silenced in melanoma cells by mechanisms triggered by TGF-beta and mediated by DNA methyltransferase 3 alpha (DNMT3A), which play a functional role in the EMT process and metastasis. In addition, TET3 (and TET2) may be guardians of regulatory T cell stability and immune homeostasis. TET3 has been shown to prevent terminal differentiation of adult neural stem cells by a mechanism involving direct binding and repression of TET3 to the imprinted gene Snrpn. TET3 has also been shown to mediate the activation of hepatic stellate cells via modulation of the long non-coding RNA HIF1A-AS1 expression. TET1 belongs to the TET/JBP family of dioxygenases that require Fe2+ and alpha-ketoglutarate (also known as 2-oxoglutarate) for activity.


Pssm-ID: 380676  Cd Length: 452  Bit Score: 488.34  E-value: 4.43e-163
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  828 CDCVEQIVEKDEGPYYTHLGSGPTVASIRELMEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCL 907
Cdd:cd18897     1 CDCVEQILEKDEGPYYTHLGSGPTVASIRELMEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRSSEEEKLLCL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  908 VRHRAGHHCQNAVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWS 987
Cdd:cd18897    81 VRHRAGHHCQNAVIVILILAWEGIPRALGDKLYQELTETLTKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWS 160
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767914142  988 MYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQTRQGDL 1049
Cdd:cd18897   161 MYFNGCKYARSKTPRKFRLIGDNPKEEENLRDNFQDLATEVAPLYKRLAPQAYQNQVTNEDI 222
TET cd18892
oxygenase domain of ten-eleven translocation (TET)1, TET2, and TET3 methylcytosine ...
828-1044 1.02e-151

oxygenase domain of ten-eleven translocation (TET)1, TET2, and TET3 methylcytosine dioxygenases and similar proteins; TET proteins are involved in DNA demethylation through iteratively oxidizing 5-methylcytosine (5mC) into 5-hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC). TET proteins contain a C-terminal catalytic domain which consists of a cysteine-rich region and a double-stranded beta-helix (DSBH) fold. Alterations in TET protein function have been linked to cancer, and TETs influence many cell differentiation processes. TET family genes have been implicated as tumor suppressors, for example mutations/deletions of the TET2 gene frequently occur in multiple spectra of myeloid malignancies. TET3 acts as a suppressor of ovarian cancer by demethylating the miR-30d precursor gene promoter to block TGF-beta1 induced epithelial-mesenchymal transition (EMT). TET3 (and TET2) promoters are silenced in melanoma cells by mechanisms triggered by TGF-beta and mediated by DNA methyltransferase 3 alpha (DNMT3A). TET genes are downregulated in endometriosis. TET proteins belong to the TET/JBP family of dioxygenases that require Fe2+ and alpha-ketoglutarate (also known as 2-oxoglutarate) for activity.


Pssm-ID: 380671  Cd Length: 398  Bit Score: 456.75  E-value: 1.02e-151
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  828 CDCVEQIV-EKDEGPYYTHLGSGPTVASIRELMEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLC 906
Cdd:cd18892     1 CGCFPPDEsPPEPGPYYTHLGAGPSLAALRELLEKRTGVTGKAIRIEKVIYTGKEGKTSQGCPIAKWIIRRSSLEEKYLV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  907 LVRHRAGHHCQNAVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSW 986
Cdd:cd18892    81 LVKHRPGHFCHSAFIVICIVAWEGVPQSNADELYSLLTDKLNKFGLPTKRRCGTNEERTCACQGLDPETCGASFSFGCSW 160
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 767914142  987 SMYFNGCKYARSKTPRKFRLAGdnPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQT 1044
Cdd:cd18892   161 SMYYNGCKFARSKTVRKFRLSD--KSEEEELEDKLQNLATHLAPLYKSLAPDSYKNQV 216
TET1 cd18895
oxygenase domain of ten-eleven translocation (TET)1 methylcytosine dioxygenase and similar ...
828-1050 3.52e-138

oxygenase domain of ten-eleven translocation (TET)1 methylcytosine dioxygenase and similar proteins; TET1 is involved in DNA demethylation through iteratively oxidizing 5-methylcytosine (5mC) into 5-hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC). Human TET1 (and TET2) are more active on 5mC-DNA than 5hmC/5fC-DNA substrates. TET proteins contain a C-terminal catalytic domain which consists of a cysteine-rich region and a double-stranded beta-helix (DSBH) fold. TET1 plays multiple roles in in tumor development and progression. TET1 serves as a tumor suppressor gene; loss of TET1 is associated with tumorigenesis and can be used as a potential biomarker for cancer therapy. In addition to its dioxygenase activity, it can induce epithelial-mesenchymal transition and act as a coactivator to regulate gene transcription. The regulation of TET1 is also correlated with microRNA in a posttranscriptional modification process. TET1 belongs to the TET/JBP family of dioxygenases that require Fe2+ and alpha-ketoglutarate (also known as 2-oxoglutarate) for activity.


Pssm-ID: 380674  Cd Length: 410  Bit Score: 422.01  E-value: 3.52e-138
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  828 CDCVEQIVEKDEGPYYTHLGSGPTVASIRELMEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCL 907
Cdd:cd18895     1 CDCVEQIIEKDEGPYYTHLGAGPSVAAVREIMENRYGEKGNAIRIEVVVYTGKEGKSSQGCPIAKWVIRRSSDEEKLLCL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  908 VRHRAGHHCQNAVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWS 987
Cdd:cd18895    81 VRQRAGHHCQTAVIVILILAWEGIPRLLADRLYQELTQTLKKYGSPTSRRCALNEDRTCACQGLDPETCGASFSFGCSWS 160
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767914142  988 MYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQTRQGDLG 1050
Cdd:cd18895   161 MYFNGCKFARSKYPRKFRLLTDDPKEEENLESNLQNLATDVAPVYKKLAPEAFQNQVENENVA 223
TET2 cd18896
oxygenase domain of ten-eleven translocation (TET)2 methylcytosine dioxygenase and similar ...
824-1043 8.26e-130

oxygenase domain of ten-eleven translocation (TET)2 methylcytosine dioxygenase and similar proteins; TET2 is involved in DNA demethylation through iteratively oxidizing 5-methylcytosine (5mC) into 5-hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC). Human TET2 (and TET1) have been shown to be more active on 5mC-DNA than 5hmC/5fC-DNA substrates. TET proteins contain a C-terminal catalytic domain which consists of a cysteine-rich region and a double-stranded beta-helix (DSBH) fold. TET2 acts as a tumor suppressor in hematopoiesis; mutations/deletions of the TET2 gene frequently occur in multiple spectra of myeloid malignancies. TET2 (and TET3) promoters are silenced in melanoma cells by mechanisms triggered by TGF-beta and mediated by DNA methyltransferase 3 alpha (DNMT3A), which play a functional role in the epithelial-mesenchymal transition process and metastasis. In addition, TET2 (and TET3) may be guardians of regulatory T cell stability and immune homeostasis. TET2 belongs to the TET/JBP family of dioxygenases that require Fe2+ and alpha-ketoglutarate (also known as 2-oxoglutarate) for activity.


Pssm-ID: 380675  Cd Length: 434  Bit Score: 400.89  E-value: 8.26e-130
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  824 EFPTCDCVEQIVEKDEGPYYTHLGSGPTVASIRELMEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEK 903
Cdd:cd18896     1 DFPSCSCVEQIIEKDEGPYYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEGKSSQGCPIAKWVIRRSSEEEK 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  904 LLCLVRHRAGHHCQNAVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFG 983
Cdd:cd18896    81 LLCLVRERAGHSCETAVIVILILVWEGIPISLADKLYSELTDTLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFG 160
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  984 CSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQ 1043
Cdd:cd18896   161 CSWSMYYNGCKFARSKIPRKFKLLGDDPKEEEKLESNLQNLSTLMAPTYKKLAPDAYNNQ 220
Tet_JBP cd14946
oxygenase domain of ten-eleven translocation (TET) enzymes, J-binding proteins (JBPs), and ...
855-1043 8.41e-35

oxygenase domain of ten-eleven translocation (TET) enzymes, J-binding proteins (JBPs), and similar proteins; TET proteins are involved in DNA demethylation through iteratively oxidizing 5-methylcytosine (5mC) into 5-hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC). TET proteins contain a C-terminal catalytic domain which consists of a cysteine-rich region and a double-stranded beta-helix (DSBH) fold. Alterations in TET protein function have been linked to cancer, and TETs influence many cell differentiation processes. J binding protein (JBP) 1 and JBP2 are thymidine hydroxylases that catalyze the first step of base J biosynthesis: the hydroxylation of thymine in DNA to form 5-hydroxymethyluracil (hmU). Base J (beta-d-glucopyranosyloxymethyluracil) is a hyper-modified DNA base found in the DNA of kinetoplastids (Trypanosoma brucei, Trypanosoma cruzi, and Leishmania). JBP1 and JBP2 each contain a J-DNA binding domain and a thymidine hydroxylase domain. Members of this TET/JBP family of dioxygenases require Fe2+ and alpha-ketoglutarate (also known as 2-oxoglutarate) for activity.


Pssm-ID: 380670  Cd Length: 264  Bit Score: 134.05  E-value: 8.41e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  855 IRELMEERYG-EKGKAIRIEKVIYTGKEGKSsRGCPIAKWVIRRhtleEKLLCLVRHRAGhhcqnavIVILILAWEGIPR 933
Cdd:cd14946     1 LLENMLSKCGtQQSFANANITLKYEGKEGKS-QGCPKALKNVRT----SKLAYFVCDHDG-------SVILAYVPEVLPK 68
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  934 SLGDTLYQELTDTLRKYGNptsrrcglnddrtcacqgKDPNTCGASFSFGCSWSMYFNGCKyarsktprkfRLAGDNPKE 1013
Cdd:cd14946    69 ELVEEFTEKLESIQTKRGT------------------LDPETKGDTGYSGILDNSMPFNYV----------TADLSQELG 120
                         170       180       190
                  ....*....|....*....|....*....|
gi 767914142 1014 EEVLRKSFQDLATEVAPLYKRLAPQAYQNQ 1043
Cdd:cd14946   121 QYLSEIVNPQISYYISKLLTCVSPRTINYL 150
Tet_JBP pfam12851
Oxygenase domain of the 2OGFeDO superfamily; A double-stranded beta helix (DSBH) fold domain ...
985-1044 8.48e-17

Oxygenase domain of the 2OGFeDO superfamily; A double-stranded beta helix (DSBH) fold domain of the 2-oxoglutarate (2OG)-Fe(II)-dependent dioxygenase (2OGFeDO) superfamily found in various eukaryotes, bacteria and bacteriophages. Members of this family catalyze nucleic acid modifications, such as thymidine hydroxylation during base J synthesis in kinetoplastids, and the conversion of 5 methyl-cytosine (5-mC) to 5-hydroxymethyl-cytosine (hmC), or further oxidation to 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC). Metazoan TET proteins contain a cysteine-rich region inserted into the core of the DSBH fold. Vertebrate TET proteins are oncogenes that are mutated in various myeloid cancers. Fungal and algal versions of this family are linked to a predicted transposase and show lineage-specific expansions.


Pssm-ID: 372343  Cd Length: 166  Bit Score: 78.96  E-value: 8.48e-17
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142   985 SWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQT 1044
Cdd:pfam12851    1 SWSMYYDGCKFPGPRKPRKFSFTPRNPKEEIKLEDELQELAALLGAIYKQIAPDLYENQI 60
zf-CXXC pfam02008
CXXC zinc finger domain; This domain contains eight conserved cysteine residues that bind to ...
50-89 5.62e-13

CXXC zinc finger domain; This domain contains eight conserved cysteine residues that bind to two zinc ions. The CXXC domain is found in a variety of chromatin-associated proteins. This domain binds to nonmethyl-CpG dinucleotides. The domain is characterized by two repeats, and shows a peculiar internal duplication in which the second unit is inserted into the first one. Each of these units is characterized by four conserved cysteines, displaying a CXXCXXCX(n)C motif that chelate a Zn+2 ion. The DNA binding interface has been identified by NMR. In eukaryotes, the CXXC domain is found in stramenopiles, plants and metazoans. Plants possess a mono-CXXC domain that is present in distinct chromatin proteins. Structural comparisons show that the mono-CXXC is homologous to the structural-zinc binding domain of medium chain dehydrogenases.


Pssm-ID: 366873  Cd Length: 48  Bit Score: 64.30  E-value: 5.62e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 767914142    50 GRKKRKRCGTCEPCRRLENCGACTSCTNR-------RTHQICKLRKC 89
Cdd:pfam02008    2 NRRKRRRCGVCEGCQRPEDCGQCSFCLDMpkfggpgKKKQKCRLRRC 48
PHA03247 PHA03247
large tegument protein UL36; Provisional
383-752 2.21e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 65.34  E-value: 2.21e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  383 PLPEALSPPAPFRS--PQSYLRAPSWPVVPPEEHSSFAPDSSAFP--PATPRTEFP-EAWGTDTPPATPRSSWPMPRPSP 457
Cdd:PHA03247 2554 PLPPAAPPAAPDRSvpPPRPAPRPSEPAVTSRARRPDAPPQSARPraPVDDRGDPRgPAPPSPLPPDTHAPDPPPPSPSP 2633
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  458 DPMAeleqlLGSASDYIQSVFKRPEALPTKPKVKVEAPSSSPAPAPSPVLQREAPTPSSEPDTHQKAQTALQQHLHHKRS 537
Cdd:PHA03247 2634 AANE-----PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  538 LFLEQVHDTSFPAPSEPSAPGWWPPPSS---------PVPRLPDRPPKEKKKKLPTPAGGPVGTEKAAPGIKPSVRKPIQ 608
Cdd:PHA03247 2709 EPAPHALVSATPLPPGPAAARQASPALPaapappavpAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAV 2788
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  609 IKKSRPREAQPLfPPVRQIVLEGLRSPASQEVQAHPPAPLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSP 688
Cdd:PHA03247 2789 ASLSESRESLPS-PWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPP 2867
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 767914142  689 MtalQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVPIQDPENQQTCLPAPESPFATR 752
Cdd:PHA03247 2868 S---RSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ 2928
PHA03247 PHA03247
large tegument protein UL36; Provisional
359-734 7.21e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.79  E-value: 7.21e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  359 AIEALTQLSSALPQPSHSTPQASCPLPEALSPPAPFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAwg 438
Cdd:PHA03247 2691 TVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPA-- 2768
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  439 tDTPPATPRSSWP--MPRPSPDPMAELEQLLGSASDYIQSVFKRPEALPTKPKVKVEAPSsspapapspvlqrEAPTPSS 516
Cdd:PHA03247 2769 -PAPPAAPAAGPPrrLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGP-------------LPPPTSA 2834
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  517 EPdthqkaqtalqqhlhhkrslfleqvhdTSFPAPSEPSAPGWWPPPSSPVprlpdrppkekkkklptpaGGPV---GTE 593
Cdd:PHA03247 2835 QP---------------------------TAPPPPPGPPPPSLPLGGSVAP-------------------GGDVrrrPPS 2868
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  594 KAAPGI-----KPSVRKPIQIKKSRPREAQPLFPPvrqivleglrspaSQEVQAHPPAPLPASQGSAVPLPPEPSLALFA 668
Cdd:PHA03247 2869 RSPAAKpaapaRPPVRRLARPAVSRSTESFALPPD-------------QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP 2935
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 767914142  669 PSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADdklEELIRQFEAEFGDSFGLPGPPSVPIQD 734
Cdd:PHA03247 2936 PPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGR---VAVPRFRVPQPAPSREAPASSTPPLTG 2998
PHA03247 PHA03247
large tegument protein UL36; Provisional
385-803 5.98e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 5.98e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  385 PEALSPPAPFRSPQSYLRAPSWPVVPPEEHS---SFAPDSSAFPPATPRT----------------EFPEAWGTDTPPAT 445
Cdd:PHA03247 2484 AEARFPFAAGAAPDPGGGGPPDPDAPPAPSRlapAILPDEPVGEPVHPRMltwirgleelasddagDPPPPLPPAAPPAA 2563
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  446 PRSSWPMPRPSPDPmaeleqllgsASDYIQSVFKRPEALP--TKPKVKVEAPSSSPapapspvlqREAPTPSSEPDTHQK 523
Cdd:PHA03247 2564 PDRSVPPPRPAPRP----------SEPAVTSRARRPDAPPqsARPRAPVDDRGDPR---------GPAPPSPLPPDTHAP 2624
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  524 AqtalqqhlhhkrslfleqvhdtsfPAPSEPSApgwwppPSSPVPRLPDRPPKEKKKKLPTPAGGPVGTEKAAPGIKPSV 603
Cdd:PHA03247 2625 D------------------------PPPPSPSP------AANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA 2674
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  604 RKPIQIKKSRPREAQPLFPPvrqivlegLRSPASQEVQAHPPAPLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPtqem 683
Cdd:PHA03247 2675 QASSPPQRPRRRAARPTVGS--------LTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPP---- 2742
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  684 rsPSPMTALQPGstGPLPPADDKLEElirqfeaefgdSFGLPGPPSVPIQDPENQQT---CLPAPESPFATRSPKQIKIE 760
Cdd:PHA03247 2743 --AVPAGPATPG--GPARPARPPTTA-----------GPPAPAPPAAPAAGPPRRLTrpaVASLSESRESLPSPWDPADP 2807
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|...
gi 767914142  761 SSGAVTVLSTTCFHSEEGGQEATPTKAENPLTPTLSGFLESPL 803
Cdd:PHA03247 2808 PAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSL 2850
PHA03247 PHA03247
large tegument protein UL36; Provisional
313-701 8.10e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.86  E-value: 8.10e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  313 AGLPAPSTRP----LLSSEVPQISPQEGLPLSQSALSIAKEKNISLQTAIAIEALTQLSSALPQPSHSTPQASCPLPEAL 388
Cdd:PHA03247 2609 RGPAPPSPLPpdthAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAA 2688
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  389 SPPApfrSPQSYLRAPSWPVVPPEehssfaPDSSAFPPATPRTEFPEAWGTDTPPATprsSWPMPRPSPDpmaeleqllG 468
Cdd:PHA03247 2689 RPTV---GSLTSLADPPPPPPTPE------PAPHALVSATPLPPGPAAARQASPALP---AAPAPPAVPA---------G 2747
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  469 SASDYIQSVFKRPEAL-----PTKPKVKVEAPSSSPAPAPSPVLQREAPTPSSEPDTHQKAQTALQQHlhhkrslflEQV 543
Cdd:PHA03247 2748 PATPGGPARPARPPTTagppaPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPA---------AAL 2818
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  544 HDTSFPAPSEPSAPGWWPPPSSPVPRLPDRPPKEkkkklptpaGGPVgtekaAPGIKPSVRKPIQIKKSRPreAQPLFPP 623
Cdd:PHA03247 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPL---------GGSV-----APGGDVRRRPPSRSPAAKP--AAPARPP 2882
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  624 VRQIVLEGL-RSPASQEVQAHPPAPLPASQGSAVPLPPEPSLALFAPSPsrdsllPPTQEMRSPSPM-----TALQPGST 697
Cdd:PHA03247 2883 VRRLARPAVsRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQP------PPPPPPRPQPPLapttdPAGAGEPS 2956

                  ....
gi 767914142  698 GPLP 701
Cdd:PHA03247 2957 GAVP 2960
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
344-484 4.53e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 44.03  E-value: 4.53e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  344 LSIAKEKNISLQTAIaIEALTqlsSALPQPSHSTPQASCPLPeALSPPAPFRSPQSYLRAPSwPVVPPEEHSSFAPDSSA 423
Cdd:PRK14950  341 LRTTSYGQLPLELAV-IEALL---VPVPAPQPAKPTAAAPSP-VRPTPAPSTRPKAAAAANI-PPKEPVRETATPPPVPP 414
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 767914142  424 FPPATPRTEFPEAWGTDTP-----PATPRSSWPMPRPSPD-----PMAELEQLLGSASDYIQSVFKRPEAL 484
Cdd:PRK14950  415 RPVAPPVPHTPESAPKLTRaaipvDEKPKYTPPAPPKEEEkaliaDGDVLEQLEAIWKQILRDVPPRSPAV 485
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
589-753 7.82e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 43.33  E-value: 7.82e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  589 PVGTEKAAPGIKPSVRKPIQIKKSRPREAQPLfPPVRQIVLEGlRSPASQEVQAHPPAPLPASQGSAVPLPPEPSLALFA 668
Cdd:PRK12323  402 PPAAPAAAPAAAAAARAVAAAPARRSPAPEAL-AAARQASARG-PGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAA 479
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  669 PSPSR--------DSLLPPTQEMRSPSPMTALQPGSTGPLP-PADDKLEELIRQFEAEFGDSfgLPGPPSVPIQDPENQQ 739
Cdd:PRK12323  480 PARAApaaapapaDDDPPPWEELPPEFASPAPAQPDAAPAGwVAESIPDPATADPDDAFETL--APAPAAAPAPRAAAAT 557
                         170
                  ....*....|....
gi 767914142  740 TCLPAPESPFATRS 753
Cdd:PRK12323  558 EPVVAPRPPRASAS 571
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
589-824 1.03e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 43.14  E-value: 1.03e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  589 PVGTEKAAPGIKP-SVRKPIQIKKS-RPREAQ-PLFP--PVRQIVLEGLRSPASQEVQAHPPAPLPASQGSAvPLPPEPS 663
Cdd:PTZ00449  573 PTLSKKPEFPKDPkHPKDPEEPKKPkRPRSAQrPTRPksPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSS-PERPEGP 651
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  664 LALFAPSPSRDSLLP--PT--QEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEF-GDSFGLPGP----------- 727
Cdd:PTZ00449  652 KIIKSPKPPKSPKPPfdPKfkEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETpGTPFTTPRPlppklprdeef 731
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  728 PSVPIQDPENQQtclPAPESPFATRSPKQIKIESSGAVTVLSTTCfhSEEGGQEATPTKAENPLTPTLSGflESPLKYLD 807
Cdd:PTZ00449  732 PFEPIGDPDAEQ---PDDIEFFTPPEEERTFFHETPADTPLPDIL--AEEFKEEDIHAETGEPDEAMKRP--DSPSEHED 804
                         250
                  ....*....|....*..
gi 767914142  808 TPTKSLLDTPAKRAQAE 824
Cdd:PTZ00449  805 KPPGDHPSLPKKRHRLD 821
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
312-462 2.25e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.17  E-value: 2.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  312 EAGLPAPSTRPLLSSEVPQISPQEGLPLSQSALSIAKEKNISLQTAIAIEALTQLSSALPQPSHSTPQASCPLPEALSPP 391
Cdd:PRK12323  399 PAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAA 478
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 767914142  392 APFRSPQSYLRAPSWPVVPPEEH-----SSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAE 462
Cdd:PRK12323  479 APARAAPAAAPAPADDDPPPWEElppefASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAA 554
TALPID3 pfam15324
Hedgehog signalling target; TALPID3 is a family of eukaryotic proteins that are targets for ...
364-459 2.63e-03

Hedgehog signalling target; TALPID3 is a family of eukaryotic proteins that are targets for Hedgehog signalling. Mutations in this gene noticed first in chickens lead to multiple abnormalities of development.


Pssm-ID: 434634 [Multi-domain]  Cd Length: 1288  Bit Score: 41.80  E-value: 2.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142   364 TQLSSALPQPSHSTPQASCPLPEALSP-PAPFRSPQ-SYLRAPSWPVVPPEEHSSFAPDSSAF--PPATPRTEFPEAwGT 439
Cdd:pfam15324  982 TLLPTPVPTPQPTPPCSPPSPLKEPSPvKTPDSSPCvSEHDFFPVKEIPPEKGADTGPAVSLVitPTVTPIATPPPA-AT 1060
                           90       100
                   ....*....|....*....|
gi 767914142   440 DTPPATPRSSWPMPRPSPDP 459
Cdd:pfam15324 1061 PTPPLSENSIDKLKSPSPEL 1080
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
326-745 2.73e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.68  E-value: 2.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142   326 SEVPQI-SPQEGLPLSQSAlsiaKEKNISLQTAIAIEALTQLSSALPQPSHSTPQASCPLPEALSPPAPFRSPQSYLRAP 404
Cdd:pfam03154  143 STSPSIpSPQDNESDSDSS----AQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPP 218
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142   405 SWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPS-PDPMAELEQLLGSASDYIqsvfkrPEA 483
Cdd:pfam03154  219 NQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSlHGQMPPMPHSLQTGPSHM------QHP 292
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142   484 LPTKPkvkveapssspAPAPSPVLQREAPTPSSEPDTHQKAQTalqQHLHHKRSLFLEQVHDTSFPAPSEPSAPGWWPPP 563
Cdd:pfam03154  293 VPPQP-----------FPLTPQSSQSQVPPGPSPAAPGQSQQR---IHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPP 358
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142   564 SSPVPRLPDRPPKEKKKKLPTpagGPVGTEKAAPGIKPSVRKPI-QIKKSRPREAQP----LFPPVRQIVLEGLRSPASQ 638
Cdd:pfam03154  359 PTTPIPQLPNPQSHKHPPHLS---GPSPFQMNSNLPPPPALKPLsSLSTHHPPSAHPpplqLMPQSQQLPPPPAQPPVLT 435
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142   639 EVQAHPP----APLPASQGSAVPLPPEPSLAlFAPSPSRDSLLPPTQEMRSPSPMTALQP------GSTGPLPPADDKLE 708
Cdd:pfam03154  436 QSQSLPPpaasHPPTSGLHQVPSQSPFPQHP-FVPGGPPPITPPSGPPTSTSSAMPGIQPpssasvSSSGPVPAAVSCPL 514
                          410       420       430
                   ....*....|....*....|....*....|....*..
gi 767914142   709 ELIrQFEAEFGDSFGLPGPPSVPIQDPENQQTCLPAP 745
Cdd:pfam03154  515 PPV-QIKEEALDEAEEPESPPPPPRSPSPEPTVVNTP 550
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
367-765 4.30e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.12  E-value: 4.30e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  367 SSALPQPSHSTPQAscPLPEALSPPAPFRSPQSYLRAPSWPVVP-PEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPAT 445
Cdd:PRK07764  419 AAAAPAPAAAPQPA--PAPAPAPAPPSPAGNAPAGGAPSPPPAAaPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAA 496
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  446 PRSswPMPRPSPDPMAELE----QLLGSASDYIQSVFkrpEALPTKPKVkveapssspAPAPSPVLQREAPTPS-----S 516
Cdd:PRK07764  497 PAA--PAAPAGADDAATLRerwpEILAAVPKRSRKTW---AILLPEATV---------LGVRGDTLVLGFSTGGlarrfA 562
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  517 EPDTHQKAQTALQQHLHhkRSLFLEQVHDTSFPAPSEPSAPGWWPPPSSPVPRLPDRPPKEKKKKLPTPAGGPVGTEKAA 596
Cdd:PRK07764  563 SPGNAEVLVTALAEELG--GDWQVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEAS 640
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  597 PGIKPSVRKPIQIKKSRPREAQPLFPPVRQIVLEGLRSPASQEVQAHPPAPLPASQGSAVPLPPEPslalfAPSPSRDSL 676
Cdd:PRK07764  641 AAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPA-----ATPPAGQAD 715
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  677 LPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVPIQDPENQQTCLPAPESPFATRSPKQ 756
Cdd:PRK07764  716 DPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDD 795

                  ....*....
gi 767914142  757 IKIESSGAV 765
Cdd:PRK07764  796 EDRRDAEEV 804
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
347-464 5.21e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 40.85  E-value: 5.21e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  347 AKEKNISLQTAIAIEALTQLSSALPQPSHSTPQASCPLPEALSP----PAPFRSPQSYLRAPSWPVVPPEEH--SSFAPD 420
Cdd:PRK14951  369 AAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASapaaPPAAAPPAPVAAPAAAAPAAAPAAapAAVALA 448
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 767914142  421 SSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELE 464
Cdd:PRK14951  449 PAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTE 492
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
589-754 6.43e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 40.52  E-value: 6.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142   589 PVGTEKAAPGIKPSVRKPIQIKKSRPREAQPlfpPVRQIVLEGLRSPASQEVQAHP---PAPLPASQGSAVPLPPEPSLA 665
Cdd:pfam03154  187 PPPGTTQAATAGPTPSAPSVPPQGSPATSQP---PNQTQSTAAPHTLIQQTPTLHPqrlPSPHPPLQPMTQPPPPSQVSP 263
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142   666 LFAPSPSRDSLLPPTqemrsPSPMTALQPGSTGPLPPADDKLEELIRQFEaefgdsfgLPGPPSVPIQDPENQQTCLPAP 745
Cdd:pfam03154  264 QPLPQPSLHGQMPPM-----PHSLQTGPSHMQHPVPPQPFPLTPQSSQSQ--------VPPGPSPAAPGQSQQRIHTPPS 330

                   ....*....
gi 767914142   746 ESPFATRSP 754
Cdd:pfam03154  331 QSQLQSQQP 339
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
586-703 6.62e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 40.47  E-value: 6.62e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142  586 AGGPVGTEKAAPgIKPSVRKPiqikkSRPREAQPLFPPVRQIVLEGLRSPASQEVQAHPPAPLPASQGSAVPLPPEPSLA 665
Cdd:PRK14951  370 AEAAAPAEKKTP-ARPEAAAP-----AAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPA 443
                          90       100       110
                  ....*....|....*....|....*....|....*...
gi 767914142  666 LFAPSPSrdSLLPPTQEMRSPSPMTALQPGSTGPLPPA 703
Cdd:PRK14951  444 AVALAPA--PPAQAAPETVAIPVRVAPEPAVASAAPAP 479
GGN pfam15685
Gametogenetin; GGN is a family of proteins largely found in mammals. It reacts with POG in the ...
588-753 7.34e-03

Gametogenetin; GGN is a family of proteins largely found in mammals. It reacts with POG in the maturation of sperm and is expressed virtually only in the testis. It is found to be associated with the intracellular membrane, binds with GGNBP1 and may be involved in vesicular trafficking.


Pssm-ID: 434857 [Multi-domain]  Cd Length: 668  Bit Score: 40.14  E-value: 7.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142   588 GPVGTEKAAPGIKPSVRKPIQIKKSRPREAQPLFPPVRQIVLEGLRSPASQEVQAHPPAPLPASQGSAVPlPPEPSLALF 667
Cdd:pfam15685  388 GPWGSPPPPPGKAHPIPGPRRPAPALLAPPMFIFPAPTNGEPVRPGPPAPQALLPRPPPPTPPATPPPVP-PPIPQLPAL 466
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767914142   668 APSPsRDSLLPPTQEMRSPSPMTALQPGSTGPLPPAddkleELIRQFEAEFGDSFGLPGPPSVPIqdpenqqTCLPAPES 747
Cdd:pfam15685  467 QPMP-LAAARPPTPRPCPGHGESALAPAPTAPLPPA-----LAADQAPAPALAAAPAPSPAPAPA-------TADPLPPA 533

                   ....*.
gi 767914142   748 PFATRS 753
Cdd:pfam15685  534 PAPIKA 539
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH