NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2462492226|ref|XP_054185699|]
View 

histone-lysine N-methyltransferase EHMT2 isoform X1 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SET_EHMT2 cd10533
SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine ...
1133-1371 0e+00

SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine N-methyltransferase 2 (EHMT2) and similar proteins; EHMT2 (also termed Eu-HMTase2, HLA-B-associated transcript 8, histone H3-K9 methyltransferase 3, H3-K9-HMTase 3, lysine N-methyltransferase 1C (KMT1C), or protein G9a) acts as a histone-lysine N-methyltransferase that specifically mono- and dimethylates 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin.


:

Pssm-ID: 380931 [Multi-domain]  Cd Length: 239  Bit Score: 551.55  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1133 EDYKYISENCETSTMNIDRNITHLQHCTCVDDCSSSNCLCGQLSIRCWYDKDGRLLQEFNKIEPPLIFECNQACSCWRNC 1212
Cdd:cd10533      1 EDYKYISENCETSTMNIDRNITHLQHCTCVDDCSSSNCLCGQLSIRCWYDKDGRLLQEFNKIEPPLIFECNQACSCWRNC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1213 KNRVVQSGIKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVREDDSYLFDLDNKDGEVYCIDARYYGNI 1292
Cdd:cd10533     81 KNRVVQSGIKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVREDDSYLFDLDNKDGEVYCIDARYYGNI 160
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462492226 1293 SRFINHLCDPNIIPVRVFMLHQDLRFPRIAFFSSRDIRTGEELGFDYGDRFWDIKSKYFTCQCGSEKCKHSAEAIALEQ 1371
Cdd:cd10533    161 SRFINHLCDPNIIPVRVFMLHQDLRFPRIAFFSSRDIRTGEELGFDYGDRFWDIKSKYFTCQCGSEKCKHSAEAIALEQ 239
EHMT_ZBD cd20905
Zinc-binding domain of euchromatic histone lysine methyltransferases EHMT1 and EHTM2; EHMT1 ...
598-727 8.83e-64

Zinc-binding domain of euchromatic histone lysine methyltransferases EHMT1 and EHTM2; EHMT1 (also known as GLP) and EHMT2 (also known as NG36 and G9a) are histone methyltransferases that methylate the K9 position of histone H3, marking genomic regions for transcriptional repression. They may play a role in the G0/G1 cell cycle transition and are associated with promoting various types of cancer. Mutations in EHMT1 are associated with the genetic disorder Kleefstra syndrome. A functional role for the zinc-binding domain has not been established.


:

Pssm-ID: 411018  Cd Length: 133  Bit Score: 212.64  E-value: 8.83e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  598 FEELPLCSCRMEAPKIDRISERAGHKCMATESVDGELSGC-NAAILKRETMRPSSRVALMVLCETHRARMVKHHCCPGCG 676
Cdd:cd20905      1 STELPLCSCRMESPLYASITELAPVYCQAIDSIDGKLIGCsNLPVSKQELLRPSPRVPFLVLCEDHRARLVKHQCCPGCG 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2462492226  677 YFCTAGTFLECHPDFRVAHRFHKACVSQLNGMVFCPHCGEDAS-EAQEVTIP 727
Cdd:cd20905     81 LFCTQGTFVQCSPDGSIKHLFHRECALLIGGKPYCPHCGEDSPpSAKEVFLP 132
ANKYR COG0666
Ankyrin repeat [Signal transduction mechanisms];
806-1072 1.01e-51

Ankyrin repeat [Signal transduction mechanisms];


:

Pssm-ID: 440430 [Multi-domain]  Cd Length: 289  Bit Score: 184.39  E-value: 1.01e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  806 GREALEKALVIQESERRKKLRFHPRQLYLSVKQGELQKVILMLLDNLDPNFQSDQQSkrTPLHAAAQKGSVEICHVLLQA 885
Cdd:COG0666     32 LLLLLLLLLLLALLALALADALGALLLLAAALAGDLLVALLLLAAGADINAKDDGGN--TLLHAAARNGDLEIVKLLLEA 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  886 GANINAVDKQQRTPLMEAVVNNHLEVARYMVQRGGCVYSKEEDGSTCLHHAAKIGNLEMVSLLLSTGqVDVNAQDSGGWT 965
Cdd:COG0666    110 GADVNARDKDGETPLHLAAYNGNLEIVKLLLEAGADVNAQDNDGNTPLHLAAANGNLEIVKLLLEAG-ADVNARDNDGET 188
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  966 PIIWAAEHKHIEVIRMLLTRGADVTLTDNvserlveEENICLHWASFTGSAAIAEVLLNARCDLHAVNYHGDTPLHIAAR 1045
Cdd:COG0666    189 PLHLAAENGHLEIVKLLLEAGADVNAKDN-------DGKTALDLAAENGNLEIVKLLLEAGADLNAKDKDGLTALLLAAA 261
                          250       260
                   ....*....|....*....|....*..
gi 2462492226 1046 ESYHDCVLLFLSRGANPELRNKEGDTA 1072
Cdd:COG0666    262 AGAALIVKLLLLALLLLAAALLDLLTL 288
2A1904 super family cl36772
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
396-504 1.01e-08

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


The actual alignment was detected with superfamily member TIGR00927:

Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 60.01  E-value: 1.01e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  396 GKVTSDLAKRRKLNSGG--GLSEELGSARRSGEVTLTKGDPGSLEEWETVVGDDFSLYYDSYSVDERVDSDSKSEVEalt 473
Cdd:TIGR00927  789 GEMKGDEGAEGKVEHEGetEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEE--- 865
                           90       100       110
                   ....*....|....*....|....*....|.
gi 2462492226  474 eqlsEEEEEEEEEEEEEEEEEEEEEEEEDEE 504
Cdd:TIGR00927  866 ----EEEEEEEEEEEEEEEEEEEEEEEENEE 892
PHA03247 super family cl33720
large tegument protein UL36; Provisional
126-382 9.24e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 9.24e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  126 PRGRGLMRARGRGRAAPPGSRGRGRGGPHRGRGRPrSLLSLPRAQA-SWTPQLSTGLTSPPVPCLPSQGEAPAEMGALLL 204
Cdd:PHA03247  2659 GRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVG-SLTSLADPPPpPPTPEPAPHALVSATPLPPGPAAARQASPALPA 2737
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  205 EKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVgdegadtPVGATPLIGDESENLEGDGDLRGGR 284
Cdd:PHA03247  2738 APAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR-------PAVASLSESRESLPSPWDPADPPAA 2810
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  285 ILLGHATKSFPSSPSkGGSCPSRAKMSMTGAGKSPPSVQSLAMRLLSMPGAQGAAAAGSEPPPATTSPEGQPKVHR-ARK 363
Cdd:PHA03247  2811 VLAPAAALPPAASPA-GPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRlARP 2889
                          250       260
                   ....*....|....*....|.
gi 2462492226  364 TMSKPGN--GQPPVPEKRPPE 382
Cdd:PHA03247  2890 AVSRSTEsfALPPDQPERPPQ 2910
 
Name Accession Description Interval E-value
SET_EHMT2 cd10533
SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine ...
1133-1371 0e+00

SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine N-methyltransferase 2 (EHMT2) and similar proteins; EHMT2 (also termed Eu-HMTase2, HLA-B-associated transcript 8, histone H3-K9 methyltransferase 3, H3-K9-HMTase 3, lysine N-methyltransferase 1C (KMT1C), or protein G9a) acts as a histone-lysine N-methyltransferase that specifically mono- and dimethylates 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin.


Pssm-ID: 380931 [Multi-domain]  Cd Length: 239  Bit Score: 551.55  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1133 EDYKYISENCETSTMNIDRNITHLQHCTCVDDCSSSNCLCGQLSIRCWYDKDGRLLQEFNKIEPPLIFECNQACSCWRNC 1212
Cdd:cd10533      1 EDYKYISENCETSTMNIDRNITHLQHCTCVDDCSSSNCLCGQLSIRCWYDKDGRLLQEFNKIEPPLIFECNQACSCWRNC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1213 KNRVVQSGIKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVREDDSYLFDLDNKDGEVYCIDARYYGNI 1292
Cdd:cd10533     81 KNRVVQSGIKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVREDDSYLFDLDNKDGEVYCIDARYYGNI 160
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462492226 1293 SRFINHLCDPNIIPVRVFMLHQDLRFPRIAFFSSRDIRTGEELGFDYGDRFWDIKSKYFTCQCGSEKCKHSAEAIALEQ 1371
Cdd:cd10533    161 SRFINHLCDPNIIPVRVFMLHQDLRFPRIAFFSSRDIRTGEELGFDYGDRFWDIKSKYFTCQCGSEKCKHSAEAIALEQ 239
EHMT_ZBD cd20905
Zinc-binding domain of euchromatic histone lysine methyltransferases EHMT1 and EHTM2; EHMT1 ...
598-727 8.83e-64

Zinc-binding domain of euchromatic histone lysine methyltransferases EHMT1 and EHTM2; EHMT1 (also known as GLP) and EHMT2 (also known as NG36 and G9a) are histone methyltransferases that methylate the K9 position of histone H3, marking genomic regions for transcriptional repression. They may play a role in the G0/G1 cell cycle transition and are associated with promoting various types of cancer. Mutations in EHMT1 are associated with the genetic disorder Kleefstra syndrome. A functional role for the zinc-binding domain has not been established.


Pssm-ID: 411018  Cd Length: 133  Bit Score: 212.64  E-value: 8.83e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  598 FEELPLCSCRMEAPKIDRISERAGHKCMATESVDGELSGC-NAAILKRETMRPSSRVALMVLCETHRARMVKHHCCPGCG 676
Cdd:cd20905      1 STELPLCSCRMESPLYASITELAPVYCQAIDSIDGKLIGCsNLPVSKQELLRPSPRVPFLVLCEDHRARLVKHQCCPGCG 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2462492226  677 YFCTAGTFLECHPDFRVAHRFHKACVSQLNGMVFCPHCGEDAS-EAQEVTIP 727
Cdd:cd20905     81 LFCTQGTFVQCSPDGSIKHLFHRECALLIGGKPYCPHCGEDSPpSAKEVFLP 132
ANKYR COG0666
Ankyrin repeat [Signal transduction mechanisms];
806-1072 1.01e-51

Ankyrin repeat [Signal transduction mechanisms];


Pssm-ID: 440430 [Multi-domain]  Cd Length: 289  Bit Score: 184.39  E-value: 1.01e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  806 GREALEKALVIQESERRKKLRFHPRQLYLSVKQGELQKVILMLLDNLDPNFQSDQQSkrTPLHAAAQKGSVEICHVLLQA 885
Cdd:COG0666     32 LLLLLLLLLLLALLALALADALGALLLLAAALAGDLLVALLLLAAGADINAKDDGGN--TLLHAAARNGDLEIVKLLLEA 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  886 GANINAVDKQQRTPLMEAVVNNHLEVARYMVQRGGCVYSKEEDGSTCLHHAAKIGNLEMVSLLLSTGqVDVNAQDSGGWT 965
Cdd:COG0666    110 GADVNARDKDGETPLHLAAYNGNLEIVKLLLEAGADVNAQDNDGNTPLHLAAANGNLEIVKLLLEAG-ADVNARDNDGET 188
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  966 PIIWAAEHKHIEVIRMLLTRGADVTLTDNvserlveEENICLHWASFTGSAAIAEVLLNARCDLHAVNYHGDTPLHIAAR 1045
Cdd:COG0666    189 PLHLAAENGHLEIVKLLLEAGADVNAKDN-------DGKTALDLAAENGNLEIVKLLLEAGADLNAKDKDGLTALLLAAA 261
                          250       260
                   ....*....|....*....|....*..
gi 2462492226 1046 ESYHDCVLLFLSRGANPELRNKEGDTA 1072
Cdd:COG0666    262 AGAALIVKLLLLALLLLAAALLDLLTL 288
SET smart00317
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on ...
1223-1345 1.60e-40

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on outlier plant homologues


Pssm-ID: 214614 [Multi-domain]  Cd Length: 124  Bit Score: 145.56  E-value: 1.60e-40
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  1223 VRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVRE--------DDSYLFDLDNKdgevYCIDARYYGNISR 1294
Cdd:smart00317    1 NKLEVFKSPGKGWGVRATEDIPKGEFIGEYVGEIITSEEAEERPkaydtdgaKAFYLFDIDSD----LCIDARRKGNLAR 76
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|.
gi 2462492226  1295 FINHLCDPNIIPVRVFMLHQDlrfpRIAFFSSRDIRTGEELGFDYGDRFWD 1345
Cdd:smart00317   77 FINHSCEPNCELLFVEVNGDD----RIVIFALRDIKPGEELTIDYGSDYAN 123
SET pfam00856
SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be ...
1234-1340 1.44e-30

SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.


Pssm-ID: 459965 [Multi-domain]  Cd Length: 115  Bit Score: 116.85  E-value: 1.44e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1234 GWGVRALQTIPQGTFICEYVGE-LISDAEADVRED-----------DSYLFDLDNKDGevYCIDAR--YYGNISRFINHL 1299
Cdd:pfam00856    1 GRGLFATEDIPKGEFIGEYVEVlLITKEEADKRELlyydklelrlwGPYLFTLDEDSE--YCIDARalYYGNWARFINHS 78
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 2462492226 1300 CDPNIIPVRVFMlhqdLRFPRIAFFSSRDIRTGEELGFDYG 1340
Cdd:pfam00856   79 CDPNCEVRVVYV----NGGPRIVIFALRDIKPGEELTIDYG 115
SET COG2940
SET domain-containing protein (function unknown) [General function prediction only];
1221-1362 6.60e-29

SET domain-containing protein (function unknown) [General function prediction only];


Pssm-ID: 442183 [Multi-domain]  Cd Length: 134  Bit Score: 112.75  E-value: 6.60e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1221 IKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVREDDS-----YLFDLDnkDGEVycIDARYYGNISRF 1295
Cdd:COG2940      4 LHPRIEVRPSPIHGRGVFATRDIPKGTLIGEYPGEVITWAEAERREPHKeplhtYLFELD--DDGV--IDGALGGNPARF 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462492226 1296 INHLCDPNIIPVRvfmlhqdlRFPRIAFFSSRDIRTGEELGFDYGDRFWDiksKYFTCQCGseKCKH 1362
Cdd:COG2940     80 INHSCDPNCEADE--------EDGRIFIVALRDIAAGEELTYDYGLDYDE---EEYPCRCP--NCRG 133
Ank_2 pfam12796
Ankyrin repeats (3 copies);
900-993 2.30e-22

Ankyrin repeats (3 copies);


Pssm-ID: 463710 [Multi-domain]  Cd Length: 91  Bit Score: 92.49  E-value: 2.30e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  900 LMEAVVNNHLEVARYMVQRGGCVYSKEEDGSTCLHHAAKIGNLEMVSLLLStgQVDVNAQDSGgWTPIIWAAEHKHIEVI 979
Cdd:pfam12796    1 LHLAAKNGNLELVKLLLENGADANLQDKNGRTALHLAAKNGHLEIVKLLLE--HADVNLKDNG-RTALHYAARSGHLEIV 77
                           90
                   ....*....|....
gi 2462492226  980 RMLLTRGADVTLTD 993
Cdd:pfam12796   78 KLLLEKGADINVKD 91
PHA03100 PHA03100
ankyrin repeat protein; Provisional
840-1061 3.72e-20

ankyrin repeat protein; Provisional


Pssm-ID: 222984 [Multi-domain]  Cd Length: 422  Bit Score: 94.73  E-value: 3.72e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  840 ELQKVILMLLDNLDPNFQSdqqsKRTPLHAAAQKGSVEICHVLLQAGANINAVDKQQRTPLMEAVVNNH-----LEVARY 914
Cdd:PHA03100    16 KNIKYIIMEDDLNDYSYKK----PVLPLYLAKEARNIDVVKILLDNGADINSSTKNNSTPLHYLSNIKYnltdvKEIVKL 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  915 MVQRGGCVYSKEEDGSTCLHHAA--KIGNLEMVSLLLSTGqVDVNAQDSGGWTPIIWAAEHKHI--EVIRMLLTRGADVT 990
Cdd:PHA03100    92 LLEYGANVNAPDNNGITPLLYAIskKSNSYSIVEYLLDNG-ANVNIKNSDGENLLHLYLESNKIdlKILKLLIDKGVDIN 170
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  991 LTDNVsERLVE------EENIC----LHWASFTGSAAIAEVLLNARCDLHAVNYHGDTPLHIAARESYHDCVLLFLSRGA 1060
Cdd:PHA03100   171 AKNRV-NYLLSygvpinIKDVYgftpLHYAVYNNNPEFVKYLLDLGANPNLVNKYGDTPLHIAILNNNKEIFKLLLNNGP 249

                   .
gi 2462492226 1061 N 1061
Cdd:PHA03100   250 S 250
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
396-504 1.01e-08

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 60.01  E-value: 1.01e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  396 GKVTSDLAKRRKLNSGG--GLSEELGSARRSGEVTLTKGDPGSLEEWETVVGDDFSLYYDSYSVDERVDSDSKSEVEalt 473
Cdd:TIGR00927  789 GEMKGDEGAEGKVEHEGetEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEE--- 865
                           90       100       110
                   ....*....|....*....|....*....|.
gi 2462492226  474 eqlsEEEEEEEEEEEEEEEEEEEEEEEEDEE 504
Cdd:TIGR00927  866 ----EEEEEEEEEEEEEEEEEEEEEEEENEE 892
trp TIGR00870
transient-receptor-potential calcium channel protein; The Transient Receptor Potential Ca2+ ...
888-1093 2.91e-08

transient-receptor-potential calcium channel protein; The Transient Receptor Potential Ca2+ Channel (TRP-CC) Family (TC. 1.A.4)The TRP-CC family has also been called the store-operated calcium channel (SOC) family. The prototypical members include the Drosophila retinal proteinsTRP and TRPL (Montell and Rubin, 1989; Hardie and Minke, 1993). SOC members of the family mediate the entry of extracellular Ca2+ into cells in responseto depletion of intracellular Ca2+ stores (Clapham, 1996) and agonist stimulated production of inositol-1,4,5 trisphosphate (IP3). One member of the TRP-CCfamily, mammalian Htrp3, has been shown to form a tight complex with the IP3 receptor (TC #1.A.3.2.1). This interaction is apparently required for IP3 tostimulate Ca2+ release via Htrp3. The vanilloid receptor subtype 1 (VR1), which is the receptor for capsaicin (the ?hot? ingredient in chili peppers) and servesas a heat-activated ion channel in the pain pathway (Caterina et al., 1997), is also a member of this family. The stretch-inhibitable non-selective cation channel(SIC) is identical to the vanilloid receptor throughout all of its first 700 residues, but it exhibits a different sequence in its last 100 residues. VR1 and SICtransport monovalent cations as well as Ca2+. VR1 is about 10x more permeable to Ca2+ than to monovalent ions. Ca2+ overload probably causes cell deathafter chronic exposure to capsaicin. (McCleskey and Gold, 1999). [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273311 [Multi-domain]  Cd Length: 743  Bit Score: 58.55  E-value: 2.91e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  888 NINAVDKQQRTPLMEAVV-NNHLEVARYMVQRGGCVYSkeedGSTCLHHAAK--IGNLEMVSLLLSTGQVD------VNA 958
Cdd:TIGR00870   44 NINCPDRLGRSALFVAAIeNENLELTELLLNLSCRGAV----GDTLLHAISLeyVDAVEAILLHLLAAFRKsgplelAND 119
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  959 QDSG----GWTPIIWAAEHKHIEVIRMLLTRGADVTLTDNVSERLVEEENICLHW-------ASFTGSAAIAEVLLNARC 1027
Cdd:TIGR00870  120 QYTSeftpGITALHLAAHRQNYEIVKLLLERGASVPARACGDFFVKSQGVDSFYHgesplnaAACLGSPSIVALLSEDPA 199
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1028 DLHAVNYHGDTPLHIAARESY---------HDCVLLFLSRGANP-------ELRNKEGDTAWDL-TPERSDVWFALQLNR 1090
Cdd:TIGR00870  200 DILTADSLGNTLLHLLVMENEfkaeyeelsCQMYNFALSLLDKLrdskeleVILNHQGLTPLKLaAKEGRIVLFRLKLAI 279

                   ...
gi 2462492226 1091 KLR 1093
Cdd:TIGR00870  280 KYK 282
TRPV5-6 cd22192
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), types 5 and 6; TRPV5 and ...
894-1042 3.60e-08

Transient Receptor Potential channel, Vanilloid subfamily (TRPV), types 5 and 6; TRPV5 and TRPV6 (TRPV5/6) are two homologous members within the vanilloid subfamily of the transient receptor potential (TRP) family. TRPV5 and TRPV6 show only 30-40% homology with other members of the TRP family and have unique properties that differentiates them from other TRP channels. They mediate calcium uptake in epithelia and their expression is dramatically increased in numerous types of cancer. The structure of TRPV5/6 shows the typical topology features of all TRP family members, such as six transmembrane regions, a short hydrophobic stretch between transmembrane segments 5 and 6, which is predicted to form the Ca2+ pore, and large intracellular N- and C-terminal domains. The N-terminal domain of TRPV5/6 contains three ankyrin repeats. This structural element is present in several proteins and plays a role in protein-protein interactions. The N- and C-terminal tails of TRPV5/6 each contain an internal PDZ motif which can function as part of a molecular scaffold via interaction with PDZ-domain containing proteins. A major difference between the properties of TRPV5 and TRPV6 is in their tissue distribution: TRPV5 is predominantly expressed in the distal convoluted tubules (DCT) and connecting tubules (CNT) of the kidney, with limited expression in extrarenal tissues. In contrast, TRPV6 has a broader expression pattern such as expression in the intestine, kidney, placenta, epididymis, exocrine tissues, and a few other tissues.


Pssm-ID: 411976 [Multi-domain]  Cd Length: 609  Bit Score: 58.10  E-value: 3.60e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  894 KQQR---TPLMEAVVNNHLEVARYMVQRGGC-VYSKEEDGSTCLHHAAKIGNLEMVSLLLSTGQVDVNAQDSG----GWT 965
Cdd:cd22192     12 QQKRiseSPLLLAAKENDVQAIKKLLKCPSCdLFQRGALGETALHVAALYDNLEAAVVLMEAAPELVNEPMTSdlyqGET 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  966 PIIWAAEHKHIEVIRMLLTRGADVtltdnVSER------LVEEENIC------LHWASFTGSAAIAEVLLNARCDLHAVN 1033
Cdd:cd22192     92 ALHIAVVNQNLNLVRELIARGADV-----VSPRatgtffRPGPKNLIyygehpLSFAACVGNEEIVRLLIEHGADIRAQD 166

                   ....*....
gi 2462492226 1034 YHGDTPLHI 1042
Cdd:cd22192    167 SLGNTVLHI 175
PHA03247 PHA03247
large tegument protein UL36; Provisional
126-382 9.24e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 9.24e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  126 PRGRGLMRARGRGRAAPPGSRGRGRGGPHRGRGRPrSLLSLPRAQA-SWTPQLSTGLTSPPVPCLPSQGEAPAEMGALLL 204
Cdd:PHA03247  2659 GRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVG-SLTSLADPPPpPPTPEPAPHALVSATPLPPGPAAARQASPALPA 2737
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  205 EKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVgdegadtPVGATPLIGDESENLEGDGDLRGGR 284
Cdd:PHA03247  2738 APAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR-------PAVASLSESRESLPSPWDPADPPAA 2810
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  285 ILLGHATKSFPSSPSkGGSCPSRAKMSMTGAGKSPPSVQSLAMRLLSMPGAQGAAAAGSEPPPATTSPEGQPKVHR-ARK 363
Cdd:PHA03247  2811 VLAPAAALPPAASPA-GPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRlARP 2889
                          250       260
                   ....*....|....*....|.
gi 2462492226  364 TMSKPGN--GQPPVPEKRPPE 382
Cdd:PHA03247  2890 AVSRSTEsfALPPDQPERPPQ 2910
ANK smart00248
ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four ...
963-991 1.47e-05

ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four consecutive copies. They are involved in protein-protein interactions. The core of the repeat seems to be an helix-loop-helix structure.


Pssm-ID: 197603 [Multi-domain]  Cd Length: 30  Bit Score: 42.96  E-value: 1.47e-05
                            10        20
                    ....*....|....*....|....*....
gi 2462492226   963 GWTPIIWAAEHKHIEVIRMLLTRGADVTL 991
Cdd:smart00248    2 GRTPLHLAAENGNLEVVKLLLDKGADINA 30
 
Name Accession Description Interval E-value
SET_EHMT2 cd10533
SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine ...
1133-1371 0e+00

SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine N-methyltransferase 2 (EHMT2) and similar proteins; EHMT2 (also termed Eu-HMTase2, HLA-B-associated transcript 8, histone H3-K9 methyltransferase 3, H3-K9-HMTase 3, lysine N-methyltransferase 1C (KMT1C), or protein G9a) acts as a histone-lysine N-methyltransferase that specifically mono- and dimethylates 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin.


Pssm-ID: 380931 [Multi-domain]  Cd Length: 239  Bit Score: 551.55  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1133 EDYKYISENCETSTMNIDRNITHLQHCTCVDDCSSSNCLCGQLSIRCWYDKDGRLLQEFNKIEPPLIFECNQACSCWRNC 1212
Cdd:cd10533      1 EDYKYISENCETSTMNIDRNITHLQHCTCVDDCSSSNCLCGQLSIRCWYDKDGRLLQEFNKIEPPLIFECNQACSCWRNC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1213 KNRVVQSGIKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVREDDSYLFDLDNKDGEVYCIDARYYGNI 1292
Cdd:cd10533     81 KNRVVQSGIKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVREDDSYLFDLDNKDGEVYCIDARYYGNI 160
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462492226 1293 SRFINHLCDPNIIPVRVFMLHQDLRFPRIAFFSSRDIRTGEELGFDYGDRFWDIKSKYFTCQCGSEKCKHSAEAIALEQ 1371
Cdd:cd10533    161 SRFINHLCDPNIIPVRVFMLHQDLRFPRIAFFSSRDIRTGEELGFDYGDRFWDIKSKYFTCQCGSEKCKHSAEAIALEQ 239
SET_EHMT cd10543
SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine ...
1134-1363 5.52e-171

SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine N-methyltransferase EHMT1, EHMT2 and similar proteins; This family includes EHMT1 (also termed Eu-HMTase1, G9a-like protein 1, GLP, GLP1, histone H3-K9 methyltransferase 5, H3-K9-HMTase 5, lysine N-methyltransferase 1D, or KMT1D) and EHMT2 (also termed Eu-HMTase2, HLA-B-associated transcript 8, histone H3-K9 methyltransferase 3, H3-K9-HMTase 3, lysine N-methyltransferase 1C, KMT1C, or protein G9a), both act as histone-lysine N-methyltransferases that specifically mono- and dimethylate 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin.


Pssm-ID: 380941 [Multi-domain]  Cd Length: 231  Bit Score: 508.80  E-value: 5.52e-171
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1134 DYKYISENCETSTMNIDRNITHLQHCTCVDDCSSSNCLCGQLSIRCWYDKDGRLLQEFNKIEPPLIFECNQACSCWRNCK 1213
Cdd:cd10543      2 DFLYVTENCETSPLNIDRNITSLQTCSCRDDCSSDNCVCGRLSVRCWYDKEGRLLPDFNKLDPPLIFECNRACSCWRNCR 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1214 NRVVQSGIKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVREDDSYLFDLDNKDGEVYCIDARYYGNIS 1293
Cdd:cd10543     82 NRVVQNGIRYRLQLFRTRGMGWGVRALQDIPKGTFVCEYIGELISDSEADSREDDSYLFDLDNKDGETYCIDARRYGNIS 161
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1294 RFINHLCDPNIIPVRVFMLHQDLRFPRIAFFSSRDIRTGEELGFDYGDRFWDIKSKYFTCQCGSEKCKHS 1363
Cdd:cd10543    162 RFINHLCEPNLIPVRVFVEHQDLRFPRIAFFASRDIKAGEELGFDYGEKFWRIKGKYFTCRCGSPKCKYS 231
SET_EHMT1 cd10535
SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine ...
1134-1363 3.85e-167

SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine N-methyltransferase 1 (EHMT1) and similar proteins; EHMT1 (also termed Eu-HMTase1, G9a-like protein 1, GLP, GLP1, histone H3-K9 methyltransferase 5, H3-K9-HMTase 5, or lysine N-methyltransferase 1D (KMT1D)) acts as a histone-lysine N-methyltransferase that specifically mono- and dimethylates 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin.


Pssm-ID: 380933 [Multi-domain]  Cd Length: 231  Bit Score: 498.69  E-value: 3.85e-167
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1134 DYKYISENCETSTMNIDRNITHLQHCTCVDDCSSSNCLCGQLSIRCWYDKDGRLLQEFNKIEPPLIFECNQACSCWRNCK 1213
Cdd:cd10535      2 NYKYVSQNCVTSPMNIDRNITHLQYCVCIDDCSSSNCMCGQLSMRCWYDKDGRLLPEFNMAEPPLIFECNHACSCWRNCR 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1214 NRVVQSGIKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVREDDSYLFDLDNKDGEVYCIDARYYGNIS 1293
Cdd:cd10535     82 NRVVQNGLRARLQLYRTRDMGWGVRSLQDIPPGTFVCEYVGELISDSEADVREEDSYLFDLDNKDGEVYCIDARFYGNVS 161
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1294 RFINHLCDPNIIPVRVFMLHQDLRFPRIAFFSSRDIRTGEELGFDYGDRFWDIKSKYFTCQCGSEKCKHS 1363
Cdd:cd10535    162 RFINHHCEPNLVPVRVFMAHQDLRFPRIAFFSTRLIEAGEQLGFDYGERFWDIKGKLFSCRCGSPKCRHS 231
SET_SETDB-like cd10538
SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1) ...
1133-1340 5.02e-96

SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1) and 2 (SETDB2), suppressor of variegation 3-9 homologs, SUV39H1 and SUV39H2, euchromatic histone-lysine N-methyltransferase EHMT1 and EHMT2, and similar proteins; The family includes SET domain bifurcated 1 (SETDB1) and 2 (SETDB2), suppressor of variegation 3-9 homologs, SUV39H1 and SUV39H2, euchromatic histone-lysine N-methyltransferase EHMT1 and EHMT2. SETDB1 (EC 2.1.1.43; also termed ERG-associated protein with SET domain (ESET), histone H3-K9 methyltransferase 4, H3-K9-HMTase 4, or lysine N-methyltransferase 1E (KMT1E)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It mainly functions in euchromatin regions, thereby playing a central role in the silencing of euchromatic genes. SETDB2 (EC 2.1.1.43; also termed chronic lymphocytic leukemia deletion region gene 8 protein (CLLD8), or lysine N-methyltransferase 1F (KMT1F)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It is involved in left-right axis specification in early development and mitosis. SUV39H1 (also termed histone H3-K9 methyltransferase 1, H3-K9-HMTase 1, lysine N-methyltransferase 1A, KMT1A, position-effect variegation 3-9 homolog, SUV39H, or Su(var)3-9 homolog 1) and SUV39H2 (also termed histone H3-K9 methyltransferase 2, H3-K9-HMTase 2, lysine N-methyltransferase 1B, KMT1B, or Su(var)3-9 homolog 2), both act as histone-lysine N-methyltransferases that specifically trimethylate 'Lys-9' of histone H3 (H3K9me3) using monomethylated H3 'Lys-9' as substrate. They mainly function in heterochromatin regions, thereby playing central roles in the establishment of constitutive heterochromatin at pericentric and telomere regions. EHMT1 (also termed Eu-HMTase1, G9a-like protein 1, GLP, GLP1, histone H3-K9 methyltransferase 5, H3-K9-HMTase 5, lysine N-methyltransferase 1D, or KMT1D) and EHMT2 (also termed Eu-HMTase2, HLA-B-associated transcript 8, histone H3-K9 methyltransferase 3, H3-K9-HMTase 3, lysine N-methyltransferase 1C, KMT1C, or protein G9a), both act as histone-lysine N-methyltransferases that specifically mono- and dimethylate 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin. This family also includes the pre-SET domain, which is found in a number of histone methyltransferases (HMTase), N-terminal to the SET domain. Pre-SET domain is a zinc binding motif which contains 9 conserved cysteines that coordinate three zinc ions. It is thought that this region plays a structural role in stabilizing SET domains. Most family members, except for Arabidopsis thaliana SUVH9, contain a post-SET domain which harbors a zinc-binding site.


Pssm-ID: 380936 [Multi-domain]  Cd Length: 217  Bit Score: 307.38  E-value: 5.02e-96
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1133 EDYKYISENCETSTMNIDRNITHLQHCTCVDDCSSSNCLCGQLSI-RCWYDKDGRLlQEFNkiEPPLIFECNQACSCWRN 1211
Cdd:cd10538      1 PSFTYIKDNIVGKNVQPFSNIIDSVGCKCKDDCLDSKCACAAESDgIFAYTKNGLL-RLNN--SPPPIFECNSKCSCDDD 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1212 CKNRVVQSGIKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVR------EDDSYLFDLDN-----KDGE 1280
Cdd:cd10538     78 CKNRVVQRGLQARLQVFRTSKKGWGVRSLEFIPKGSFVCEYVGEVITTSEADRRgkiydkSGGSYLFDLDEfsdsdGDGE 157
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1281 VYCIDARYYGNISRFINHLCDPNIIPVRVFMLHQDLRFPRIAFFSSRDIRTGEELGFDYG 1340
Cdd:cd10538    158 ELCVDATFCGNVSRFINHSCDPNLFPFNVVIDHDDLRYPRIALFATRDILPGEELTFDYG 217
SET_SUV39H cd10542
SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 ...
1134-1361 6.54e-73

SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 homologs, SUV39H1, SUV39H2 and similar proteins; This family includes SUV39H1 (also termed histone H3-K9 methyltransferase 1, H3-K9-HMTase 1, lysine N-methyltransferase 1A, KMT1A, position-effect variegation 3-9 homolog, SUV39H, or Su(var)3-9 homolog 1) and SUV39H2 (also termed histone H3-K9 methyltransferase 2, H3-K9-HMTase 2, lysine N-methyltransferase 1B, KMT1B, or Su(var)3-9 homolog 2), both act as histone-lysine N-methyltransferases that specifically trimethylate 'Lys-9' of histone H3 (H3K9me3) using monomethylated H3 'Lys-9' as substrate. They mainly function in heterochromatin regions, thereby playing central roles in the establishment of constitutive heterochromatin at pericentric and telomere regions. Also included are Schizosaccharomyces pombe H3K9 methyltransferase Clr4 (SUV39H homolog) and Neurospora crassa DIM-5, both of which also methylate 'Lys-9' of histone H3.


Pssm-ID: 380940 [Multi-domain]  Cd Length: 245  Bit Score: 243.35  E-value: 6.54e-73
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1134 DYKYIseNCETSTMNIDRNITHLQHCTCVDDC--SSSNClCGQLS-IRCWYDKDGRLlqefnKIEPPL-IFECNQACSCW 1209
Cdd:cd10542      2 NFQYI--NDYIPGDGVKIPEDFLVGCECTEDChnNNPTC-CPAESgVKFAYDKQGRL-----RLPPGTpIYECNSRCKCG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1210 RNCKNRVVQSGIKVRLQLYRTA-KMGWGVRALQTIPQGTFICEYVGELISDAEADVR------EDDSYLFDLD-NKDGEV 1281
Cdd:cd10542     74 PDCPNRVVQRGRKVPLCIFRTSnGRGWGVKTLEDIKKGTFVMEYVGEIITSEEAERRgkiydaNGRTYLFDLDyNDDDCE 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1282 YCIDARYYGNISRFINHLCDPNIIPVRVFMLHQDLRFPRIAFFSSRDIRTGEELGFDYgDRFWDIKSKYFT--------- 1352
Cdd:cd10542    154 YTVDAAYYGNISHFINHSCDPNLAVYAVWINHLDPRLPRIAFFAKRDIKAGEELTFDY-LMTGTGGSSESTipkpkdvrv 232
                          250
                   ....*....|
gi 2462492226 1353 -CQCGSEKCK 1361
Cdd:cd10542    233 pCLCGSKNCR 242
SET_SETDB1 cd10517
SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1) ...
1110-1361 1.27e-72

SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1) and similar proteins; SETDB1 (EC 2.1.1.43; also termed ERG-associated protein with SET domain (ESET), histone H3-K9 methyltransferase 4, H3-K9-HMTase 4, or lysine N-methyltransferase 1E (KMT1E)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It mainly functions in euchromatin regions, thereby playing a central role in the silencing of euchromatic genes.


Pssm-ID: 380915 [Multi-domain]  Cd Length: 288  Bit Score: 244.12  E-value: 1.27e-72
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1110 DVARGYENVPIPCVNGVDGEPcPEDYKYISENCETSTMNIDRNITHLQHCTCVDDCS-SSNCLCGQLSI---RCWYDKD- 1184
Cdd:cd10517      8 DISYGKEGVPIPCVNEIDNSS-PPYVEYSKERIPGKGVNINLDPDFLVGCDCTDGCRdKSKCACQQLTIeatAATPGGQi 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1185 --------GRLLQEFnkiePPLIFECNQACSCWRNCKNRVVQSGIKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGEL 1256
Cdd:cd10517     87 npsagyqyRRLMEKL----PTGVYECNSRCKCDKRCYNRVVQNGLQVRLQVFKTEKKGWGIRCLDDIPKGSFVCIYAGQI 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1257 ISDAEADVRE---DDSYLFDLD------------NKDGEVYC--IDARYYGNISRFINHLCDPNIIPVRVFMLHQDLRFP 1319
Cdd:cd10517    163 LTEDEANEEGlqyGDEYFAELDyievveklkegyESDVEEHCyiIDAKSEGNLGRYLNHSCSPNLFVQNVFVDTHDLRFP 242
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 2462492226 1320 RIAFFSSRDIRTGEELGFDYGdrfWDIKSKYFT---CQCGSEKCK 1361
Cdd:cd10517    243 WVAFFASRYIRAGTELTWDYN---YEVGSVPGKvlyCYCGSSNCR 284
EHMT_ZBD cd20905
Zinc-binding domain of euchromatic histone lysine methyltransferases EHMT1 and EHTM2; EHMT1 ...
598-727 8.83e-64

Zinc-binding domain of euchromatic histone lysine methyltransferases EHMT1 and EHTM2; EHMT1 (also known as GLP) and EHMT2 (also known as NG36 and G9a) are histone methyltransferases that methylate the K9 position of histone H3, marking genomic regions for transcriptional repression. They may play a role in the G0/G1 cell cycle transition and are associated with promoting various types of cancer. Mutations in EHMT1 are associated with the genetic disorder Kleefstra syndrome. A functional role for the zinc-binding domain has not been established.


Pssm-ID: 411018  Cd Length: 133  Bit Score: 212.64  E-value: 8.83e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  598 FEELPLCSCRMEAPKIDRISERAGHKCMATESVDGELSGC-NAAILKRETMRPSSRVALMVLCETHRARMVKHHCCPGCG 676
Cdd:cd20905      1 STELPLCSCRMESPLYASITELAPVYCQAIDSIDGKLIGCsNLPVSKQELLRPSPRVPFLVLCEDHRARLVKHQCCPGCG 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2462492226  677 YFCTAGTFLECHPDFRVAHRFHKACVSQLNGMVFCPHCGEDAS-EAQEVTIP 727
Cdd:cd20905     81 LFCTQGTFVQCSPDGSIKHLFHRECALLIGGKPYCPHCGEDSPpSAKEVFLP 132
SET_SETMAR cd10544
SET domain (including pre-SET and post-SET domains) found in SET domain and mariner ...
1134-1360 3.09e-58

SET domain (including pre-SET and post-SET domains) found in SET domain and mariner transposase fusion protein (SETMAR) and similar proteins; SETMAR (also termed metnase) is a DNA-binding protein that is indirectly recruited to sites of DNA damage through protein-protein interactions. It has a sequence-specific DNA-binding activity recognizing the 19-mer core of the 5'-terminal inverted repeats (TIRs) of the Hsmar1 element and displays a DNA nicking and end joining activity. SETMAR also acts as a histone-lysine N-methyltransferase that methylates 'Lys-4' and 'Lys-36' of histone H3. It specifically mediates dimethylation of H3 'Lys-36' at sites of DNA double-strand break and may recruit proteins required for efficient DSB repair through non-homologous end-joining.


Pssm-ID: 380942 [Multi-domain]  Cd Length: 254  Bit Score: 201.76  E-value: 3.09e-58
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1134 DYKYISENCETSTMNIDRNITHLQHCTCVDD-CSSSNCLCgqlsIRCW---YDKDGRLLQEFNKIEPPlIFECNQACSCW 1209
Cdd:cd10544      2 DFQYTPENVPGPGADTDPNEITFPGCDCKTSsCEPETCSC----LRKYgpnYDDDGCLLDFDGKYSGP-VFECNSMCKCS 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1210 RNCKNRVVQSGIKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVR------EDDSYLFDLDN--KDGEV 1281
Cdd:cd10544     77 ESCQNRVVQNGLQFKLQVFKTPKKGWGLRTLEFIPKGRFVCEYAGEVIGFEEARRRtksqtkGDMNYIIVLREhlSSGKV 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1282 Y--CIDARYYGNISRFINHLCDPN--IIPVRVfmlhqDLRFPRIAFFSSRDIRTGEELGFDYGDRFWDIKSKYFT----- 1352
Cdd:cd10544    157 LetFVDPTYIGNIGRFLNHSCEPNlfMVPVRV-----DSMVPKLALFAARDIVAGEELSFDYSGEFSNSVESVTLarqde 231
                          250
                   ....*....|....
gi 2462492226 1353 ------CQCGSEKC 1360
Cdd:cd10544    232 sksrkpCLCGAENC 245
SET_AtSUVH-like cd10545
SET domain found in Arabidopsis thaliana histone H3-K9 methyltransferases (SUVHs) and similar ...
1159-1340 1.38e-55

SET domain found in Arabidopsis thaliana histone H3-K9 methyltransferases (SUVHs) and similar proteins; Arabidopsis thaliana SUVH protein (also termed suppressor of variegation 3-9 homolog protein) is a histone-lysine N-methyltransferase that methylates 'Lys-9' of histone H3. H3 'Lys-9' methylation represents a specific tag for epigenetic transcriptional repression. Some family members contain a post-SET domain which binds a Zn2+ ion. Most family members, except for Arabidopsis thaliana SUVH9, contain a post-SET domain which harbors a zinc-binding site.


Pssm-ID: 380943 [Multi-domain]  Cd Length: 232  Bit Score: 193.00  E-value: 1.38e-55
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1159 CTCVDDC--SSSNCLCGQL-SIRCWYDKDGRLlqefnkIEP-PLIFECNQACSCWRNCKNRVVQSGIKVRLQLYRTAKMG 1234
Cdd:cd10545     24 CDCKNRCtdGASDCACVKKnGGEIPYNFNGRL------IRAkPAIYECGPLCKCPPSCYNRVTQKGLRYRLEVFKTAERG 97
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1235 WGVRALQTIPQGTFICEYVGELISDAEADVR-EDDSYLFDLDNK------DGEV---------------------YCIDA 1286
Cdd:cd10545     98 WGVRSWDSIPAGSFICEYVGELLDTSEADTRsGNDDYLFDIDNRqtnrgwDGGQrldvgmsdgerssaedeesseFTIDA 177
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462492226 1287 RYYGNISRFINHLCDPNIIPVRVFMLHQDLRFPRIAFFSSRDIRTGEELGFDYG 1340
Cdd:cd10545    178 GSFGNVARFINHSCSPNLFVQCVLYDHNDLRLPRVMLFAADNIPPLQELTYDYG 231
ANKYR COG0666
Ankyrin repeat [Signal transduction mechanisms];
806-1072 1.01e-51

Ankyrin repeat [Signal transduction mechanisms];


Pssm-ID: 440430 [Multi-domain]  Cd Length: 289  Bit Score: 184.39  E-value: 1.01e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  806 GREALEKALVIQESERRKKLRFHPRQLYLSVKQGELQKVILMLLDNLDPNFQSDQQSkrTPLHAAAQKGSVEICHVLLQA 885
Cdd:COG0666     32 LLLLLLLLLLLALLALALADALGALLLLAAALAGDLLVALLLLAAGADINAKDDGGN--TLLHAAARNGDLEIVKLLLEA 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  886 GANINAVDKQQRTPLMEAVVNNHLEVARYMVQRGGCVYSKEEDGSTCLHHAAKIGNLEMVSLLLSTGqVDVNAQDSGGWT 965
Cdd:COG0666    110 GADVNARDKDGETPLHLAAYNGNLEIVKLLLEAGADVNAQDNDGNTPLHLAAANGNLEIVKLLLEAG-ADVNARDNDGET 188
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  966 PIIWAAEHKHIEVIRMLLTRGADVTLTDNvserlveEENICLHWASFTGSAAIAEVLLNARCDLHAVNYHGDTPLHIAAR 1045
Cdd:COG0666    189 PLHLAAENGHLEIVKLLLEAGADVNAKDN-------DGKTALDLAAENGNLEIVKLLLEAGADLNAKDKDGLTALLLAAA 261
                          250       260
                   ....*....|....*....|....*..
gi 2462492226 1046 ESYHDCVLLFLSRGANPELRNKEGDTA 1072
Cdd:COG0666    262 AGAALIVKLLLLALLLLAAALLDLLTL 288
SET_SETDB cd10541
SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1), ...
1159-1361 7.38e-50

SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1), SET domain bifurcated 2 (SETDB2), and similar proteins; SETDB1 (EC 2.1.1.43; also termed ERG-associated protein with SET domain (ESET), histone H3-K9 methyltransferase 4, H3-K9-HMTase 4, or lysine N-methyltransferase 1E (KMT1E)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It mainly functions in euchromatin regions, thereby playing a central role in the silencing of euchromatic genes. SETDB2 (EC 2.1.1.43; also termed chronic lymphocytic leukemia deletion region gene 8 protein (CLLD8), or lysine N-methyltransferase 1F (KMT1F)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It is involved in left-right axis specification in early development and mitosis.


Pssm-ID: 380939 [Multi-domain]  Cd Length: 236  Bit Score: 176.97  E-value: 7.38e-50
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1159 CTCVDDC-SSSNCLCGQLSIR----CWYDKD----GRLLQEFNKIEPPLIFECNQACSCWRN-CKNRVVQSGIKVRLQLY 1228
Cdd:cd10541     18 CDCTDGCrDKSKCACHQLTIQatacTPGGQDnptaGYQYKRLEECLPTGVYECNKLCKCDPNmCQNRLVQHGLQVRLQLF 97
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1229 RTAKMGWGVRALQTIPQGTFICEYVGELISDAEADvRED----DSYLFDLDNKDGEVYCIDARYYGNISRFINHLCDPNI 1304
Cdd:cd10541     98 KTQNKGWGIRCLDDIAKGTFVCIYAGKILTDDFAD-KEGlemgDEYFANLDHIEESCYIIDAKLEGNLGRYLNHSCSPNL 176
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462492226 1305 IPVRVFMLHQDLRFPRIAFFSSRDIRTGEELGFDYGDRFWDIKSKYFTCQCGSEKCK 1361
Cdd:cd10541    177 FVQNVFVDTHDLRFPWVAFFASKRIKAGTELTWDYNYEVGSVEGKELLCCCGSNECR 233
ANKYR COG0666
Ankyrin repeat [Signal transduction mechanisms];
841-1106 3.09e-49

Ankyrin repeat [Signal transduction mechanisms];


Pssm-ID: 440430 [Multi-domain]  Cd Length: 289  Bit Score: 177.07  E-value: 3.09e-49
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  841 LQKVILMLLDNLDPNFQSDQQSKRTPLHAAAQKGSVEICHVLLQAGANINAVDKQQRTPLMEAVVNNHLEVARYMVQRGG 920
Cdd:COG0666     32 LLLLLLLLLLLALLALALADALGALLLLAAALAGDLLVALLLLAAGADINAKDDGGNTLLHAAARNGDLEIVKLLLEAGA 111
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  921 CVYSKEEDGSTCLHHAAKIGNLEMVSLLLSTGqVDVNAQDSGGWTPIIWAAEHKHIEVIRMLLTRGADVTLTDNvserlv 1000
Cdd:COG0666    112 DVNARDKDGETPLHLAAYNGNLEIVKLLLEAG-ADVNAQDNDGNTPLHLAAANGNLEIVKLLLEAGADVNARDN------ 184
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1001 eEENICLHWASFTGSAAIAEVLLNARCDLHAVNYHGDTPLHIAARESYHDCVLLFLSRGANPELRNKEGDTAWDLTPERS 1080
Cdd:COG0666    185 -DGETPLHLAAENGHLEIVKLLLEAGADVNAKDNDGKTALDLAAENGNLEIVKLLLEAGADLNAKDKDGLTALLLAAAAG 263
                          250       260
                   ....*....|....*....|....*.
gi 2462492226 1081 DVWFALQLNRKLRLGVGNRAIRTEKI 1106
Cdd:COG0666    264 AALIVKLLLLALLLLAAALLDLLTLL 289
SET_SUV39H2 cd10532
SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 ...
1134-1361 1.76e-48

SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 homolog 2 (SUV39H2) and similar proteins; SUV39H2 (EC 2.1.1.43; also termed histone H3-K9 methyltransferase 2, H3-K9-HMTase 2, lysine N-methyltransferase 1B (KMT1B), or Su(var)3-9 homolog 2) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3) using monomethylated H3 'Lys-9' as substrate. It mainly functions in heterochromatin regions, thereby playing a central role in the establishment of constitutive heterochromatin at pericentric and telomere regions.


Pssm-ID: 380930 [Multi-domain]  Cd Length: 243  Bit Score: 173.15  E-value: 1.76e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1134 DYKYISENCETSTMNIDRNITHlqHCTCVDdCSSSNCLCGQLSIRCWYDKDGRLlqefnKIEPPL-IFECNQACSCWRNC 1212
Cdd:cd10532      2 DFYYINEYKPAPGINLDNEATV--GCDCSD-CFFGKCCPAEAGVLFAYNEHGQL-----KIPPGTpIYECNSRCKCGPDC 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1213 KNRVVQSGIKVRLQLYRTAK-MGWGVRALQTIPQGTFICEYVGELISDAEADVR------EDDSYLFDLDNKDGEvYCID 1285
Cdd:cd10532     74 PNRVVQKGTQYSLCIFRTSNgRGWGVKTLQKIKKNSFVMEYVGEVITSEEAERRgqfydsKGITYLFDLDYESDE-FTVD 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1286 ARYYGNISRFINHLCDPNIIPVRVFMLHQDLRFPRIAFFSSRDIRTGEELGFDY-----GDRFWD------IKSKYFT-C 1353
Cdd:cd10532    153 AARYGNVSHFVNHSCDPNLQVFNVFIDNLDTRLPRIALFSTRTIKAGEELTFDYqmkgsGDLSSDsidnspAKKRVRTvC 232

                   ....*...
gi 2462492226 1354 QCGSEKCK 1361
Cdd:cd10532    233 KCGAVTCR 240
SET_SUV39H1 cd10525
SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 ...
1134-1361 4.98e-48

SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 homolog 1 (SUV39H1) and similar proteins; SUV39H1 (EC 2.1.1.43; also termed histone H3-K9 methyltransferase 1, H3-K9-HMTase 1, lysine N-methyltransferase 1A (KMT1A), position-effect variegation 3-9 homolog (SUV39H), or Su(var)3-9 homolog 1) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3) using monomethylated H3 'Lys-9' as substrate. It mainly functions in heterochromatin regions, thereby playing a central role in the establishment of constitutive heterochromatin at pericentric and telomere regions.


Pssm-ID: 380923 [Multi-domain]  Cd Length: 255  Bit Score: 172.38  E-value: 4.98e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1134 DYKYISENCETSTMNIDRNITHlqhCTCvDDCSSS---NCLCGQLSIRCWYDKDGRLlqefnKIEPPL-IFECNQACSCW 1209
Cdd:cd10525      2 DFVYINEYKVGEGVTLNQVAVG---CEC-QDCLSQpvgGCCPGASKHRFAYNEQGQV-----KVRPGLpIYECNSRCRCG 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1210 RNCKNRVVQSGIKVRLQLYRTAK-MGWGVRALQTIPQGTFICEYVGELISDAEADVR------EDDSYLFDLDNKDgEVY 1282
Cdd:cd10525     73 PDCPNRVVQKGIQYDLCIFRTDNgRGWGVRTLEKIRKNSFVMEYVGEIITSEEAERRgqiydrQGATYLFDLDYVE-DVY 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1283 CIDARYYGNISRFINHLCDPNIIPVRVFMLHQDLRFPRIAFFSSRDIRTGEELGFDY---------------------GD 1341
Cdd:cd10525    152 TVDAAYYGNISHFVNHSCDPNLQVYNVFIDNLDERLPRIALFATRTIRAGEELTFDYnmqvdpvdaestkmdsnfglaGL 231
                          250       260
                   ....*....|....*....|
gi 2462492226 1342 RFWDIKSKYFTCQCGSEKCK 1361
Cdd:cd10525    232 PGSPKKRVRIECKCGVRSCR 251
SET_SUV39H_Clr4-like cd20073
SET domain (including pre-SET and post-SET domains) found in of Schizosaccharomyces pombe H3K9 ...
1152-1361 9.13e-47

SET domain (including pre-SET and post-SET domains) found in of Schizosaccharomyces pombe H3K9 methyltransferase Clr4, and similar proteins; This subfamily contains fission yeast Schizosaccharomyces pombe H3K9 methyltransferase Clr4 (also known as Suv39h), the sole homolog of the mammalian SUV39H1 and SUV39H2 enzymes, that has a critical role in preventing aberrant heterochromatin formation. It is known to di- and tri-methylate Lys-9 of histone H3, a central heterochromatic histone modification, with its specificity profile most similar to that of the human SUV39H2 homolog.


Pssm-ID: 380999 [Multi-domain]  Cd Length: 259  Bit Score: 168.90  E-value: 9.13e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1152 NITHLQHCTcVDDCSSSNCLCGQLSIRCWYDKDGRLLQEFNKIepplIFECNQACSCWRNCKNRVVQSGIKVRLQLYRTA 1231
Cdd:cd20073     27 SCSKLGGCD-LNNPGSCQCLEDSNEKSFAYDEYGRVRANTGSI----IYECNENCDCGINCPNRVVQRGRKLPLEIFKTK 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1232 KMGWGVRALQTIPQGTFICEYVGELISDAEADVRE---DD---SYLFDLDNKDGEV---YCIDARYYGNISRFINHLCDP 1302
Cdd:cd20073    102 HKGWGLRCPRFIKAGTFIGVYLGEVITQSEAEIRGkkyDNvgvTYLFDLDLFEDQVdeyYTVDAQYCGDVTRFINHSCDP 181
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462492226 1303 NIIPVRVFMLHQDLRFPRIAFFSSRDIRTGEELGFDYGDR----------------FWDIKSKyFTCQCGSEKCK 1361
Cdd:cd20073    182 NLAIYSVLRDKSDSKIYDLAFFAIKDIPALEELTFDYSGRnnfdqlgfignrsnskYINLKNK-RPCYCGSANCR 255
ANKYR COG0666
Ankyrin repeat [Signal transduction mechanisms];
845-1092 2.08e-46

Ankyrin repeat [Signal transduction mechanisms];


Pssm-ID: 440430 [Multi-domain]  Cd Length: 289  Bit Score: 168.98  E-value: 2.08e-46
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  845 ILMLLDNLDPNFQSDQQSKRTPLHAAAQKGSVEICHVLLQAGANINAVDKQQRTPLMEAVVNNHLEVARYMVQRGGCVYS 924
Cdd:COG0666      3 LLLLLLLLLLAALLLLLLLALLLLAAALLLLLLLLLLLLLALLALALADALGALLLLAAALAGDLLVALLLLAAGADINA 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  925 KEEDGSTCLHHAAKIGNLEMVSLLLSTGqVDVNAQDSGGWTPIIWAAEHKHIEVIRMLLTRGADVTLTDNVSERLveeen 1004
Cdd:COG0666     83 KDDGGNTLLHAAARNGDLEIVKLLLEAG-ADVNARDKDGETPLHLAAYNGNLEIVKLLLEAGADVNAQDNDGNTP----- 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1005 icLHWASFTGSAAIAEVLLNARCDLHAVNYHGDTPLHIAARESYHDCVLLFLSRGANPELRNKEGDTAWDLTPERSDVWF 1084
Cdd:COG0666    157 --LHLAAANGNLEIVKLLLEAGADVNARDNDGETPLHLAAENGHLEIVKLLLEAGADVNAKDNDGKTALDLAAENGNLEI 234

                   ....*...
gi 2462492226 1085 ALQLNRKL 1092
Cdd:COG0666    235 VKLLLEAG 242
SET_SUV39H_DIM5-like cd19473
SET domain (including pre-SET domain) found in Neurospora crassa (DIM-5) and similar proteins; ...
1159-1361 2.19e-46

SET domain (including pre-SET domain) found in Neurospora crassa (DIM-5) and similar proteins; This subfamily contains Neurospora crassa DIM-5 (also termed H3-K9-HMTase dim-5, or HKMT) which functions as histone-lysine N-methyltransferase that specifically trimethylates histone H3 to form H3K9me3.


Pssm-ID: 380996 [Multi-domain]  Cd Length: 274  Bit Score: 168.26  E-value: 2.19e-46
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1159 CTCVDD--CSSSNCLCGQ-----------LSIRCWY---DKDGRLLQEF-NKIEPplIFECNQACSCWRNCKNRVVQSGI 1221
Cdd:cd19473     26 CECTDDedCMYSGCLCLQdvdpdddrdpgKKKNAYHssgAKKGCLRGHMlNSRLP--IYECHEGCACSDDCPNRVVERGR 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1222 KVRLQLYRTA-KMGWGVRALQTIPQGTFICEYVGELISDAEADVREDDS--------YLFDLDN----------KDGEVY 1282
Cdd:cd19473    104 KVPLQIFRTSdGRGWGVRSTVDIKRGQFVDCYVGEIITPEEAQRRRDAAtiaqrkdvYLFALDKfsdpdsldprLRGDPY 183
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1283 CIDARYYGNISRFINHLCDPNIipvRVFML---HQDLRFPRIAFFSSRDIRTGEELGFDY--------GDRFWDIKSKYF 1351
Cdd:cd19473    184 EIDGEFMSGPTRFINHSCDPNL---RIFARvgdHADKHIHDLAFFAIKDIPRGTELTFDYvdgvtgldDDAGDEEKEKEM 260
                          250
                   ....*....|.
gi 2462492226 1352 T-CQCGSEKCK 1361
Cdd:cd19473    261 TkCLCGSPKCR 271
SET_SETDB2 cd10523
SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 2 (SETDB2) ...
1126-1361 9.96e-44

SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 2 (SETDB2) and similar proteins; SETDB2 (EC 2.1.1.43; also termed chronic lymphocytic leukemia deletion region gene 8 protein (CLLD8), or lysine N-methyltransferase 1F (KMT1F)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It is involved in left-right axis specification in early development and mitosis.


Pssm-ID: 380921 [Multi-domain]  Cd Length: 266  Bit Score: 160.38  E-value: 9.96e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1126 VDGEPCPEDYKYISENCETStmnidrNITHLQHCTCVDDCSS-SNCLCGQLSIR----CWYDKD-GRLLQEFNKIEPPL- 1198
Cdd:cd10523      7 VQLDRNPQDQQQLVDDFDIS------NGAFVDSCDCTDGCIDiLKCACLQLTARafskSESSPSkGGRGYKYKRLQEPIp 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1199 --IFECNQACSCWRN-CKNRVVQSGIKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELIS----------------- 1258
Cdd:cd10523     81 sgLYECNVSCKCNRMlCQNRVVQHGLQVRLQVFKTEKKGWGVRCLDDIDKGTFVCIYAGRVLSrarspteplppklelps 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1259 DAEADVREDDSYLFDLDNKDGEVYCIDARYYGNISRFINHLCDPNIIPVRVFMLHQDLRFPRIAFFSSRDIRTGEELGFD 1338
Cdd:cd10523    161 ENEVEVVTSWLILSKKRKLRENVCFLDASKEGNVGRFLNHSCCPNLFVQNVFVDTHDKNFPWVAFFTNRVVKAGTELTWD 240
                          250       260
                   ....*....|....*....|...
gi 2462492226 1339 YGDRFWDIKSKYFTCQCGSEKCK 1361
Cdd:cd10523    241 YSYDAGTSPEQEIPCLCGVNKCQ 263
SET smart00317
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on ...
1223-1345 1.60e-40

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on outlier plant homologues


Pssm-ID: 214614 [Multi-domain]  Cd Length: 124  Bit Score: 145.56  E-value: 1.60e-40
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  1223 VRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVRE--------DDSYLFDLDNKdgevYCIDARYYGNISR 1294
Cdd:smart00317    1 NKLEVFKSPGKGWGVRATEDIPKGEFIGEYVGEIITSEEAEERPkaydtdgaKAFYLFDIDSD----LCIDARRKGNLAR 76
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|.
gi 2462492226  1295 FINHLCDPNIIPVRVFMLHQDlrfpRIAFFSSRDIRTGEELGFDYGDRFWD 1345
Cdd:smart00317   77 FINHSCEPNCELLFVEVNGDD----RIVIFALRDIKPGEELTIDYGSDYAN 123
SET_SETD2-like cd10531
SET domain (including post-SET domain) found in SET domain-containing protein 2 (SETD2), ...
1225-1361 3.62e-36

SET domain (including post-SET domain) found in SET domain-containing protein 2 (SETD2), nuclear SETD2 (NSD2), ASH1-like protein (ASH1L) and similar proteins; This family includes SET domain-containing protein 2 (SETD2), nuclear SETD2 (NSD2) and ASH1-like protein (ASH1L), which function as histone-lysine N-methyltransferases. SETD2 specifically trimethylates 'Lys-36' of histone H3 (H3K36me3) using demethylated 'Lys-36' (H3K36me2) as substrate. NSD2 shows histone H3 'Lys-27' (H3K27me) methyltransferase activity. ASH1L specifically methylates 'Lys-36' of histone H3 (H3K36me). The family also includes Arabidopsis thaliana ASH1-related protein 3 (ASHR3) and similar proteins.


Pssm-ID: 380929  Cd Length: 136  Bit Score: 133.92  E-value: 3.62e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1225 LQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVRED--------DSYLFDLdnKDGEVycIDARYYGNISRFI 1296
Cdd:cd10531      2 LELFRTEKKGWGVKAKEDIQKGEFIIEYVGEVIDKKEFKERLDeyeelgksNFYILSL--SDDVV--IDATRKGNLSRFI 77
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462492226 1297 NHLCDPNIIPVRVFMLHQdlrfPRIAFFSSRDIRTGEELGFDYG-DRFWDIKSKyftCQCGSEKCK 1361
Cdd:cd10531     78 NHSCEPNCETQKWIVNGE----YRIGIFALRDIPAGEELTFDYNfVNYNEAKQV---CLCGAQNCR 136
SET_SETD1-like cd10518
SET domain (including post-SET domain) found in SET domain-containing proteins (SETD1A/SETD1B), ...
1213-1361 7.16e-34

SET domain (including post-SET domain) found in SET domain-containing proteins (SETD1A/SETD1B), histone-lysine N-methyltransferases (KMT2A/KMT2B/KMT2C/KMT2D) and similar proteins; This family includes SET domain-containing protein 1A (SETD1A), 1B (SETD1B), as well as histone-lysine N-methyltransferase 2A (KMT2A), 2B (KMT2B), 2C (KMT2C), 2D (KMT2D). These proteins are histone-lysine N-methyltransferases (EC 2.1.1.43) that specifically methylate 'Lys-4' of histone H3 (H3K4me).


Pssm-ID: 380916  Cd Length: 150  Bit Score: 127.71  E-value: 7.16e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1213 KNRVVQSGIKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVRED--------DSYLFDLDNKdgevYCI 1284
Cdd:cd10518      4 RFRQLRSRLKERLRVGKSGIHGWGLFAKRPIAAGEMVIEYVGEVIRPIVADKREKrydeegggGTYMFRIDED----LVI 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1285 DARYYGNISRFINHLCDPN----IIPVRVFMlhqdlrfpRIAFFSSRDIRTGEELGFDYgdRFWDIKSKYFTCQCGSEKC 1360
Cdd:cd10518     80 DATKKGNIARFINHSCDPNcyakIITVDGEK--------HIVIFAKRDIAPGEELTYDY--KFPIEDEEKIPCLCGAPNC 149

                   .
gi 2462492226 1361 K 1361
Cdd:cd10518    150 R 150
PreSET smart00468
N-terminal to some SET domains; A Cys-rich putative Zn2+-binding domain that occurs N-terminal ...
1108-1207 8.36e-33

N-terminal to some SET domains; A Cys-rich putative Zn2+-binding domain that occurs N-terminal to some SET domains. Function is unknown. Unpublished.


Pssm-ID: 128744 [Multi-domain]  Cd Length: 98  Bit Score: 122.52  E-value: 8.36e-33
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  1108 CRDVARGYENVPIPCVNGVDGEPCPEDYKYISENCETSTMNIDRNITHLQHCTCVDDCSSSN-CLCGQLSIRCW-YDKDG 1185
Cdd:smart00468    1 CLDISNGKENVPVPLVNEVDEDPPPPDFEYISEYIYGQGVPIDRSPSPLVGCSCSGDCSSSNkCECARKNGGEFaYELNG 80
                            90       100
                    ....*....|....*....|..
gi 2462492226  1186 RllqeFNKIEPPLIFECNQACS 1207
Cdd:smart00468   81 G----LRLKRKPLIYECNSRCS 98
SET_ASH1L cd19174
SET domain (including post-SET domain) found in ASH1-like protein (ASH1L) and similar proteins; ...
1225-1361 6.99e-32

SET domain (including post-SET domain) found in ASH1-like protein (ASH1L) and similar proteins; ASH1L (EC 2.1.1.43; also termed absent small and homeotic disks protein 1 homolog, KMT2H, or lysine N-methyltransferase 2H) acts as histone-lysine N-methyltransferase that specifically methylates 'Lys-36' of histone H3 (H3K36me). It plays important roles in development; heterozygous mutation of ASH1L is associated with severe intellectual disability (ID) and multiple congenital anomaly (MCA).


Pssm-ID: 380951 [Multi-domain]  Cd Length: 141  Bit Score: 121.63  E-value: 6.99e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1225 LQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVREDDSYLFDLDNkdgevYC--------IDARYYGNISRFI 1296
Cdd:cd19174      2 LERFRTEDKGWGVRTKEPIKAGQFIIEYVGEVVSEQEFRRRMIEQYHNHSHH-----YClnldsgmvIDGYRMGNEARFV 76
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462492226 1297 NHLCDPNIIPVRVFMLHQdlrfPRIAFFSSRDIRTGEELGFDYGDRFWDIKSKYfTCQCGSEKCK 1361
Cdd:cd19174     77 NHSCDPNCEMQKWSVNGV----YRIGLFALKDIPAGEELTYDYNFHSFNVEKQQ-PCKCGSPNCR 136
SET_EZH cd10519
SET domain found in enhancer of zeste homolog 1 (EZH1), zeste homolog 2 (EZH2) and similar ...
1224-1341 1.88e-31

SET domain found in enhancer of zeste homolog 1 (EZH1), zeste homolog 2 (EZH2) and similar proteins; The family includes EZH1 and EZH2. EZH1 (EC 2.1.1.43; also termed ENX-2, or histone-lysine N-methyltransferase EZH1) is a catalytic subunit of the PRC2/EED-EZH1 complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target gene. EZH2 (EC 2.1.1.43; also termed lysine N-methyltransferase 6, ENX-1, or histone-lysine N-methyltransferase EZH2) is a catalytic subunit of the PRC2/EED-EZH2 complex, which methylates 'Lys-9' (H3K9me) and 'Lys-27' (H3K27me) of histone H3, leading to transcriptional repression of the affected target gene. Both, EZH1 and EZH2, can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively.


Pssm-ID: 380917  Cd Length: 117  Bit Score: 119.66  E-value: 1.88e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1224 RLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVRE---DD---SYLFDLDNKdgevYCIDARYYGNISRFIN 1297
Cdd:cd10519      2 RLLLGKSDVAGWGLFLKEPIKKDEFIGEYTGELISQDEADRRGkiyDKynsSYLFNLNDQ----FVVDATRKGNKIRFAN 77
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 2462492226 1298 HLCDPNIIPvRVFMLHQDlrfPRIAFFSSRDIRTGEELGFDYGD 1341
Cdd:cd10519     78 HSSNPNCYA-KVMMVNGD---HRIGIFAKRDIEAGEELFFDYGY 117
SET pfam00856
SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be ...
1234-1340 1.44e-30

SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.


Pssm-ID: 459965 [Multi-domain]  Cd Length: 115  Bit Score: 116.85  E-value: 1.44e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1234 GWGVRALQTIPQGTFICEYVGE-LISDAEADVRED-----------DSYLFDLDNKDGevYCIDAR--YYGNISRFINHL 1299
Cdd:pfam00856    1 GRGLFATEDIPKGEFIGEYVEVlLITKEEADKRELlyydklelrlwGPYLFTLDEDSE--YCIDARalYYGNWARFINHS 78
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 2462492226 1300 CDPNIIPVRVFMlhqdLRFPRIAFFSSRDIRTGEELGFDYG 1340
Cdd:pfam00856   79 CDPNCEVRVVYV----NGGPRIVIFALRDIKPGEELTIDYG 115
SET_SETD2 cd19172
SET domain (including post-SET domain) found in SET domain-containing protein 2 (SETD2) and ...
1224-1361 2.57e-30

SET domain (including post-SET domain) found in SET domain-containing protein 2 (SETD2) and similar proteins; SETD2 (also termed HIF-1, huntingtin yeast partner B, huntingtin-interacting protein 1 (HIP-1), huntingtin-interacting protein B, lysine N-methyltransferase 3A or protein-lysine N-methyltransferase SETD2) acts as histone-lysine N-methyltransferase that specifically trimethylates 'Lys-36' of histone H3 (H3K36me3) using demethylated 'Lys-36' (H3K36me2) as substrate. It has been shown that methylation is a posttranslational modification of dynamic microtubules and that SETD2 methylates alpha-tubulin at lysine 40, the same lysine that is marked by acetylation on microtubules. Methylation of microtubules occurs during mitosis and cytokinesis and can be ablated by SETD2 deletion, which causes mitotic spindle and cytokinesis defects, micronuclei, and polyploidy.


Pssm-ID: 380949 [Multi-domain]  Cd Length: 142  Bit Score: 117.30  E-value: 2.57e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1224 RLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAE--------ADVREDDSYLFDLDNKDgevyCIDARYYGNISRF 1295
Cdd:cd19172      3 KVEVFRTEKKGWGLRAAEDLPKGTFVIEYVGEVLDEKEfkrrmkeyAREGNRHYYFMALKSDE----IIDATKKGNLSRF 78
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462492226 1296 INHLCDPNIIpVRVFMLHQDLrfpRIAFFSSRDIRTGEELGFDYG-DRFWDIKSKyftCQCGSEKCK 1361
Cdd:cd19172     79 INHSCEPNCE-TQKWTVNGEL---RVGFFAKRDIPAGEELTFDYQfERYGKEAQK---CYCGSPNCR 138
Pre-SET pfam05033
Pre-SET motif; This protein motif is a zinc binding motif. It contains 9 conserved cysteines ...
1111-1215 1.45e-29

Pre-SET motif; This protein motif is a zinc binding motif. It contains 9 conserved cysteines that coordinate three zinc ions. It is thought that this region plays a structural role in stabilising SET domains.


Pssm-ID: 461530 [Multi-domain]  Cd Length: 99  Bit Score: 113.67  E-value: 1.45e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1111 VARGYENVPIPCVNGVDGEPCPEDYKYISENCETSTMNIDRnithLQHCTCvDDCSSSNCLCGQLS---IRCWYDKDGRL 1187
Cdd:pfam05033    1 ISKGKENVPIPVVNEVDDEPPPPDFTYITSYIYPKEFLLII----PQGCDC-GDCSSEKCSCAQLNggeFRFPYDKDGLL 75
                           90       100
                   ....*....|....*....|....*...
gi 2462492226 1188 LQEfnkiEPPLIFECNQACSCWRNCKNR 1215
Cdd:pfam05033   76 VPE----SKPPIYECNPLCGCPPSCPNR 99
SET COG2940
SET domain-containing protein (function unknown) [General function prediction only];
1221-1362 6.60e-29

SET domain-containing protein (function unknown) [General function prediction only];


Pssm-ID: 442183 [Multi-domain]  Cd Length: 134  Bit Score: 112.75  E-value: 6.60e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1221 IKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVREDDS-----YLFDLDnkDGEVycIDARYYGNISRF 1295
Cdd:COG2940      4 LHPRIEVRPSPIHGRGVFATRDIPKGTLIGEYPGEVITWAEAERREPHKeplhtYLFELD--DDGV--IDGALGGNPARF 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462492226 1296 INHLCDPNIIPVRvfmlhqdlRFPRIAFFSSRDIRTGEELGFDYGDRFWDiksKYFTCQCGseKCKH 1362
Cdd:COG2940     80 INHSCDPNCEADE--------EDGRIFIVALRDIAAGEELTYDYGLDYDE---EEYPCRCP--NCRG 133
SET_NSD cd19173
SET domain (including post-SET domain) found in nuclear SET domain-containing proteins, NSD1, ...
1223-1360 1.24e-25

SET domain (including post-SET domain) found in nuclear SET domain-containing proteins, NSD1, NSD2, NSD3 and similar proteins; The nuclear receptor-binding SET Domain (NSD) family of histone H3 lysine 36 methyltransferases is comprised of NSD1, NSD2, and NSD3, which are primarily known to be involved in chromatin integrity and gene expression through mono-, di-, or tri-methylating lysine 36 of histone H3 (H3K36), respectively. NSD1 (EC 2.1.1.43; also termed histone-lysine N-methyltransferase H3 lysine-36 and H4 lysine-20 specific, androgen receptor coactivator 267 kDa protein (ARA267), androgen receptor-associated protein of 267 kDa, H3-K36-HMTase, H4-K20-HMTase, lysine N-methyltransferase 3B (KMT3B) or NR-binding SET domain-containing protein 1) functions as a histone-lysine N-methyltransferase that preferentially methylates 'Lys-36' of histone H3 and 'Lys-20' of histone H4. NSD2 (EC 2.1.1.43; also termed multiple myeloma SET domain-containing protein (MMSET), protein trithorax-5 (TRX5), or wolf-Hirschhorn syndrome candidate 1 protein (WHSC1)) acts as histone-lysine N-methyltransferase with histone H3 'Lys-27' (H3K27me) methyltransferase activity. NSD3 (EC 2.1.1.43; also termed protein whistle, WHSC1-like 1 isoform 9 with methyltransferase activity to lysine, Wolf-Hirschhorn syndrome candidate 1-like protein 1 (WHSC1L1), or WHSC1-like protein 1) functions as a histone-lysine N-methyltransferase that preferentially methylates 'Lys-4' and 'Lys-27' of histone H3.


Pssm-ID: 380950 [Multi-domain]  Cd Length: 142  Bit Score: 103.93  E-value: 1.24e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1223 VRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVR------EDDS--YLFDLDNKdgevYCIDARYYGNISR 1294
Cdd:cd19173      2 PPTEPFKTGDRGWGLRTKRDIKKGDFVIEYVGELIDEEECRRRlkkaheNNITnfYMLTLDKD----RIIDAGPKGNLSR 77
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462492226 1295 FINHLCDPNiIPVRVFMLHQDlrfPRIAFFSSRDIRTGEELGFDYG-DRFWDIKSKyftCQCGSEKC 1360
Cdd:cd19173     78 FMNHSCQPN-CETQKWTVNGD---TRVGLFAVRDIPAGEELTFNYNlDCLGNEKKV---CRCGAPNC 137
SET_ASHR3-like cd19175
SET domain (including post-SET domain) found in Arabidopsis thaliana ASH1-related protein 3 ...
1224-1361 2.88e-25

SET domain (including post-SET domain) found in Arabidopsis thaliana ASH1-related protein 3 (ASHR3) and similar proteins; This family includes Arabidopsis thaliana ASH1-related protein 3 (ASHR3, also termed protein SET DOMAIN GROUP 4 or protein stamen loss), ASH1 homolog 3 (ASHH3, also termed protein SET DOMAIN GROUP 7) and homolog 4 (ASHH4, also termed protein SET DOMAIN GROUP 24). They all function as histone-lysine N-methyltransferases (EC 2.1.1.43).


Pssm-ID: 380952 [Multi-domain]  Cd Length: 139  Bit Score: 102.88  E-value: 2.88e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1224 RLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVR--------EDDSYLFDLDnKDgevYCIDARYYGNISRF 1295
Cdd:cd19175      1 KMKLVKTEKCGWGLVADEDINAGEFIIEYVGEVIDDKTCEERlwdmkhkgEKNFYMCEID-KD---MVIDATFKGNLSRF 76
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462492226 1296 INHLCDPNIIpVRVFMLHQDLrfpRIAFFSSRDIRTGEELGFDYgdRFWDIKSKYfTCQCGSEKCK 1361
Cdd:cd19175     77 INHSCDPNCE-LQKWQVDGET---RIGVFAIRDIKKGEELTYDY--QFVQFGADQ-DCHCGSKNCR 135
SET_SET1 cd20072
SET domain (including post-SET domain) found in catalytic component of the Saccharomyces ...
1222-1361 4.44e-25

SET domain (including post-SET domain) found in catalytic component of the Saccharomyces cerevisiae COMPASS complex and similar proteins; The family contains mostly fungal SET domains, including SET1 found in the catalytic component of the Saccharomyces cerevisiae COMPASS (complex of proteins associated with Set1). SET1 is a histone-lysine N-methyltransferase that specifically methylates 'Lys-4' of histone H3 (H3K4me), when part of the SET1 histone methyltransferase (HMT) complex. The activity of this catalytic domain is established through forming a complex with a set of core proteins; it is extensively contacted by Cps60 (Bre2), Cps50 (Swd1), and Cps30 (Swd3).


Pssm-ID: 380998  Cd Length: 148  Bit Score: 102.50  E-value: 4.44e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1222 KVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVRED--------DSYLFDLDnkdgEVYCIDARYYGNIS 1293
Cdd:cd20072     12 KKQLKFARSAIHNWGLYAMENISAKDMVIEYVGEVIRQQVADEREKrylrqgigSSYLFRID----DDTVVDATKKGNIA 87
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462492226 1294 RFINHLCDPNIIpVRVFMLHQDlrfPRIAFFSSRDIRTGEELGFDYgdRFwDIKSKYFTCQCGSEKCK 1361
Cdd:cd20072     88 RFINHCCDPNCT-AKIIKVEGE---KRIVIYAKRDIAAGEELTYDY--KF-PREEDKIPCLCGAPNCR 148
SET_SETD8 cd10528
SET domain found in SET domain-containing protein 8 (SETD8) and similar proteins; SETD8 (EC 2. ...
1215-1342 2.79e-24

SET domain found in SET domain-containing protein 8 (SETD8) and similar proteins; SETD8 (EC 2.1.1.43; also termed N-lysine methyltransferase KMT5A, H4-K20-HMTase KMT5A, lysine N-methyltransferase 5A, lysine-specific methylase 5A, PR/SET domain-containing protein 07, PR-Set7 or PR/SET07) is a nucleosomal histone-lysine N-methyltransferase that specifically monomethylates 'Lys-20' of histone H4 (H4K20me1). It plays a central role in the silencing of euchromatic genes.


Pssm-ID: 380926 [Multi-domain]  Cd Length: 141  Bit Score: 99.96  E-value: 2.79e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1215 RVVQSGIKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVRE--------DDSYLFDLDNKdGEVYCIDA 1286
Cdd:cd10528      9 ELILSGKEEGLKVIEIDGKGRGVIATRPFEKGDFVVEYHGDLITITEAKKREalyakdpsTGCYMYYFQYK-GKTYCVDA 87
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2462492226 1287 -RYYGNISRFINHLC-DPNIIPVRVFMlhQDLrfPRIAFFSSRDIRTGEELGFDYGDR 1342
Cdd:cd10528     88 tKESGRLGRLINHSKkKPNLKTKLLVI--DGV--PHLILVAKRDIKPGEELLYDYGDR 141
SET_EZH-like cd19168
SET domain found in enhancer of zeste homolog 1 (EZH1) and zeste homolog 2 (EZH2) of polycomb ...
1234-1343 7.37e-24

SET domain found in enhancer of zeste homolog 1 (EZH1) and zeste homolog 2 (EZH2) of polycomb repressive complex 2 (PRC2), and similar proteins; The family includes EZH1 and EZH2. EZH1 (EC 2.1.1.43; also termed ENX-2, or histone-lysine N-methyltransferase EZH1) is a catalytic subunit of the PRC2/EED-EZH1 complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target gene. EZH2 (EC 2.1.1.43; also termed lysine N-methyltransferase 6, ENX-1, or histone-lysine N-methyltransferase EZH2) is a catalytic subunit of the PRC2/EED-EZH2 complex, which methylates 'Lys-9' (H3K9me) and 'Lys-27' (H3K27me) of histone H3, leading to transcriptional repression of the affected target gene. Both EZH1 and EZH2 can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively. PRC2 is involved in several cancers; EZH2 is overexpressed in breast, liver and prostate cancer, while point mutations in EZH2 alter the substrate preference and product specificity of PRC2 in Non-Hodgkin lymphomas (NHLs). Thus, PRC2 is a popular target for cancer therapeutics.


Pssm-ID: 380945  Cd Length: 124  Bit Score: 98.03  E-value: 7.37e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1234 GWGVRALQTIPQGTFICEYVGELISDAEADVRE------DDSYLFDLDNKdgevYCIDARYYGNISRFINHLCDP----N 1303
Cdd:cd19168     13 GLGLFAAEDIKEGEFVIEYTGELISHDEGVRREhrrgdvSYLYLFEEQEG----IWVDAAIYGNLSRYINHATDKvktgN 88
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 2462492226 1304 IIPVRVFMLHQdlrfPRIAFFSSRDIRTGEELGFDYGDRF 1343
Cdd:cd19168     89 CMPKIMYVNHE----WRIKFTAIKDIKIGEELFFNYGDNF 124
SET_SETD1 cd19169
SET domain (including post-SET domain) found in SET domain-containing protein 1 (SETD1) and ...
1222-1361 1.51e-23

SET domain (including post-SET domain) found in SET domain-containing protein 1 (SETD1) and similar proteins; This family includes SET domain-containing protein 1A (SETD1A) and SET domain-containing protein 1B (SETD1B). These proteins are histone-lysine N-methyltransferases that specifically methylate 'Lys-4' of histone H3 (H3K4me) when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated.


Pssm-ID: 380946  Cd Length: 148  Bit Score: 98.18  E-value: 1.51e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1222 KVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVRED--------DSYLFDLDnkdgEVYCIDARYYGNIS 1293
Cdd:cd19169     12 KKQLKFAKSRIHDWGLFALEPIAADEMVIEYVGQVIRQSVADEREKryeaigigSSYLFRVD----DDTIIDATKCGNLA 87
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462492226 1294 RFINHLCDPN----IIPVRvfmlhqdlRFPRIAFFSSRDIRTGEELGFDYGDRFWDIKskyFTCQCGSEKCK 1361
Cdd:cd19169     88 RFINHSCNPNcyakIITVE--------SQKKIVIYSKRPIAVNEEITYDYKFPIEDEK---IPCLCGAPQCR 148
SET_LegAS4-like cd10522
SET domain found in Legionella pneumophila type IV secretion system effector LegAS4 and ...
1234-1345 2.05e-23

SET domain found in Legionella pneumophila type IV secretion system effector LegAS4 and similar proteins; LegAS4 is a type IV secretion system effector of Legionella pneumophila. It contains a SET domain that is involved in the modification of Lys4 of histone H3 (H3K4) in the nucleolus of the host cell, thereby enhancing heterochromatic rDNA transcription. It also contains an ankyrin repeat domain of unknown function at its C-terminal region.


Pssm-ID: 380920 [Multi-domain]  Cd Length: 122  Bit Score: 96.64  E-value: 2.05e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1234 GWGVRALQTIPQGTFICEYVGELISDAEADVR----EDDSYLFDLDnkDGEVYcIDARYYGNISRFINHLCDPNIIPVrv 1309
Cdd:cd10522     14 GLGLFAAETIAKGEFVGEYTGEVLDRWEEDRDsvyhYDPLYPFDLN--GDILV-IDAGKKGNLTRFINHSDQPNLELI-- 88
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 2462492226 1310 FMLHQDLrfPRIAFFSSRDIRTGEELGFDYGDRFWD 1345
Cdd:cd10522     89 VRTLKGE--QHIGFVAIRDIKPGEELFISYGPKYWK 122
Ank_2 pfam12796
Ankyrin repeats (3 copies);
900-993 2.30e-22

Ankyrin repeats (3 copies);


Pssm-ID: 463710 [Multi-domain]  Cd Length: 91  Bit Score: 92.49  E-value: 2.30e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  900 LMEAVVNNHLEVARYMVQRGGCVYSKEEDGSTCLHHAAKIGNLEMVSLLLStgQVDVNAQDSGgWTPIIWAAEHKHIEVI 979
Cdd:pfam12796    1 LHLAAKNGNLELVKLLLENGADANLQDKNGRTALHLAAKNGHLEIVKLLLE--HADVNLKDNG-RTALHYAARSGHLEIV 77
                           90
                   ....*....|....
gi 2462492226  980 RMLLTRGADVTLTD 993
Cdd:pfam12796   78 KLLLEKGADINVKD 91
SET_KMT2A_2B cd19170
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2A (KMT2A), ...
1215-1361 1.50e-21

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2A (KMT2A), 2B (KMT2B) and similar proteins; This family includes KMT2A and KMT2B. Both KMT2A (also termed ALL-1 or CXXC7 or MLL or MLL1 or TRX1 or HRX) and KMT2B (also termed MLL4 or TRX2) act as histone methyltransferases that methylate 'Lys-4' of histone H3 (H3K4me).


Pssm-ID: 380947 [Multi-domain]  Cd Length: 152  Bit Score: 92.45  E-value: 1.50e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1215 RVVQSGIKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVRED--DS-----YLFDLDnkdgEVYCIDAR 1287
Cdd:cd19170      6 RHLRKTAKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGEVIRSVLTDKREKyyESkgigcYMFRID----DDEVVDAT 81
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462492226 1288 YYGNISRFINHLCDPN----IIPVrvfmlhqDLRfPRIAFFSSRDIRTGEELGFDYGDRFWDIKskyFTCQCGSEKCK 1361
Cdd:cd19170     82 MHGNAARFINHSCEPNcysrVVNI-------DGK-KHIVIFALRRILRGEELTYDYKFPIEDVK---IPCTCGSKKCR 148
Ank_2 pfam12796
Ankyrin repeats (3 copies);
867-960 1.54e-20

Ankyrin repeats (3 copies);


Pssm-ID: 463710 [Multi-domain]  Cd Length: 91  Bit Score: 87.48  E-value: 1.54e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  867 LHAAAQKGSVEICHVLLQAGANINAVDKQQRTPLMEAVVNNHLEVARYMVQRGGCvySKEEDGSTCLHHAAKIGNLEMVS 946
Cdd:pfam12796    1 LHLAAKNGNLELVKLLLENGADANLQDKNGRTALHLAAKNGHLEIVKLLLEHADV--NLKDNGRTALHYAARSGHLEIVK 78
                           90
                   ....*....|....
gi 2462492226  947 LLLSTGqVDVNAQD 960
Cdd:pfam12796   79 LLLEKG-ADINVKD 91
PHA03100 PHA03100
ankyrin repeat protein; Provisional
840-1061 3.72e-20

ankyrin repeat protein; Provisional


Pssm-ID: 222984 [Multi-domain]  Cd Length: 422  Bit Score: 94.73  E-value: 3.72e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  840 ELQKVILMLLDNLDPNFQSdqqsKRTPLHAAAQKGSVEICHVLLQAGANINAVDKQQRTPLMEAVVNNH-----LEVARY 914
Cdd:PHA03100    16 KNIKYIIMEDDLNDYSYKK----PVLPLYLAKEARNIDVVKILLDNGADINSSTKNNSTPLHYLSNIKYnltdvKEIVKL 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  915 MVQRGGCVYSKEEDGSTCLHHAA--KIGNLEMVSLLLSTGqVDVNAQDSGGWTPIIWAAEHKHI--EVIRMLLTRGADVT 990
Cdd:PHA03100    92 LLEYGANVNAPDNNGITPLLYAIskKSNSYSIVEYLLDNG-ANVNIKNSDGENLLHLYLESNKIdlKILKLLIDKGVDIN 170
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  991 LTDNVsERLVE------EENIC----LHWASFTGSAAIAEVLLNARCDLHAVNYHGDTPLHIAARESYHDCVLLFLSRGA 1060
Cdd:PHA03100   171 AKNRV-NYLLSygvpinIKDVYgftpLHYAVYNNNPEFVKYLLDLGANPNLVNKYGDTPLHIAILNNNKEIFKLLLNNGP 249

                   .
gi 2462492226 1061 N 1061
Cdd:PHA03100   250 S 250
SET_KMT2C_2D cd19171
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2C (KMT2C), ...
1222-1361 2.76e-19

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2C (KMT2C), 2D (KMT2D) and similar proteins; This family includes KMT2C and KMT2D. Both, KMT2C (also termed HALR or MLL3) and KMT2D (also termed ALR or MLL2), act as histone methyltransferases that methylate 'Lys-4' of histone H3 (H3K4me). They are subunits of MLL2/3 complex, a coactivator complex of nuclear receptors, involved in transcriptional coactivation.


Pssm-ID: 380948 [Multi-domain]  Cd Length: 153  Bit Score: 85.94  E-value: 2.76e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1222 KVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVREDDS-------YLFDLDNKdgevYCIDARYYGNISR 1294
Cdd:cd19171     13 RSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGEIIRNEVANRREKIYesqnrgiYMFRIDND----WVIDATMTGGPAR 88
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462492226 1295 FINHLCDPNIIpVRVFMLHQDlrfPRIAFFSSRDIRTGEELGFDYGDRFWDIKSKyFTCQCGSEKCK 1361
Cdd:cd19171     89 YINHSCNPNCV-AEVVTFDKE---KKIIIISNRRIAKGEELTYDYKFDFEDDQHK-IPCLCGAPNCR 150
SET_NSD2 cd19211
SET domain (including post-SET domain) found in nuclear SET domain-containing protein 2 (NSD2) ...
1226-1360 2.01e-18

SET domain (including post-SET domain) found in nuclear SET domain-containing protein 2 (NSD2) and similar proteins; NSD2 (EC 2.1.1.43; also termed multiple myeloma SET domain-containing protein (MMSET), protein trithorax-5 (TRX5), or wolf-Hirschhorn syndrome candidate 1 protein (WHSC1)) acts as histone-lysine N-methyltransferase with histone H3 'Lys-36' (H3K36me) methyltransferase activity. NSD2 has been shown to mediate di- and trimethylation of H3K36 and dimethylation of H4K20 in different systems, and has been characterized as a transcriptional repressor interacting with histone deacetylase HDAC1 and histone demethylase LSD1. NSD2 mediates constitutive NF-kappaB signaling for cancer cell proliferation, survival and tumor growth. It is highly overexpressed in several types of human cancers, including small-cell lung cancers, neuroblastoma, carcinomas of stomach and colon, and bladder cancers, and its overexpression tends to be associated with tumor aggressiveness. WHSC1 is frequently deleted in Wolf-Hirschhorn syndrome (WHS).


Pssm-ID: 380988 [Multi-domain]  Cd Length: 142  Bit Score: 83.12  E-value: 2.01e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1226 QLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVR-----EDDS---YLFDLDnKDgevYCIDARYYGNISRFIN 1297
Cdd:cd19211      5 KIIKTEGKGWGLIAKRDIKKGEFVNEYVGELIDEEECMARikhahENDIthfYMLTID-KD---RIIDAGPKGNYSRFMN 80
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462492226 1298 HLCDPNiIPVRVFMLHQDlrfPRIAFFSSRDIRTGEELGFDYG-DRFWDIKSkyfTCQCGSEKC 1360
Cdd:cd19211     81 HSCQPN-CETQKWTVNGD---TRVGLFAVCDIPAGTELTFNYNlDCLGNEKT---VCRCGAPNC 137
SET_SETD5-like cd10529
SET domain found in SET domain-containing protein 5 (SETD5), inactive histone-lysine ...
1236-1339 5.96e-18

SET domain found in SET domain-containing protein 5 (SETD5), inactive histone-lysine N-methyltransferase 2E (KMT2E) and similar proteins; SETD5 is a probable transcriptional regulator that acts via the formation of large multiprotein complexes that modify and/or remodel the chromatin. KMT2E (also termed inactive lysine N-methyltransferase 2E or myeloid/lymphoid or mixed-lineage leukemia protein 5 (MLL5)) associates with chromatin regions downstream of transcriptional start sites of active genes and thus regulates gene transcription. The family also includes Saccharomyces cerevisiae SET domain-containing proteins, SET3 and SET4, and Schizosaccharomyces pombe SET3. Most of these family members contain a post-SET domain which harbors a zinc-binding site.


Pssm-ID: 380927  Cd Length: 127  Bit Score: 81.17  E-value: 5.96e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1236 GVRALQTIPQGTFICEYVGE--LISDAEADVRED---DSYLFDLDNKDGEVYCIDARYYGNISRFINHLCDPNiipVRV- 1309
Cdd:cd10529     18 GLVATEDISPGEPILEYKGEvsLRSEFKEDNGFFkrpSPFVFFYDGFEGLPLCVDARKYGNEARFIRRSCRPN---AELr 94
                           90       100       110
                   ....*....|....*....|....*....|..
gi 2462492226 1310 FMLHQDLRFpRIAFFSSRDIRTGEE--LGFDY 1339
Cdd:cd10529     95 HVVVSNGEL-RLFIFALKDIRKGTEitIPFDY 125
SET_EZH2 cd19218
SET domain found in enhancer of zeste homolog 2 (EZH2) and similar proteins; EZH2 (EC 2.1.1.43) ...
1220-1339 1.12e-17

SET domain found in enhancer of zeste homolog 2 (EZH2) and similar proteins; EZH2 (EC 2.1.1.43), also termed lysine N-methyltransferase 6, or ENX-1, or histone-lysine N-methyltransferase EZH2, is a catalytic subunit of the polycomb repressive complex 2 (PRC2)/EED-EZH2 complex, which methylates 'Lys-9' (H3K9me) and 'Lys-27' (H3K27me) of histone H3, leading to transcriptional repression of the affected target gene. It can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively. PRC2 is involved in several cancers; EZH2 is overexpressed in breast, liver and prostate cancer, while point mutations in EZH2 alter the substrate preference and product specificity of PRC2 in Non-Hodgkin lymphomas (NHLs). Thus, PRC2 is a popular target for cancer therapeutics.


Pssm-ID: 380995  Cd Length: 120  Bit Score: 80.34  E-value: 1.12e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1220 GIKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVREDD------SYLFDLDNKdgevYCIDARYYGNIS 1293
Cdd:cd19218      1 GSKKHLLLAPSDVAGWGIFIKDPVQKNEFISEYCGEIISQDEADRRGKVydkymcSFLFNLNND----FVVDATRKGNKI 76
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 2462492226 1294 RFINHLCDPNIIpVRVFMLHQDlrfPRIAFFSSRDIRTGEELGFDY 1339
Cdd:cd19218     77 RFANHSVNPNCY-AKVMMVNGD---HRIGIFAKRAIQTGEELFFDY 118
SET_EZH1 cd19217
SET domain found in enhancer of zeste homolog 1 (EZH1) and similar proteins; EZH1 (EC 2.1.1.43) ...
1218-1339 2.36e-17

SET domain found in enhancer of zeste homolog 1 (EZH1) and similar proteins; EZH1 (EC 2.1.1.43), also termed ENX-2, or histone-lysine N-methyltransferase EZH1, is a catalytic subunit of the PRC2/EED-EZH1 complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target gene. It can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively.


Pssm-ID: 380994  Cd Length: 136  Bit Score: 80.11  E-value: 2.36e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1218 QSGIKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVRED------DSYLFDLDNKdgevYCIDARYYGN 1291
Cdd:cd19217      1 QRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKvydkymSSFLFNLNND----FVVDATRKGN 76
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 2462492226 1292 ISRFINHLCDPNIIpVRVFMLHQDlrfPRIAFFSSRDIRTGEELGFDY 1339
Cdd:cd19217     77 KIRFANHSVNPNCY-AKVVMVNGD---HRIGIFAKRAIQQGEELFFDY 120
SET_NSD1 cd19210
SET domain (including post-SET domain) found in nuclear receptor-binding SET domain-containing ...
1225-1360 2.70e-17

SET domain (including post-SET domain) found in nuclear receptor-binding SET domain-containing protein 1 (NSD1) and similar proteins; NSD1 (EC 2.1.1.43; also termed Histone-lysine N-methyltransferase H3 lysine-36 and H4 lysine-20 specific, androgen receptor coactivator 267 kDa protein (ARA267), androgen receptor-associated protein of 267 kDa, H3-K36-HMTase, H4-K20-HMTase, lysine N-methyltransferase 3B (KMT3B), or NR-binding SET domain-containing protein 1) functions as a histone-lysine N-methyltransferase that preferentially methylates 'Lys-36' of histone H3 and 'Lys-20' of histone H4. NSD1 is altered in approximately 10% of head and neck cancer patients with 55% decrease in risk of death in NSD1-mutated versus non-mutated patients; its disruption promotes favorable chemotherapeutic responses linked to hypomethylation.


Pssm-ID: 380987 [Multi-domain]  Cd Length: 142  Bit Score: 79.97  E-value: 2.70e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1225 LQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVR-----EDDS---YLFDLDnKDgevYCIDARYYGNISRFI 1296
Cdd:cd19210      4 VEIFRTLGRGWGLRCKTDIKKGEFVNEYVGELIDEEECRARiryaqEHDItnfYMLTLD-KD---RIIDAGPKGNYARFM 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462492226 1297 NHLCDPNiIPVRVFMLHQDlrfPRIAFFSSRDIRTGEELGFDYgdRFWDIKSKYFTCQCGSEKC 1360
Cdd:cd19210     80 NHCCQPN-CETQKWTVNGD---TRVGLFALCDIKAGTELTFNY--NLECLGNGKTVCKCGAPNC 137
SET_KMT2A cd19206
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2A (KMT2A) ...
1227-1361 4.42e-17

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2A (KMT2A) and similar proteins; KMT2A (EC2.1.1.43; also termed lysine N-methyltransferase 2A, ALL-1, CXXC-type zinc finger protein 7 (CXXC7), myeloid/lymphoid or mixed-lineage leukemia (MLL), myeloid/lymphoid or mixed-lineage leukemia protein 1 (MLL1), trithorax-like protein (TRX1), or zinc finger protein HRX) acts as a histone methyltransferase that plays an essential role in early development and hematopoiesis. It is a catalytic subunit of the MLL1/MLL complex, a multiprotein complex that mediates both methylation of 'Lys-4' of histone H3 (H3K4me) complex and acetylation of 'Lys-16' of histone H4 (H4K16ac).


Pssm-ID: 380983 [Multi-domain]  Cd Length: 154  Bit Score: 79.68  E-value: 4.42e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1227 LYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVRE---DDS----YLFDLDnkDGEVycIDARYYGNISRFINHL 1299
Cdd:cd19206     18 VYRSPIHGRGLFCKRNIDAGEMVIEYSGNVIRSILTDKREkyyDSKgigcYMFRID--DSEV--VDATMHGNAARFINHS 93
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462492226 1300 CDPNIIPVRVFMLHQDlrfpRIAFFSSRDIRTGEELGFDYGDRFWDIKSKyFTCQCGSEKCK 1361
Cdd:cd19206     94 CEPNCYSRVINIDGQK----HIVIFAMRKIYRGEELTYDYKFPIEDASNK-LPCNCGAKKCR 150
SET_SETD1A cd19204
SET domain (including post-SET domain) found in SET domain-containing protein 1A (SETD1A) and ...
1222-1361 1.02e-16

SET domain (including post-SET domain) found in SET domain-containing protein 1A (SETD1A) and similar proteins; SETD1A (EC2.1.1.43), also termed lysine N-methyltransferase 2F, or Set1/Ash2 histone methyltransferase complex subunit SET1, is a histone-lysine N-methyltransferase that specifically methylates 'Lys-4' of histone H3 (H3K4me), when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated. Human SET domain containing protein 1A (hSETD1A) expression occurs at a high rate in hepatocellular carcinoma patients and controls tumor metastasis in breast cancer by activating MMP expression.


Pssm-ID: 380981 [Multi-domain]  Cd Length: 153  Bit Score: 78.91  E-value: 1.02e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1222 KVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVRED--------DSYLFDLDNKDgevyCIDARYYGNIS 1293
Cdd:cd19204     13 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQVVADMREKryvqegigSSYLFRVDHDT----IIDATKCGNLA 88
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462492226 1294 RFINHLCDPNIIPVRVFMLHQDlrfpRIAFFSSRDIRTGEELGFDYGdrfWDIKSKYFTCQCGSEKCK 1361
Cdd:cd19204     89 RFINHCCTPNCYAKVITIESQK----KIVIYSKQPIGVNEEITYDYK---FPIEDNKIPCLCGTENCR 149
SET_NSD3 cd19212
SET domain (including post-SET domain) found in nuclear receptor-binding SET domain-containing ...
1226-1360 1.05e-16

SET domain (including post-SET domain) found in nuclear receptor-binding SET domain-containing protein 3 (NSD3) and similar proteins; NSD3 (EC 2.1.1.43; also termed protein whistle, WHSC1-like 1 isoform 9 with methyltransferase activity to lysine, Wolf-Hirschhorn syndrome candidate 1-like protein 1 (WHSC1L1), or WHSC1-like protein 1) functions as a histone-lysine N-methyltransferase that preferentially methylates 'Lys-4' and 'Lys-27' of histone H3. NSD3 is amplified and overexpressed in multiple cancer types, including acute myeloid leukemia (AML), breast, lung, pancreatic and bladder cancers, as well as squamous cell carcinoma of the head and neck (SCCHN). NSD3 contributes to tumorigenesis by interacting with bromodomain-containing protein 4 (BRD4), the bromodomain and extraterminal (BET) protein, which is a potential therapeutic target in acute myeloid leukemia (AML). NSD3 is amplified in primary tumors and cell lines from breast carcinoma, and can promote the cell viability of small-cell lung cancer and pancreatic ductal adenocarcinoma. High NSD3 expression is implicated in poor grade and heavy smoking history in SCCHN. Thus, NSD3 may serve as a potential druggable target for selective cancer therapy.


Pssm-ID: 380989 [Multi-domain]  Cd Length: 142  Bit Score: 78.43  E-value: 1.05e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1226 QLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVREDDSYLFDLDN-------KDgevYCIDARYYGNISRFINH 1298
Cdd:cd19212      5 EIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLRIKRAHENSVTNfymltvtKD---RIIDAGPKGNYSRFMNH 81
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462492226 1299 LCDPNiIPVRVFMLHQDLrfpRIAFFSSRDIRTGEELGFDYG-DRFWDIKSKyftCQCGSEKC 1360
Cdd:cd19212     82 SCNPN-CETQKWTVNGDV---RVGLFALCDIPAGMELTFNYNlDCLGNGRTE---CHCGADNC 137
Ank_2 pfam12796
Ankyrin repeats (3 copies);
933-1066 1.64e-16

Ankyrin repeats (3 copies);


Pssm-ID: 463710 [Multi-domain]  Cd Length: 91  Bit Score: 75.92  E-value: 1.64e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  933 LHHAAKIGNLEMVSLLLSTGqVDVNAQDSGGWTPIIWAAEHKHIEVIRMLLTRgADVTLTDNvserlveeeniclhwasf 1012
Cdd:pfam12796    1 LHLAAKNGNLELVKLLLENG-ADANLQDKNGRTALHLAAKNGHLEIVKLLLEH-ADVNLKDN------------------ 60
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462492226 1013 tgsaaiaevllnarcdlhavnyhGDTPLHIAARESYHDCVLLFLSRGANPELRN 1066
Cdd:pfam12796   61 -----------------------GRTALHYAARSGHLEIVKLLLEKGADINVKD 91
SET_SETD1B cd19205
SET domain (including post-SET domain) found in SET domain-containing protein 1B (SETD1B) and ...
1222-1361 2.80e-15

SET domain (including post-SET domain) found in SET domain-containing protein 1B (SETD1B) and similar proteins; SETD1B (EC2.1.1.43), also termed lysine N-methyltransferase 2G, is a histone-lysine N-methyltransferase that specifically methylates 'Lys-4' of histone H3 (H3K4me) when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated. Loss of SETD1B occurs in up to half the gastric and colorectal cancers, most commonly via SETD1B mutations, while de novo variants in SETD1B are associated with intellectual disability, epilepsy and autism.


Pssm-ID: 380982 [Multi-domain]  Cd Length: 153  Bit Score: 74.71  E-value: 2.80e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1222 KVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVRED--------DSYLFDLDNKDgevyCIDARYYGNIS 1293
Cdd:cd19205     13 KKKLKFCKSHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKryedegigSSYMFRVDHDT----IIDATKCGNFA 88
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462492226 1294 RFINHLCDPNIIPVRVFMLHQDlrfpRIAFFSSRDIRTGEELGFDYGdrfWDIKSKYFTCQCGSEKCK 1361
Cdd:cd19205     89 RFINHSCNPNCYAKVITVESQK----KIVIYSKQHINVNEEITYDYK---FPIEDVKIPCLCGSENCR 149
SET_KMT2C cd19208
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2C (KMT2C) ...
1210-1361 1.22e-14

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2C (KMT2C) and similar proteins; KMT2C (EC2.1.1.43; also termed lysine N-methyltransferase 2C, homologous to ALR protein (HALR) myeloid/lymphoid, or mixed-lineage leukemia protein 3 (MLL3)), acts as a histone methyltransferase that methylates 'Lys-4' of histone H3 (H3K4me) and may be involved in leukemogenesis and developmental disorder. KMT2C is a catalytic subunit of MLL2/3 complex, a coactivator complex of nuclear receptors, involved in transcriptional coactivation. Overexpression of KMT2C is associated with estrogen receptor-positive breast cancer; KMT2C mediates the estrogen dependence of breast cancer through regulation of estrogen receptor alpha (ERalpha) enhancer function. KMT2C is frequently mutated in certain populations with diffuse-type gastric adenocarcinomas (DGA); its loss promotes epithelial-to-mesenchymal transition (EMT) and is associated with worse overall survival.


Pssm-ID: 380985 [Multi-domain]  Cd Length: 154  Bit Score: 72.74  E-value: 1.22e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1210 RNCKNRVVQSGIKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVRED-------DSYLFDLDNKdgevY 1282
Cdd:cd19208      2 KSSQYRKMKTEWKSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKlyesqnrGVYMFRIDND----H 77
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462492226 1283 CIDARYYGNISRFINHLCDPNIIPVRVFMLHQDlrfpRIAFFSSRDIRTGEELGFDYGDRFWDIKSKyFTCQCGSEKCK 1361
Cdd:cd19208     78 VIDATLTGGPARYINHSCAPNCVAEVVTFEKGH----KIIISSSRRIQKGEELCYDYKFDFEDDQHK-IPCHCGAVNCR 151
SET_SpSET3-like cd19183
SET domain (including post-SET domain) found in Schizosaccharomyces pombe SET ...
1225-1387 2.15e-14

SET domain (including post-SET domain) found in Schizosaccharomyces pombe SET domain-containing protein 3 (SETD3) and similar proteins; Schizosaccharomyces pombe SETD3 functions as a transcriptional regulator that acts via the formation of large multiprotein complexes that modify and/or remodel the chromatin. It is required for both, gene activation and repression.


Pssm-ID: 380960  Cd Length: 173  Bit Score: 72.44  E-value: 2.15e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1225 LQLYRTAKMGW-GVRALQTIPQGTFICEYVGELISDAEADVREDDSY----------LFDldnkDGEVYCIDARYYGNIS 1293
Cdd:cd19183      3 ISSIGLANASRfGLFADRPIPAGDPIQELLGEIGLQSEYIADPENQYqilgapkphvFFH----PQSPLYIDTRRSGSVA 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1294 RFINHLCDPN--IIPVRVfmlhQDLRFPRIAFFSSRDIRTGEELGFDYGdrfWDIkskyftcqcgsekcKHSAEAIALEQ 1371
Cdd:cd19183     79 RFIRRSCRPNaeLVTVAS----DSGSVLKFVLYASRDISPGEEITIGWD---WDN--------------PHPFRRFALGE 137
                          170
                   ....*....|....*.
gi 2462492226 1372 SRLARLDPHPELLPEL 1387
Cdd:cd19183    138 LVPSNLDLEQHLLSFL 153
PHA02876 PHA02876
ankyrin repeat protein; Provisional
836-1072 2.50e-14

ankyrin repeat protein; Provisional


Pssm-ID: 165207 [Multi-domain]  Cd Length: 682  Bit Score: 78.18  E-value: 2.50e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  836 VKQGELQKVILMLLDNLDPNfqSDQQSKRTPLHAAAQKGSVEICHVLLQAGANIN------------AVD---------- 893
Cdd:PHA02876   153 IQQDELLIAEMLLEGGADVN--AKDIYCITPIHYAAERGNAKMVNLLLSYGADVNiialddlsvlecAVDsknidtikai 230
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  894 -------KQQRTPLMEAVVNNHLEVARYMVQRGGCVYSKEEDGSTCLHHAAKIGNL-EMVSLLLSTGqVDVNAQDSGGWT 965
Cdd:PHA02876   231 idnrsniNKNDLSLLKAIRNEDLETSLLLYDAGFSVNSIDDCKNTPLHHASQAPSLsRLVPKLLERG-ADVNAKNIKGET 309
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  966 PIIWAAEHKH-IEVIRMLLTRGADVTLTDNVSerlveeeNICLHWAS-FTGSAAIAEVLLNARCDLHAVNYHGDTPLHIA 1043
Cdd:PHA02876   310 PLYLMAKNGYdTENIRTLIMLGADVNAADRLY-------ITPLHQAStLDRNKDIVITLLELGANVNARDYCDKTPIHYA 382
                          250       260
                   ....*....|....*....|....*....
gi 2462492226 1044 ARESYHDCVLLFLSRGANPELRNKEGDTA 1072
Cdd:PHA02876   383 AVRNNVVIINTLLDYGADIEALSQKIGTA 411
SET cd08161
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain superfamily; The Su(var)3-9, ...
1224-1340 3.47e-14

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain superfamily; The Su(var)3-9, Enhancer-of-zeste, Trithorax (SET) domain superfamily corresponds to SET domain-containing lysine methyltransferases, which catalyze site and state-specific methylation of lysine residues in histones that are fundamental in epigenetic regulation of gene activation and silencing in eukaryotic organisms. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains has been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as N-SET and C-SET. C-SET forms an unusual and conserved knot-like structure of probable functional importance. In addition to N-SET and C-SET, an insert region (I-SET) and flanking regions of high structural variability form part of the overall structure. Some family members contain a pre-SET domain, which is found in a number of histone methyltransferases (HMTase), and a post-SET domain, which harbors a zinc-binding site.


Pssm-ID: 380914 [Multi-domain]  Cd Length: 72  Bit Score: 68.82  E-value: 3.47e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1224 RLQLYRTAKMGWGVRALQTIPQGTFICeyvgelisdaeadvreddsylfdldnkdgevycidaryygnISRFINHLCDPN 1303
Cdd:cd08161      1 EIRPSTIPGAGFGLFATRDIPKGEVIG-----------------------------------------LARFINHSCEPN 39
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 2462492226 1304 IIPVRVFmlhqDLRFPRIAFFSSRDIRTGEELGFDYG 1340
Cdd:cd08161     40 CEFEEVY----VGGKPRVFIVALRDIKAGEELTVDYG 72
PHA03095 PHA03095
ankyrin-like protein; Provisional
864-1064 3.86e-14

ankyrin-like protein; Provisional


Pssm-ID: 222980 [Multi-domain]  Cd Length: 471  Bit Score: 76.60  E-value: 3.86e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  864 RTPLHA-----AAQKGSVEIchvLLQAGANINAVDKQQRTPLMEAVVNNH--LEVARYMVQRGGCVYSKEEDGSTCLHHA 936
Cdd:PHA03095   118 RTPLHVylsgfNINPKVIRL---LLRKGADVNALDLYGMTPLAVLLKSRNanVELLRLLIDAGADVYAVDDRFRSLLHHH 194
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  937 A---KIGNlEMVSLLLSTGqVDVNAQDSGGWTPIIWAAEH---KHIeVIRMLLTRGADVTLTDNVSErlveeenICLHWA 1010
Cdd:PHA03095   195 LqsfKPRA-RIVRELIRAG-CDPAATDMLGNTPLHSMATGsscKRS-LVLPLLIAGISINARNRYGQ-------TPLHYA 264
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462492226 1011 SFTGSAAIAEVLLNARCDLHAVNYHGDTPLHIAARESYHDCVLLFLSRGANPEL 1064
Cdd:PHA03095   265 AVFNNPRACRRLIALGADINAVSSDGNTPLSLMVRNNNGRAVRAALAKNPSAET 318
PHA03095 PHA03095
ankyrin-like protein; Provisional
829-1072 4.33e-14

ankyrin-like protein; Provisional


Pssm-ID: 222980 [Multi-domain]  Cd Length: 471  Bit Score: 76.60  E-value: 4.33e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  829 PRQLYLSVKQGELQKVILMLLDN-LDPNFQSdqQSKRTPLHAAAQKGSVE-ICHVLLQAGANINAVDKQQRTPLmeavvn 906
Cdd:PHA03095    50 PLHLYLHYSSEKVKDIVRLLLEAgADVNAPE--RCGFTPLHLYLYNATTLdVIKLLIKAGADVNAKDKVGRTPL------ 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  907 nhlevarymvqrggcvyskeedgSTCLhhAAKIGNLEMVSLLLSTGqVDVNAQDSGGWTPIiwAAEHKH----IEVIRML 982
Cdd:PHA03095   122 -----------------------HVYL--SGFNINPKVIRLLLRKG-ADVNALDLYGMTPL--AVLLKSrnanVELLRLL 173
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  983 LTRGADVTLTDNVSERLVEEeniclHWASFTGSAAIAEVLLNARCDLHAVNYHGDTPLHIAARESYHDCVLL--FLSRGA 1060
Cdd:PHA03095   174 IDAGADVYAVDDRFRSLLHH-----HLQSFKPRARIVRELIRAGCDPAATDMLGNTPLHSMATGSSCKRSLVlpLLIAGI 248
                          250
                   ....*....|..
gi 2462492226 1061 NPELRNKEGDTA 1072
Cdd:PHA03095   249 SINARNRYGQTP 260
Ank_2 pfam12796
Ankyrin repeats (3 copies);
832-919 4.62e-14

Ankyrin repeats (3 copies);


Pssm-ID: 463710 [Multi-domain]  Cd Length: 91  Bit Score: 68.99  E-value: 4.62e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  832 LYLSVKQGELQKVILMLLDNLDPNFQSdqQSKRTPLHAAAQKGSVEICHVLLQaGANINAVDkQQRTPLMEAVVNNHLEV 911
Cdd:pfam12796    1 LHLAAKNGNLELVKLLLENGADANLQD--KNGRTALHLAAKNGHLEIVKLLLE-HADVNLKD-NGRTALHYAARSGHLEI 76

                   ....*...
gi 2462492226  912 ARYMVQRG 919
Cdd:pfam12796   77 VKLLLEKG 84
SET_KMT2B cd19207
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2B (KMT2B) ...
1227-1361 5.01e-14

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2B (KMT2B) and similar proteins; KMT2B (EC2.1.1.43; also termed lysine N-methyltransferase 2B, myeloid/lymphoid or mixed-lineage leukemia protein 4 (MLL2/MLL4), trithorax homolog 2 (TRX2), or WW domain-binding protein 7 (WBP-7)), acts as a histone methyltransferase that methylates 'Lys-4' of histone H3 (H3K4me). It is required during the transcriptionally active period of oocyte growth for the establishment and/or maintenance of bulk H3K4 trimethylation (H3K4me3), global transcriptional silencing that precedes resumption of meiosis, oocyte survival and normal zygotic genome activation.


Pssm-ID: 380984 [Multi-domain]  Cd Length: 154  Bit Score: 71.21  E-value: 5.01e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1227 LYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVRED-------DSYLFDLDNKDgevyCIDARYYGNISRFINHL 1299
Cdd:cd19207     18 VYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKfydskgiGCYMFRIDDFD----VVDATMHGNAARFINHS 93
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462492226 1300 CDPNIIPVRVFMLHQDlrfpRIAFFSSRDIRTGEELGFDYGDRFWDIKSKyFTCQCGSEKCK 1361
Cdd:cd19207     94 CEPNCYSRVIHVEGQK----HIVIFALRKIYRGEELTYDYKFPIEDASNK-LPCNCGAKRCR 150
SET_KMT2D cd19209
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2D (KMT2D) ...
1210-1361 5.70e-14

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2D (KMT2D) and similar proteins; KMT2D (EC2.1.1.43; also termed lysine N-methyltransferase 2D, ALL1-related protein (ALR), or myeloid/lymphoid or mixed-lineage leukemia protein 2 (MLL2)), acts as histone methyltransferase that methylates 'Lys-4' of histone H3 (H3K4me). It is a coactivator for estrogen receptor by being recruited by ESR1, thereby activating transcription. KMT2D is a subunit of MLL2/3 complex, a coactivator complex of nuclear receptors, involved in transcriptional coactivation.


Pssm-ID: 380986 [Multi-domain]  Cd Length: 155  Bit Score: 70.88  E-value: 5.70e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1210 RNCKNRVVQSGIKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVRED-------DSYLFDLDNKdgevY 1282
Cdd:cd19209      3 KSSQYRRLKTEWKNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKiyeeqnrGIYMFRINNE----H 78
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462492226 1283 CIDARYYGNISRFINHLCDPNIIPVRVFMLHQDlrfpRIAFFSSRDIRTGEELGFDYGDRFWDIKSKyFTCQCGSEKCK 1361
Cdd:cd19209     79 VIDATLTGGPARYINHSCAPNCVAEVVTFDKED----KIIIISSRRIPKGEELTYDYQFDFEDDQHK-IPCHCGAWNCR 152
PHA02874 PHA02874
ankyrin repeat protein; Provisional
865-1106 5.78e-14

ankyrin repeat protein; Provisional


Pssm-ID: 165205 [Multi-domain]  Cd Length: 434  Bit Score: 75.77  E-value: 5.78e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  865 TPLHAAAQKGSVEICHVLLQAGANINAVDKQQRTPLMEAVVNNHLEVARYMVQRGgcvyskeEDGST----CLhhaakig 940
Cdd:PHA02874    37 TPLIDAIRSGDAKIVELFIKHGADINHINTKIPHPLLTAIKIGAHDIIKLLIDNG-------VDTSIlpipCI------- 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  941 NLEMVSLLLSTGqVDVNAQDSGGWTPIIWAAEHKHIEVIRMLLTRGADVTLTDnvserlvEEENICLHWASFTGSAAIAE 1020
Cdd:PHA02874   103 EKDMIKTILDCG-IDVNIKDAELKTFLHYAIKKGDLESIKMLFEYGADVNIED-------DNGCYPIHIAIKHNFFDIIK 174
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1021 VLLNARCDLHAVNYHGDTPLHIAARESYHDCVLLFLSRGANPELRNKEGdtawdLTPERSdvwfALQLNRK-LRLGVGNR 1099
Cdd:PHA02874   175 LLLEKGAYANVKDNNGESPLHNAAEYGDYACIKLLIDHGNHIMNKCKNG-----FTPLHN----AIIHNRSaIELLINNA 245

                   ....*..
gi 2462492226 1100 AIRTEKI 1106
Cdd:PHA02874   246 SINDQDI 252
PLN03192 PLN03192
Voltage-dependent potassium channel; Provisional
864-990 1.41e-13

Voltage-dependent potassium channel; Provisional


Pssm-ID: 215625 [Multi-domain]  Cd Length: 823  Bit Score: 75.67  E-value: 1.41e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  864 RTPLHAAAQKGSVEICHVLLQAGANINAVDKQQRTPLMEAVVNNHLEVARYMVQRGGCvySKEEDGSTCLHHAAKIGNLE 943
Cdd:PLN03192   559 RTPLHIAASKGYEDCVLVLLKHACNVHIRDANGNTALWNAISAKHHKIFRILYHFASI--SDPHAAGDLLCTAAKRNDLT 636
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 2462492226  944 MVSLLLSTGqVDVNAQDSGGWTPIIWAAEHKHIEVIRMLLTRGADVT 990
Cdd:PLN03192   637 AMKELLKQG-LNVDSEDHQGATALQVAMAEDHVDMVRLLIMNGADVD 682
PHA02874 PHA02874
ankyrin repeat protein; Provisional
832-1001 2.19e-13

ankyrin repeat protein; Provisional


Pssm-ID: 165205 [Multi-domain]  Cd Length: 434  Bit Score: 74.23  E-value: 2.19e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  832 LYLSVKQGELQKVILMLLDNLDPNFQSDQQSkrTPLHAAAQKGSVEICHVLLQAGANINAVDKQQRTPLMEAVVNNHLEV 911
Cdd:PHA02874   128 LHYAIKKGDLESIKMLFEYGADVNIEDDNGC--YPIHIAIKHNFFDIIKLLLEKGAYANVKDNNGESPLHNAAEYGDYAC 205
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  912 ARYMVQRGGCVYSKEEDGSTCLHHAAkIGNLEMVSLLLSTGQvdVNAQDSGGWTPIIWAAEHK-HIEVIRMLLTRGADVT 990
Cdd:PHA02874   206 IKLLIDHGNHIMNKCKNGFTPLHNAI-IHNRSAIELLINNAS--INDQDIDGSTPLHHAINPPcDIDIIDILLYHKADIS 282
                          170
                   ....*....|.
gi 2462492226  991 LTDNVSERLVE 1001
Cdd:PHA02874   283 IKDNKGENPID 293
PHA02878 PHA02878
ankyrin repeat protein; Provisional
819-1069 2.94e-13

ankyrin repeat protein; Provisional


Pssm-ID: 222939 [Multi-domain]  Cd Length: 477  Bit Score: 73.76  E-value: 2.94e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  819 SERRKKLRFHPRQLYLSvkqgelqKVILmlLDNLDPNFQSDQQSKRTPLHAAAQKgsVEICHVLLQAGANINAVDKQQ-R 897
Cdd:PHA02878   101 TLVAIKDAFNNRNVEIF-------KIIL--TNRYKNIQTIDLVYIDKKSKDDIIE--AEITKLLLSYGADINMKDRHKgN 169
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  898 TPLMEAVVNNHLEVARYMVQRGGCVYSKEEDGSTCLHHAAKIGNLEMVSLLLSTGQvDVNAQDSGGWTPIIWAAEH-KHI 976
Cdd:PHA02878   170 TALHYATENKDQRLTELLLSYGANVNIPDKTNNSPLHHAVKHYNKPIVHILLENGA-STDARDKCGNTPLHISVGYcKDY 248
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  977 EVIRMLLTRGADVTLTDNVseRLVEEENICLHwasftgSAAIAEVLLNARCDLHAVNYHGDTPLHIAARESY-------- 1048
Cdd:PHA02878   249 DILKLLLEHGVDVNAKSYI--LGLTALHSSIK------SERKLKLLLEYGADINSLNSYKLTPLSSAVKQYLcinigril 320
                          250       260
                   ....*....|....*....|...
gi 2462492226 1049 --HDCVLLFLsrgaNPELRNKEG 1069
Cdd:PHA02878   321 isNICLLKRI----KPDIKNSEG 339
PHA02876 PHA02876
ankyrin repeat protein; Provisional
863-1061 1.27e-12

ankyrin repeat protein; Provisional


Pssm-ID: 165207 [Multi-domain]  Cd Length: 682  Bit Score: 72.40  E-value: 1.27e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  863 KRTPLHAAAQKGSV-EICHVLLQAGANINAVDKQQRTPLMEAVVNNH-LEVARYMVQRGGCVYSKEEDGSTCLHHAAKIG 940
Cdd:PHA02876   273 KNTPLHHASQAPSLsRLVPKLLERGADVNAKNIKGETPLYLMAKNGYdTENIRTLIMLGADVNAADRLYITPLHQASTLD 352
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  941 -NLEMVSLLLSTGqVDVNAQDSGGWTPIIWAAEHKHIEVIRMLLTRGADVtltdnvsERLVEEENICLHWASF-TGSAAI 1018
Cdd:PHA02876   353 rNKDIVITLLELG-ANVNARDYCDKTPIHYAAVRNNVVIINTLLDYGADI-------EALSQKIGTALHFALCgTNPYMS 424
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....
gi 2462492226 1019 AEVLLNARCDLHAVNYHGDTPLHIAARESYH-DCVLLFLSRGAN 1061
Cdd:PHA02876   425 VKTLIDRGANVNSKNKDLSTPLHYACKKNCKlDVIEMLLDNGAD 468
Ank_4 pfam13637
Ankyrin repeats (many copies);
929-983 1.35e-11

Ankyrin repeats (many copies);


Pssm-ID: 372654 [Multi-domain]  Cd Length: 54  Bit Score: 60.75  E-value: 1.35e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462492226  929 GSTCLHHAAKIGNLEMVSLLLSTGqVDVNAQDSGGWTPIIWAAEHKHIEVIRMLL 983
Cdd:pfam13637    1 ELTALHAAAASGHLELLRLLLEKG-ADINAVDGNGETALHFAASNGNVEVLKLLL 54
Ank_4 pfam13637
Ankyrin repeats (many copies);
864-916 2.05e-11

Ankyrin repeats (many copies);


Pssm-ID: 372654 [Multi-domain]  Cd Length: 54  Bit Score: 60.37  E-value: 2.05e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2462492226  864 RTPLHAAAQKGSVEICHVLLQAGANINAVDKQQRTPLMEAVVNNHLEVARYMV 916
Cdd:pfam13637    2 LTALHAAAASGHLELLRLLLEKGADINAVDGNGETALHFAASNGNVEVLKLLL 54
PHA03100 PHA03100
ankyrin repeat protein; Provisional
843-989 2.16e-10

ankyrin repeat protein; Provisional


Pssm-ID: 222984 [Multi-domain]  Cd Length: 422  Bit Score: 64.69  E-value: 2.16e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  843 KVILMLLDN-LDPNFQSDQQSkrTPLHAAAQK--GSVEICHVLLQAGANINAVDKQQRTPLMEAVVNNH--LEVARYMVQ 917
Cdd:PHA03100    87 EIVKLLLEYgANVNAPDNNGI--TPLLYAISKksNSYSIVEYLLDNGANVNIKNSDGENLLHLYLESNKidLKILKLLID 164
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  918 RG---------------GC-VYSKEEDGSTCLHHAAKIGNLEMVSLLLSTGqVDVNAQDSGGWTPIIWAAEHKHIEVIRM 981
Cdd:PHA03100   165 KGvdinaknrvnyllsyGVpINIKDVYGFTPLHYAVYNNNPEFVKYLLDLG-ANPNLVNKYGDTPLHIAILNNNKEIFKL 243

                   ....*...
gi 2462492226  982 LLTRGADV 989
Cdd:PHA03100   244 LLNNGPSI 251
PHA02876 PHA02876
ankyrin repeat protein; Provisional
843-989 2.39e-10

ankyrin repeat protein; Provisional


Pssm-ID: 165207 [Multi-domain]  Cd Length: 682  Bit Score: 65.08  E-value: 2.39e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  843 KVILMLLDNLDPNFQSDQQSKRTPLHAAAQKGSVEICHVLLQAGANINAVDKQQRTPLMEAV--VNNHLEVaRYMVQRGG 920
Cdd:PHA02876   355 KDIVITLLELGANVNARDYCDKTPIHYAAVRNNVVIINTLLDYGADIEALSQKIGTALHFALcgTNPYMSV-KTLIDRGA 433
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  921 CVYSKEEDGSTCLHHAAKIG-NLEMVSLLLSTGqVDVNAQDSGGWTPIIWAAEHKHIevIRMLLTRGADV 989
Cdd:PHA02876   434 NVNSKNKDLSTPLHYACKKNcKLDVIEMLLDNG-ADVNAINIQNQYPLLIALEYHGI--VNILLHYGAEL 500
PHA02875 PHA02875
ankyrin repeat protein; Provisional
831-1023 2.54e-10

ankyrin repeat protein; Provisional


Pssm-ID: 165206 [Multi-domain]  Cd Length: 413  Bit Score: 64.24  E-value: 2.54e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  831 QLYLSVKQGELQKVILMLLDNldpNFQSDQQSKR--TPLHAAAQKGSVEICHVLLQAGANINAVDKQQRTPLMEAVVNNH 908
Cdd:PHA02875    71 ELHDAVEEGDVKAVEELLDLG---KFADDVFYKDgmTPLHLATILKKLDIMKLLIARGADPDIPNTDKFSPLHLAVMMGD 147
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  909 LEVARYMVQRGGCVYSKEEDGSTCLHHAAKIGNLEMVSLLLSTGQVDVNAQDSGGWTPIIWAAEHKHIEVIRMLLTRGAD 988
Cdd:PHA02875   148 IKGIELLIDHKACLDIEDCCGCTPLIIAMAKGDIAICKMLLDSGANIDYFGKNGCVAALCYAIENNKIDIVRLFIKRGAD 227
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 2462492226  989 ---VTLTDNVSERLVEE-ENICLHWASFTGSAAIAEVLL 1023
Cdd:PHA02875   228 cniMFMIEGEECTILDMiCNMCTNLESEAIDALIADIAI 266
Ank_4 pfam13637
Ankyrin repeats (many copies);
896-949 5.33e-10

Ankyrin repeats (many copies);


Pssm-ID: 372654 [Multi-domain]  Cd Length: 54  Bit Score: 56.13  E-value: 5.33e-10
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462492226  896 QRTPLMEAVVNNHLEVARYMVQRGGCVYSKEEDGSTCLHHAAKIGNLEMVSLLL 949
Cdd:pfam13637    1 ELTALHAAAASGHLELLRLLLEKGADINAVDGNGETALHFAASNGNVEVLKLLL 54
PHA02875 PHA02875
ankyrin repeat protein; Provisional
864-1072 5.68e-10

ankyrin repeat protein; Provisional


Pssm-ID: 165206 [Multi-domain]  Cd Length: 413  Bit Score: 63.09  E-value: 5.68e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  864 RTPLHAAAQKGSVEICHVLLQAGANINAVDKQQRTPLMEAVVNNHLEVARYMVQRGGCVYSKEEDGSTCLHHAAKIGNLE 943
Cdd:PHA02875     3 QVALCDAILFGELDIARRLLDIGINPNFEIYDGISPIKLAMKFRDSEAIKLLMKHGAIPDVKYPDIESELHDAVEEGDVK 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  944 MVSLLLSTGQV--DVNAQDsgGWTPIIWAAEHKHIEVIRMLLTRGAD--VTLTDNVSErlveeenicLHWASFTGSAAIA 1019
Cdd:PHA02875    83 AVEELLDLGKFadDVFYKD--GMTPLHLATILKKLDIMKLLIARGADpdIPNTDKFSP---------LHLAVMMGDIKGI 151
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2462492226 1020 EVLLNARCDLHAVNYHGDTPLHIAARESYHDCVLLFLSRGANPELRNKEGDTA 1072
Cdd:PHA02875   152 ELLIDHKACLDIEDCCGCTPLIIAMAKGDIAICKMLLDSGANIDYFGKNGCVA 204
PHA02874 PHA02874
ankyrin repeat protein; Provisional
877-1074 9.92e-10

ankyrin repeat protein; Provisional


Pssm-ID: 165205 [Multi-domain]  Cd Length: 434  Bit Score: 62.67  E-value: 9.92e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  877 EICHVLLQAGANINAVDKQQRTPLMEAVVNNHLEVARYMVQRGGCVYSKEEDGSTCLHHAAKIGNLEMVSLLLSTGQVdV 956
Cdd:PHA02874   105 DMIKTILDCGIDVNIKDAELKTFLHYAIKKGDLESIKMLFEYGADVNIEDDNGCYPIHIAIKHNFFDIIKLLLEKGAY-A 183
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  957 NAQDSGGWTPIIWAAEHKHIEVIRMLLTRGADVTltdNVSERLVEEenicLHWASFTGSAAIAEVLLNARCDLHAVNyhG 1036
Cdd:PHA02874   184 NVKDNNGESPLHNAAEYGDYACIKLLIDHGNHIM---NKCKNGFTP----LHNAIIHNRSAIELLINNASINDQDID--G 254
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 2462492226 1037 DTPLHIAARESYH-DCVLLFLSRGANPELRNKEGDTAWD 1074
Cdd:PHA02874   255 STPLHHAINPPCDiDIIDILLYHKADISIKDNKGENPID 293
PHA02876 PHA02876
ankyrin repeat protein; Provisional
877-1061 1.10e-09

ankyrin repeat protein; Provisional


Pssm-ID: 165207 [Multi-domain]  Cd Length: 682  Bit Score: 63.16  E-value: 1.10e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  877 EIC-HVLLQA--GANI--NAVDK--QQRTPLMEAVVNNHLEVARYMVQRGGCVYSKEEDGSTCLHHAAKIGNLEMVSLLL 949
Cdd:PHA02876   119 EACiHILKEAisGNDIhyDKINEsiEYMKLIKERIQQDELLIAEMLLEGGADVNAKDIYCITPIHYAAERGNAKMVNLLL 198
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  950 STGqVDVNAQDSGGWTPIIWAAEHKHIEVIRMLLTRGA-----DVTLTDNVSERLVEE-----------------ENICL 1007
Cdd:PHA02876   199 SYG-ADVNIIALDDLSVLECAVDSKNIDTIKAIIDNRSninknDLSLLKAIRNEDLETslllydagfsvnsiddcKNTPL 277
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462492226 1008 HWASFTGS-AAIAEVLLNARCDLHAVNYHGDTPLHIAARESYH-DCVLLFLSRGAN 1061
Cdd:PHA02876   278 HHASQAPSlSRLVPKLLERGADVNAKNIKGETPLYLMAKNGYDtENIRTLIMLGAD 333
PHA02798 PHA02798
ankyrin-like protein; Provisional
875-994 2.72e-09

ankyrin-like protein; Provisional


Pssm-ID: 222931 [Multi-domain]  Cd Length: 489  Bit Score: 61.39  E-value: 2.72e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  875 SVEICHVLLQAGANINAVDKQQRTPLMEAVVN----NH-LEVARYMVQRGGCVYSKEEDGST---CLHHAAKIGNLEMVS 946
Cdd:PHA02798    50 STDIVKLFINLGANVNGLDNEYSTPLCTILSNikdyKHmLDIVKILIENGADINKKNSDGETplyCLLSNGYINNLEILL 129
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462492226  947 LLLSTGqVDVNAQDSGGWTPI---IWAAEHKHIEVIRMLLTRGADVTLTDN 994
Cdd:PHA02798   130 FMIENG-ADTTLLDKDGFTMLqvyLQSNHHIDIEIIKLLLEKGVDINTHNN 179
SET_SMYD cd20071
SET domain (including SET domain and post-SET domain) found in SET and MYND domain-containing ...
1232-1360 8.19e-09

SET domain (including SET domain and post-SET domain) found in SET and MYND domain-containing protein, and similar proteins; The family includes SET and MYND domain-containing proteins, SMYD1-SYMD5. SMYD1 (EC 2.1.1.43; also termed BOP) is a heart and muscle specific SET-MYND domain containing protein, which functions as a histone methyltransferase and regulates downstream gene transcription. It methylates histone H3 at 'Lys-4' (H3K4me), seems able to perform both mono-, di-, and trimethylation. SMYD2 (also termed HSKM-B, or lysine N-methyltransferase 3C (KMT3C)) functions as a histone methyltransferase that methylates both histones and non-histone proteins, including p53/TP53 and RB1. It specifically methylates histone H3 'Lys-4' (H3K4me) and dimethylates histone H3 'Lys-36' (H3K36me2). SMYD3 (also termed zinc finger MYND domain-containing protein 1) functions as a histone methyltransferase that specifically methylates 'Lys-4' of histone H3, inducing di- and tri-methylation, but not monomethylation. It also methylates 'Lys-5' of histone H4. SMYD3 plays an important role in transcriptional activation as a member of an RNA polymerase complex. SMYD4 functions as a potential tumor suppressor that plays a critical role in breast carcinogenesis at least partly through inhibiting the expression of PDGFR-alpha. SMYD5 (also termed protein NN8-4AG, or retinoic acid-induced protein 15) functions as histone lysine methyltransferase that mediates H4K20me3 at heterochromatin regions.


Pssm-ID: 380997 [Multi-domain]  Cd Length: 122  Bit Score: 55.08  E-value: 8.19e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1232 KMGWGVRALQTIPQGtficeyvgELISDAEADVREDDSYLFDLDNKDGEVYCIdaryYGNISRFiNHLCDPNiipVRVFM 1311
Cdd:cd20071      8 SKGRGLVATRDIEPG--------ELILVEKPLVSVPSNSFSLTDGLNEIGVGL----FPLASLL-NHSCDPN---AVVVF 71
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462492226 1312 LHQDlrfpRIAFFSSRDIRTGEELGFDYGDRFWD--------IKSKYFTCQCgsEKC 1360
Cdd:cd20071     72 DGNG----TLRVRALRDIKAGEELTISYIDPLLPrterrrelLEKYGFTCSC--PRC 122
PHA03100 PHA03100
ankyrin repeat protein; Provisional
828-952 9.03e-09

ankyrin repeat protein; Provisional


Pssm-ID: 222984 [Multi-domain]  Cd Length: 422  Bit Score: 59.29  E-value: 9.03e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  828 HPRQLYLSVKQGELqKVILMLLDNlDPNFQSDQQSKRTPLHAAAQ--KGSVEICHVLLQAGANINA-------------- 891
Cdd:PHA03100   108 TPLLYAISKKSNSY-SIVEYLLDN-GANVNIKNSDGENLLHLYLEsnKIDLKILKLLIDKGVDINAknrvnyllsygvpi 185
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462492226  892 --VDKQQRTPLMEAVVNNHLEVARYMVQRGGCVYSKEEDGSTCLHHAAKIGNLEMVSLLLSTG 952
Cdd:PHA03100   186 niKDVYGFTPLHYAVYNNNPEFVKYLLDLGANPNLVNKYGDTPLHIAILNNNKEIFKLLLNNG 248
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
396-504 1.01e-08

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 60.01  E-value: 1.01e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  396 GKVTSDLAKRRKLNSGG--GLSEELGSARRSGEVTLTKGDPGSLEEWETVVGDDFSLYYDSYSVDERVDSDSKSEVEalt 473
Cdd:TIGR00927  789 GEMKGDEGAEGKVEHEGetEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEE--- 865
                           90       100       110
                   ....*....|....*....|....*....|.
gi 2462492226  474 eqlsEEEEEEEEEEEEEEEEEEEEEEEEDEE 504
Cdd:TIGR00927  866 ----EEEEEEEEEEEEEEEEEEEEEEEENEE 892
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
407-505 1.09e-08

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 60.01  E-value: 1.09e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  407 KLNSGGGLSEELGSARRSGEVTLTKGDPGSLEEWETVVGDDFSLYYDSYSVDERVDSDSKSEVEALTEQLSEEEEEEEEE 486
Cdd:TIGR00927  792 KGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEE 871
                           90
                   ....*....|....*....
gi 2462492226  487 EEEEEEEEEEEEEEEDEES 505
Cdd:TIGR00927  872 EEEEEEEEEEEEEEEEEEN 890
Ank_2 pfam12796
Ankyrin repeats (3 copies);
812-893 1.38e-08

Ankyrin repeats (3 copies);


Pssm-ID: 463710 [Multi-domain]  Cd Length: 91  Bit Score: 53.58  E-value: 1.38e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  812 KALVIQESERRKKLRFHPRQLYLSVKQGELQKVILmLLDNLDPNFQSDQqskRTPLHAAAQKGSVEICHVLLQAGANINA 891
Cdd:pfam12796   14 KLLLENGADANLQDKNGRTALHLAAKNGHLEIVKL-LLEHADVNLKDNG---RTALHYAARSGHLEIVKLLLEKGADINV 89

                   ..
gi 2462492226  892 VD 893
Cdd:pfam12796   90 KD 91
trp TIGR00870
transient-receptor-potential calcium channel protein; The Transient Receptor Potential Ca2+ ...
888-1093 2.91e-08

transient-receptor-potential calcium channel protein; The Transient Receptor Potential Ca2+ Channel (TRP-CC) Family (TC. 1.A.4)The TRP-CC family has also been called the store-operated calcium channel (SOC) family. The prototypical members include the Drosophila retinal proteinsTRP and TRPL (Montell and Rubin, 1989; Hardie and Minke, 1993). SOC members of the family mediate the entry of extracellular Ca2+ into cells in responseto depletion of intracellular Ca2+ stores (Clapham, 1996) and agonist stimulated production of inositol-1,4,5 trisphosphate (IP3). One member of the TRP-CCfamily, mammalian Htrp3, has been shown to form a tight complex with the IP3 receptor (TC #1.A.3.2.1). This interaction is apparently required for IP3 tostimulate Ca2+ release via Htrp3. The vanilloid receptor subtype 1 (VR1), which is the receptor for capsaicin (the ?hot? ingredient in chili peppers) and servesas a heat-activated ion channel in the pain pathway (Caterina et al., 1997), is also a member of this family. The stretch-inhibitable non-selective cation channel(SIC) is identical to the vanilloid receptor throughout all of its first 700 residues, but it exhibits a different sequence in its last 100 residues. VR1 and SICtransport monovalent cations as well as Ca2+. VR1 is about 10x more permeable to Ca2+ than to monovalent ions. Ca2+ overload probably causes cell deathafter chronic exposure to capsaicin. (McCleskey and Gold, 1999). [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273311 [Multi-domain]  Cd Length: 743  Bit Score: 58.55  E-value: 2.91e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  888 NINAVDKQQRTPLMEAVV-NNHLEVARYMVQRGGCVYSkeedGSTCLHHAAK--IGNLEMVSLLLSTGQVD------VNA 958
Cdd:TIGR00870   44 NINCPDRLGRSALFVAAIeNENLELTELLLNLSCRGAV----GDTLLHAISLeyVDAVEAILLHLLAAFRKsgplelAND 119
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  959 QDSG----GWTPIIWAAEHKHIEVIRMLLTRGADVTLTDNVSERLVEEENICLHW-------ASFTGSAAIAEVLLNARC 1027
Cdd:TIGR00870  120 QYTSeftpGITALHLAAHRQNYEIVKLLLERGASVPARACGDFFVKSQGVDSFYHgesplnaAACLGSPSIVALLSEDPA 199
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1028 DLHAVNYHGDTPLHIAARESY---------HDCVLLFLSRGANP-------ELRNKEGDTAWDL-TPERSDVWFALQLNR 1090
Cdd:TIGR00870  200 DILTADSLGNTLLHLLVMENEfkaeyeelsCQMYNFALSLLDKLrdskeleVILNHQGLTPLKLaAKEGRIVLFRLKLAI 279

                   ...
gi 2462492226 1091 KLR 1093
Cdd:TIGR00870  280 KYK 282
PHA03100 PHA03100
ankyrin repeat protein; Provisional
905-1061 3.33e-08

ankyrin repeat protein; Provisional


Pssm-ID: 222984 [Multi-domain]  Cd Length: 422  Bit Score: 57.75  E-value: 3.33e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  905 VNNHLEVARYMVQRGGCVYSKEEDGSTCLHHAAKIGNLEMVSLLLSTGqVDVNAQDSGGWTPIIWAAEHKHI-----EVI 979
Cdd:PHA03100    11 RIIKVKNIKYIIMEDDLNDYSYKKPVLPLYLAKEARNIDVVKILLDNG-ADINSSTKNNSTPLHYLSNIKYNltdvkEIV 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  980 RMLLTRGADVTLTDNVSerlveeeNICLHWASFT--GSAAIAEVLLNARCDLHAVNYHGDTPLHIAARESYHD--CVLLF 1055
Cdd:PHA03100    90 KLLLEYGANVNAPDNNG-------ITPLLYAISKksNSYSIVEYLLDNGANVNIKNSDGENLLHLYLESNKIDlkILKLL 162

                   ....*.
gi 2462492226 1056 LSRGAN 1061
Cdd:PHA03100   163 IDKGVD 168
PLN03192 PLN03192
Voltage-dependent potassium channel; Provisional
1013-1074 3.37e-08

Voltage-dependent potassium channel; Provisional


Pssm-ID: 215625 [Multi-domain]  Cd Length: 823  Bit Score: 58.34  E-value: 3.37e-08
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462492226 1013 TGSAAIAEVLLNARCDLHAVNYHGDTPLHIAARESYHDCVLLFLSRGANPELRNKEGDTA-WD 1074
Cdd:PLN03192   535 TGNAALLEELLKAKLDPDIGDSKGRTPLHIAASKGYEDCVLVLLKHACNVHIRDANGNTAlWN 597
TRPV5-6 cd22192
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), types 5 and 6; TRPV5 and ...
894-1042 3.60e-08

Transient Receptor Potential channel, Vanilloid subfamily (TRPV), types 5 and 6; TRPV5 and TRPV6 (TRPV5/6) are two homologous members within the vanilloid subfamily of the transient receptor potential (TRP) family. TRPV5 and TRPV6 show only 30-40% homology with other members of the TRP family and have unique properties that differentiates them from other TRP channels. They mediate calcium uptake in epithelia and their expression is dramatically increased in numerous types of cancer. The structure of TRPV5/6 shows the typical topology features of all TRP family members, such as six transmembrane regions, a short hydrophobic stretch between transmembrane segments 5 and 6, which is predicted to form the Ca2+ pore, and large intracellular N- and C-terminal domains. The N-terminal domain of TRPV5/6 contains three ankyrin repeats. This structural element is present in several proteins and plays a role in protein-protein interactions. The N- and C-terminal tails of TRPV5/6 each contain an internal PDZ motif which can function as part of a molecular scaffold via interaction with PDZ-domain containing proteins. A major difference between the properties of TRPV5 and TRPV6 is in their tissue distribution: TRPV5 is predominantly expressed in the distal convoluted tubules (DCT) and connecting tubules (CNT) of the kidney, with limited expression in extrarenal tissues. In contrast, TRPV6 has a broader expression pattern such as expression in the intestine, kidney, placenta, epididymis, exocrine tissues, and a few other tissues.


Pssm-ID: 411976 [Multi-domain]  Cd Length: 609  Bit Score: 58.10  E-value: 3.60e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  894 KQQR---TPLMEAVVNNHLEVARYMVQRGGC-VYSKEEDGSTCLHHAAKIGNLEMVSLLLSTGQVDVNAQDSG----GWT 965
Cdd:cd22192     12 QQKRiseSPLLLAAKENDVQAIKKLLKCPSCdLFQRGALGETALHVAALYDNLEAAVVLMEAAPELVNEPMTSdlyqGET 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  966 PIIWAAEHKHIEVIRMLLTRGADVtltdnVSER------LVEEENIC------LHWASFTGSAAIAEVLLNARCDLHAVN 1033
Cdd:cd22192     92 ALHIAVVNQNLNLVRELIARGADV-----VSPRatgtffRPGPKNLIyygehpLSFAACVGNEEIVRLLIEHGADIRAQD 166

                   ....*....
gi 2462492226 1034 YHGDTPLHI 1042
Cdd:cd22192    167 SLGNTVLHI 175
Ank_2 pfam12796
Ankyrin repeats (3 copies);
1007-1072 4.03e-08

Ankyrin repeats (3 copies);


Pssm-ID: 463710 [Multi-domain]  Cd Length: 91  Bit Score: 52.04  E-value: 4.03e-08
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462492226 1007 LHWASFTGSAAIAEVLLNARCDLHAVNYHGDTPLHIAARESYHDCVLLFLSRgANPELRNkEGDTA 1072
Cdd:pfam12796    1 LHLAAKNGNLELVKLLLENGADANLQDKNGRTALHLAAKNGHLEIVKLLLEH-ADVNLKD-NGRTA 64
Ank_5 pfam13857
Ankyrin repeats (many copies);
948-994 4.87e-08

Ankyrin repeats (many copies);


Pssm-ID: 433530 [Multi-domain]  Cd Length: 56  Bit Score: 50.81  E-value: 4.87e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 2462492226  948 LLSTGQVDVNAQDSGGWTPIIWAAEHKHIEVIRMLLTRGADVTLTDN 994
Cdd:pfam13857    1 LLEHGPIDLNRLDGEGYTPLHVAAKYGALEIVRVLLAYGVDLNLKDE 47
SET_ATXR5_6-like cd10539
SET domain found in fungal protein lysine methyltransferase SET5 and similar protein; The ...
1232-1339 5.10e-08

SET domain found in fungal protein lysine methyltransferase SET5 and similar protein; The family includes Arabidopsis thaliana ATXR5 and ATXR6. Both ATXR5 (also termed protein SET DOMAIN GROUP 15, or TRX-related protein 5) and ATXR6 (also termed protein SET DOMAIN GROUP 34, or TRX-related protein 6) function as histone methyltransferase that specifically monomethylates 'Lys-37' of histone H3 (H3K27me1). They are required for chromatin structure and gene silencing.


Pssm-ID: 380937  Cd Length: 138  Bit Score: 53.18  E-value: 5.10e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1232 KMGWGVRALQTIPQGTFICEYVGEL--ISDAEADvrEDDSYLFDLDNKDGE---VYCIDARyyGNISRFI----NHLCD- 1301
Cdd:cd10539     13 REGFTVEADGFIKDLTIIAEYTGDVdyIRNREFD--DNDSIMTLLLAGDPSkslVICPDKR--GNIARFIsginNHTKDg 88
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 2462492226 1302 ---PNIIPVRVFMLHQdlrfPRIAFFSSRDIRTGEELGFDY 1339
Cdd:cd10539     89 kkkQNCKCVRYSINGE----ARVLLVATRDIAKGERLYYDY 125
PHA02878 PHA02878
ankyrin repeat protein; Provisional
866-1071 8.48e-08

ankyrin repeat protein; Provisional


Pssm-ID: 222939 [Multi-domain]  Cd Length: 477  Bit Score: 56.43  E-value: 8.48e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  866 PLHAAAQKGSVEICHVLLQAGANINAVDKQQRTPLMEAVVNNHLEVARYMVQrggcVYSKEEDGST--CLHHAAKIGNLE 943
Cdd:PHA02878    40 PLHQAVEARNLDVVKSLLTRGHNVNQPDHRDLTPLHIICKEPNKLGMKEMIR----SINKCSVFYTlvAIKDAFNNRNVE 115
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  944 MVSLLLSTG-----QVDVNAQDSGGWTPIIWAaehkhiEVIRMLLTRGADVTLTDNvserlvEEENICLHWASFTGSAAI 1018
Cdd:PHA02878   116 IFKIILTNRykniqTIDLVYIDKKSKDDIIEA------EITKLLLSYGADINMKDR------HKGNTALHYATENKDQRL 183
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2462492226 1019 AEVLLNARCDLHAVNYHGDTPLHIAARESYHDCVLLFLSRGANPELRNKEGDT 1071
Cdd:PHA02878   184 TELLLSYGANVNIPDKTNNSPLHHAVKHYNKPIVHILLENGASTDARDKCGNT 236
PHA02874 PHA02874
ankyrin repeat protein; Provisional
918-1071 4.12e-07

ankyrin repeat protein; Provisional


Pssm-ID: 165205 [Multi-domain]  Cd Length: 434  Bit Score: 54.20  E-value: 4.12e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  918 RGGCVYSKEEDGSTCLHHAAKIGNLEMVSLLLSTGqVDVNAQDSGGWTPIIWAAEHKHIEVIRMLLTRGADVT------L 991
Cdd:PHA02874    24 KGNCINISVDETTTPLIDAIRSGDAKIVELFIKHG-ADINHINTKIPHPLLTAIKIGAHDIIKLLIDNGVDTSilpipcI 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  992 TDNVSERLVE----------EENICLHWASFTGSAAIAEVLLNARCDLHAVNYHGDTPLHIAARESYHDCVLLFLSRGAN 1061
Cdd:PHA02874   103 EKDMIKTILDcgidvnikdaELKTFLHYAIKKGDLESIKMLFEYGADVNIEDDNGCYPIHIAIKHNFFDIIKLLLEKGAY 182
                          170
                   ....*....|
gi 2462492226 1062 PELRNKEGDT 1071
Cdd:PHA02874   183 ANVKDNNGES 192
Ank pfam00023
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ...
864-894 7.54e-07

Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities. Repeats 13-24 are especially active, with known sites of interaction for the Na/K ATPase, Cl/HCO(3) anion exchanger, voltage-gated sodium channel, clathrin heavy chain and L1 family cell adhesion molecules. The ANK repeats are found to form a contiguous spiral stack such that ion transporters like the anion exchanger associate in a large central cavity formed by the ANK repeat spiral, while clathrin and cell adhesion molecules associate with specific regions outside this cavity.


Pssm-ID: 459634 [Multi-domain]  Cd Length: 34  Bit Score: 46.90  E-value: 7.54e-07
                           10        20        30
                   ....*....|....*....|....*....|..
gi 2462492226  864 RTPLHAAA-QKGSVEICHVLLQAGANINAVDK 894
Cdd:pfam00023    3 NTPLHLAAgRRGNLEIVKLLLSKGADVNARDK 34
PTZ00322 PTZ00322
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional
875-958 9.92e-07

6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional


Pssm-ID: 140343 [Multi-domain]  Cd Length: 664  Bit Score: 53.36  E-value: 9.92e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  875 SVEICH-----------VLLQAGANINAVDKQQRTPLMEAVVNNHLEVARYMVQRGGCVYSKEEDGSTCLHHAAKIGNLE 943
Cdd:PTZ00322    83 TVELCQlaasgdavgarILLTGGADPNCRDYDGRTPLHIACANGHVQVVRVLLEFGADPTLLDKDGKTPLELAEENGFRE 162
                           90
                   ....*....|....*
gi 2462492226  944 MVSLLLSTGQVDVNA 958
Cdd:PTZ00322   163 VVQLLSRHSQCHFEL 177
trp TIGR00870
transient-receptor-potential calcium channel protein; The Transient Receptor Potential Ca2+ ...
856-1050 1.40e-06

transient-receptor-potential calcium channel protein; The Transient Receptor Potential Ca2+ Channel (TRP-CC) Family (TC. 1.A.4)The TRP-CC family has also been called the store-operated calcium channel (SOC) family. The prototypical members include the Drosophila retinal proteinsTRP and TRPL (Montell and Rubin, 1989; Hardie and Minke, 1993). SOC members of the family mediate the entry of extracellular Ca2+ into cells in responseto depletion of intracellular Ca2+ stores (Clapham, 1996) and agonist stimulated production of inositol-1,4,5 trisphosphate (IP3). One member of the TRP-CCfamily, mammalian Htrp3, has been shown to form a tight complex with the IP3 receptor (TC #1.A.3.2.1). This interaction is apparently required for IP3 tostimulate Ca2+ release via Htrp3. The vanilloid receptor subtype 1 (VR1), which is the receptor for capsaicin (the ?hot? ingredient in chili peppers) and servesas a heat-activated ion channel in the pain pathway (Caterina et al., 1997), is also a member of this family. The stretch-inhibitable non-selective cation channel(SIC) is identical to the vanilloid receptor throughout all of its first 700 residues, but it exhibits a different sequence in its last 100 residues. VR1 and SICtransport monovalent cations as well as Ca2+. VR1 is about 10x more permeable to Ca2+ than to monovalent ions. Ca2+ overload probably causes cell deathafter chronic exposure to capsaicin. (McCleskey and Gold, 1999). [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273311 [Multi-domain]  Cd Length: 743  Bit Score: 52.78  E-value: 1.40e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  856 FQSDQqskrTPLHAAAQKGSVEICHVLLQAGANINAvdkqqrtplmeavvnnhlevarymvqRGGCVYSKEEDGSTCLHH 935
Cdd:TIGR00870  125 FTPGI----TALHLAAHRQNYEIVKLLLERGASVPA--------------------------RACGDFFVKSQGVDSFYH 174
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  936 -------AAKIGNLEMVSLLLSTGQvDVNAQDSGGWTPiiwaaehKHIEVIrmlltrgadvtltdnVSERLVEEENICLH 1008
Cdd:TIGR00870  175 gesplnaAACLGSPSIVALLSEDPA-DILTADSLGNTL-------LHLLVM---------------ENEFKAEYEELSCQ 231
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 2462492226 1009 WASFtgsaaiAEVLLNARCDL----HAVNYHGDTPLHIAARESYHD 1050
Cdd:TIGR00870  232 MYNF------ALSLLDKLRDSkeleVILNHQGLTPLKLAAKEGRIV 271
Ank_5 pfam13857
Ankyrin repeats (many copies);
1022-1075 1.50e-06

Ankyrin repeats (many copies);


Pssm-ID: 433530 [Multi-domain]  Cd Length: 56  Bit Score: 46.57  E-value: 1.50e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462492226 1022 LLNAR-CDLHAVNYHGDTPLHIAARESYHDCVLLFLSRGANPELRNKEGDTAWDL 1075
Cdd:pfam13857    1 LLEHGpIDLNRLDGEGYTPLHVAAKYGALEIVRVLLAYGVDLNLKDEEGLTALDL 55
TRPV5-6 cd22192
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), types 5 and 6; TRPV5 and ...
860-1046 2.74e-06

Transient Receptor Potential channel, Vanilloid subfamily (TRPV), types 5 and 6; TRPV5 and TRPV6 (TRPV5/6) are two homologous members within the vanilloid subfamily of the transient receptor potential (TRP) family. TRPV5 and TRPV6 show only 30-40% homology with other members of the TRP family and have unique properties that differentiates them from other TRP channels. They mediate calcium uptake in epithelia and their expression is dramatically increased in numerous types of cancer. The structure of TRPV5/6 shows the typical topology features of all TRP family members, such as six transmembrane regions, a short hydrophobic stretch between transmembrane segments 5 and 6, which is predicted to form the Ca2+ pore, and large intracellular N- and C-terminal domains. The N-terminal domain of TRPV5/6 contains three ankyrin repeats. This structural element is present in several proteins and plays a role in protein-protein interactions. The N- and C-terminal tails of TRPV5/6 each contain an internal PDZ motif which can function as part of a molecular scaffold via interaction with PDZ-domain containing proteins. A major difference between the properties of TRPV5 and TRPV6 is in their tissue distribution: TRPV5 is predominantly expressed in the distal convoluted tubules (DCT) and connecting tubules (CNT) of the kidney, with limited expression in extrarenal tissues. In contrast, TRPV6 has a broader expression pattern such as expression in the intestine, kidney, placenta, epididymis, exocrine tissues, and a few other tissues.


Pssm-ID: 411976 [Multi-domain]  Cd Length: 609  Bit Score: 51.94  E-value: 2.74e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  860 QQSKR---TPLHAAAQKGSVEICHVLLQAganiNAVDKQQRTPLME-----AVVNNHLEVARYMVQrggCV--------Y 923
Cdd:cd22192     11 LQQKRiseSPLLLAAKENDVQAIKKLLKC----PSCDLFQRGALGEtalhvAALYDNLEAAVVLME---AApelvnepmT 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  924 SKEEDGSTCLHHAAKIGNLEMVSLLLSTGQVDVNAQDSG-------------GWTPIIWAAEHKHIEVIRMLLTRGADVT 990
Cdd:cd22192     84 SDLYQGETALHIAVVNQNLNLVRELIARGADVVSPRATGtffrpgpknliyyGEHPLSFAACVGNEEIVRLLIEHGADIR 163
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462492226  991 LTDNVSerlveeeNICLH----WASFTGSAAIAEVLLN--ARCDLHAV----NYHGDTPLHIAARE 1046
Cdd:cd22192    164 AQDSLG-------NTVLHilvlQPNKTFACQMYDLILSydKEDDLQPLdlvpNNQGLTPFKLAAKE 222
Ank_4 pfam13637
Ankyrin repeats (many copies);
963-1023 6.54e-06

Ankyrin repeats (many copies);


Pssm-ID: 372654 [Multi-domain]  Cd Length: 54  Bit Score: 44.57  E-value: 6.54e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462492226  963 GWTPIIWAAEHKHIEVIRMLLTRGADVTLTDnvserlvEEENICLHWASFTGSAAIAEVLL 1023
Cdd:pfam13637    1 ELTALHAAAASGHLELLRLLLEKGADINAVD-------GNGETALHFAASNGNVEVLKLLL 54
PHA02875 PHA02875
ankyrin repeat protein; Provisional
936-1082 7.53e-06

ankyrin repeat protein; Provisional


Pssm-ID: 165206 [Multi-domain]  Cd Length: 413  Bit Score: 49.99  E-value: 7.53e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  936 AAKIGNLEMVSLLLSTGqVDVNAQDSGGWTPIIWAAEHKHIEVIRMLLTRGAdvtlTDNVSERLVEEEnicLHWASFTGS 1015
Cdd:PHA02875     9 AILFGELDIARRLLDIG-INPNFEIYDGISPIKLAMKFRDSEAIKLLMKHGA----IPDVKYPDIESE---LHDAVEEGD 80
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462492226 1016 AAIAEVLLNARCDLHAVNYH-GDTPLHIAARESYHDCVLLFLSRGANPELRNKEGDTAWDLTPERSDV 1082
Cdd:PHA02875    81 VKAVEELLDLGKFADDVFYKdGMTPLHLATILKKLDIMKLLIARGADPDIPNTDKFSPLHLAVMMGDI 148
Ank pfam00023
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ...
1035-1067 8.41e-06

Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities. Repeats 13-24 are especially active, with known sites of interaction for the Na/K ATPase, Cl/HCO(3) anion exchanger, voltage-gated sodium channel, clathrin heavy chain and L1 family cell adhesion molecules. The ANK repeats are found to form a contiguous spiral stack such that ion transporters like the anion exchanger associate in a large central cavity formed by the ANK repeat spiral, while clathrin and cell adhesion molecules associate with specific regions outside this cavity.


Pssm-ID: 459634 [Multi-domain]  Cd Length: 34  Bit Score: 43.82  E-value: 8.41e-06
                           10        20        30
                   ....*....|....*....|....*....|....
gi 2462492226 1035 HGDTPLHIAA-RESYHDCVLLFLSRGANPELRNK 1067
Cdd:pfam00023    1 DGNTPLHLAAgRRGNLEIVKLLLSKGADVNARDK 34
PHA03247 PHA03247
large tegument protein UL36; Provisional
126-382 9.24e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 9.24e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  126 PRGRGLMRARGRGRAAPPGSRGRGRGGPHRGRGRPrSLLSLPRAQA-SWTPQLSTGLTSPPVPCLPSQGEAPAEMGALLL 204
Cdd:PHA03247  2659 GRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVG-SLTSLADPPPpPPTPEPAPHALVSATPLPPGPAAARQASPALPA 2737
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  205 EKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVgdegadtPVGATPLIGDESENLEGDGDLRGGR 284
Cdd:PHA03247  2738 APAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR-------PAVASLSESRESLPSPWDPADPPAA 2810
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  285 ILLGHATKSFPSSPSkGGSCPSRAKMSMTGAGKSPPSVQSLAMRLLSMPGAQGAAAAGSEPPPATTSPEGQPKVHR-ARK 363
Cdd:PHA03247  2811 VLAPAAALPPAASPA-GPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRlARP 2889
                          250       260
                   ....*....|....*....|.
gi 2462492226  364 TMSKPGN--GQPPVPEKRPPE 382
Cdd:PHA03247  2890 AVSRSTEsfALPPDQPERPPQ 2910
PTZ00322 PTZ00322
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional
1013-1078 1.06e-05

6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional


Pssm-ID: 140343 [Multi-domain]  Cd Length: 664  Bit Score: 49.90  E-value: 1.06e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462492226 1013 TGSAAIAEVLLNARCDLHAVNYHGDTPLHIAARESYHDCVLLFLSRGANPELRNKEGDTAWDLTPE 1078
Cdd:PTZ00322    92 SGDAVGARILLTGGADPNCRDYDGRTPLHIACANGHVQVVRVLLEFGADPTLLDKDGKTPLELAEE 157
Ank_5 pfam13857
Ankyrin repeats (many copies);
915-970 1.25e-05

Ankyrin repeats (many copies);


Pssm-ID: 433530 [Multi-domain]  Cd Length: 56  Bit Score: 43.87  E-value: 1.25e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462492226  915 MVQRGGC-VYSKEEDGSTCLHHAAKIGNLEMVSLLLsTGQVDVNAQDSGGWTPIIWA 970
Cdd:pfam13857    1 LLEHGPIdLNRLDGEGYTPLHVAAKYGALEIVRVLL-AYGVDLNLKDEEGLTALDLA 56
ANK smart00248
ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four ...
963-991 1.47e-05

ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four consecutive copies. They are involved in protein-protein interactions. The core of the repeat seems to be an helix-loop-helix structure.


Pssm-ID: 197603 [Multi-domain]  Cd Length: 30  Bit Score: 42.96  E-value: 1.47e-05
                            10        20
                    ....*....|....*....|....*....
gi 2462492226   963 GWTPIIWAAEHKHIEVIRMLLTRGADVTL 991
Cdd:smart00248    2 GRTPLHLAAENGNLEVVKLLLDKGADINA 30
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
403-510 1.70e-05

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 49.61  E-value: 1.70e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  403 AKRRKLNSGGGLSEELGSARRSGEVTL-----TKGDPGSL-----EEWETVVGDDFSLYYDSYSVDERVDSDSKSEVEAL 472
Cdd:TIGR00927  759 GDRKETEHEGETEAEGKEDEDEGEIQAgedgeMKGDEGAEgkvehEGETEAGEKDEHEGQSETQADDTEVKDETGEQELN 838
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462492226  473 TEQLSEEEEEEE-------------EEEEEEEEEEEEEEEEEDEESGNQSD 510
Cdd:TIGR00927  839 AENQGEAKQDEKgvdggggsdggdsEEEEEEEEEEEEEEEEEEEEEEEEEE 889
PTZ00322 PTZ00322
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional
846-929 2.11e-05

6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional


Pssm-ID: 140343 [Multi-domain]  Cd Length: 664  Bit Score: 49.13  E-value: 2.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  846 LMLLDNLDPNfqSDQQSKRTPLHAAAQKGSVEICHVLLQAGANINAVDKQQRTPLMEAVVNNHLEVARYMVQRGGCVYSK 925
Cdd:PTZ00322   100 ILLTGGADPN--CRDYDGRTPLHIACANGHVQVVRVLLEFGADPTLLDKDGKTPLELAEENGFREVVQLLSRHSQCHFEL 177

                   ....
gi 2462492226  926 EEDG 929
Cdd:PTZ00322   178 GANA 181
Ank pfam00023
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ...
963-994 2.16e-05

Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities. Repeats 13-24 are especially active, with known sites of interaction for the Na/K ATPase, Cl/HCO(3) anion exchanger, voltage-gated sodium channel, clathrin heavy chain and L1 family cell adhesion molecules. The ANK repeats are found to form a contiguous spiral stack such that ion transporters like the anion exchanger associate in a large central cavity formed by the ANK repeat spiral, while clathrin and cell adhesion molecules associate with specific regions outside this cavity.


Pssm-ID: 459634 [Multi-domain]  Cd Length: 34  Bit Score: 42.66  E-value: 2.16e-05
                           10        20        30
                   ....*....|....*....|....*....|...
gi 2462492226  963 GWTPIIWAAEH-KHIEVIRMLLTRGADVTLTDN 994
Cdd:pfam00023    2 GNTPLHLAAGRrGNLEIVKLLLSKGADVNARDK 34
SET_SpSet7-like cd10540
SET domain found in Schizossacharomyces pombe Set7 and similar proteins; Schizosaccharomyces ...
1224-1345 2.65e-05

SET domain found in Schizossacharomyces pombe Set7 and similar proteins; Schizosaccharomyces pombe Set7 is a novel histone-lysine N-methyltransferase. The family also includes a viral histone H3 lysine 27 methyltransferase from Paramecium bursaria Chlorella virus 1 (PBCV-1).


Pssm-ID: 380938  Cd Length: 112  Bit Score: 44.55  E-value: 2.65e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1224 RLQLYRTAKMGWGVRALQTIPQGTFIcEYVGELISDAEADVREDDSYLFDLdnkdgeVYCIDARYY----GNISRFiNHL 1299
Cdd:cd10540      1 RLEVKPSTLKGRGVFATRPIKKGEVI-EEAPVIVLPKEEYQHLCKTVLDHY------VFSWGDGCLalalGYGSMF-NHS 72
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 2462492226 1300 CDPNIIPVRVFMLHqdlrfpRIAFFSSRDIRTGEELGFDYGDRFWD 1345
Cdd:cd10540     73 YTPNAEYEIDFENQ------TIVFYALRDIEAGEELTINYGDDLWD 112
ANK smart00248
ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four ...
864-891 2.80e-05

ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four consecutive copies. They are involved in protein-protein interactions. The core of the repeat seems to be an helix-loop-helix structure.


Pssm-ID: 197603 [Multi-domain]  Cd Length: 30  Bit Score: 42.19  E-value: 2.80e-05
                            10        20
                    ....*....|....*....|....*...
gi 2462492226   864 RTPLHAAAQKGSVEICHVLLQAGANINA 891
Cdd:smart00248    3 RTPLHLAAENGNLEVVKLLLDKGADINA 30
Ank pfam00023
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ...
928-960 2.81e-05

Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities. Repeats 13-24 are especially active, with known sites of interaction for the Na/K ATPase, Cl/HCO(3) anion exchanger, voltage-gated sodium channel, clathrin heavy chain and L1 family cell adhesion molecules. The ANK repeats are found to form a contiguous spiral stack such that ion transporters like the anion exchanger associate in a large central cavity formed by the ANK repeat spiral, while clathrin and cell adhesion molecules associate with specific regions outside this cavity.


Pssm-ID: 459634 [Multi-domain]  Cd Length: 34  Bit Score: 42.28  E-value: 2.81e-05
                           10        20        30
                   ....*....|....*....|....*....|....
gi 2462492226  928 DGSTCLHHAA-KIGNLEMVSLLLSTGqVDVNAQD 960
Cdd:pfam00023    1 DGNTPLHLAAgRRGNLEIVKLLLSKG-ADVNARD 33
PHA02798 PHA02798
ankyrin-like protein; Provisional
845-957 3.00e-05

ankyrin-like protein; Provisional


Pssm-ID: 222931 [Multi-domain]  Cd Length: 489  Bit Score: 48.29  E-value: 3.00e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  845 ILMLLDNLDPNFQSDQQSKRTPL-----HAAAQKGSVEICHVLLQAGANINAVDKQQRTPLMEAVVN---NHLEVARYMV 916
Cdd:PHA02798    53 IVKLFINLGANVNGLDNEYSTPLctilsNIKDYKHMLDIVKILIENGADINKKNSDGETPLYCLLSNgyiNNLEILLFMI 132
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 2462492226  917 QRGGCVYSKEEDGSTCLHHAAKIGN---LEMVSLLLSTGqVDVN 957
Cdd:PHA02798   133 ENGADTTLLDKDGFTMLQVYLQSNHhidIEIIKLLLEKG-VDIN 175
TRPV5-6 cd22192
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), types 5 and 6; TRPV5 and ...
832-967 3.42e-05

Transient Receptor Potential channel, Vanilloid subfamily (TRPV), types 5 and 6; TRPV5 and TRPV6 (TRPV5/6) are two homologous members within the vanilloid subfamily of the transient receptor potential (TRP) family. TRPV5 and TRPV6 show only 30-40% homology with other members of the TRP family and have unique properties that differentiates them from other TRP channels. They mediate calcium uptake in epithelia and their expression is dramatically increased in numerous types of cancer. The structure of TRPV5/6 shows the typical topology features of all TRP family members, such as six transmembrane regions, a short hydrophobic stretch between transmembrane segments 5 and 6, which is predicted to form the Ca2+ pore, and large intracellular N- and C-terminal domains. The N-terminal domain of TRPV5/6 contains three ankyrin repeats. This structural element is present in several proteins and plays a role in protein-protein interactions. The N- and C-terminal tails of TRPV5/6 each contain an internal PDZ motif which can function as part of a molecular scaffold via interaction with PDZ-domain containing proteins. A major difference between the properties of TRPV5 and TRPV6 is in their tissue distribution: TRPV5 is predominantly expressed in the distal convoluted tubules (DCT) and connecting tubules (CNT) of the kidney, with limited expression in extrarenal tissues. In contrast, TRPV6 has a broader expression pattern such as expression in the intestine, kidney, placenta, epididymis, exocrine tissues, and a few other tissues.


Pssm-ID: 411976 [Multi-domain]  Cd Length: 609  Bit Score: 48.47  E-value: 3.42e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  832 LYLSVKQGELQKVILMLLDNLDPNFQSDQQSKrTPLHAAAQKGSVEICHVLLQAGAN-INAVDK----QQRTPLMEAVVN 906
Cdd:cd22192     21 LLLAAKENDVQAIKKLLKCPSCDLFQRGALGE-TALHVAALYDNLEAAVVLMEAAPElVNEPMTsdlyQGETALHIAVVN 99
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462492226  907 NHLEVARYMVQRGGCVYSKEEDGS------TCLHH--------AAKIGNLEMVSLLLSTGqVDVNAQDSGGWTPI 967
Cdd:cd22192    100 QNLNLVRELIARGADVVSPRATGTffrpgpKNLIYygehplsfAACVGNEEIVRLLIEHG-ADIRAQDSLGNTVL 173
Ank_4 pfam13637
Ankyrin repeats (many copies);
1006-1056 3.91e-05

Ankyrin repeats (many copies);


Pssm-ID: 372654 [Multi-domain]  Cd Length: 54  Bit Score: 42.26  E-value: 3.91e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462492226 1006 CLHWASFTGSAAIAEVLLNARCDLHAVNYHGDTPLHIAARESYHDCVLLFL 1056
Cdd:pfam13637    4 ALHAAAASGHLELLRLLLEKGADINAVDGNGETALHFAASNGNVEVLKLLL 54
ANK smart00248
ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four ...
1035-1064 1.32e-04

ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four consecutive copies. They are involved in protein-protein interactions. The core of the repeat seems to be an helix-loop-helix structure.


Pssm-ID: 197603 [Multi-domain]  Cd Length: 30  Bit Score: 40.26  E-value: 1.32e-04
                            10        20        30
                    ....*....|....*....|....*....|
gi 2462492226  1035 HGDTPLHIAARESYHDCVLLFLSRGANPEL 1064
Cdd:smart00248    1 DGRTPLHLAAENGNLEVVKLLLDKGADINA 30
Ank_5 pfam13857
Ankyrin repeats (many copies);
848-900 1.38e-04

Ankyrin repeats (many copies);


Pssm-ID: 433530 [Multi-domain]  Cd Length: 56  Bit Score: 40.79  E-value: 1.38e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2462492226  848 LLDNLDPNFQSDQQSKRTPLHAAAQKGSVEICHVLLQAGANINAVDKQQRTPL 900
Cdd:pfam13857    1 LLEHGPIDLNRLDGEGYTPLHVAAKYGALEIVRVLLAYGVDLNLKDEEGLTAL 53
PHA02859 PHA02859
ankyrin repeat protein; Provisional
875-967 1.43e-04

ankyrin repeat protein; Provisional


Pssm-ID: 165195 [Multi-domain]  Cd Length: 209  Bit Score: 44.81  E-value: 1.43e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  875 SVEICHVLLQAGANINAVDKQQRTPLMEAV--VNNHLEVARYMVQRGGCVYSKEEDGSTCLH-----HAAKignlEMVSL 947
Cdd:PHA02859   102 EPEILKILIDSGSSITEEDEDGKNLLHMYMcnFNVRINVIKLLIDSGVSFLNKDFDNNNILYsyilfHSDK----KIFDF 177
                           90       100
                   ....*....|....*....|
gi 2462492226  948 LLSTGqVDVNAQDSGGWTPI 967
Cdd:PHA02859   178 LTSLG-IDINETNKSGYNCY 196
ANK smart00248
ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four ...
928-958 1.53e-04

ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four consecutive copies. They are involved in protein-protein interactions. The core of the repeat seems to be an helix-loop-helix structure.


Pssm-ID: 197603 [Multi-domain]  Cd Length: 30  Bit Score: 40.26  E-value: 1.53e-04
                            10        20        30
                    ....*....|....*....|....*....|.
gi 2462492226   928 DGSTCLHHAAKIGNLEMVSLLLSTGqVDVNA 958
Cdd:smart00248    1 DGRTPLHLAAENGNLEVVKLLLDKG-ADINA 30
PHA03100 PHA03100
ankyrin repeat protein; Provisional
865-927 1.64e-04

ankyrin repeat protein; Provisional


Pssm-ID: 222984 [Multi-domain]  Cd Length: 422  Bit Score: 45.81  E-value: 1.64e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462492226  865 TPLHAAAQKGSVEICHVLLQAGANINAVDKQQRTPLMEAVVNNHLEVARYMVQRGGCVYSKEE 927
Cdd:PHA03100   194 TPLHYAVYNNNPEFVKYLLDLGANPNLVNKYGDTPLHIAILNNNKEIFKLLLNNGPSIKTIIE 256
PR-SET_PRDM7_9 cd19193
PR-SET domain found in PR domain zinc finger protein 7 (PRDM7) and 9 (PRDM9) and similar ...
1234-1343 1.68e-04

PR-SET domain found in PR domain zinc finger protein 7 (PRDM7) and 9 (PRDM9) and similar proteins; PRDM7 (also termed PR domain-containing protein 7) is a primate-specific histone methyltransferase that is the result of a recent gene duplication of PRDM9. It selectively catalyzes the trimethylation of H3 lysine 4 (H3K4me3). PRDM9 (also termed PR domain-containing protein 9) is a histone methyltransferase that specifically trimethylates 'Lys-4' of histone H3 (H3K4me3) during meiotic prophase and is essential for proper meiotic progression. It also efficiently mono-, di-, and trimethylates H3K36. Aberrant PRDM9 expression is assciated with with genome instability in cancer.


Pssm-ID: 380970 [Multi-domain]  Cd Length: 129  Bit Score: 42.99  E-value: 1.68e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1234 GWGVRALQTIPQGTFICEYVGELISDAEAdvrEDDSYLFDLDNKDGEVYCIDAR--YYGNISRFIN---HLCDPNIIpvr 1308
Cdd:cd19193     19 GLGVWAEAPIPKGMVFGPYEGEIVEDEEA---ADSGYSWQIYKGGKLSHYIDAKdeSKSNWMRYVNcarNEEEQNLV--- 92
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 2462492226 1309 VFMLHQDlrfprIAFFSSRDIRTGEELGFDYGDRF 1343
Cdd:cd19193     93 AFQYRGK-----IYYRTCKDIAPGTELLVWYGDEY 122
SET_KMT2E cd19182
SET domain found in inactive histone-lysine N-methyltransferase 2E (KMT2E) and similar ...
1237-1340 3.61e-04

SET domain found in inactive histone-lysine N-methyltransferase 2E (KMT2E) and similar proteins; KMT2E (also termed inactive lysine N-methyltransferase 2E, myeloid/lymphoid or mixed-lineage leukemia protein 5 (MLL5)) plays a key role in hematopoiesis, spermatogenesis and cell cycle progression. It associates with chromatin regions downstream of transcriptional start sites of active genes and thus regulates gene transcription. Lack of key residues in the SET domain as well as the presence of an unusually large loop in the SET-I subdomain preclude the interaction of MLL5 SET with its cofactor and substrate thus making MLL5 devoid of any in vitro methyltransferase activity on full-length histones and histone H3 peptide.


Pssm-ID: 380959  Cd Length: 129  Bit Score: 41.80  E-value: 3.61e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1237 VRALQTIPQGTFICEYVGELISDAEAdvrEDDSYLFD--------LDNKDGEVYCIDARYYGNISRFINHLCDPNIIPVR 1308
Cdd:cd19182     21 LKAAKDLPPDTLIIEYRGKFMLREQF---EANGYFFKrpypfvlfYSKFHGLEMCVDARTFGNEARFIRRSCTPNAEVRH 97
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 2462492226 1309 VF---MLHqdlrfprIAFFSSRDIRTGEEL----GFDYG 1340
Cdd:cd19182     98 VIedgTIH-------LYIYSIRSIPKGTEItiafDFDYG 129
Ank_5 pfam13857
Ankyrin repeats (many copies);
882-936 4.49e-04

Ankyrin repeats (many copies);


Pssm-ID: 433530 [Multi-domain]  Cd Length: 56  Bit Score: 39.64  E-value: 4.49e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462492226  882 LLQAG-ANINAVDKQQRTPLMEAVVNNHLEVARYMVQRGGCVYSKEEDGSTCLHHA 936
Cdd:pfam13857    1 LLEHGpIDLNRLDGEGYTPLHVAAKYGALEIVRVLLAYGVDLNLKDEEGLTALDLA 56
Ank_3 pfam13606
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ...
864-891 6.68e-04

Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities.


Pssm-ID: 463933 [Multi-domain]  Cd Length: 30  Bit Score: 38.39  E-value: 6.68e-04
                           10        20
                   ....*....|....*....|....*...
gi 2462492226  864 RTPLHAAAQKGSVEICHVLLQAGANINA 891
Cdd:pfam13606    3 NTPLHLAARNGRLEIVKLLLENGADINA 30
PTZ00322 PTZ00322
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional
933-1004 7.93e-04

6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional


Pssm-ID: 140343 [Multi-domain]  Cd Length: 664  Bit Score: 43.73  E-value: 7.93e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462492226  933 LHHAAKIGNLEMVSLLLSTGqVDVNAQDSGGWTPIIWAAEHKHIEVIRMLLTRGADVTLTD---NVSERLVEEEN 1004
Cdd:PTZ00322    86 LCQLAASGDAVGARILLTGG-ADPNCRDYDGRTPLHIACANGHVQVVRVLLEFGADPTLLDkdgKTPLELAEENG 159
PR-SET_PRDM6 cd19191
PR-SET domain found in PR domain zinc finger protein 6 (PRDM6) and similar proteins; PRDM6 ...
1234-1343 8.17e-04

PR-SET domain found in PR domain zinc finger protein 6 (PRDM6) and similar proteins; PRDM6 (also termed PR domain-containing protein 6) is a putative histone-lysine N-methyltransferase that acts as a transcriptional repressor of smooth muscle gene expression. It may specifically methylate 'Lys-20' of histone H4 when associated with other proteins and in vitro.


Pssm-ID: 380968  Cd Length: 128  Bit Score: 40.92  E-value: 8.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1234 GWGVRALQTIPQGTFICEYVGELIS-DAEADVREDDSYLFDLDNKDGEV-YCIDAR--YYGNISRFIN---HLCDPNIIP 1306
Cdd:cd19191     16 GYGICAAQRIPQGTWIGPFEGVLVSpEKQIGAVRNTQHLWEIYDQEGTLqHFIDGGdpSKSSWMRYIRcarHCGEQNLTV 95
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 2462492226 1307 VRvfmlHQDLRFPRIAffssRDIRTGEELGFDYGDRF 1343
Cdd:cd19191     96 VQ----YRGCIFYRAC----RDIPRGTELLVWYDDSY 124
SET_SMYD4 cd10536
SET domain (including iSET domain and post-SET domain) found in SET and MYND domain-containing ...
1295-1355 9.09e-04

SET domain (including iSET domain and post-SET domain) found in SET and MYND domain-containing protein 4 (SMYD4) and similar proteins; SMYD4 functions as a potential tumor suppressor that plays a critical role in breast carcinogenesis at least partly through inhibiting the expression of PDGFR-alpha. In zebrafish, SMYD4 is ubiquitously expressed in early embryos and becomes enriched in the developing heart; mutants show a strong defect in cardiomyocyte proliferation, which lead to a severe cardiac malformation.


Pssm-ID: 380934 [Multi-domain]  Cd Length: 218  Bit Score: 42.29  E-value: 9.09e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462492226 1295 FINHLCDPNIIpvRVFMLHQdlrfprIAFFSSRDIRTGEELGFDYG------DRFW---DIKSKY-FTCQC 1355
Cdd:cd10536    153 LLNHSCDPNTI--RSFYGNT------IVVRATRPIKKGEEITICYGphfsrmKRSErqrLLKEQYfFDCSC 215
Ank_3 pfam13606
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ...
963-989 9.24e-04

Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities.


Pssm-ID: 463933 [Multi-domain]  Cd Length: 30  Bit Score: 38.01  E-value: 9.24e-04
                           10        20
                   ....*....|....*....|....*..
gi 2462492226  963 GWTPIIWAAEHKHIEVIRMLLTRGADV 989
Cdd:pfam13606    2 GNTPLHLAARNGRLEIVKLLLENGADI 28
PR-SET_PRDM14 cd19198
PR-SET domain found in PR domain zinc finger protein 14 (PRDM14) and similar proteins; PRDM14 ...
1225-1343 1.25e-03

PR-SET domain found in PR domain zinc finger protein 14 (PRDM14) and similar proteins; PRDM14 (also termed PR domain-containing protein 14) acts as a transcription factor that has both positive and negative roles on transcription. It acts on regulating epigenetic modifications in the cells, playing a key role in the regulation of cell pluripotency, epigenetic reprogramming, differentiation and development. Aberrant PRDM14 expression is associated with tumorigenesis, cell migration and cell chemotherapeutic drugs resistance.


Pssm-ID: 380975  Cd Length: 133  Bit Score: 40.46  E-value: 1.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1225 LQLYRTAKMG---WGVRALQTIPQGTFICEYVGELISDAEADVREDDSYLFDLdNKDGEV-YCIDAR-YYGNISRFINhl 1299
Cdd:cd19198      7 LRVLQTSFGGtphYGVFCKKTIPKGTRFGPFRGRVVNTSEIKTYDDNSFMWEI-FEDGKLsHFIDGRgSTGNWMSYVN-- 83
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 2462492226 1300 C-----DPNIIPVRvfmlHQDlrfpRIAFFSSRDIRTGEELGFDYGDRF 1343
Cdd:cd19198     84 CaryaeEQNLIAIQ----CQG----QIFYESCKEILQGQELLVWYGDCY 124
PHA02989 PHA02989
ankyrin repeat protein; Provisional
843-1102 1.25e-03

ankyrin repeat protein; Provisional


Pssm-ID: 222954 [Multi-domain]  Cd Length: 494  Bit Score: 43.19  E-value: 1.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  843 KVILMLLDN-LDPNFQSDQQskrTPLHAA------AQKGSVEICHVLLQAGANINAVDKQQRTPLMEAVVN---NHLEVA 912
Cdd:PHA02989    51 KIVKLLIDNgADVNYKGYIE---TPLCAVlrnreiTSNKIKKIVKLLLKFGADINLKTFNGVSPIVCFIYNsniNNCDML 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  913 RYMVQRGGCVYS-KEEDGSTCLHHAAK--IGNLEMVSLLLSTGQVDVNAQDSGGWTPIIWAAEHK----HIEVIRMLLTR 985
Cdd:PHA02989   128 RFLLSKGINVNDvKNSRGYNLLHMYLEsfSVKKDVIKILLSFGVNLFEKTSLYGLTPMNIYLRNDidviSIKVIKYLIKK 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  986 GADVTLTDNVSERLVE---EENICLHWASFTgsaaiaevLLN---ARCDLHAVNYHGDTPLHIAARESYHDCVLLFLSRG 1059
Cdd:PHA02989   208 GVNIETNNNGSESVLEsflDNNKILSKKEFK--------VLNfilKYIKINKKDKKGFNPLLISAKVDNYEAFNYLLKLG 279
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|...
gi 2462492226 1060 ANPELRNKEGDTAWDLTPERSDvwfALQLNRKLRLGVGNRAIR 1102
Cdd:PHA02989   280 DDIYNVSKDGDTVLTYAIKHGN---IDMLNRILQLKPGKYLIK 319
SET_SETD5 cd19181
SET domain (including post-SET domain) found in SET domain-containing protein 5 (SETD5) and ...
1217-1303 2.04e-03

SET domain (including post-SET domain) found in SET domain-containing protein 5 (SETD5) and similar proteins; SETD5 is a probable transcriptional regulator that acts via the formation of large multiprotein complexes that modify and/or remodel the chromatin. SETD5 loss-of-function mutations are a likely cause of a familial syndromic intellectual disability with variable phenotypic expression.


Pssm-ID: 380958  Cd Length: 150  Bit Score: 40.38  E-value: 2.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226 1217 VQSGIKVRLQLYRTAkmgwgVRALQTIPQGTFICEYVGELISDAEADV-----REDDSYLFDLDNKDGEVYCIDARYYGN 1291
Cdd:cd19181      6 LQLGRVTRVQKHRKI-----LRAARDLALDTLIIEYRGKVMLRQQFEVnghffKRPYPFVLFYSKFNGVEMCVDARTFGN 80
                           90
                   ....*....|..
gi 2462492226 1292 ISRFINHLCDPN 1303
Cdd:cd19181     81 DARFIRRSCTPN 92
PTZ00322 PTZ00322
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional
912-983 6.27e-03

6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional


Pssm-ID: 140343 [Multi-domain]  Cd Length: 664  Bit Score: 41.04  E-value: 6.27e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462492226  912 ARYMVQRGGCVYSKEEDGSTCLHHAAKIGNLEMVSLLLSTGqVDVNAQDSGGWTPIIWAAEHKHIEVIRMLL 983
Cdd:PTZ00322    98 ARILLTGGADPNCRDYDGRTPLHIACANGHVQVVRVLLEFG-ADPTLLDKDGKTPLELAEENGFREVVQLLS 168
TRPV cd21882
Transient Receptor Potential channel, Vanilloid subfamily (TRPV); The vanilloid TRP subfamily ...
897-1046 6.36e-03

Transient Receptor Potential channel, Vanilloid subfamily (TRPV); The vanilloid TRP subfamily (TRPV), named after the vanilloid receptor 1 (TRPV1), consists of six members: four thermo-sensing channels (TRPV1, TRPV2, TRPV3, and TRPV4) and two Ca2+ selective channels (TRPV5 and TRPV6). The calcium-selective channels TRPV5 and TRPV6 can be heterotetramers and are important for general Ca2+ homeostasis. All four channels within the TRPV1-4 group show temperature-invoked currents when expressed in heterologous cell systems, ranging from activation at ~25C for TRPV4 to ~52C for TRPV2. The structure of TRPV shows the typical topology features of all Transient Receptor Potential (TRP) ion channel family members, such as six transmembrane regions, a short hydrophobic stretch between transmembrane segments 5 and 6 and large intracellular N- and C-terminal domains. The TRP family consists of membrane proteins that function as ion channels that communicate between the cell and its environment, by a vast array of physical or chemical stimuli, including radiation (in the form of temperature, infrared ,or light) and pressure (osmotic or mechanical). TRP channels are formed by a tetrameric complex of channel subunits. Based on sequence identity, the mammalian TRP channel family is classified into six subfamilies, with significant sequence similarity within the transmembrane domains, but very low similarity in their N- and C-terminal cytoplasmic regions. The six subfamilies are named based on their first member: TRPC (canonical), TRPV (vanilloid), TRPM (melastatin), TRPA (ankyrin), TRPML (mucolipin), and TRPP (polycystic).


Pssm-ID: 411975 [Multi-domain]  Cd Length: 600  Bit Score: 41.02  E-value: 6.36e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  897 RTPLMEAVVNNH----------LEVARYMVQRGGCVYSKEED----GSTCLHHAAKIGNLEMVSLLLSTGqVDVNAQDSG 962
Cdd:cd21882     27 KTCLHKAALNLNdgvneaimllLEAAPDSGNPKELVNAPCTDefyqGQTALHIAIENRNLNLVRLLVENG-ADVSARATG 105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  963 -------------GWTPIIWAAEHKHIEVIRMLLTRGADV-------TLTDNVSERLVEEENICLHWASFTGSAAIAEVL 1022
Cdd:cd21882    106 rffrkspgnlfyfGELPLSLAACTNQEEIVRLLLENGAQPaaleaqdSLGNTVLHALVLQADNTPENSAFVCQMYNLLLS 185
                          170       180
                   ....*....|....*....|....*....
gi 2462492226 1023 LNARCD----LHAV-NYHGDTPLHIAARE 1046
Cdd:cd21882    186 YGAHLDptqqLEEIpNHQGLTPLKLAAVE 214
Ank_3 pfam13606
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ...
928-958 7.01e-03

Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities.


Pssm-ID: 463933 [Multi-domain]  Cd Length: 30  Bit Score: 35.31  E-value: 7.01e-03
                           10        20        30
                   ....*....|....*....|....*....|.
gi 2462492226  928 DGSTCLHHAAKIGNLEMVSLLLSTGqVDVNA 958
Cdd:pfam13606    1 DGNTPLHLAARNGRLEIVKLLLENG-ADINA 30
PTZ00322 PTZ00322
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional
979-1057 8.02e-03

6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional


Pssm-ID: 140343 [Multi-domain]  Cd Length: 664  Bit Score: 40.65  E-value: 8.02e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462492226  979 IRMLLTRGADVTLTDnvserlvEEENICLHWASFTGSAAIAEVLLNARCDLHAVNYHGDTPLHIAARESYHDCVLLFLS 1057
Cdd:PTZ00322    98 ARILLTGGADPNCRD-------YDGRTPLHIACANGHVQVVRVLLEFGADPTLLDKDGKTPLELAEENGFREVVQLLSR 169
PHA02917 PHA02917
ankyrin-like protein; Provisional
926-1005 8.28e-03

ankyrin-like protein; Provisional


Pssm-ID: 165231 [Multi-domain]  Cd Length: 661  Bit Score: 40.75  E-value: 8.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462492226  926 EEDGSTCLHHAAKIGNLEMVSLLLSTGQvDVNAQDSGGWTPI-IWAAEHKHIEVIRMLLTRGADVtltDNVSERLVEEEN 1004
Cdd:PHA02917   449 DKRGETLLHKAVRYNKQSLVSLLLESGS-DVNIRSNNGYTCIaIAINESRNIELLKMLLCHKPTL---DCVIDSLREISN 524

                   .
gi 2462492226 1005 I 1005
Cdd:PHA02917   525 I 525
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH