NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|38202250|ref|NP_877437|]
View 

formylglycine-generating enzyme isoform 1 precursor [Homo sapiens]

Protein Classification

formylglycine-generating enzyme family protein( domain architecture ID 10510263)

formylglycine-generating enzyme family protein similar to human sulfatase-modifying factor 1 (SUMF1), which oxidizes a cysteine residue in the substrate sulfatase, to an active site 3-oxoalanine residue, also called C(alpha)-formylglycine

CATH:  3.90.1580.10
Gene Ontology:  GO:0046872
SCOP:  4000450

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
FGE-sulfatase pfam03781
Sulfatase-modifying factor enzyme 1; This domain is found in eukaryotic proteins required for ...
91-367 1.03e-90

Sulfatase-modifying factor enzyme 1; This domain is found in eukaryotic proteins required for post-translational sulfatase modification (SUMF1). These proteins are associated with the rare disorder multiple sulfatase deficiency (MSD). The protein product of the SUMF1 gene is FGE, formylglycine (FGly),-generating enzyme, which is a sulfatase. Sulfatases are enzymes essential for degradation and remodelling of sulfate esters, and formylglycine (FGly), the key catalytic in the active site, is unique to sulfatases. FGE is localized to the endoplasmic reticulum (ER) and interacts with and modifies the unfolded form of newly synthesized sulfatases. FGE is a single-domain monomer with a surprising paucity of secondary structure that adopts a unique fold which is stabilized by two Ca2+ ions. The effect of all mutations found in MSD patients is explained by the FGE structure, providing a molecular basis for MSD. A redox-active disulfide bond is present in the active site of FGE. An oxidized cysteine residue, possibly cysteine sulfenic acid, has been detected that may allow formulation of a structure-based mechanism for FGly formation from cysteine residues in all sulfatases. In Mycobacteria and Treponema denticola this enzyme functions as an iron(II)-dependent oxidoreductase.


:

Pssm-ID: 397722 [Multi-domain]  Cd Length: 259  Bit Score: 273.22  E-value: 1.03e-90
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250    91 MVPIPAGVFTMGTDDPqIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVNSTGYLTEAekfgdsfvfegmlseqvktni 170
Cdd:pfam03781   5 MVLIPGGSFEMGSAER-TGNDNEAPAHDVTVRPFAIDKYPVTNAQYAAFVEATGYTTEV--------------------- 62
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250   171 qqavaAAPWWLPVKGANWRHPEGPDSTILHRPDHPVLHVSWNDAVAYCTWAGK------RLPTEAEWEYSCRGGLHNRLF 244
Cdd:pfam03781  63 -----YPQWWAEVEGANWRHPSGGLSDIDDGADHPVTGVSWYDAVAYARWLGKrtgngyRLPTEAEWEYAARGGSKGRRY 137
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250   245 PWGNKLQPkgqhYANIWQG-EFPVTNTGEDGFQG-TAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETL-NPKGP 321
Cdd:pfam03781 138 PWGDELYP----AGNIWQGaDFPNEHAGADSFNGrTSPVGSFPPNALGLYDMAGNVWEWTSDWYKPHYSFAPYDeLSRDN 213
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*..
gi 38202250   322 PSGKDRVKKGGSYMCHrSYCYRYRCAAR-SQNTPDSSASNLGFRCAA 367
Cdd:pfam03781 214 FGGGYRVVRGGSWACS-VYPSRLRPAFRgNCQTPGTRADDVGFRLVR 259
 
Name Accession Description Interval E-value
FGE-sulfatase pfam03781
Sulfatase-modifying factor enzyme 1; This domain is found in eukaryotic proteins required for ...
91-367 1.03e-90

Sulfatase-modifying factor enzyme 1; This domain is found in eukaryotic proteins required for post-translational sulfatase modification (SUMF1). These proteins are associated with the rare disorder multiple sulfatase deficiency (MSD). The protein product of the SUMF1 gene is FGE, formylglycine (FGly),-generating enzyme, which is a sulfatase. Sulfatases are enzymes essential for degradation and remodelling of sulfate esters, and formylglycine (FGly), the key catalytic in the active site, is unique to sulfatases. FGE is localized to the endoplasmic reticulum (ER) and interacts with and modifies the unfolded form of newly synthesized sulfatases. FGE is a single-domain monomer with a surprising paucity of secondary structure that adopts a unique fold which is stabilized by two Ca2+ ions. The effect of all mutations found in MSD patients is explained by the FGE structure, providing a molecular basis for MSD. A redox-active disulfide bond is present in the active site of FGE. An oxidized cysteine residue, possibly cysteine sulfenic acid, has been detected that may allow formulation of a structure-based mechanism for FGly formation from cysteine residues in all sulfatases. In Mycobacteria and Treponema denticola this enzyme functions as an iron(II)-dependent oxidoreductase.


Pssm-ID: 397722 [Multi-domain]  Cd Length: 259  Bit Score: 273.22  E-value: 1.03e-90
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250    91 MVPIPAGVFTMGTDDPqIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVNSTGYLTEAekfgdsfvfegmlseqvktni 170
Cdd:pfam03781   5 MVLIPGGSFEMGSAER-TGNDNEAPAHDVTVRPFAIDKYPVTNAQYAAFVEATGYTTEV--------------------- 62
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250   171 qqavaAAPWWLPVKGANWRHPEGPDSTILHRPDHPVLHVSWNDAVAYCTWAGK------RLPTEAEWEYSCRGGLHNRLF 244
Cdd:pfam03781  63 -----YPQWWAEVEGANWRHPSGGLSDIDDGADHPVTGVSWYDAVAYARWLGKrtgngyRLPTEAEWEYAARGGSKGRRY 137
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250   245 PWGNKLQPkgqhYANIWQG-EFPVTNTGEDGFQG-TAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETL-NPKGP 321
Cdd:pfam03781 138 PWGDELYP----AGNIWQGaDFPNEHAGADSFNGrTSPVGSFPPNALGLYDMAGNVWEWTSDWYKPHYSFAPYDeLSRDN 213
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*..
gi 38202250   322 PSGKDRVKKGGSYMCHrSYCYRYRCAAR-SQNTPDSSASNLGFRCAA 367
Cdd:pfam03781 214 FGGGYRVVRGGSWACS-VYPSRLRPAFRgNCQTPGTRADDVGFRLVR 259
YfmG COG1262
Formylglycine-generating enzyme, required for sulfatase activity, contains SUMF1/FGE domain ...
91-368 6.19e-82

Formylglycine-generating enzyme, required for sulfatase activity, contains SUMF1/FGE domain [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 440874 [Multi-domain]  Cd Length: 238  Bit Score: 249.92  E-value: 6.19e-82
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250  91 MVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVnstgylteaekfgdsfvfegmlseqvktni 170
Cdd:COG1262  13 MVLIPGGTFLMGSPEGEGAFDNERPRHRVTVSPFYIDKYEVTNAEYRAFV------------------------------ 62
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250 171 qqavaaapWWLPvkgANWRH-PEGPDSTilhRPDHPVLHVSWNDAVAYCTWAGK------RLPTEAEWEYSCRGGlHNRL 243
Cdd:COG1262  63 --------GWTL---ADGRNnPLYSDFG---GPDHPVVHVSWYDAQAYCRWLGKktgkgyRLPTEAEWEYAARGG-DGRP 127
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250 244 FPWGNKLQPkgqHYANIWqgefpvtntGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPS 323
Cdd:COG1262 128 YPWGDDLPP---ELANYA---------GNDGRGSTAPVGSFPPNPFGLYDMAGNVWEWTADWYDPPYPGAPADGPVGPEN 195
                       250       260       270       280
                ....*....|....*....|....*....|....*....|....*
gi 38202250 324 GKDRVKKGGSYMCHRSYCyryRCAARSQNTPDSSASNLGFRCAAD 368
Cdd:COG1262 196 GGQRVLRGGSWATPPDHL---RSAYRNFFPPDARWQFVGFRLARD 237
GldK_short TIGR03529
gliding motility-associated lipoprotein GldK; Members of this protein family are exclusive to ...
79-366 2.28e-40

gliding motility-associated lipoprotein GldK; Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldK is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldK abolish the gliding phenotype. GldK is homologous to GldJ. This model represents a GldK homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture than that found in Flavobacterium johnsoniae and related species (represented by (TIGR03525). Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility.


Pssm-ID: 274632 [Multi-domain]  Cd Length: 344  Bit Score: 145.84  E-value: 2.28e-40
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250    79 PVPGER---QLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVNS--TGYLTEAEKFG 153
Cdd:TIGR03529  39 GVPKREswqQNVPVGMVVIPAGTFHMGQADEDVPATQINLNKQITISEFFMDKTEVTNNKYRQFLEVvlEGQLATGTPLP 118
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250   154 DSFVFEGMLSEqvkTNIQQAVAAAPWWLPVKGANWRHPEgpdstilhRPDHPVLHVSWNDAVAYCTWAGK---------- 223
Cdd:TIGR03529 119 PEYDMEELYPD---TTVWSTSFSHHMGDPLMEYYFDHPA--------FDNYPVVGVDWNAAKQFCEWRTYhmnayrnees 187
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250   224 -------RLPTEAEWEYSCRGGLHNRLFPWGNKL--QPKGQHYANIWQGEfpvTNTGEDGFQGTAPVDAFPPNGYGLYNI 294
Cdd:TIGR03529 188 qydmprfRLPSEAEWEYAARGGRDMAKYPWGGPYlrNKRGCMLANFKPGR---GNYYDDGFPYTAPVAVYFPNDFGLYDM 264
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 38202250   295 VGNAWEWTSD-WWTVHHSVEETLNPK-GPPSGKDRVKKGGSYmchRSYCYRYRCAARSQNTPDSSASNLGFRCA 366
Cdd:TIGR03529 265 AGNVAEWVLDaYAATSVPIVWDLNPVyEDPNEVRKIIRGGSW---KDIAYYLETGTRTFEYEDVSQAHIGFRTV 335
SenA NF041186
selenoneine synthase SenA; Selenoneine is the selenium analog of ergothioneine, a ...
76-250 6.29e-24

selenoneine synthase SenA; Selenoneine is the selenium analog of ergothioneine, a sulfur-containing derivative of a hercynine, which derives from histidine by trimethylation of its amino group. SenA, a homolog of ergothioneine biosynthesis protein EgtB, completes selenoneine biosynthesis in a pathway that also requires a SelD protein to prepare selenophosphate, EgtD to prepare hercynine, and the selenosugar-producing glycosyltransferase SenB (TIGR04348). SenA catalyzes selenium-carbon bond formation between hercynine and selenosugar to produce selenoneine. Other enzymes that create carbon-selenium bonds include SelA, involved in selenocysteine biosynthesis, SelU, involved in selenouridine biosynthesis, and SbtM, a radical SAM enzyme involved in post-translational modification of the RiPP precursor SbtA (see PMID:34730336).


Pssm-ID: 469092 [Multi-domain]  Cd Length: 384  Bit Score: 101.42  E-value: 6.29e-24
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250   76 APGPVPGERQLAHSKMVPIPAGVFTMGTD-DPQIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVNSTGYLTEAekfgd 154
Cdd:NF041186 155 APPLAAAPAPAPAAGDLAVPGGTFRLGSDpGPGFAFDNEKWAHPVEVAPFEIDAAPVTNAEFLAFVEAGGYRDPR----- 229
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250  155 sfvfegMLSEQVKTNIQQAVAAAP-WWLPVKGANWRHpegpdsTILHR-----PDHPVLHVSWNDAVAYCTWAGKRLPTE 228
Cdd:NF041186 230 ------LWSAAGWAWLAAQGLAAPrYWRRGADGAWQE------RRFGRwqpldPDAPVVHVSAHEAEAYCRWAGRRLPTE 297
                        170       180
                 ....*....|....*....|..
gi 38202250  229 AEWEYSCRGglhNRLFPWGNKL 250
Cdd:NF041186 298 AEWEYAAAG---APGFPWGDSV 316
 
Name Accession Description Interval E-value
FGE-sulfatase pfam03781
Sulfatase-modifying factor enzyme 1; This domain is found in eukaryotic proteins required for ...
91-367 1.03e-90

Sulfatase-modifying factor enzyme 1; This domain is found in eukaryotic proteins required for post-translational sulfatase modification (SUMF1). These proteins are associated with the rare disorder multiple sulfatase deficiency (MSD). The protein product of the SUMF1 gene is FGE, formylglycine (FGly),-generating enzyme, which is a sulfatase. Sulfatases are enzymes essential for degradation and remodelling of sulfate esters, and formylglycine (FGly), the key catalytic in the active site, is unique to sulfatases. FGE is localized to the endoplasmic reticulum (ER) and interacts with and modifies the unfolded form of newly synthesized sulfatases. FGE is a single-domain monomer with a surprising paucity of secondary structure that adopts a unique fold which is stabilized by two Ca2+ ions. The effect of all mutations found in MSD patients is explained by the FGE structure, providing a molecular basis for MSD. A redox-active disulfide bond is present in the active site of FGE. An oxidized cysteine residue, possibly cysteine sulfenic acid, has been detected that may allow formulation of a structure-based mechanism for FGly formation from cysteine residues in all sulfatases. In Mycobacteria and Treponema denticola this enzyme functions as an iron(II)-dependent oxidoreductase.


Pssm-ID: 397722 [Multi-domain]  Cd Length: 259  Bit Score: 273.22  E-value: 1.03e-90
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250    91 MVPIPAGVFTMGTDDPqIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVNSTGYLTEAekfgdsfvfegmlseqvktni 170
Cdd:pfam03781   5 MVLIPGGSFEMGSAER-TGNDNEAPAHDVTVRPFAIDKYPVTNAQYAAFVEATGYTTEV--------------------- 62
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250   171 qqavaAAPWWLPVKGANWRHPEGPDSTILHRPDHPVLHVSWNDAVAYCTWAGK------RLPTEAEWEYSCRGGLHNRLF 244
Cdd:pfam03781  63 -----YPQWWAEVEGANWRHPSGGLSDIDDGADHPVTGVSWYDAVAYARWLGKrtgngyRLPTEAEWEYAARGGSKGRRY 137
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250   245 PWGNKLQPkgqhYANIWQG-EFPVTNTGEDGFQG-TAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETL-NPKGP 321
Cdd:pfam03781 138 PWGDELYP----AGNIWQGaDFPNEHAGADSFNGrTSPVGSFPPNALGLYDMAGNVWEWTSDWYKPHYSFAPYDeLSRDN 213
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*..
gi 38202250   322 PSGKDRVKKGGSYMCHrSYCYRYRCAAR-SQNTPDSSASNLGFRCAA 367
Cdd:pfam03781 214 FGGGYRVVRGGSWACS-VYPSRLRPAFRgNCQTPGTRADDVGFRLVR 259
YfmG COG1262
Formylglycine-generating enzyme, required for sulfatase activity, contains SUMF1/FGE domain ...
91-368 6.19e-82

Formylglycine-generating enzyme, required for sulfatase activity, contains SUMF1/FGE domain [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 440874 [Multi-domain]  Cd Length: 238  Bit Score: 249.92  E-value: 6.19e-82
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250  91 MVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVnstgylteaekfgdsfvfegmlseqvktni 170
Cdd:COG1262  13 MVLIPGGTFLMGSPEGEGAFDNERPRHRVTVSPFYIDKYEVTNAEYRAFV------------------------------ 62
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250 171 qqavaaapWWLPvkgANWRH-PEGPDSTilhRPDHPVLHVSWNDAVAYCTWAGK------RLPTEAEWEYSCRGGlHNRL 243
Cdd:COG1262  63 --------GWTL---ADGRNnPLYSDFG---GPDHPVVHVSWYDAQAYCRWLGKktgkgyRLPTEAEWEYAARGG-DGRP 127
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250 244 FPWGNKLQPkgqHYANIWqgefpvtntGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPS 323
Cdd:COG1262 128 YPWGDDLPP---ELANYA---------GNDGRGSTAPVGSFPPNPFGLYDMAGNVWEWTADWYDPPYPGAPADGPVGPEN 195
                       250       260       270       280
                ....*....|....*....|....*....|....*....|....*
gi 38202250 324 GKDRVKKGGSYMCHRSYCyryRCAARSQNTPDSSASNLGFRCAAD 368
Cdd:COG1262 196 GGQRVLRGGSWATPPDHL---RSAYRNFFPPDARWQFVGFRLARD 237
GldK_short TIGR03529
gliding motility-associated lipoprotein GldK; Members of this protein family are exclusive to ...
79-366 2.28e-40

gliding motility-associated lipoprotein GldK; Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldK is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldK abolish the gliding phenotype. GldK is homologous to GldJ. This model represents a GldK homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture than that found in Flavobacterium johnsoniae and related species (represented by (TIGR03525). Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility.


Pssm-ID: 274632 [Multi-domain]  Cd Length: 344  Bit Score: 145.84  E-value: 2.28e-40
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250    79 PVPGER---QLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVNS--TGYLTEAEKFG 153
Cdd:TIGR03529  39 GVPKREswqQNVPVGMVVIPAGTFHMGQADEDVPATQINLNKQITISEFFMDKTEVTNNKYRQFLEVvlEGQLATGTPLP 118
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250   154 DSFVFEGMLSEqvkTNIQQAVAAAPWWLPVKGANWRHPEgpdstilhRPDHPVLHVSWNDAVAYCTWAGK---------- 223
Cdd:TIGR03529 119 PEYDMEELYPD---TTVWSTSFSHHMGDPLMEYYFDHPA--------FDNYPVVGVDWNAAKQFCEWRTYhmnayrnees 187
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250   224 -------RLPTEAEWEYSCRGGLHNRLFPWGNKL--QPKGQHYANIWQGEfpvTNTGEDGFQGTAPVDAFPPNGYGLYNI 294
Cdd:TIGR03529 188 qydmprfRLPSEAEWEYAARGGRDMAKYPWGGPYlrNKRGCMLANFKPGR---GNYYDDGFPYTAPVAVYFPNDFGLYDM 264
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 38202250   295 VGNAWEWTSD-WWTVHHSVEETLNPK-GPPSGKDRVKKGGSYmchRSYCYRYRCAARSQNTPDSSASNLGFRCA 366
Cdd:TIGR03529 265 AGNVAEWVLDaYAATSVPIVWDLNPVyEDPNEVRKIIRGGSW---KDIAYYLETGTRTFEYEDVSQAHIGFRTV 335
egtB_TIGR03440 TIGR03440
ergothioneine biosynthesis protein EgtB; Members of this family include EgtB, and enzyme of ...
76-307 5.79e-32

ergothioneine biosynthesis protein EgtB; Members of this family include EgtB, and enzyme of the ergothioneine biosynthesis, as found in numerous Actinobacteria. Characterized homologs to this family include a formylglycine-generating enzyme that serves as a maturase for an aerobic sulfatase (cf. the radical SAM enzymes that serve as anaerobic sulfatase maturases). [Biosynthesis of cofactors, prosthetic groups, and carriers, Glutathione and analogs]


Pssm-ID: 274581 [Multi-domain]  Cd Length: 406  Bit Score: 124.36  E-value: 5.79e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250    76 APGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVNSTGYLTEaekfgds 155
Cdd:TIGR03440 155 YQPPVPAPASAPPLRWVAFPGGEFEIGSDADGFAFDNERPRHRVLVPPFEIDARPVTNGEYLEFIEDGGYRRP------- 227
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250   156 fvfEGMLSEQVKTNIQQAVAAAPWWLPVKGANWRHPEGPDSTIlhRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSC 235
Cdd:TIGR03440 228 ---ELWLSDGWAWVQAEGWQAPLYWRRDDGTWWVFTLGGLRPL--DPDAPVCHVSYYEADAYARWAGARLPTEAEWEKAA 302
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 38202250   236 RGGLHnrlfpwgnklqpkgqhyaniwQGEFPVTNTGedgfqgtAPVDAFPPNGYGLYNIVGNAWEWTSDWWT 307
Cdd:TIGR03440 303 RWGDA---------------------PPNFAEANLG-------APVGAYPAGAQGLGQLFGDVWEWTASPYE 346
GldK TIGR03525
gliding motility-associated lipoprotein GldK; Members of this protein family are exclusive to ...
91-374 4.94e-27

gliding motility-associated lipoprotein GldK; Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldK is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldK abolish the gliding phenotype. GldK is homologous to GldJ. There is a GldK homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility.


Pssm-ID: 274629  Cd Length: 450  Bit Score: 111.11  E-value: 4.94e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250    91 MVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVN-------STGYLTEAEKFGD--------- 154
Cdd:TIGR03525  41 MVLVPGGSFIMGKSDEDIAGVMNAPTKTVTVRSFYMDETEITNSEYRQFVEwvrdsivRTKLAELADLAGIgpgdgggsi 120
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250   155 -SFVFEGMLSEQV---------------------------KTNIQQAVAAAP----------WWLP------------VK 184
Cdd:TIGR03525 121 qDYAFKDAESDNAtpyqkymydnyyslgetddyagrklnkKTELIWDTSEYPdeyyvevmdsLYLPedesynglrtfdVT 200
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250   185 GANWRHP----------EG--------------PDSTI------------LHRP--------DHPVLHVSWNDAVAYCTW 220
Cdd:TIGR03525 201 KLKYRYSwmdidaaarsKGsrkdfikteevqvyPDTTVwikdfnysynepMHNDyfwhqaydDYPVVGVTWKQARAFCNW 280
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250   221 AGK-----------------RLPTEAEWEYSCRGGLHNRLFPWGN--KLQPKGQHYANI--WQGEFpvtntGEDGFQGTA 279
Cdd:TIGR03525 281 RTKykndfrkkkgpanvntfRLPTEAEWEYAARGGLEGATYPWGGpyTKNDRGCFMANFkpVRGDY-----AADEALYTV 355
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250   280 PVDAFPPNGYGLYNIVGNAWEWT-SDWWTVHHSVEETLNPK-GPPSGKDRVKKGGSYmchRSYCYRYRCAARSQNTPDSS 357
Cdd:TIGR03525 356 EAKSYEPNDYGLYNMAGNVSEWTnSSYDPSSYEYMSTMNPNvNDSENTRKVVRGGSW---KDVAYFLQVSTRDYEYADSA 432
                         410
                  ....*....|....*..
gi 38202250   358 ASNLGFRCAADRLPTMD 374
Cdd:TIGR03525 433 RSYIGFRTVQDYLGTAV 449
SenA NF041186
selenoneine synthase SenA; Selenoneine is the selenium analog of ergothioneine, a ...
76-250 6.29e-24

selenoneine synthase SenA; Selenoneine is the selenium analog of ergothioneine, a sulfur-containing derivative of a hercynine, which derives from histidine by trimethylation of its amino group. SenA, a homolog of ergothioneine biosynthesis protein EgtB, completes selenoneine biosynthesis in a pathway that also requires a SelD protein to prepare selenophosphate, EgtD to prepare hercynine, and the selenosugar-producing glycosyltransferase SenB (TIGR04348). SenA catalyzes selenium-carbon bond formation between hercynine and selenosugar to produce selenoneine. Other enzymes that create carbon-selenium bonds include SelA, involved in selenocysteine biosynthesis, SelU, involved in selenouridine biosynthesis, and SbtM, a radical SAM enzyme involved in post-translational modification of the RiPP precursor SbtA (see PMID:34730336).


Pssm-ID: 469092 [Multi-domain]  Cd Length: 384  Bit Score: 101.42  E-value: 6.29e-24
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250   76 APGPVPGERQLAHSKMVPIPAGVFTMGTD-DPQIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVNSTGYLTEAekfgd 154
Cdd:NF041186 155 APPLAAAPAPAPAAGDLAVPGGTFRLGSDpGPGFAFDNEKWAHPVEVAPFEIDAAPVTNAEFLAFVEAGGYRDPR----- 229
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38202250  155 sfvfegMLSEQVKTNIQQAVAAAP-WWLPVKGANWRHpegpdsTILHR-----PDHPVLHVSWNDAVAYCTWAGKRLPTE 228
Cdd:NF041186 230 ------LWSAAGWAWLAAQGLAAPrYWRRGADGAWQE------RRFGRwqpldPDAPVVHVSAHEAEAYCRWAGRRLPTE 297
                        170       180
                 ....*....|....*....|..
gi 38202250  229 AEWEYSCRGglhNRLFPWGNKL 250
Cdd:NF041186 298 AEWEYAAAG---APGFPWGDSV 316
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH