NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|530387589|ref|XP_005273469|]
View 

heparan-alpha-glucosaminide N-acetyltransferase isoform X3 [Homo sapiens]

Protein Classification

COG4299 superfamily protein( domain architecture ID 1903386)

COG4299 superfamily protein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COG4299 super family cl42651
Predicted acyltransferase, DUF1624 domain [General function prediction only];
237-487 1.10e-54

Predicted acyltransferase, DUF1624 domain [General function prediction only];


The actual alignment was detected with superfamily member COG4299:

Pssm-ID: 443440 [Multi-domain]  Cd Length: 370  Bit Score: 187.68  E-value: 1.10e-54
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387589 237 PPRLRSVDTFRGIALILMVFVNYGGG---KYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGK 313
Cdd:COG4299    2 SKRLLSLDVLRGLTIALMILVNNPGSwshVYAPLLHAEWHGFTPTDLVFPFFLFIVGVAMPFSLSKRLAKGAPKSALYRK 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387589 314 IAWRSFLLICIGIIIvnpnYCLG--PLSWDKVRIPGVLQRLGVTYFVVAVLELLFakpvpehcasersclslrditSSWP 391
Cdd:COG4299   82 ILKRSLILFLLGLFL----NWFPffLKDFSEIRIPGVLQRIALAYLFAALLYLYL---------------------SRKT 136
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387589 392 QWLLILV-LEGLWLGLTFlLPVPGCPTGYLGPGGigdfgkypNctggAAGYIDRLLLGDDHLYQhpssavlyHTEVAYDP 470
Cdd:COG4299  137 QLIIAAGlLLGYWLLLAF-VPVPGFGAGPLSPEG--------N----LAAYIDRLLLGKGHLYK--------GEGKTFDP 195
                        250
                 ....*....|....*..
gi 530387589 471 EGILGTINSIVMAFLGV 487
Cdd:COG4299  196 EGLLSTLPAIVTVLLGY 212
 
Name Accession Description Interval E-value
COG4299 COG4299
Predicted acyltransferase, DUF1624 domain [General function prediction only];
237-487 1.10e-54

Predicted acyltransferase, DUF1624 domain [General function prediction only];


Pssm-ID: 443440 [Multi-domain]  Cd Length: 370  Bit Score: 187.68  E-value: 1.10e-54
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387589 237 PPRLRSVDTFRGIALILMVFVNYGGG---KYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGK 313
Cdd:COG4299    2 SKRLLSLDVLRGLTIALMILVNNPGSwshVYAPLLHAEWHGFTPTDLVFPFFLFIVGVAMPFSLSKRLAKGAPKSALYRK 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387589 314 IAWRSFLLICIGIIIvnpnYCLG--PLSWDKVRIPGVLQRLGVTYFVVAVLELLFakpvpehcasersclslrditSSWP 391
Cdd:COG4299   82 ILKRSLILFLLGLFL----NWFPffLKDFSEIRIPGVLQRIALAYLFAALLYLYL---------------------SRKT 136
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387589 392 QWLLILV-LEGLWLGLTFlLPVPGCPTGYLGPGGigdfgkypNctggAAGYIDRLLLGDDHLYQhpssavlyHTEVAYDP 470
Cdd:COG4299  137 QLIIAAGlLLGYWLLLAF-VPVPGFGAGPLSPEG--------N----LAAYIDRLLLGKGHLYK--------GEGKTFDP 195
                        250
                 ....*....|....*..
gi 530387589 471 EGILGTINSIVMAFLGV 487
Cdd:COG4299  196 EGLLSTLPAIVTVLLGY 212
DUF5009 pfam16401
Domain of unknown function (DUF5009); This small family of proteins is functionally ...
239-328 3.48e-05

Domain of unknown function (DUF5009); This small family of proteins is functionally uncharacterized. This family is mainly found in various Bacteroides species. The members in this family are around 470 residues in length.


Pssm-ID: 293010  Cd Length: 260  Bit Score: 45.56  E-value: 3.48e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387589  239 RLRSVDTFRGIALILMVF---VNYGGGKYWYFkHAS-----------WNGLTVADLVFPWFVFIMGSSIFLSMTSILQRG 304
Cdd:pfam16401   1 RALSLDALRGYAIILMVLsgsIAFSILPGWMY-HAQtpppghifnpeIPGITWVDLVFPFFLFAMGAAIPLALGKKAEKG 79
                          90       100
                  ....*....|....*....|....
gi 530387589  305 CSKFRLLGKIAWRSFLLICIGIII 328
Cdd:pfam16401  80 SSKLLLLYDAIKRFVLLTFFALFT 103
 
Name Accession Description Interval E-value
COG4299 COG4299
Predicted acyltransferase, DUF1624 domain [General function prediction only];
237-487 1.10e-54

Predicted acyltransferase, DUF1624 domain [General function prediction only];


Pssm-ID: 443440 [Multi-domain]  Cd Length: 370  Bit Score: 187.68  E-value: 1.10e-54
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387589 237 PPRLRSVDTFRGIALILMVFVNYGGG---KYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGK 313
Cdd:COG4299    2 SKRLLSLDVLRGLTIALMILVNNPGSwshVYAPLLHAEWHGFTPTDLVFPFFLFIVGVAMPFSLSKRLAKGAPKSALYRK 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387589 314 IAWRSFLLICIGIIIvnpnYCLG--PLSWDKVRIPGVLQRLGVTYFVVAVLELLFakpvpehcasersclslrditSSWP 391
Cdd:COG4299   82 ILKRSLILFLLGLFL----NWFPffLKDFSEIRIPGVLQRIALAYLFAALLYLYL---------------------SRKT 136
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387589 392 QWLLILV-LEGLWLGLTFlLPVPGCPTGYLGPGGigdfgkypNctggAAGYIDRLLLGDDHLYQhpssavlyHTEVAYDP 470
Cdd:COG4299  137 QLIIAAGlLLGYWLLLAF-VPVPGFGAGPLSPEG--------N----LAAYIDRLLLGKGHLYK--------GEGKTFDP 195
                        250
                 ....*....|....*..
gi 530387589 471 EGILGTINSIVMAFLGV 487
Cdd:COG4299  196 EGLLSTLPAIVTVLLGY 212
COG3503 COG3503
Uncharacterized membrane protein, DUF1624 family [Function unknown];
237-340 9.41e-06

Uncharacterized membrane protein, DUF1624 family [Function unknown];


Pssm-ID: 442726  Cd Length: 273  Bit Score: 47.14  E-value: 9.41e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387589 237 PPRLRSVDTFRGIALILMV------FVNYGGGKYWYFKHASWNGLtVADLVFPWFVFIMGSSIFLSmtsiLQRGCSKFRL 310
Cdd:COG3503    1 TSRLASIDALRGLAMVLMAldhvrdDLHFFGLVPTDLATTPPWRW-FTHLCAPLFLFLAGVSLYLA----HSRGIRWRAL 75
                         90       100       110
                 ....*....|....*....|....*....|
gi 530387589 311 LGKIAWRSFLLICIGIIIVNPNYCLGPLSW 340
Cdd:COG3503   76 SRFLLKRGLWLILLALLITLFTWLFFPDSF 105
DUF5009 pfam16401
Domain of unknown function (DUF5009); This small family of proteins is functionally ...
239-328 3.48e-05

Domain of unknown function (DUF5009); This small family of proteins is functionally uncharacterized. This family is mainly found in various Bacteroides species. The members in this family are around 470 residues in length.


Pssm-ID: 293010  Cd Length: 260  Bit Score: 45.56  E-value: 3.48e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387589  239 RLRSVDTFRGIALILMVF---VNYGGGKYWYFkHAS-----------WNGLTVADLVFPWFVFIMGSSIFLSMTSILQRG 304
Cdd:pfam16401   1 RALSLDALRGYAIILMVLsgsIAFSILPGWMY-HAQtpppghifnpeIPGITWVDLVFPFFLFAMGAAIPLALGKKAEKG 79
                          90       100
                  ....*....|....*....|....
gi 530387589  305 CSKFRLLGKIAWRSFLLICIGIII 328
Cdd:pfam16401  80 SSKLLLLYDAIKRFVLLTFFALFT 103
HGSNAT_cat pfam07786
Heparan-alpha-glucosaminide N-acetyltransferase, catalytic; This entry includes the catalytic ...
239-433 1.30e-03

Heparan-alpha-glucosaminide N-acetyltransferase, catalytic; This entry includes the catalytic domain of HGSNAT (Heparan-alpha-glucosaminide N-acetyltransferase). It contains the conserved histidine in the active site (His269), thought to hold the acetyl group during the transfer across the membrane and required for its enzymatic activity. HGSNAT transfers an acetyl group from cytoplasmically derived acetyl-CoA to terminal N-glucosamine residues of heparan sulfate within the lysosomes.


Pssm-ID: 377915  Cd Length: 222  Bit Score: 40.28  E-value: 1.30e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387589  239 RLRSVDTFRGIALILMVF--------------VNYGGGKYWYFKHAswngltVADLvfpwFVFIMGSSIFLSmtsilqrg 304
Cdd:pfam07786   1 RYWEIDALRGIALILMIIfhflwdleffgyldVDLTSGFWVYFARL------IASL----FLFIAGISLVLA-------- 62
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387589  305 CSKFRLLGKIAWRSFLLICIGIIIVNPNYCLGPLSWdkVRIpGVLQRLGvtyfVVAVLELLFAKpvpehcasersclslr 384
Cdd:pfam07786  63 HGRGLRWRKFLKRGLKIFAAALLITAATYIAFPDSF--IYF-GILHFIG----LASLLGLLFLR---------------- 119
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*....
gi 530387589  385 ditssWPQWLLILVLeGLWLGLTFLLPVPGCPTGYLGPGGIGDFGKYPN 433
Cdd:pfam07786 120 -----LPKWLLLLGA-LLFLALGLFLRSPTFDTPLLLWLGLSPLPFRTL 162
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH