U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Links from Nucleotide

    • Showing Current items.

    HSPG2 heparan sulfate proteoglycan 2 [ Homo sapiens (human) ]

    Gene ID: 3339, updated on 3-Nov-2024

    Summary

    Official Symbol
    HSPG2provided by HGNC
    Official Full Name
    heparan sulfate proteoglycan 2provided by HGNC
    Primary source
    HGNC:HGNC:5273
    See related
    Ensembl:ENSG00000142798 MIM:142461; AllianceGenome:HGNC:5273
    Gene type
    protein coding
    RefSeq status
    REVIEWED
    Organism
    Homo sapiens
    Lineage
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo
    Also known as
    PLC; SJA; SJS; HSPG; SJS1; PRCAN
    Summary
    This gene encodes the perlecan protein, which consists of a core protein to which three long chains of glycosaminoglycans (heparan sulfate or chondroitin sulfate) are attached. The perlecan protein is a large multidomain proteoglycan that binds to and cross-links many extracellular matrix components and cell-surface molecules. It has been shown that this protein interacts with laminin, prolargin, collagen type IV, FGFBP1, FBLN2, FGF7 and transthyretin, etc., and it plays essential roles in multiple biological activities. Perlecan is a key component of the vascular extracellular matrix, where it helps to maintain the endothelial barrier function. It is a potent inhibitor of smooth muscle cell proliferation and is thus thought to help maintain vascular homeostasis. It can also promote growth factor (e.g., FGF2) activity and thus stimulate endothelial growth and re-generation. It is a major component of basement membranes, where it is involved in the stabilization of other molecules as well as being involved with glomerular permeability to macromolecules and cell adhesion. Mutations in this gene cause Schwartz-Jampel syndrome type 1, Silverman-Handmaker type of dyssegmental dysplasia, and tardive dyskinesia. Alternative splicing of this gene results in multiple transcript variants. [provided by RefSeq, May 2014]
    Expression
    Broad expression in fat (RPKM 67.9), gall bladder (RPKM 21.5) and 21 other tissues See more
    Orthologs
    NEW
    Try the new Gene table
    Try the new Transcript table

    Genomic context

    See HSPG2 in Genome Data Viewer
    Location:
    1p36.12
    Exon count:
    103
    Annotation release Status Assembly Chr Location
    RS_2024_08 current GRCh38.p14 (GCF_000001405.40) 1 NC_000001.11 (21822244..21937310, complement)
    RS_2024_08 current T2T-CHM13v2.0 (GCF_009914755.1) 1 NC_060925.1 (21646052..21761123, complement)
    RS_2024_09 previous assembly GRCh37.p13 (GCF_000001405.25) 1 NC_000001.10 (22148737..22263803, complement)

    Chromosome 1 - NC_000001.11Genomic Context describing neighboring genes Neighboring gene ReSE screen-validated silencer GRCh37_chr1:21928741-21928970 Neighboring gene Sharpr-MPRA regulatory region 12165 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr1:21937517-21938264 Neighboring gene ReSE screen-validated silencer GRCh37_chr1:21939576-21939868 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:21945135-21945976 Neighboring gene RAP1 GTPase activating protein Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:21958265-21959233 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr1:21960338-21961086 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:21961087-21961835 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:21966135-21966964 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr1:21977615-21978365 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 384 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 385 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 386 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr1:21992551-21993051 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 387 Neighboring gene MED14-independent group 3 enhancer GRCh37_chr1:21996570-21997769 Neighboring gene ubiquitin specific peptidase 48 Neighboring gene uncharacterized LOC124904736 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 388 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 336 Neighboring gene H3K27ac hESC enhancer GRCh37_chr1:22099067-22099566 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 337 Neighboring gene NANOG-H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:22108688-22109459 Neighboring gene NANOG-H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:22109460-22110230 Neighboring gene ReSE screen-validated silencer GRCh37_chr1:22112153-22112327 Neighboring gene ReSE screen-validated silencer GRCh37_chr1:22137244-22137394 Neighboring gene Sharpr-MPRA regulatory region 6146 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 390 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr1:22148116-22148982 Neighboring gene low density lipoprotein receptor class A domain containing 2 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr1:22168222-22168722 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr1:22170194-22170694 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr1:22174882-22175382 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr1:22175383-22175883 Neighboring gene CDK7 strongly-dependent group 2 enhancer GRCh37_chr1:22178409-22179608 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:22191053-22191858 Neighboring gene NANOG-H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:22191859-22192664 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 341 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:22235910-22236626 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:22236627-22237342 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:22237343-22238058 Neighboring gene ReSE screen-validated silencer GRCh37_chr1:22241885-22242040 Neighboring gene Sharpr-MPRA regulatory region 4422 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr1:22246290-22246830 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:22247372-22247912 Neighboring gene ReSE screen-validated silencer GRCh37_chr1:22248108-22248275 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:22248994-22249533 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:22254196-22254752 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:22254753-22255307 Neighboring gene ribosomal protein L21 pseudogene 29 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr1:22258036-22258798 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr1:22262253-22263094 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 391 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:22271072-22271789 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:22288389-22289052 Neighboring gene ReSE screen-validated silencer GRCh37_chr1:22290537-22290804 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr1:22312621-22313122 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr1:22313123-22313622 Neighboring gene RNA, U6 small nuclear 1022, pseudogene Neighboring gene RNA, 7SL, cytoplasmic 386, pseudogene Neighboring gene chymotrypsin like elastase 3B

    Genomic regions, transcripts, and products

    Expression

    • Project title: HPA RNA-seq normal tissues HPA RNA-seq normal tissues
    • Description: RNA-seq was performed of tissue samples from 95 human individuals representing 27 different tissues in order to determine tissue-specificity of all protein-coding genes
    • BioProject: PRJEB4337
    • Publication: PMID 24309898
    • Analysis date: Wed Apr 4 07:08:55 2018

    Bibliography

    GeneRIFs: Gene References Into Functions

    What's a GeneRIF?

    HIV-1 interactions

    Protein interactions

    Protein Gene Interaction Pubs
    Envelope surface glycoprotein gp120 env HIV-1 SF33 Env (gp120) binds to HSPG2 in polarized infant tonsil cells as shown through immunoprecipitation PubMed
    env HSPG interacts strongly with positively charged-V3 loop of X4-tropic gp120, but weakly with less positively charged-V3 loop of R5-tropic gp120 on epithelial cells PubMed
    Tat tat Perlecan mediates Tat uptake and is required for HIV-1 LTR-directed transactivation in the human colon carcinoma cell line, WiDr. PubMed

    Go to the HIV-1, Human Interaction Database

    Pathways from PubChem

    Interactions

    Products Interactant Other Gene Complex Source Pubs Description

    General gene information

    Markers

    Gene Ontology Provided by GOA

    Function Evidence Code Pubs
    enables amyloid-beta binding IC
    Inferred by Curator
    more info
    PubMed 
    enables calcium ion binding IEA
    Inferred from Electronic Annotation
    more info
     
    enables collagen V binding IEA
    Inferred from Electronic Annotation
    more info
     
    enables extracellular matrix structural constituent conferring compression resistance RCA
    inferred from Reviewed Computational Analysis
    more info
    PubMed 
    enables integrin binding TAS
    Traceable Author Statement
    more info
    PubMed 
    enables low-density lipoprotein particle receptor binding TAS
    Traceable Author Statement
    more info
    PubMed 
    enables protein binding IPI
    Inferred from Physical Interaction
    more info
    PubMed 
    Process Evidence Code Pubs
    involved_in angiogenesis IEA
    Inferred from Electronic Annotation
    more info
     
    involved_in animal organ regeneration IEA
    Inferred from Electronic Annotation
    more info
     
    involved_in brain development TAS
    Traceable Author Statement
    more info
    PubMed 
    involved_in cell differentiation TAS
    Traceable Author Statement
    more info
    PubMed 
    involved_in circulatory system development TAS
    Traceable Author Statement
    more info
    PubMed 
    involved_in embryo implantation IEA
    Inferred from Electronic Annotation
    more info
     
    involved_in inflammatory response TAS
    Traceable Author Statement
    more info
    PubMed 
    involved_in lipid metabolic process TAS
    Traceable Author Statement
    more info
    PubMed 
    involved_in negative regulation of amyloid fibril formation IGI
    Inferred from Genetic Interaction
    more info
    PubMed 
    involved_in negative regulation of angiogenesis TAS
    Traceable Author Statement
    more info
    PubMed 
    involved_in negative regulation of cell adhesion IEA
    Inferred from Electronic Annotation
    more info
     
    involved_in negative regulation of cell population proliferation IEA
    Inferred from Electronic Annotation
    more info
     
    involved_in positive regulation of endothelial cell proliferation IEA
    Inferred from Electronic Annotation
    more info
     
    involved_in receptor-mediated endocytosis ISS
    Inferred from Sequence or Structural Similarity
    more info
     
    involved_in response to hypoxia IEA
    Inferred from Electronic Annotation
    more info
     
    involved_in response to xenobiotic stimulus IEA
    Inferred from Electronic Annotation
    more info
     
    involved_in smoothened signaling pathway IEA
    Inferred from Electronic Annotation
    more info
     

    General protein information

    Preferred Names
    basement membrane-specific heparan sulfate proteoglycan core protein
    Names
    endorepellin (domain V region)
    perlecan proteoglycan

    NCBI Reference Sequences (RefSeq)

    NEW Try the new Transcript table

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    Genomic

    1. NG_016740.1 RefSeqGene

      Range
      4948..120014
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    mRNA and Protein(s)

    1. NM_001291860.2NP_001278789.1  basement membrane-specific heparan sulfate proteoglycan core protein isoform a precursor

      Status: REVIEWED

      Description
      Transcript Variant: This variant (1) represents the longer transcript and encodes the longer isoform (a).
      Source sequence(s)
      AA450342, AL590103, AL590556, BC033152, BE273742, M85289, X62515
      Conserved Domains (18) summary
      cd00096
      Location:31313200
      Ig; Immunoglobulin domain
      cd05743
      Location:421498
      Ig_Perlecan_D2_like; Immunoglobulin (Ig)-like domain II (D2) of the human basement membrane heparan sulfate proteoglycan perlecan, also known as HSPG2
      cd05754
      Location:17721857
      Ig3_Perlecan_like; Third immunoglobulin (Ig)-like domain found in Perlecan and similar proteins
      smart00408
      Location:19712033
      IGc2; Immunoglobulin C-2 Type
      smart00200
      Location:80194
      SEA; Domain found in sea urchin sperm protein, enterokinase, agrin
      smart00281
      Location:9861113
      LamB; Laminin B domain
      smart00409
      Location:414490
      IG; Immunoglobulin
      smart00410
      Location:20582135
      IG_like; Immunoglobulin like
      cd00054
      Location:38533882
      EGF_CA; Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the ...
      cd00055
      Location:11591208
      EGF_Lam; Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in ...
      cd00110
      Location:42044363
      LamG; Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of ...
      cd00112
      Location:285319
      LDLa; Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about ...
      pfam07679
      Location:35823659
      I-set; Immunoglobulin I-set domain
      pfam00008
      Location:41094139
      EGF; EGF-like domain
      pfam00047
      Location:30273107
      ig; Immunoglobulin domain
      pfam00053
      Location:12761323
      Laminin_EGF; Laminin EGF domain
      pfam13895
      Location:29303010
      Ig_2; Immunoglobulin domain
      cl11960
      Location:35973652
      Ig; Immunoglobulin domain
    2. NM_005529.7NP_005520.4  basement membrane-specific heparan sulfate proteoglycan core protein isoform b precursor

      See identical proteins and their annotated locations for NP_005520.4

      Status: REVIEWED

      Description
      Transcript Variant: This variant (2) uses an alternate in-frame splice site in the 5' coding region, compared to variant 1, resulting in an isoform (b) that is 1 aa shorter than isoform a.
      Source sequence(s)
      AA450342, AL590103, AL590556, BC033152, BE273742, M85289, X62515
      Consensus CDS
      CCDS30625.1
      UniProtKB/Swiss-Prot
      P98160, Q16287, Q5SZI3, Q9H3V5
      Related
      ENSP00000363827.3, ENST00000374695.8
      Conserved Domains (18) summary
      cd00096
      Location:31303199
      Ig; Immunoglobulin domain
      cd05743
      Location:421498
      Ig_Perlecan_D2_like; Immunoglobulin (Ig)-like domain II (D2) of the human basement membrane heparan sulfate proteoglycan perlecan, also known as HSPG2
      cd05754
      Location:17711856
      Ig3_Perlecan_like; Third immunoglobulin (Ig)-like domain found in Perlecan and similar proteins
      smart00408
      Location:19702032
      IGc2; Immunoglobulin C-2 Type
      smart00200
      Location:80194
      SEA; Domain found in sea urchin sperm protein, enterokinase, agrin
      smart00281
      Location:9851112
      LamB; Laminin B domain
      smart00409
      Location:414490
      IG; Immunoglobulin
      smart00410
      Location:20572134
      IG_like; Immunoglobulin like
      cd00054
      Location:38523881
      EGF_CA; Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the ...
      cd00055
      Location:11581207
      EGF_Lam; Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in ...
      cd00110
      Location:42034362
      LamG; Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of ...
      cd00112
      Location:285319
      LDLa; Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about ...
      pfam07679
      Location:35813658
      I-set; Immunoglobulin I-set domain
      pfam00008
      Location:41084138
      EGF; EGF-like domain
      pfam00047
      Location:30263106
      ig; Immunoglobulin domain
      pfam00053
      Location:12751322
      Laminin_EGF; Laminin EGF domain
      pfam13895
      Location:29293009
      Ig_2; Immunoglobulin domain
      cl11960
      Location:35963651
      Ig; Immunoglobulin domain

    RefSeqs of Annotated Genomes: GCF_000001405.40-RS_2024_08

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCh38.p14 Primary Assembly

    Genomic

    1. NC_000001.11 Reference GRCh38.p14 Primary Assembly

      Range
      21822244..21937310 complement
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    mRNA and Protein(s)

    1. XM_047419090.1XP_047275046.1  basement membrane-specific heparan sulfate proteoglycan core protein isoform X3

    2. XM_047419080.1XP_047275036.1  basement membrane-specific heparan sulfate proteoglycan core protein isoform X2

    3. XM_011541318.3XP_011539620.1  basement membrane-specific heparan sulfate proteoglycan core protein isoform X1

      Conserved Domains (18) summary
      cd00096
      Location:33133382
      Ig; Immunoglobulin domain
      cd05743
      Location:438515
      Ig_Perlecan_D2_like; Immunoglobulin (Ig)-like domain II (D2) of the human basement membrane heparan sulfate proteoglycan perlecan, also known as HSPG2
      cd05754
      Location:19542039
      Ig3_Perlecan_like; Third immunoglobulin (Ig)-like domain found in Perlecan and similar proteins
      smart00408
      Location:21532215
      IGc2; Immunoglobulin C-2 Type
      smart00200
      Location:80193
      SEA; Domain found in sea urchin sperm protein, enterokinase, agrin
      smart00281
      Location:10031130
      LamB; Laminin B domain
      smart00409
      Location:431507
      IG; Immunoglobulin
      smart00410
      Location:22402317
      IG_like; Immunoglobulin like
      cd00054
      Location:40354064
      EGF_CA; Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the ...
      cd00055
      Location:11761225
      EGF_Lam; Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in ...
      cd00110
      Location:43864545
      LamG; Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of ...
      cd00112
      Location:302336
      LDLa; Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about ...
      pfam07679
      Location:37643841
      I-set; Immunoglobulin I-set domain
      pfam00008
      Location:42914321
      EGF; EGF-like domain
      pfam00047
      Location:32093289
      ig; Immunoglobulin domain
      pfam00053
      Location:12931340
      Laminin_EGF; Laminin EGF domain
      pfam13895
      Location:31123192
      Ig_2; Immunoglobulin domain
      cl11960
      Location:37793834
      Ig; Immunoglobulin domain
    4. XM_017001122.1XP_016856611.1  basement membrane-specific heparan sulfate proteoglycan core protein isoform X7

    5. XM_017001121.1XP_016856610.1  basement membrane-specific heparan sulfate proteoglycan core protein isoform X6

    6. XM_047419091.1XP_047275047.1  basement membrane-specific heparan sulfate proteoglycan core protein isoform X5

    7. XM_017001120.1XP_016856609.1  basement membrane-specific heparan sulfate proteoglycan core protein isoform X4

    Alternate T2T-CHM13v2.0

    Genomic

    1. NC_060925.1 Alternate T2T-CHM13v2.0

      Range
      21646052..21761123 complement
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    mRNA and Protein(s)

    1. XM_054336237.1XP_054192212.1  basement membrane-specific heparan sulfate proteoglycan core protein isoform X3

    2. XM_054336236.1XP_054192211.1  basement membrane-specific heparan sulfate proteoglycan core protein isoform X2

    3. XM_054336235.1XP_054192210.1  basement membrane-specific heparan sulfate proteoglycan core protein isoform X1

    4. XM_054336241.1XP_054192216.1  basement membrane-specific heparan sulfate proteoglycan core protein isoform X7

    5. XM_054336240.1XP_054192215.1  basement membrane-specific heparan sulfate proteoglycan core protein isoform X6

    6. XM_054336239.1XP_054192214.1  basement membrane-specific heparan sulfate proteoglycan core protein isoform X5

    7. XM_054336238.1XP_054192213.1  basement membrane-specific heparan sulfate proteoglycan core protein isoform X4