U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

CENPA centromere protein A [ Homo sapiens (human) ]

Gene ID: 1058, updated on 2-Nov-2024

Summary

Official Symbol
CENPAprovided by HGNC
Official Full Name
centromere protein Aprovided by HGNC
Primary source
HGNC:HGNC:1851
See related
Ensembl:ENSG00000115163 MIM:117139; AllianceGenome:HGNC:1851
Gene type
protein coding
RefSeq status
REVIEWED
Organism
Homo sapiens
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo
Also known as
CenH3; CENP-A
Summary
Centromeres are the differentiated chromosomal domains that specify the mitotic behavior of chromosomes. This gene encodes a centromere protein which contains a histone H3 related histone fold domain that is required for targeting to the centromere. Centromere protein A is proposed to be a component of a modified nucleosome or nucleosome-like structure in which it replaces 1 or both copies of conventional histone H3 in the (H3-H4)2 tetrameric core of the nucleosome particle. The protein is a replication-independent histone that is a member of the histone H3 family. Alternative splicing results in multiple transcript variants encoding distinct isoforms. [provided by RefSeq, Nov 2015]
Expression
Broad expression in lymph node (RPKM 4.5), appendix (RPKM 2.5) and 14 other tissues See more
Orthologs
NEW
Try the new Gene table
Try the new Transcript table

Genomic context

See CENPA in Genome Data Viewer
Location:
2p23.3
Exon count:
5
Annotation release Status Assembly Chr Location
RS_2024_08 current GRCh38.p14 (GCF_000001405.40) 2 NC_000002.12 (26786056..26794589)
RS_2024_08 current T2T-CHM13v2.0 (GCF_009914755.1) 2 NC_060926.1 (26827645..26836182)
RS_2024_09 previous assembly GRCh37.p13 (GCF_000001405.25) 2 NC_000002.11 (27008924..27017457)

Chromosome 2 - NC_000002.12Genomic Context describing neighboring genes Neighboring gene H3K4me1 hESC enhancer GRCh37_chr2:26919925-26920426 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr2:26929035-26929932 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr2:26932169-26933090 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr2:26933091-26934010 Neighboring gene potassium two pore domain channel subfamily K member 3 Neighboring gene P300/CBP strongly-dependent group 1 enhancer GRCh37_chr2:26946480-26947679 Neighboring gene H3K27ac hESC enhancer GRCh37_chr2:26960802-26961302 Neighboring gene H3K27ac hESC enhancer GRCh37_chr2:26961303-26961803 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr2:26965319-26965824 Neighboring gene OCT4-NANOG-H3K27ac-H3K4me1 hESC enhancer GRCh37_chr2:26972913-26973660 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr2:26975905-26976652 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr2:26976653-26977400 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr2:26977401-26978148 Neighboring gene NANOG-H3K27ac-H3K4me1 hESC enhancer GRCh37_chr2:26981141-26981888 Neighboring gene NANOG-H3K27ac-H3K4me1 hESC enhancer GRCh37_chr2:26981889-26982634 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 15476 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 11276 Neighboring gene solute carrier family 35 member F6 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr2:27007325-27008094 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 15477 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 11277 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 15478 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 15479 Neighboring gene MPRA-validated peak3629 silencer Neighboring gene CDKN2A interacting protein N-terminal like pseudogene 2 Neighboring gene dihydropyrimidinase like 5 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 15480 Neighboring gene NANOG-H3K4me1 hESC enhancer GRCh37_chr2:27135238-27135875 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr2:27136064-27137020 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr2:27157221-27157720 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr2:27172437-27172937

Genomic regions, transcripts, and products

Expression

  • Project title: HPA RNA-seq normal tissues
  • Description: RNA-seq was performed of tissue samples from 95 human individuals representing 27 different tissues in order to determine tissue-specificity of all protein-coding genes
  • BioProject: PRJEB4337
  • Publication: PMID 24309898
  • Analysis date: Wed Apr 4 07:08:55 2018

Bibliography

GeneRIFs: Gene References Into Functions

What's a GeneRIF?

HIV-1 interactions

Protein interactions

Protein Gene Interaction Pubs
Vpr vpr HIV-1 Vpr and p300/HAT co-localizes with CENP-A and CENP-H proteins in the centromere and arm region of chromosome PubMed

Go to the HIV-1, Human Interaction Database

Pathways from PubChem

Interactions

Products Interactant Other Gene Complex Source Pubs Description

General gene information

Markers

Gene Ontology Provided by GOA

Function Evidence Code Pubs
enables DNA binding IEA
Inferred from Electronic Annotation
more info
 
enables chromatin binding TAS
Traceable Author Statement
more info
PubMed 
enables protein binding IPI
Inferred from Physical Interaction
more info
PubMed 
enables protein heterodimerization activity IEA
Inferred from Electronic Annotation
more info
 
enables structural constituent of chromatin IEA
Inferred from Electronic Annotation
more info
 
Component Evidence Code Pubs
part_of CENP-A containing nucleosome IDA
Inferred from Direct Assay
more info
PubMed 
part_of CENP-A containing nucleosome IPI
Inferred from Physical Interaction
more info
PubMed 
located_in chromosome, centromeric region IDA
Inferred from Direct Assay
more info
PubMed 
located_in condensed chromosome, centromeric region IDA
Inferred from Direct Assay
more info
PubMed 
located_in cytosol TAS
Traceable Author Statement
more info
 
located_in nucleoplasm IDA
Inferred from Direct Assay
more info
 
located_in nucleoplasm TAS
Traceable Author Statement
more info
 
part_of nucleosome IDA
Inferred from Direct Assay
more info
PubMed 
located_in nucleus HDA PubMed 
is_active_in nucleus IBA
Inferred from Biological aspect of Ancestor
more info
 
located_in nucleus IDA
Inferred from Direct Assay
more info
PubMed 
located_in pericentric heterochromatin IEA
Inferred from Electronic Annotation
more info
 

General protein information

Preferred Names
histone H3-like centromeric protein A
Names
centromere autoantigen A
centromere protein A, 17kDa
centromere-specific histone

NCBI Reference Sequences (RefSeq)

NEW Try the new Transcript table

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

mRNA and Protein(s)

  1. NM_001042426.2NP_001035891.1  histone H3-like centromeric protein A isoform b

    See identical proteins and their annotated locations for NP_001035891.1

    Status: REVIEWED

    Description
    Transcript Variant: This variant (2) lacks an in-frame exon in the coding region, compared to variant 1. The resulting protein (isoform b) is shorter than isoform a.
    Source sequence(s)
    AC011740, BC000881
    Consensus CDS
    CCDS42662.1
    UniProtKB/Swiss-Prot
    P49450
    Related
    ENSP00000233505.8, ENST00000233505.12
    Conserved Domains (1) summary
    cl23735
    Location:34111
    H4; Histone H4, one of the four histones, along with H2A, H2B and H3, which forms the eukaryotic nucleosome core; along with H3, it plays a central role in nucleosome formation; histones bind to DNA and wrap the genetic material into "beads on a string" in ...
  2. NM_001809.4NP_001800.1  histone H3-like centromeric protein A isoform a

    See identical proteins and their annotated locations for NP_001800.1

    Status: REVIEWED

    Description
    Transcript Variant: This variant (1) represents the longer transcript and encodes the longer isoform (a).
    Source sequence(s)
    AC011740
    Consensus CDS
    CCDS1729.1
    UniProtKB/Swiss-Prot
    D6W544, P49450, Q53T74, Q9BVW2
    Related
    ENSP00000336868.4, ENST00000335756.9
    Conserved Domains (1) summary
    smart00428
    Location:34137
    H3; Histone H3

RefSeqs of Annotated Genomes: GCF_000001405.40-RS_2024_08

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCh38.p14 Primary Assembly

Genomic

  1. NC_000002.12 Reference GRCh38.p14 Primary Assembly

    Range
    26786056..26794589
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

Alternate T2T-CHM13v2.0

Genomic

  1. NC_060926.1 Alternate T2T-CHM13v2.0

    Range
    26827645..26836182
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)