U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Sp4 trans-acting transcription factor 4 [ Mus musculus (house mouse) ]

Gene ID: 20688, updated on 9-Dec-2024

Summary

Official Symbol
Sp4provided by MGI
Official Full Name
trans-acting transcription factor 4provided by MGI
Primary source
MGI:MGI:107595
See related
Ensembl:ENSMUSG00000025323 AllianceGenome:MGI:107595
Gene type
protein coding
RefSeq status
VALIDATED
Organism
Mus musculus
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
Also known as
HF-1b; HF1-b; 5730497N03Rik
Summary
Predicted to enable DNA-binding transcription factor activity, RNA polymerase II-specific; RNA polymerase II cis-regulatory region sequence-specific DNA binding activity; and identical protein binding activity. Acts upstream of or within regulation of heart contraction. Predicted to be located in cytosol and nucleoplasm. Is expressed in several structures, including alimentary system; embryo mesenchyme; genitourinary system; nervous system; and sensory organ. Used to study schizophrenia. Human ortholog(s) of this gene implicated in congenital heart disease and dilated cardiomyopathy. Orthologous to human SP4 (Sp4 transcription factor). [provided by Alliance of Genome Resources, Dec 2024]
Expression
Broad expression in whole brain E14.5 (RPKM 2.8), CNS E14 (RPKM 2.6) and 23 other tissues See more
Orthologs
NEW
Try the new Gene table
Try the new Transcript table

Genomic context

See Sp4 in Genome Data Viewer
Location:
12 F2; 12 63.48 cM
Exon count:
6
Annotation release Status Assembly Chr Location
RS_2024_02 current GRCm39 (GCF_000001635.27) 12 NC_000078.7 (118195421..118265211, complement)
108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 12 NC_000078.6 (118231686..118301440, complement)

Chromosome 12 - NC_000078.7Genomic Context describing neighboring genes Neighboring gene 60S acidic ribosomal protein P2 pseudogene Neighboring gene STARR-seq mESC enhancer starr_33427 Neighboring gene cell division cycle associated 7 like Neighboring gene dynein, axonemal, heavy chain 11 Neighboring gene STARR-seq mESC enhancer starr_33429 Neighboring gene STARR-positive B cell enhancer mm9_chr12:119539914-119540215 Neighboring gene STARR-positive B cell enhancer mm9_chr12:119540362-119540663 Neighboring gene CapStarr-seq enhancer MGSCv37_chr12:119611733-119611916 Neighboring gene STARR-seq mESC enhancer starr_33430 Neighboring gene CapStarr-seq enhancer MGSCv37_chr12:119615690-119615886 Neighboring gene Riken cDNA D230030E09 gene Neighboring gene predicted gene, 25577

Genomic regions, transcripts, and products

Expression

  • Project title: Mouse ENCODE transcriptome data
  • Description: RNA profiling data sets generated by the Mouse ENCODE project.
  • BioProject: PRJNA66167
  • Publication: PMID 25409824
  • Analysis date: n/a

Bibliography

GeneRIFs: Gene References Into Functions

What's a GeneRIF?

Variation

Alleles

Alleles of this type are documented at Mouse Genome Informatics  (MGI)
  • Endonuclease-mediated (1) 
  • Targeted (6)  1 citation

General gene information

Gene Ontology Provided by MGI

Function Evidence Code Pubs
enables DNA binding ISO
Inferred from Sequence Orthology
more info
 
enables DNA binding ISS
Inferred from Sequence or Structural Similarity
more info
PubMed 
enables DNA-binding transcription factor activity ISO
Inferred from Sequence Orthology
more info
 
enables DNA-binding transcription factor activity, RNA polymerase II-specific IBA
Inferred from Biological aspect of Ancestor
more info
 
enables RNA polymerase II cis-regulatory region sequence-specific DNA binding IBA
Inferred from Biological aspect of Ancestor
more info
 
enables identical protein binding ISO
Inferred from Sequence Orthology
more info
 
enables metal ion binding IEA
Inferred from Electronic Annotation
more info
 
enables sequence-specific DNA binding ISO
Inferred from Sequence Orthology
more info
 
Component Evidence Code Pubs
located_in cytosol ISO
Inferred from Sequence Orthology
more info
 
located_in nucleoplasm ISO
Inferred from Sequence Orthology
more info
 
located_in nucleus IEA
Inferred from Electronic Annotation
more info
 

NCBI Reference Sequences (RefSeq)

NEW Try the new Transcript table

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

mRNA and Protein(s)

  1. NM_001166385.2NP_001159857.2  transcription factor Sp4 isoform 2

    Status: VALIDATED

    Description
    Transcript Variant: This variant (2) uses an alternate in-frame splice site in the 3' coding region, compared to variant 1. This difference results in a shorter isoform (2) compared to isoform 1.
    Source sequence(s)
    AC163032, AC163034, BC076630, BY070406
    Consensus CDS
    CCDS88413.1
    UniProtKB/TrEMBL
    K4DI62, Q6DFV2
    Related
    ENSMUSP00000026367.11, ENSMUST00000026367.11
    Conserved Domains (4) summary
    sd00017
    Location:645667
    ZF_C2H2; C2H2 Zn finger [structural motif]
    pfam00096
    Location:703725
    zf-C2H2; Zinc finger, C2H2 type
    cd22536
    Location:30644
    SP4_N; N-terminal domain of transcription factor Specificity Protein (SP) 4
    pfam13465
    Location:689712
    zf-H2C2_2; Zinc-finger double domain
  2. NM_009239.4NP_033265.4  transcription factor Sp4 isoform 1

    Status: VALIDATED

    Description
    Transcript Variant: This variant (1) represents the longer transcript and encodes the longer isoform (1).
    Source sequence(s)
    AC163032, AC163034, BC076630, BY070406
    Consensus CDS
    CCDS36579.1
    UniProtKB/Swiss-Prot
    E9QNI3, Q62445
    UniProtKB/TrEMBL
    A0A1Y7VJR5
    Related
    ENSMUSP00000152603.2, ENSMUST00000222314.2
    Conserved Domains (4) summary
    COG5048
    Location:660731
    COG5048; FOG: Zn-finger [General function prediction only]
    sd00017
    Location:647669
    ZF_C2H2; C2H2 Zn finger [structural motif]
    pfam00096
    Location:705727
    zf-C2H2; Zinc finger, C2H2 type
    pfam13465
    Location:691714
    zf-H2C2_2; Zinc-finger double domain

RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCm39 C57BL/6J

Genomic

  1. NC_000078.7 Reference GRCm39 C57BL/6J

    Range
    118195421..118265211 complement
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

mRNA and Protein(s)

  1. XM_017315007.3XP_017170496.1  transcription factor Sp4 isoform X1

    UniProtKB/Swiss-Prot
    E9QNI3, Q62445
    Conserved Domains (3) summary
    sd00017
    Location:632654
    ZF_C2H2; C2H2 Zn finger [structural motif]
    pfam00096
    Location:690712
    zf-C2H2; Zinc finger, C2H2 type
    pfam13465
    Location:676699
    zf-H2C2_2; Zinc-finger double domain
  2. XM_036157278.1XP_036013171.1  transcription factor Sp4 isoform X2

    UniProtKB/Swiss-Prot
    E9QNI3, Q62445
    Conserved Domains (4) summary
    sd00017
    Location:588610
    ZF_C2H2; C2H2 Zn finger [structural motif]
    pfam00096
    Location:646668
    zf-C2H2; Zinc finger, C2H2 type
    pfam13465
    Location:632655
    zf-H2C2_2; Zinc-finger double domain
    cd22536
    Location:30587
    SP4_N; N-terminal domain of transcription factor Specificity Protein (SP) 4
  3. XM_036157279.1XP_036013172.1  transcription factor Sp4 isoform X3

    Conserved Domains (5) summary
    COG5048
    Location:349420
    COG5048; FOG: Zn-finger [General function prediction only]
    sd00017
    Location:336358
    ZF_C2H2; C2H2 Zn finger [structural motif]
    pfam00096
    Location:334358
    zf-C2H2; Zinc finger, C2H2 type
    pfam13465
    Location:380403
    zf-H2C2_2; Zinc-finger double domain
    cl41773
    Location:1335
    SP1-4_N; N-terminal domain of transcription factor Specificity Proteins (SP) 1-4