U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Gm20939 predicted gene, 20939 [ Mus musculus (house mouse) ]

Gene ID: 100044193, updated on 9-Aug-2024

Summary

Official Symbol
Gm20939provided by MGI
Official Full Name
predicted gene, 20939provided by MGI
Primary source
MGI:MGI:5434295
See related
AllianceGenome:MGI:5434295
Gene type
protein coding
RefSeq status
VALIDATED
Organism
Mus musculus
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
Expression
Low expression observed in reference dataset See more
NEW
Try the new Gene table
Try the new Transcript table

Genomic context

See Gm20939 in Genome Data Viewer
Location:
17; 17 E5
Exon count:
4
Annotation release Status Assembly Chr Location
RS_2024_02 current GRCm39 (GCF_000001635.27) 17 NC_000083.7 (95172329..95187353)
108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 17 NC_000083.6 (94864901..94879925)

Chromosome 17 - NC_000083.7Genomic Context describing neighboring genes Neighboring gene predicted gene 1976 Neighboring gene S-adenosylmethionine decarboxylase, pseudogene 7 Neighboring gene STARR-positive B cell enhancer mm9_chr17:95233967-95234268 Neighboring gene STARR-positive B cell enhancer mm9_chr17:95235804-95236104 Neighboring gene STARR-seq mESC enhancer starr_43645 Neighboring gene signal recognition particle 19 kDa protein pseudogene Neighboring gene predicted gene, 23102

Genomic regions, transcripts, and products

Expression

  • Project title: Mouse ENCODE transcriptome data
  • Description: RNA profiling data sets generated by the Mouse ENCODE project.
  • BioProject: PRJNA66167
  • Publication: PMID 25409824
  • Analysis date: n/a

General protein information

Preferred Names
uncharacterized protein LOC100044193

NCBI Reference Sequences (RefSeq)

NEW Try the new Transcript table

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

mRNA and Protein(s)

  1. NM_001379445.1NP_001366374.1  uncharacterized protein LOC100044193

    Status: VALIDATED

    Source sequence(s)
    AC144860, AC225448
    Conserved Domains (3) summary
    COG5048
    Location:128534
    COG5048; FOG: Zn-finger [General function prediction only]
    sd00017
    Location:553573
    ZF_C2H2; C2H2 Zn finger [structural motif]
    pfam01352
    Location:444
    KRAB; KRAB box

RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCm39 C57BL/6J

Genomic

  1. NC_000083.7 Reference GRCm39 C57BL/6J

    Range
    95172329..95187353
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

Suppressed Reference Sequence(s)

The following Reference Sequences have been suppressed. Explain

  1. NM_001024731.2: Suppressed sequence

    Description
    NM_001024731.2: This RefSeq was removed because currently there is insufficient support for the transcript and the protein. There is a tandem repeat within the coding sequence and this transcript differ in repeat number relative to the reference genome.