CCDS Home FTP Process Releases & Statistics
Collaborators EBI HGNC MGI NCBI
Contact Us email CCDS
Genome Displays Related Resources Gene HomoloGene MANE RefSeq
|
|
Report for CCDS23871.2 (current version)
CCDS |
Status |
Species |
Chrom. |
Gene |
CCDS Release |
NCBI Annotation Release |
Ensembl Annotation Release |
Links |
23871.2 |
Reviewed, update pending |
Mus musculus |
10 |
Chst3 |
23 |
108 |
98 |
|
Public Note for CCDS 23871.1 |
The coding region has been updated to extend the N-terminus to one that is more supported by the available transcript data. This N-terminal extension is only conserved in a few species, including mouse, rat, rhesus and marmoset. The updated start codon has a weak Kozak signal, as does the much better conserved downstream start codon. It is possible that leaky scanning by ribosomes may allow the downstream start codon to be used at least some of the time, which would result in a 6 aa length difference at the N-terminus. There is no experimental evidence indicating which start codon is preferentially used in vivo. |
Public since: CCDS release 2, NCBI annotation release 36.1, Ensembl annotation release 39
Review status: Reviewed (by CCDS collaboration)
Attributes |
CDS uses downstream AUG |
Sequence IDs included in CCDS 23871.2
Original |
Current |
Source |
Nucleotide ID |
Protein ID |
Status in CCDS |
Seq. Status |
Links |
---|
|
|
EBI |
ENSMUST00000135158.8 |
ENSMUSP00000126281.1 |
Accepted |
alive |
|
|
|
NCBI |
NM_016803.3 |
NP_058083.2 |
Updated |
not alive |
|
|
|
NCBI |
NM_016803.4 |
NP_058083.3 |
Pending |
alive |
|
RefSeq |
Length |
Related UniProtKB/SwissProt |
Length |
Identity |
Gaps |
Mismatches |
NP_058083.3 |
472 |
O88199 |
472 |
100% |
0 |
0 |
Chromosomal Locations for CCDS 23871.2
Assembly GRCm38.p6 (GCF_000001635.26)
CCDS Sequence Data |
---|
Blue highlighting indicates alternating exons. | Red highlighting indicates amino acids encoded across a splice junction. | | Mouse over the nucleotide or protein sequence below and click on the highlighted codon or residue to select the pair. |
Nucleotide Sequence (1437 nt): ATGGCGCCCCCTCTCCCCATGGAGAAAGGACTCGCTTTGCCTCAGGATTTCCGGGACCTTGTACACAGCC TAAAGATTCGAGGCAGATACGTCTTGTTCCTGGCATTTGTGGTCATAGTTTTTATCTTCATTGAAAAGGA AAATAAAATCATATCCAGGGTCTCCGACAAGCTGAAGCAGATCCCTCATTTTGTGGCAGATGCCAACAGC ACTGACCCAGCCCTGCTCTTATCGGAGAATGCATCTCTCTTGTCCCTGAGCGAGTTGGATTCCACCTTTT CCCATCTGCGGAGCCGCCTGCACAACCTGAGCCTGCAGCTGGGCGTGGAGCCAGCAATGGAGAGCCAGGA GGCTGGGGCAGAGAAGCCATCCCAGCAGGCTGGAGCAGGGACCCGGCGCCACGTGCTTCTCATGGCCACC ACCCGCACGGGTTCCTCGTTCGTGGGCGAGTTCTTCAACCAGCAGGGCAATATCTTCTACCTCTTCGAGC CACTGTGGCACATCGAGCGCACCGTGTTCTTCCAGCAGCGAGGCGCCAGCGCGGCTGGTTCAGCCTTGGT CTACCGTGATGTCCTCAAGCAGTTGTTGCTATGCGACCTGTATGTGCTGGAGCCCTTCATCAGCCCTCCG CCCGAGGACCACTTGACTCAGTTCCTGTTCCGCCGGGGATCCAGCCGTTCACTCTGCGAGGATCCGGTGT GCACACCCTTCGTCAAGAAGGTCTTTGAGAAGTACCACTGCAGGAACCGTCGCTGCGGGCCACTCAACGT GACCTTGGCGGGCGAGGCCTGCCGCCGCAAGGACCACGTGGCCCTCAAGGCTGTGCGCATCCGTCAGCTG GAGTTCCTGCAGCCGCTAGTTGAGGACCCGAGGTTGGATCTACGAGTCATTCAGCTGGTGCGCGACCCCC GGGCCGTGCTGGCTTCACGCATAGTGGCCTTTGCGGGCAAGTATGAGAACTGGAAGAAGTGGCTGTCCGA GGGGCAGGACCAGCTGAGCGAGGATGAGGTGCAGCGATTGCGGGGCAACTGTGAGAGCATCCGCCTGTCT GCAGAGCTGGGCTTGCGGCAGCCAGCCTGGCTGCGCGGTCGTTACATGCTGGTGCGCTATGAGGATGTGG CACGCAGGCCACTGCAGAAGGCCCGAGAGATGTACAGCTTTGCGGGCATCCCCTTGACCCCGCAGGTGGA GGACTGGATCCAGAAGAACACGCAGGCGACACGCGACAGCAGCGATGTCTACTCCACTCAGAAAAACTCT TCTGAGCAGTTTGAGAAGTGGCGCTTCAGCATGCCTTTCAAGCTGGCACAGGTGGTACAGGCTGCCTGTG GCCCGACCATGCACCTCTTTGGCTACAAGTTGGCCAGGGATGCCGCCTCACTCACCAACCGCTCCATCAG CCTGCTGGAGGAGCGGGGCACCTTCTGGGTCACGTAG
Translation (478 aa): MAPPLPMEKGLALPQDFRDLVHSLKIRGRYVLFLAFVVIVFIFIEKENKIISRVSDKLKQIPHFVADANS TDPALLLSENASLLSLSELDSTFSHLRSRLHNLSLQLGVEPAMESQEAGAEKPSQQAGAGTRRHVLLMAT TRTGSSFVGEFFNQQGNIFYLFEPLWHIERTVFFQQRGASAAGSALVYRDVLKQLLLCDLYVLEPFISPP PEDHLTQFLFRRGSSRSLCEDPVCTPFVKKVFEKYHCRNRRCGPLNVTLAGEACRRKDHVALKAVRIRQL EFLQPLVEDPRLDLRVIQLVRDPRAVLASRIVAFAGKYENWKKWLSEGQDQLSEDEVQRLRGNCESIRLS AELGLRQPAWLRGRYMLVRYEDVARRPLQKAREMYSFAGIPLTPQVEDWIQKNTQATRDSSDVYSTQKNS SEQFEKWRFSMPFKLAQVVQAACGPTMHLFGYKLARDAASLTNRSISLLEERGTFWVT
|