Skip Header

 
Contribute Send feedback

Reviewed, UniProtKB/Swiss-Prot Q3U6N9 (CH033_MOUSE)

Last modified July 22, 2008. Version 21. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (4) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Alternative products · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    UPF0488 protein C8orf33 homolog
OrganismMus musculus (Mouse)
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMus

Protein attributes

Sequence length222 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at transcript level.

General annotation (Comments)

Sequence similarities

Belongs to the UPF0488 family.

Ontologies

Keywords

   Coding sequence diversityAlternative splicing

Gene Ontology (GO)

None. [Check GOA]

Alternative products

This entry describes 3 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q3U6N9-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q3U6N9-2)

The sequence of this isoform differs from the canonical sequence as follows:
     1-62: Missing.
     129-129: Missing.
Notes: No experimental confirmation available.
Isoform 3 (identifier: Q3U6N9-3)

The sequence of this isoform differs from the canonical sequence as follows:
     1-62: Missing.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical view

Molecule processing

Chain1 – 222222UPF0488 protein C8orf33 homolog

Regions

Compositional bias64 – 696Poly-Lys

Natural variations

Alternative sequence1 – 6262Missing in isoform 2 and isoform 3.
Alternative sequence1291Missing in isoform 2.

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified October 11, 2005. Version 1.
Checksum: 53D7B2DE0876A60C

FASTA22224,465
        10         20         30         40         50         60 
MAEPGRPARE APAASSRKTH RAPRRPRPSR SASGASEPPL RSSVQPACDS AAGTHPVGNT 

        70         80         90        100        110        120 
VAMKQKKKKT PNRVSGTNGS EKPSEKPAPD EAPPSAEAQA EQLARELAWC VEQLELGLKT 

       130        140        150        160        170        180 
QRPTPKQKEQ AVGAIRTLRS EKTPLPRKRQ LMRSLFGDYR AQMDAEWREA LRALKAATHS 

       190        200        210        220 
AQVQLVSEAT RKKSGRVCRP RPAERAKTTP DLTSEEFRFN FF 

« Hide

Isoform 2 [UniParc].

Checksum: C914FAABC27D338A
Show »

15917,999
Isoform 3 [UniParc].

Checksum: 67B00E6D6E5DB8C4
Show »

16018,128

References

[1]"The transcriptional landscape of the mammalian genome."
Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J. expand/collapse author list , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
Science 309:1559-1563(2005) [PubMed: 16141072] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1; 2 AND 3).
Strain: C57BL/6J.
Tissue: Bone marrow, Embryo and Head.
[2]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3).
Strain: Czech II.
Tissue: Mammary tumor.

Cross-references

Sequence databases

AK004154 mRNA. Translation: BAB23195.1.
AK141496 mRNA. Translation: BAE24703.1.
AK153059 mRNA. Translation: BAE31685.1.
AK160695 mRNA. Translation: BAE35959.1.
BC038324 mRNA. Translation: AAH38324.1.
BC091734 mRNA. Translation: AAH91734.1.
RefSeqNP_473440.1.
UniGeneMm.322968
Mm.440229

3D structure databases

ModBaseSearch...

Genome annotation databases

EnsemblENSMUSG00000063236. Mus musculus. [Contig view]
GeneID117171.
KEGGmmu:117171.

Organism-specific databases

MGIMGI:2152337. 1110038F14Rik.

Phylogenomic databases

HOGENOMQ3U6N9.
HOVERGENQ3U6N9.

Gene expression databases

ArrayExpressQ3U6N9.
CleanExMM_1110038F14RIK.

Family and domain databases

ProDomQ3U6N9.
[Graphical view] [Entries sharing at least one domain]
BLOCKSSearch...

Other Resources

ProtoNetSearch...

Entry information

Entry nameCH033_MOUSE
AccessionPrimary (citable) accession number: Q3U6N9
Secondary accession number(s): Q3URI5, Q9D0Z5
Entry history
Integrated into UniProtKB/Swiss-Prot: October 2, 2007
Last sequence update: October 11, 2005
Last modified: July 22, 2008
This is version 21 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)

Relevant documents

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot

UniProtKB secondary accession numbers

Index of UniProtKB secondary accession numbers

SIMILARITY comments

Index of protein domains and families

Uncharacterized protein families (UPF)

List of uncharacterized protein family (UPF) entries

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Alternative products · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents