Skip Header

 
Contribute Send feedback

Reviewed, UniProtKB/Swiss-Prot Q5DTI6 (CB067_MOUSE)

Last modified July 22, 2008. Version 24. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · Ontologies · Alternative products · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Uncharacterized protein C2orf67 homolog
Gene names
Name: Kiaa4189
OrganismMus musculus (Mouse)
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMus

Protein attributes

Sequence length991 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at transcript level.

Ontologies

Keywords

   Coding sequence diversityAlternative splicing

Gene Ontology (GO)

None. [Check GOA]

Alternative products

This entry describes 5 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q5DTI6-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q5DTI6-2)

The sequence of this isoform differs from the canonical sequence as follows:
     412-417: GIVILE → VLLVFC
     418-991: Missing.
Notes: No experimental confirmation available.
Isoform 3 (identifier: Q5DTI6-3)

The sequence of this isoform differs from the canonical sequence as follows:
     677-720: Missing.
Notes: No experimental confirmation available.
Isoform 4 (identifier: Q5DTI6-4)

The sequence of this isoform differs from the canonical sequence as follows:
     365-368: NSST → YAKR
     369-991: Missing.
Notes: No experimental confirmation available.
Isoform 5 (identifier: Q5DTI6-5)

The sequence of this isoform differs from the canonical sequence as follows:
     412-446: Missing.
Notes: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical view

Molecule processing

Chain1 – 991991Uncharacterized protein C2orf67 homolog

Natural variations

Alternative sequence365 – 3684NSST → YAKR in isoform 4.
Alternative sequence369 – 991623Missing in isoform 4.
Alternative sequence412 – 44635Missing in isoform 5.
Alternative sequence412 – 4176GIVILE → VLLVFC in isoform 2.
Alternative sequence418 – 991574Missing in isoform 2.
Alternative sequence677 – 72044Missing in isoform 3.

Experimental info

Sequence conflict1621T → A in BAC38212. Ref.2
Sequence conflict2041S → T in BAC38212. Ref.2
Sequence conflict2191E → Q in BAC38212. Ref.2
Sequence conflict3491S → P in BAE42157. Ref.2
Sequence conflict3531V → L in BAC38212 and BAC38675. Ref.2
Sequence conflict8941A → P in BAC38675. Ref.2
Sequence conflict9471A → T in BAC38675. Ref.2

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified February 26, 2008. Version 2.
Checksum: D0A36B7662258A11

FASTA991112,486
        10         20         30         40         50         60 
MTPALKEATT KGICFSSLPN TMESDKMLCM ESPRTVDEKL KGGDTFSQML GFPTPEPTLN 

        70         80         90        100        110        120 
TNFVNLKHFA SPQASKHFQT VLLMSSNSTL NKYNENYNQK KVMESNCSKL KNVLCNGSSI 

       130        140        150        160        170        180 
QLSKICPSHS ENEFIKKELS DTTSQCMKDI QIVLDSNLTK DTNVDRLHLQ NCKWYQKNAL 

       190        200        210        220        230        240 
LDKFTDTKIK KGLLQCTQKK IGPSHSDVPT SSSAAEKEEE VNARLLHCVS KQKILLSQAR 

       250        260        270        280        290        300 
RTQKHLQMLL AKHVVKHYGQ QMKFSMKHQL PTMKIFHEPT TVLSNSLLEH TEIKPEVNIL 

       310        320        330        340        350        360 
ASENKFWDDT NNGFSQCTAA EIQRFALSAT GLLSHVEEGL DSDATDSSSD DEVDEYTIRK 

       370        380        390        400        410        420 
NVAVNSSTEW KWLVDRAQVG SRWTWLQAQI SELEYKIQQL TDIHRQIRAS KGIVILEECQ 

       430        440        450        460        470        480 
LPKDILKKQI QFSNQAVSLN TSVNSQVPQR SEEPLPEHDF EMSPSSPTLL LRNIEKQSAQ 

       490        500        510        520        530        540 
LTEIINSLIA PLNLSPTSSP LSSKSCSHKC LANGISRSAS ENLDELSSSS SWLLNQKHSK 

       550        560        570        580        590        600 
KRRKDRTRLK SPSLAIMSTA ARTRPLQSFH KRKLYRLSPT FYWTPETLPS KEAFLSSTQT 

       610        620        630        640        650        660 
PYTGSPFSWD NWEQSSRSHL LREQVSKLDS SFHPVLSLPS EIPLHLHFET LFKKTDMKGE 

       670        680        690        700        710        720 
LAENQFVGDC LISPPPVQGT SSLNQWRNGY SPICKPQIRS QPSVQLLQGR KKRHLSETAL 

       730        740        750        760        770        780 
AGERTRFEEF AFQRSEPGSH CNFTAVSNAN VTSRTQNPSS QNTSRRRLRS ESSYDIDNIV 

       790        800        810        820        830        840 
IPMSLVAPAK LEKLQYKEIL TPRWRKVVLQ PLDEHNLNKE EIEDLSDDVF SLRHRKYEER 

       850        860        870        880        890        900 
EQARWSLWEQ SKWHRRNNRA YSKNVEGQDL VLKEHSSELG SAQQGTAESP FELAAESHSL 

       910        920        930        940        950        960 
CAQDSLSLND GQEDKSLRWE RRAFPLKDED TAALLCQDER KDQTGGASTA FHDEVFCSTT 

       970        980        990 
PESGHPPKMQ LDGMEEYKSF GIGVTNVKRN R 

« Hide

Isoform 2 [UniParc].

Checksum: B77E85CCF3D9E0C0
Show »

41747,281
Isoform 3 [UniParc].

Checksum: 25257E3095C2EE5A
Show »

947107,512
Isoform 4 [UniParc].

Checksum: 11DFF4456E9ED2E6
Show »

36841,457
Isoform 5 [UniParc].

Checksum: AC118E0D59B65A9B
Show »

956108,617

References

[1]"Prediction of the coding sequences of mouse homologues of KIAA gene. The complete nucleotide sequences of mouse KIAA-homologous cDNAs identified by screening of terminal sequences of cDNA clones randomly sampled from size-fractionated libraries."
Okazaki N., Kikuno R.F., Ohara R., Inamoto S., Nagase T., Ohara O., Koga H.
Submitted (FEB-2005) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3).
Tissue: Fetal brain.
[2]"The transcriptional landscape of the mammalian genome."
Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J. expand/collapse author list , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
Science 309:1559-1563(2005) [PubMed: 16141072] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 2 AND 4), NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 311-991 (ISOFORM 5).
Strain: C57BL/6J and NOD.
Tissue: Head.

Cross-references

Sequence databases

AK220534 mRNA. Translation: BAD90312.1. Different initiation.
AK081413 mRNA. Translation: BAC38212.1.
AK082897 mRNA. Translation: BAC38675.1. Different initiation.
AK170981 mRNA. Translation: BAE42157.1.
RefSeqNP_001116210.1.
NP_808313.3.
UniGeneMm.383276
Mm.441295
Mm.461139

3D structure databases

ModBaseSearch...

Genome annotation databases

EnsemblENSMUSG00000026004. Mus musculus. [Contig view]
GeneID68691.
KEGGmmu:68691.

Organism-specific databases

MGIMGI:1915941. 1110028C15Rik.
RougeSearch...

Phylogenomic databases

HOGENOMQ5DTI6.
HOVERGENQ5DTI6.

Gene expression databases

CleanExMM_1110028C15RIK.

Family and domain databases

ProDomQ5DTI6.
[Graphical view] [Entries sharing at least one domain]
BLOCKSSearch...

Other Resources

SOURCESearch...
ProtoNetSearch...

Entry information

Entry nameCB067_MOUSE
AccessionPrimary (citable) accession number: Q5DTI6
Secondary accession number(s): Q3TC00, Q8BNM1, Q8C4R5
Entry history
Integrated into UniProtKB/Swiss-Prot: February 26, 2008
Last sequence update: February 26, 2008
Last modified: July 22, 2008
This is version 24 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)

Relevant documents

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot

UniProtKB secondary accession numbers

Index of UniProtKB secondary accession numbers

Names and origin · Protein attributes · Ontologies · Alternative products · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents