Skip Header

 
Contribute Send feedback

Reviewed, UniProtKB/Swiss-Prot Q5NCX5 (K1787_MOUSE)

Last modified July 22, 2008. Version 20. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Alternative products · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    NHR domain-containing protein KIAA1787
Gene names
Name: Kiaa1787
OrganismMus musculus (Mouse)
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMus

Protein attributes

Sequence length1563 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at transcript level.

General annotation (Comments)

Sequence similarities

Contains 6 NHR (neuralized homology repeat) domains.

Sequence caution

The sequence CAI35150.1 differs from that shown. Reason: Erroneous gene model prediction.

Ontologies

Keywords

   Coding sequence diversityAlternative splicing
   DomainRepeat

Gene Ontology (GO)

None. [Check GOA]

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q5NCX5-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q5NCX5-2)

The sequence of this isoform differs from the canonical sequence as follows:
     245-266: Missing.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical view

Molecule processing

Chain1 – 15631563NHR domain-containing protein KIAA1787

Regions

Domain43 – 209167NHR 1
Domain319 – 486168NHR 2
Domain522 – 688167NHR 3
Domain718 – 886169NHR 4
Domain915 – 1087173NHR 5
Domain1132 – 1295164NHR 6

Natural variations

Alternative sequence245 – 26622Missing in isoform 2.

Experimental info

Sequence conflict12471L → M in BAE33965. Ref.2

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified February 1, 2005. Version 1.
Checksum: 65F86EC95429B6C0

FASTA1,563167,637
        10         20         30         40         50         60 
MAAGSGGSGG SGAGPGPGPG PGGGGGPGSS GPGLGSGGGL GGGGELHPRT GRLVSLSACG 

        70         80         90        100        110        120 
RTARRQQPGQ EFNHGLVLSR EPLRDGRVFT VRIDRKVNSW SGSIEIGVTA LDPSVLDFPS 

       130        140        150        160        170        180 
SATGLKGGSW VVSGCSVLRD GRSVLEEYGQ DLDQLVEGDR VGVERTATGE LRLWVNGRDC 

       190        200        210        220        230        240 
GVAATGLPAR VWAVVDLYGK CTQITVLPSE PGFSPPTPVP TPPLEPLAPP EDSALLEQGT 

       250        260        270        280        290        300 
SVDEAFMVSP AQARPETFPN SLDSHNDFAS MELSEVVSNA ILSAYNGGLL NVSLSSPPAG 

       310        320        330        340        350        360 
DGLASSGPAT SPILTSNDAL LFHEKCGTLI KLSNNNKTAE RRRPLDEFNN GVVMTNRPLR 

       370        380        390        400        410        420 
DNEMFEIRID KLVDKWSGSI EIGVTTHNPN SLEYPATMTN LQSGTIMMSG CGILTNGKGT 

       430        440        450        460        470        480 
RREYCEFSLD ELQEGDHIGL TRKSNSALHF FINGIDQGVA TPLTPPVVYG VVDLYGMAVK 

       490        500        510        520        530        540 
VTIVHNNNHS DRLRRNNAIL RALSPEGALR RAAPAAQAEP ERLLFHPNCG QKAAITHEGR 

       550        560        570        580        590        600 
TALRPHATDD FNHGVVLSSR ALRDGEVFQV RIDKMVDKWA GSIEIGVTTH NPAYLQLPST 

       610        620        630        640        650        660 
MTNLRSGTWM MTGNGVMHNG TTILDEYGHN LDRLKAGDTV GVVRREDGTL HFFVNGMTQG 

       670        680        690        700        710        720 
PAAWNVPPGV YAVVDLYGQA AQATIVDDVE VPPVSEPLPE GNNQMSPSSP SSAAGGSDLR 

       730        740        750        760        770        780 
FHQLHGSNAV ITNGGRTALR HNCRSEFNDA IVISNRALRD GELFEIVIQK MVDRWSGSIE 

       790        800        810        820        830        840 
AGVTAIRPED LEFPNTMTDI DYDTWMLSGT AIMQDGNTMR NNYGCDLDAL GTGARIGMMR 

       850        860        870        880        890        900 
TAKGDLHYFI NGQDQGAACS GLPPGKEVYA VVDLYGQCVQ VSITNATGPM DNSLATSNTA 

       910        920        930        940        950        960 
TEKSFPLHSP VAGVAHRFHS MCGKNVTLEE DGTRAVRVAG YAHGLVFSTK ELKAEEVFEV 

       970        980        990       1000       1010       1020 
KVEELDEKWA GSLRLGLTTL APEDMGPGAG SGPGLPPSLP ELRTKTTWMV SSCEVRRDGH 

      1030       1040       1050       1060       1070       1080 
LQRMNYGRNL ERLGVGSRVG IRRCADDTMH ILVDGEDMGP AAAGIAKNVW AVLDLYGPVR 

      1090       1100       1110       1120       1130       1140 
SVAIVSSTRL EEPEGTQPPS PSSDTGSEVE EDDEVEEQGL RGQNQVGIVP TALEFLENHG 

      1150       1160       1170       1180       1190       1200 
KNILLSNGNR TATRVASYNQ GIVVISQPLV PHMLVQVRID FLNRQWTSSL VLGVITCPPE 

      1210       1220       1230       1240       1250       1260 
RLNFPASACA LKRAAWLLRG RGVFHNGLKI CEKFGPNLDT CPEGTILGLR LDSSGGLHLH 

      1270       1280       1290       1300       1310       1320 
INGVDQGVAV PDVPQPCHAL VDLYGQCEQV TIVSPDPGTA SGKIAGTQGD MEKADMVDGI 

      1330       1340       1350       1360       1370       1380 
KESVCWGPPP AASPLKSCEY HALCSRFQEL LLLPEDYFMP PPKRSLCYCE SCRKLRGDEA 

      1390       1400       1410       1420       1430       1440 
HRRRGEPPRE YALPFGWCRF NLRVNPHLEA GTLTKKWHMA YHGSSVAVVR RVLDRGELGA 

      1450       1460       1470       1480       1490       1500 
GTTSILSCRP LKGEPGVGFE EPGENCAPPR EEQPPPVLLS PSLQYAGAEM LASKVQFRDP 

      1510       1520       1530       1540       1550       1560 
KSQRTHQAQV AFQVCVRPGS YTPGPPSAAL RELPDQHFSP SELEWVTKEK GATLLYALLV 


RVE 

« Hide

Isoform 2 [UniParc].

Checksum: 08AA3BAF649E8C72
Show »

1,541165,239

References

[1]The mouse genome sequencing consortium
Submitted (JAN-2007) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: C57BL/6J.
[2]"The transcriptional landscape of the mammalian genome."
Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J. expand/collapse author list , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
Science 309:1559-1563(2005) [PubMed: 16141072] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 19-1563 (ISOFORM 1).
Strain: NOD.
Tissue: Spleen.
[3]"Prediction of the coding sequences of mouse homologues of KIAA gene: IV. The complete nucleotide sequences of 500 mouse KIAA-homologous cDNAs identified by screening of terminal sequences of cDNA clones randomly sampled from size-fractionated libraries."
Okazaki N., Kikuno R., Ohara R., Inamoto S., Koseki H., Hiraoka S., Saga Y., Seino S., Nishimura M., Kaisho T., Hoshino K., Kitamura H., Nagase T., Ohara O., Koga H.
DNA Res. 11:205-218(2004) [PubMed: 15368895] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 142-1563 (ISOFORM 2).
Tissue: Embryo.
[4]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 936-1563.
Tissue: Eye.
+Additional computationally mapped references.

Cross-references

Sequence databases

AL596185 Genomic DNA. Translation: CAI35149.1.
AL596185 Genomic DNA. Translation: CAI35150.1. Sequence problems.
AK157107 mRNA. Translation: BAE33965.1.
AK173264 mRNA. Translation: BAD32542.1.
BC023037 mRNA. Translation: AAH23037.1.
RefSeqNP_001013432.1.
UniGeneMm.268347

3D structure databases

ModBaseSearch...

Genome annotation databases

EnsemblENSMUSG00000047284. Mus musculus. [Contig view]
GeneID216860.
KEGGmmu:216860.

Organism-specific databases

MGIMGI:1921092. 0610025P10Rik.
RougeSearch...

Phylogenomic databases

HOGENOMQ5NCX5.
HOVERGENQ5NCX5.

Gene expression databases

ArrayExpressQ5NCX5.
CleanExMM_0610025P10RIK.

Family and domain databases

InterProIPR006573. Neu_Z.
[Graphical view]
PfamPF07177. Neuralized. 6 hits.
[Graphical view]
SMARTSM00588. NEUZ. 6 hits.
[Graphical view]
PROSITEPS51065. NHR. 6 hits.
[Graphical view]
ProDomQ5NCX5.
[Graphical view] [Entries sharing at least one domain]
BLOCKSSearch...

Other Resources

SOURCESearch...
ProtoNetSearch...

Entry information

Entry nameK1787_MOUSE
AccessionPrimary (citable) accession number: Q5NCX5
Secondary accession number(s): Q3U090 expand/collapse secondary AC list , Q5NCX4, Q69ZA2, Q8R1V5
Entry history
Integrated into UniProtKB/Swiss-Prot: August 21, 2007
Last sequence update: February 1, 2005
Last modified: July 22, 2008
This is version 20 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)

Relevant documents

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot

UniProtKB secondary accession numbers

Index of UniProtKB secondary accession numbers

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Alternative products · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents