Skip Header

 
Contribute Send feedback

Reviewed, UniProtKB/Swiss-Prot Q5FYB0 (ARSJ_HUMAN)

Last modified September 2, 2008. Version 35. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (4) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Arylsulfatase J
      Short name=ASJ
    EC=3.1.6.-
Gene names
Name: ARSJ
ORF Names: UNQ372/PRO708
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length599 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at transcript level.

General annotation (Comments)

Cofactor

Binds 1 calcium ion per subunit By similarity.

Subcellular location

SecretedPotential.

Post-translational modification

The conversion to 3-oxoalanine (also known as C-formylglycine, FGly), of a serine or cysteine residue in prokaryotes and of a cysteine residue in eukaryotes, is critical for catalytic activity By similarity.

Sequence similarities

Belongs to the sulfatase family.

Sequence caution

The sequence AAQ89010.1 differs from that shown. Reason: Frameshift at position 425.

Ontologies

Keywords

   Cellular componentSecreted
   DomainSignal
   LigandCalcium
Metal-binding
   Molecular functionHydrolase
   PTMGlycoprotein

Gene Ontology (GO)

   Molecular functionarylsulfatase activity Ref.1

Traceable author statement. Source: HGNC

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical view

Molecule processing

Signal peptide1 – 4949 Potential
Chain50 – 599550Arylsulfatase J

Sites

Active site1781 By similarity
Metal binding841Calcium By similarity
Metal binding851Calcium By similarity
Metal binding1221Calcium; via 3-oxoalanine By similarity
Metal binding3271Calcium By similarity
Metal binding3281Calcium By similarity

Amino acid modifications

Modified residue12213-oxoalanine (Cys) By similarity
Glycosylation1571N-linked (GlcNAc...) Potential
Glycosylation3061N-linked (GlcNAc...) Potential
Glycosylation3181N-linked (GlcNAc...) Potential
Glycosylation4311N-linked (GlcNAc...) Potential
Glycosylation4971N-linked (GlcNAc...) Potential
Glycosylation5271N-linked (GlcNAc...) Potential

Experimental info

Sequence conflict1821Y → N in AAQ89010. Ref.2
Sequence conflict2631I → T in AAQ89010. Ref.2
Sequence conflict4951L → P in AAH89445. Ref.3
Sequence conflict5761S → K in AAH89445. Ref.3
Sequence conflict590 – 59910STCHSGVTCG → KPANLAR Ref.2

Sequences

Sequence LengthMass (Da)Tools
Q5FYB0-1 [UniParc].

Last modified March 1, 2005. Version 1.
Checksum: 1548898E95C43A7A

FASTA59967,235
        10         20         30         40         50         60 
MAPRGCAGHP PPPSPQACVC PGKMLAMGAL AGFWILCLLT YGYLSWGQAL EEEEEGALLA 

        70         80         90        100        110        120 
QAGEKLEPST TSTSQPHLIF ILADDQGFRD VGYHGSEIKT PTLDKLAAEG VKLENYYVQP 

       130        140        150        160        170        180 
ICTPSRSQFI TGKYQIHTGL QHSIIRPTQP NCLPLDNATL PQKLKEVGYS THMVGKWHLG 

       190        200        210        220        230        240 
FYRKECMPTR RGFDTFFGSL LGSGDYYTHY KCDSPGMCGY DLYENDNAAW DYDNGIYSTQ 

       250        260        270        280        290        300 
MYTQRVQQIL ASHNPTKPIF LYIAYQAVHS PLQAPGRYFE HYRSIININR RRYAAMLSCL 

       310        320        330        340        350        360 
DEAINNVTLA LKTYGFYNNS IIIYSSDNGG QPTAGGSNWP LRGSKGTYWE GGIRAVGFVH 

       370        380        390        400        410        420 
SPLLKNKGTV CKELVHITDW YPTLISLAEG QIDEDIQLDG YDIWETISEG LRSPRVDILH 

       430        440        450        460        470        480 
NIDPIYTKAK NGSWAAGYGI WNTAIQSAIR VQHWKLLTGN PGYSDWVPPQ SFSNLGPNRW 

       490        500        510        520        530        540 
HNERITLSTG KSVWLFNITA DPYERVDLSN RYPGIVKKLL RRLSQFNKTA VPVRYPPKDP 

       550        560        570        580        590 
RSNPRLNGGV WGPWYKEETK KKKPSKNQAE KKQKKSKKKK KKQQKAVSGS TCHSGVTCG 

« Hide

References

« Hide 'large scale' references
[1]"Sulfatases and sulfatase modifying factors: an exclusive and promiscuous relationship."
Sardiello M., Annunziata I., Roma G., Ballabio A.
Hum. Mol. Genet. 14:3203-3217(2005) [PubMed: 16174644] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA].
[2]"The secreted protein discovery initiative (SPDI), a large-scale effort to identify novel human secreted and transmembrane proteins: a bioinformatics assessment."
Clark H.F., Gurney A.L., Abaya E., Baker K., Baldwin D.T., Brush J., Chen J., Chow B., Chui C., Crowley C., Currell B., Deuel B., Dowd P., Eaton D., Foster J.S., Grimaldi C., Gu Q., Hass P.E. expand/collapse author list , Heldens S., Huang A., Kim H.S., Klimowski L., Jin Y., Johnson S., Lee J., Lewis L., Liao D., Mark M.R., Robbie E., Sanchez C., Schoenfeld J., Seshagiri S., Simmons L., Singh J., Smith V., Stinson J., Vagts A., Vandlen R.L., Watanabe C., Wieand D., Woods K., Xie M.-H., Yansura D.G., Yi S., Yu G., Yuan J., Zhang M., Zhang Z., Goddard A.D., Wood W.I., Godowski P.J., Gray A.M.
Genome Res. 13:2265-2270(2003) [PubMed: 12975309] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
[3]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1-578.
Tissue: Chondrosarcoma.

Cross-references

Sequence databases

AY875938 mRNA. Translation: AAW66666.1.
AY358647 mRNA. Translation: AAQ89010.1. Frameshift.
BC089445 mRNA. Translation: AAH89445.1.
RefSeqNP_078866.3.
UniGeneHs.22895

3D structure databases

ModBaseSearch...

Genome annotation databases

EnsemblENSG00000180801. Homo sapiens. [Contig view]
GeneID79642.
KEGGhsa:79642.

Organism-specific databases

HGNCHGNC:26286. ARSJ.
MIM610010. gene.
GenAtlasSearch...
GeneCardsSearch...

Phylogenomic databases

HOVERGENQ5FYB0.

Gene expression databases

ArrayExpressQ5FYB0.
CleanExHS_ARSJ.

Family and domain databases

InterProIPR017849. Alkaline_Pase-like_a/b/a.
IPR000917. Sulphatase.
[Graphical view]
Gene3DG3DSA:3.40.720.10. Alk_phosphtse. 1 hit.
PfamPF00884. Sulfatase. 1 hit.
[Graphical view]
PROSITEPS00523. SULFATASE_1. 1 hit.
PS00149. SULFATASE_2. 1 hit.
[Graphical view]
ProDomQ5FYB0.
[Graphical view] [Entries sharing at least one domain]
BLOCKSSearch...

Other Resources

SOURCESearch...
ProtoNetSearch...

Entry information

Entry nameARSJ_HUMAN
AccessionPrimary (citable) accession number: Q5FYB0
Secondary accession number(s): Q5FWE4, Q6UWT9
Entry history
Integrated into UniProtKB/Swiss-Prot: October 11, 2005
Last sequence update: March 1, 2005
Last modified: September 2, 2008
This is version 35 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)

Relevant documents

Human chromosome 4

Human chromosome 4: entries, gene names and cross-references to MIM

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

UniProtKB secondary accession numbers

Index of UniProtKB secondary accession numbers

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents