Skip Header

 
Contribute Send feedback

Reviewed, UniProtKB/Swiss-Prot P00790 (PEPA_HUMAN)

Last modified November 25, 2008. Version 110. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (7) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Pepsin A
    EC=3.4.23.1
Gene names
Name: PGA3
AND
Name: PGA4
AND
Name: PGA5
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length388 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

Shows particularly broad specificity; although bonds involving phenylalanine and leucine are preferred, many others are also cleaved to some extent.

Catalytic activity

Preferential cleavage: hydrophobic, preferably aromatic, residues in P1 and P1' positions. Cleaves 1-Phe-|-Val-2, 4-Gln-|-His-5, 13-Glu-|-Ala-14, 14-Ala-|-Leu-15, 15-Leu-|-Tyr-16, 16-Tyr-|-Leu-17, 23-Gly-|-Phe-24, 24-Phe-|-Phe-25 and 25-Phe-|-Tyr-26 bonds in the B chain of insulin.

Subcellular location

Secreted.

Sequence similarities

Belongs to the peptidase A1 family.

Ontologies

Keywords

   Biological processDigestion
   Cellular componentSecreted
   Coding sequence diversityPolymorphism
   DomainSignal
   Molecular functionAspartyl protease
Hydrolase
Protease
   PTMPhosphoprotein
Zymogen
   Technical term3D-structure
Direct protein sequencing

Gene Ontology (GO)

   Biological processdigestion

Non-traceable author statement. Source: UniProtKB

proteolysis

Non-traceable author statement. Source: UniProtKB

   Cellular componentextracellular region

Inferred from electronic annotation. Source: UniProtKB-KW

   Molecular functionaspartic-type endopeptidase activity Ref.1

Traceable author statement. Source: UniProtKB

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 1515
Propeptide16 – 6247Activation peptide
PRO_0000026013
Chain63 – 388326Pepsin A
PRO_0000026014

Sites

Active site941
Active site2771

Amino acid modifications

Modified residue1301Phosphoserine Potential
Disulfide bond107 ↔ 112
Disulfide bond268 ↔ 272
Disulfide bond311 ↔ 344

Natural variations

Natural variant281L → F in isozyme 5.
VAR_006481
Natural variant581E → K in isozyme 3A, isozyme 4 and isozyme 5.
VAR_006482
Natural variant921V → L in isozyme 4 and isozyme 5.
VAR_006483
Natural variant2221Q → K
VAR_006484
Natural variant2651A → T
VAR_006485
Natural variant3531L → V: dbSNP rs17595.
VAR_006486
Natural variant3761D → E
VAR_006487

Experimental info

Sequence conflict2911A → T in BAF84553. Ref.3
Sequence conflict371 – 3722YF → FY AA sequence Ref.8

Secondary structure

........................................................................ 388
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
P00790-1 [UniParc].

Last modified July 21, 1986. Version 1.
Checksum: C9CB89BA08F4D78B

FASTA38841,977
        10         20         30         40         50         60 
MKWLLLLGLV ALSECIMYKV PLIRKKSLRR TLSERGLLKD FLKKHNLNPA RKYFPQWEAP 

        70         80         90        100        110        120 
TLVDEQPLEN YLDMEYFGTI GIGTPAQDFT VVFDTGSSNL WVPSVYCSSL ACTNHNRFNP 

       130        140        150        160        170        180 
EDSSTYQSTS ETVSITYGTG SMTGILGYDT VQVGGISDTN QIFGLSETEP GSFLYYAPFD 

       190        200        210        220        230        240 
GILGLAYPSI SSSGATPVFD NIWNQGLVSQ DLFSVYLSAD DQSGSVVIFG GIDSSYYTGS 

       250        260        270        280        290        300 
LNWVPVTVEG YWQITVDSIT MNGEAIACAE GCQAIVDTGT SLLTGPTSPI ANIQSDIGAS 

       310        320        330        340        350        360 
ENSDGDMVVS CSAISSLPDI VFTINGVQYP VPPSAYILQS EGSCISGFQG MNLPTESGEL 

       370        380 
WILGDVFIRQ YFTVFDRANN QVGLAPVA 

« Hide

References

« Hide 'large scale' references
[1]"Primary structure of human pepsinogen gene."
Sogawa K., Fujii-Kuriyama Y., Mizukami Y., Ichihara Y., Takahashi K.
J. Biol. Chem. 258:5306-5311(1983) [PubMed: 6300126] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
[2]"Nucleotide sequence comparison of five human pepsinogen A (PGA) genes: evolution of the PGA multigene family."
Evers M.P.J., Zelle B., Bebelman J.-P., van Beusechem V., Kraakman L., Hoffer M.J.V., Pronk J.C., Mager W.H., Planta R.J., Eriksson A.W., Frants R.R.
Genomics 4:232-239(1989) [PubMed: 2714789] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] (ISOZYME 5).
Tissue: Placenta.
[3]"Complete sequencing and characterization of 21,243 full-length human cDNAs."
Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S. expand/collapse author list , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
Nat. Genet. 36:40-45(2004) [PubMed: 14702039] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Tissue: Skeletal muscle.
[4]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA], VARIANTS LYS-222; THR-265 AND VAL-353.
Tissue: Colon and Kidney.
[5]"Isolation of human, swine, and rat prepepsinogens and calf preprochymosin, and determination of the primary structures of their NH2-terminal signal sequences."
Ichihara Y., Sogawa K., Takahashi K.
J. Biochem. 98:483-492(1985) [PubMed: 2415509] [Abstract]
Cited for: PARTIAL PROTEIN SEQUENCE OF 1-28.
[6]"A comparative study on the NH2-terminal amino acid sequences and some other properties of six isozymic forms of human pepsinogens and pepsins."
Athauda S.B.P., Tanji M., Kageyama T., Takahashi K.
J. Biochem. 106:920-927(1989) [PubMed: 2515193] [Abstract]
Cited for: PROTEIN SEQUENCE OF 16-100 (ISOZYMES 2; 3; 3A; 4 AND 5).
[7]"Activation of human pepsinogens."
Foltmann B.
FEBS Lett. 241:69-72(1988) [PubMed: 3197840] [Abstract]
Cited for: PROTEIN SEQUENCE OF 16-68 (ISOZYMES 3 AND 5).
[8]"Carboxyl-terminal sequence of human gastricsin and pepsin."
Huang W.-Y., Tang J.
J. Biol. Chem. 245:2189-2193(1970) [PubMed: 4909888] [Abstract]
Cited for: PROTEIN SEQUENCE OF 362-388.
[9]"Crystal structure of human pepsin and its complex with pepstatin."
Fujinaga M., Chernaia M.M., Tarasova N.I., Mosimann S.C., James M.N.G.
Protein Sci. 4:960-972(1995) [PubMed: 7663352] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (2.2 ANGSTROMS) OF 63-388.
+Additional computationally mapped references.

Cross-references

Sequence databases

J00287 expand/collapse EMBL AC list , J00279, J00280, J00281, J00282, J00283, J00284, J00285, J00286 Genomic DNA. Translation: AAA98529.1.
M26032 expand/collapse EMBL AC list , M26025, M26026, M26027, M26028, M26029, M26030, M26031 Genomic