Skip Header

 
Contribute Send feedback

Reviewed, UniProtKB/Swiss-Prot P03315 (POLS_SFV)

Last modified September 2, 2008. Version 94. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Web resources · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Structural polyprotein
Alternative name(s):
    p130
Cleaved into the following 6 chains:
    1- Recommended name:
            Capsid protein
              EC=3.4.21.-
        Alternative name(s):
            Coat protein
              Short name=C
    2- Recommended name:
            p62
        Alternative name(s):
            E3/E2
    3- Recommended name:
            E3 protein
        Alternative name(s):
            Spike glycoprotein E3
    4- Recommended name:
            E2 envelope glycoprotein
        Alternative name(s):
            Spike glycoprotein E2
    5- Recommended name:
            6K protein
    6- Recommended name:
            E1 envelope glycoprotein
        Alternative name(s):
            Spike glycoprotein E1
OrganismSemliki forest virus (SFV)
Taxonomic identifier11033 [NCBI]
Taxonomic lineageVirusesssRNA positive-strand viruses, no DNA stageTogaviridaeAlphavirusSFV complex
Virus hostAedes [TaxID: 7158]
Culex tritaeniorhynchus [TaxID: 7178]
Atelerix albiventris (Middle-African hedgehog) [TaxID: 9368]
Homo sapiens (Human) [TaxID: 9606]
Rhipicephalus [TaxID: 34630]
Quelea [TaxID: 158617]
Halcyon [TaxID: 170865]

Protein attributes

Sequence length1253 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

Capsid protein possesses a protease activity that results in its autocatalytic cleavage from the nascent structural protein. Following its self-cleavage, the capsid protein transiently associates with ribosomes, and within several minutes the protein binds to viral RNA and rapidly assembles into icosaedric core particles. The resulting nucleocapsid eventually associates with the cytoplasmic domain of E2 at the cell membrane, leading to budding and formation of mature virions. New virions attach to target cells, and after endocytosis their membrane fuses with the target cell membrane. This leads to the release of the nucleocapsid into the cytoplasm, followed by an uncoating event necessary for the genomic RNA to become accessible. The uncoating might be triggered by the interaction of capsid proteins with ribosomes. Binding of ribosomes would release the genomic RNA since the same region is genomic RNA-binding and ribosome-binding.

E3 protein's function is unknown.

E2 is responsible for viral attachment to target host cell, by binding to the cell receptor. Synthesized as a p62 precursor which is processed by furin at the cell membrane just before virion budding, giving rise to E2-E1 heterodimer. The p62-E1 heterodimer is stable, whereas E2-E1 is unstable and dissociate at low pH. p62 is processed at the last step, presumably to avoid E1 fusion activation before its final export to cell surface. E2 C-terminus contains a transitory transmembrane that would be disrupted by palmitoylation, resulting in reorientation of the C-terminal tail from lumenal to cytoplasmic side. This step is critical since E2 C-terminus is involved in budding by interacting with capsid proteins. This release of E2 C-terminus in cytoplasm occurs lately in protein export, and precludes premature assembly of particles at the endoplasmic reticulum membrane.

6K is a constitutive membrane protein involved in virus glycoprotein processing, membrane permeabilization, and the budding of viral particles. Present in low amount in virions, about 3% compared to viral glycoproteins. Because of its lipophilic properties, the 6K protein is postulated to influence the selection of lipids that interact with the transmembrane domains of the glycoproteins, which, in turn, affects the deformability of the bilayer required for the extreme curvature that occurs as budding proceeds.

E1 is a class II viral fusion protein. Fusion activity is inactive as long as E1 is bound to E2 in mature virion. After virus attachment to target cell and endocytosis, acidification of the endosome would induce dissociation of E1/E2 heterodimer and concomitant trimerization of the E1 subunits. This E1 trimer is fusion active, and promotes release of viral nucleocapsid in cytoplasm after cell and viral membrane fusion. Efficient fusion requires the presence of cholesterol and sphingolipid in the target membrane. Fusion is optimal at levels of about 1 molecule of cholesterol per 2 molecules of phospholipids, and is specific for sterols containing a 3-beta-hydroxyl group.

Subunit structure

p62 and E1 form a heterodimer shortly after synthesis. Processing of p62 into E2 and E3 results in a heterodimer of E2 and E1. Spike at virion surface are constituted of three E2-E1 heterodimers. After target cell attachment and endocytosis, E1 change conformation to form homotrimers.

Subcellular location

Capsid protein: VirionBy similarity. CytoplasmBy similarity.

p62: Virion membrane; Single-pass type I membrane proteinBy similarity. Cell membrane; Single-pass type I membrane proteinBy similarity.

E2 envelope glycoprotein: Virion membrane; Single-pass type I membrane proteinBy similarity. Cell membrane; Single-pass type I membrane proteinBy similarity.

E1 envelope glycoprotein: Virion membrane; Single-pass type I membrane proteinBy similarity. Cell membrane; Single-pass type I membrane proteinBy similarity.

6K protein: Cell membrane; Multi-pass membrane proteinBy similarity. Virion membrane; Multi-pass membrane proteinBy similarity.

Post-translational modification

Specific enzymatic cleavages in vivo yield mature proteins. Capsid protein is auto-cleaved during polyprotein translation, unmasking p62 signal peptide. The remaining polyprotein is then targeted to the endoplasmic reticulum, where host signal peptidase cleaves it into p62, 6K and E1 proteins. p62 is further processed to mature E3 and E2 by host furin in trans-Golgi vesicle. Protein processing process takes about 30 minutes at physiologic temperatures. The folding of the p62/6K/E1 precursor requires the formation of intrachain disulfide bonds and has been shown to involve a transient covalent interaction between the nascent and newly synthesized heterodimer and the host-cell chaperones, P4HB/PDI and PDIA3/ERp57. The folding pathway also includes non covalent interaction with human CANX/calnexin and CALR/calreticulin.

Envelope E1, E2 and E3 proteins are N-glycosylated.

E2 is palmitoylated via thioester bonds. These palmitoylations may induce disruption of the C-terminus transmembrane. This would result in the reorientation of E2 c-terminus from lumenal to cytoplasmic side. 6K protein is also palmitoylated with about four covalently bound fatty acids per molecule. E1 is stearoylated.

Miscellaneous

The mature virion nucleocapsid consists of 240 copies of the capsid protein. 80 spike trimers of E1 and E2 are present at the surface of mature virion. They project about 100 Angstroms from the outer surface and are located at the local and strict three fold axis of the icosaedral lattice. The glycoproteins splay out to form a protein shell or skirt covering most of the outer surface of the membrane bilayer.

Structural polyprotein is translated from a subgenomic RNA synthesized during togavirus replication.

Sequence similarities

Contains 1 peptidase S3 domain.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical view

Molecule processing

Chain1 – 267267Capsid protein
Chain268 – 755488p62
Chain268 – 33366E3 protein
Signal peptide268 – 28215Not cleaved Potential
Chain334 – 755422E2 envelope glycoprotein
Chain756 – 815606K protein
Chain816 – 1253438E1 envelope glycoprotein

Regions

Topological domain268 – 701434Extracellular Potential
Transmembrane702 – 72221 Potential
Topological domain723 – 75533Cytoplasmic Potential
Topological domain756 – 77015Extracellular Potential
Transmembrane771 – 79121 Potential
Topological domain7921Cytoplasmic Potential
Transmembrane793 – 81321 Potential
Topological domain814 – 1230417Extracellular Potential
Transmembrane1231 – 125121 Potential
Topological domain1252 – 12532Cytoplasmic Potential
Domain112 – 267156Peptidase S3
Region1 – 113113Intrinsically disordered, in contact with genomic RNA in nucleocapsid Potential
Region94 – 10613Ribosome-binding
Region728 – 74821Transient transmembrane before p62-6K protein processing Potential
Region899 – 91618E1 fusion peptide loop

Sites

Active site1451Charge relay system By similarity
Active site1511Charge relay system By similarity
Active site2191Charge relay system By similarity
Site267 – 2682Cleavage; by capsid protein
Site333 – 3342Cleavage; by host furin
Site755 – 7562Cleavage; by host signal peptidase
Site815 – 8162Cleavage; by host signal peptidase

Amino acid modifications

Lipidation7181S-palmitoyl cysteine; by host Potential
Lipidation7281S-palmitoyl cysteine; by host By similarity
Lipidation7481S-palmitoyl cysteine; by host By similarity
Lipidation7491S-palmitoyl cysteine; by host By similarity
Lipidation12481S-stearoyl cysteine; by host
Glycosylation2801N-linked (GlcNAc...) Potential
Glycosylation3271N-linked (GlcNAc...) Potential
Glycosylation5331N-linked (GlcNAc...) Potential
Glycosylation5951N-linked (GlcNAc...) Potential
Glycosylation9561N-linked (GlcNAc...)
Disulfide bond119 ↔ 134
Disulfide bond864 ↔ 929
Disulfide bond877 ↔ 909
Disulfide bond878 ↔ 911
Disulfide bond883 ↔ 893
Disulfide bond1074 ↔ 1086
Disulfide bond1116 ↔ 1191
Disulfide bond1121 ↔ 1195
Disulfide bond1143 ↔ 1185

Experimental info

Mutagenesis2671W → A or R: Complete loss of cleavage by capsid protease
Mutagenesis3301R → S: Complete loss of p62 precursor processing
Mutagenesis3331R → F: Complete loss of p62 precursor processing
Mutagenesis7551A → F: Complete loss of p62 precursor-6K cleavage
Mutagenesis8151A → F: Complete loss of 6K protein-E1 envelope glycoprotein cleavage
Mutagenesis8591L → F: E1 fusion is less cholesterol and sphingolipid dependent
Mutagenesis8901D → A: Shifts the pH threshold for fusion to a more acidic range
Mutagenesis8941K → Q: No effect on E1 fusion activity
Mutagenesis8981G → A: Shifts the pH threshold for fusion to a more acidic range
Mutagenesis8981G → D: No effect on E1 fusion activity
Mutagenesis9011P → D: Retention of E1 protein in endoplasmic reticulum
Mutagenesis9031M → L: No effect on E1 fusion activity
Mutagenesis9061G → A: Shifts the pH threshold for fusion to a more acidic range
Mutagenesis9061G → D: Complete loss of E1 fusion activity
Mutagenesis9061G → P: Retention of E1 protein in endoplasmic reticulum
Mutagenesis9931V → A: E1 fusion is less cholesterol and sphingolipid dependent

Secondary structure

.............................. 1253
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
P03315-1 [UniParc].

Last modified July 21, 1986. Version 1.
Checksum: 2A73228D08B82AC5

FASTA1,253138,017
        10         20         30         40         50         60 
MNYIPTQTFY GRRWRPRPAA RPWPLQATPV APVVPDFQAQ QMQQLISAVN ALTMRQNAIA 

        70         80         90        100        110        120 
PARPPKPKKK KTTKPKPKTQ PKKINGKTQQ QKKKDKQADK KKKKPGKRER MCMKIENDCI 

       130        140        150        160        170        180 
FEVKHEGKVT GYACLVGDKV MKPAHVKGVI DNADLAKLAF KKSSKYDLEC AQIPVHMRSD 

       190        200        210        220        230        240 
ASKYTHEKPE GHYNWHHGAV QYSGGRFTIP TGAGKPGDSG RPIFDNKGRV VAIVLGGANE 

       250        260        270        280        290        300 
GSRTALSVVT WNKDMVTRVT PEGSEEWSAP LITAMCVLAN ATFPCFQPPC VPCCYENNAE 

       310        320        330        340        350        360 
ATLRMLEDNV DRPGYYDLLQ AALTCRNGTR HRRSVSQHFN VYKATRPYIA YCADCGAGHS 

       370        380        390        400        410        420 
CHSPVAIEAV RSEATDGMLK IQFSAQIGID KSDNHDYTKI RYADGHAIEN AVRSSLKVAT 

       430        440        450        460        470        480 
SGDCFVHGTM GHFILAKCPP GEFLQVSIQD TRNAVRACRI QYHHDPQPVG REKFTIRPHY 

       490        500        510        520        530        540 
GKEIPCTTYQ QTTAETVEEI DMHMPPDTPD RTLLSQQSGN VKITVGGKKV KYNCTCGTGN 

       550        560        570        580        590        600 
VGTTNSDMTI NTCLIEQCHV SVTDHKKWQF NSPFVPRADE PARKGKVHIP FPLDNITCRV 

       610        620