Skip Header

 
Contribute Send feedback

Reviewed, UniProtKB/Swiss-Prot P13470 (GTFC_STRMU)

Last modified November 4, 2008. Version 84. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Glucosyltransferase-SI
      Short name=GTF-SI
    EC=2.4.1.5
Alternative name(s):
    Dextransucrase
    Sucrose 6-glucosyltransferase
Gene names
Name: gtfC
Ordered Locus Names: SMU_1005
OrganismStreptococcus mutans [Complete proteome] [HAMAP]
Taxonomic identifier1309 [NCBI]
Taxonomic lineageBacteriaFirmicutesLactobacillalesStreptococcaceaeStreptococcus

Protein attributes

Sequence length1455 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceInferred from homology.

General annotation (Comments)

Function

Production of extracellular glucans, that are thought to play a key role in the development of the dental plaque because of their ability to adhere to smooth surfaces and mediate the aggregation of bacterial cells and food debris.

Catalytic activity

Sucrose + ((1->6)-alpha-D-glucosyl)(n) = D-fructose + ((1->6)-alpha-D-glucosyl)(n+1).

Subcellular location

Secreted.

Miscellaneous

GTF-I synthesizes water-insoluble glucans (alpha 1,3-linked glucose and some 1,6 linkages), GTF-S synthesizes water-soluble glucans (alpha 1,6-glucose). GTF-SI synthesizes both forms of glucans.

Sequence similarities

Belongs to the glycosyl hydrolase 70 family.

Contains 11 cell wall-binding repeats.

Ontologies

Keywords

   Cellular componentSecreted
   DiseaseDental caries
   DomainRepeat
Signal
   Molecular functionGlycosyltransferase
Transferase
   Technical termComplete proteome

Gene Ontology (GO)

   Cellular componentextracellular region

Inferred from electronic annotation. Source: UniProtKB-KW

   Molecular functiondextransucrase activity

Inferred from electronic annotation. Source: EC

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 3434
Chain35 – 14551421Glucosyltransferase-SI
PRO_0000021386

Regions

Repeat184 – 20320Cell wall-binding 1
Repeat205 – 22420Cell wall-binding 2
Repeat1116 – 113520Cell wall-binding 3
Repeat1136 – 115520Cell wall-binding 4
Repeat1219 – 123921Cell wall-binding 5
Repeat1242 – 126221Cell wall-binding 6
Repeat1263 – 128220Cell wall-binding 7
Repeat1307 – 132721Cell wall-binding 8
Repeat1328 – 134720Cell wall-binding 9
Repeat1372 – 139221Cell wall-binding 10
Repeat1393 – 141220Cell wall-binding 11
Region250 – 1050801Catalytic; approximate

Natural variations

Natural variant211V → I in strain: GS-5.
Natural variant811P → L in strain: MT4239.
Natural variant1061D → V in strain: GS-5.
Natural variant1161S → A in strain: GS-5 and MT4467.
Natural variant1261A → T in strain: GS-5.
Natural variant150 – 1512SR → PK in strain: GS-5, MT4239 and MT4467.
Natural variant2561A → V in strain: GS-5 and MT4467.
Natural variant4251R → N in strain: MT4251.
Natural variant5191Y → D in strain: MT4245 and MT4251.
Natural variant5381R → K in strain: MT4245 and MT4251.
Natural variant5451Y → F in strain: MT4245 and MT4251.
Natural variant5971N → D in strain: MT4245, MT4251, MT4467 and MT8148.
Natural variant6001R → K in strain: MT4245, MT4251, MT4467 and MT8148.
Natural variant6011A → T in strain: GS-5.
Natural variant6141M → T in strain: GS-5.
Natural variant7271T → I in strain: MT8148.
Natural variant7341A → V in strain: MT8148.
Natural variant9641L → F in strain: MT4239.
Natural variant11131N → Y in strain: MT4239.
Natural variant11181A → T in strain: MT4239.
Natural variant12041I → V in strain: GS-5, MT4239, MT4467 and MT8148.
Natural variant12081V → I in strain: MT8148.
Natural variant1292 – 12943DGH → NGY in strain: GS-5, MT4467 and sMT8148.
Natural variant1305 – 136965Missing in strain: MT4245.
Natural variant13261I → V in strain: MT8148.
Natural variant13311T → A in strain: GS-5, MT4239, MT4467 and MT8148.
Natural variant13771R → K in strain: MT8148.
Natural variant13981V → I in strain: MT8148.
Natural variant14241D → N in strain: MT4239.
Natural variant14391V → I in strain: MT4239 and MT8148.
Natural variant14441S → P in strain: MT8148.

Experimental info

Sequence conflict1337 – 1455119QRLYF…NFFRF → HASILSLMVFRLRESSLQSV KVVSNTMILIPEMKFVIVM in AAA88592 and BAA26102. Ref.1

Sequences

Sequence LengthMass (Da)Tools
P13470-1 [UniParc].

Last modified November 28, 2002. Version 2.
Checksum: 3CB455A99A4FEC86

FASTA1,455162,966
        10         20         30         40         50         60 
MEKKVRFKLR KVKKRWVTVS VASAVVTLTS LSGSLVKADS TDDRQQAVTE SQASLVTTSE 

        70         80         90        100        110        120 
AAKETLTATD TSTATSATSQ PTATVTDNVS TTNQSTNTTA NTANFDVKPT TTSEQSKTDN 

       130        140        150        160        170        180 
SDKIIATSKA VNRLTATGKF VPANNNTAHS RTVTDKIVPI KPKIGKLKQP SSLSQDDIAA 

       190        200        210        220        230        240 
LGNVKNIRKV NGKYYYYKED GTLQKNYALN INGKTFFFDE TGALSNNTLP SKKGNITNND 

       250        260        270        280        290        300 
NTNSFAQYNQ VYSTDAANFE HVDHYLTAES WYRPKYILKD GKTWTQSTEK DFRPLLMTWW 

       310        320        330        340        350        360 
PDQETQRQYV NYMNAQLGIH QTYNTATSPL QLNLAAQTIQ TKIEEKITAE KNTNWLRQTI 

       370        380        390        400        410        420 
SAFVKTQSAW NSDSEKPFDD HLQKGALLYS NNSKLTSQAN SNYRILNRTP TNQTGKKDPR 

       430        440        450        460        470        480 
YTADRTIGGY EFLLANDVDN SNPVVQAEQL NWLHFLMNFG NIYANDPDAN FDSIRVDAVD 

       490        500        510        520        530        540 
NVDADLLQIA GDYLKAAKGI HKNDKAANDH LSILEAWSYN DTPYLHDDGD NMINMDNRLR 

       550        560        570        580        590        600 
LSLLYSLAKP LNQRSGMNPL ITNSLVNRTD DNAETAAVPS YSFIRAHDSE VQDLIRNIIR 

       610        620        630        640        650        660 
AEINPNVVGY SFTMEEIKKA FEIYNKDLLA TEKKYTHYNT ALSYALLLTN KSSVPRVYYG 

       670        680        690        700        710        720 
DMFTDDGQYM AHKTINYEAI ETLLKARIKY VSGGQAMRNQ QVGNSEIITS VRYGKGALKA 

       730        740        750        760        770        780 
TDTGDRTTRT SGVAVIEGNN PSLRLKASDR VVVNMGAAHK NQAYRPLLLT TDNGIKAYHS 

       790        800        810        820        830        840 
DQEAAGLVRY TNDRGELIFT AADIKGYANP QVSGYLGVWV PVGAAADQDV RVAASTAPST 

       850        860        870        880        890        900 
DGKSVHQNAA LDSRVMFEGF SNFQAFATKK EEYTNVVIAK NVDKFAEWGV TDFEMAPQYV 

       910        920        930        940        950        960 
SSTDGSFLDS VIQNGYAFTD RYDLGISKPN KYGTADDLVK AIKALHSKGI KVMADWVPDQ 

       970        980        990       1000       1010       1020 
MYALPEKEVV TATRVDKYGT PVAGSQIKNT LYVVDGKSSG KDQQAKYGGA FLEELQAKYP 

      1030       1040       1050       1060       1070       1080 
ELFARKQIST GVPMDPSVKI KQWSAKYFNG TNILGRGAGY VLKDQATNTY FSLVSDNTFL 

      1090       1100       1110       1120       1130       1140 
PKSLVNPNHG TSSSVTGLVF DGKGYVYYST SGNQAKNAFI SLGNNWYYFD NNGYMVTGAQ 

      1150       1160       1170       1180       1190       1200 
SINGANYYFL SNGIQLRNAI YDNGNKVLSY YGNDGRRYEN GYYLFGQQWR YFQNGIMAVG 

      1210       1220       1230       1240       1250       1260 
LTRIHGAVQY FDASGFQAKG QFITTADGKL RYFDRDSGNQ ISNRFVRNSK GEWFLFDHNG 

      1270       1280       1290       1300       1310       1320 
VAVTGTVTFN GQRLYFKPNG VQAKGEFIRD ADGHLRYYDP NSGNEVRNRF VRNSKGEWFL 

      1330       1340       1350       1360       1370       1380 
FDHNGIAVTG TRVVNGQRLY FKSNGVQAKG ELITERKGRI KYYDPNSGNE VRNRYVRTSS 

      1390       1400       1410       1420       1430       1440 
GNWYYFGNDG YALIGWHVVE GRRVYFDENG VYRYASHDQR NHWDYDYRRD FGRGSSSAVR 

      1450 
FRHSRNGFFD NFFRF 

« Hide

References

« Hide 'large scale' references
[1]"Sequence analysis of the gtfC gene from Streptococcus mutans GS-5."
Ueda S., Shiroza T., Kuramitsu H.K.
Gene 69:101-109(1988) [PubMed: 2976010] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
Strain: GS-5.
[2]"Molecular analyses of glucosyltransferase genes among strains of Streptococcus mutans."
Fujiwara T., Terao Y., Hoshino T., Kawabata S., Ooshima T., Sobue S., Kimura S., Hamada S.
FEMS Microbiol. Lett. 161:331-336(1998) [PubMed: 9570124] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
Strain: MT4239 / Serotype c, MT4245 / Serotype e, MT4251 / Serotype f, MT4467 / Serotype e and MT8148 / Serotype c.
[3]"Genome sequence of Streptococcus mutans UA159, a cariogenic dental pathogen."
Ajdic D.J., McShan W.M., McLaughlin R.E., Savic G., Chang J., Carson M.B., Primeaux C., Tian R., Kenton S., Jia H.G., Lin S.P., Qian Y., Li S., Zhu H., Najar F.Z., Lai H., White J., Roe B.A., Ferretti J.J.
Proc. Natl. Acad. Sci. U.S.A. 99:14434-14439(2002) [PubMed: 12397186] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: ATCC 700610 / UA159 / Serotype c.
[4]"Sequence analysis of the gtfB gene from Streptococcus mutans."
Shiroza T., Ueda S., Kuramitsu H.K.
J. Bacteriol. 169:4263-4270(1987) [PubMed: 3040685] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-349.
Strain: GS-5.

Cross-references

Sequence databases

M22054 Genomic DNA. Translation: AAA88592.1.
D88652 Genomic DNA. Translation: BAA26102.1.
D88655 Genomic DNA. Translation: BAA26106.1.
D88658 Genomic DNA. Translation: BAA26110.1.
D88661 Genomic DNA. Translation: BAA26114.1.
D89978 Genomic DNA. Translation: BAA26120.1.
AE014133 Genomic DNA. Translation: AAN58706.1.
M17361 Genomic DNA. Translation: AAA88589.1.
PIRJT0345.
RefSeqNP_721400.1.

3D structure databases

HSSPHSSP built from PDB template 1H8G based on UniProtKB P06653.
ModBaseSearch...

Genome annotation databases

GeneID1028343.
GenomeReviewsGene locus SMU_1005 in contig AE014133_GR.
KEGGsmu:SMU.1005.
NMPDRfig|210007.1.peg.913.

Organism-specific databases

CMRSearch...

Enzyme and pathway databases

BioCycSMUT210007:SMU_1005-MON.

Family and domain databases

InterProIPR002479. Cell_wall_bd_put.
IPR003318. Glyco_hydro70cat.
IPR013781. Glyco_hydro_sub_cat.
[Graphical view]
Gene3DG3DSA:3.20.20.80. Glyco_hydro_cat. 2 hits.
PfamPF01473. CW_binding_1. 2 hits.
PF02324. Glyco_hydro_70. 1 hit.
[Graphical view]
PROSITEPS51170. CW. 11 hits.
[Graphical view]
BLOCKSSearch...
ProtoNetSearch...

Entry information

Entry nameGTFC_STRMU
AccessionPrimary (citable) accession number: P13470
Secondary accession number(s): O69382 expand/collapse secondary AC list , O69385, O69388, O69391, O69397, P05427
Entry history
Integrated into UniProtKB/Swiss-Prot: January 1, 1990
Last sequence update: November 28, 2002
Last modified: November 4, 2008
This is version 84 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHAMAP (High-quality Automated and Manual Annotation of microbial Proteomes)

Relevant documents

Glycosyl hydrolases

Classification of glycosyl hydrolase families and list of entries

UniProtKB secondary accession numbers

Index of UniProtKB secondary accession numbers

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents