FragAnchor: A Large-Scale Predictor of Glycosylphosphatidylinositol Anchors in Eukaryote Protein Sequences by Qualitative Scoring

(整期优先)网络出版时间:2007-02-12
/ 1
Aglycosylphosphatidylinositol(GPI)anchorisacommonbutcomplexC-terminalpost-translationalmodificationofextracellularproteinsineukaryotes.HereweinvestigatetheproblemofcorrectlyannotatingGPI-anchoredproteinsforthegrowingnumberofsequencesinpublicdatabases.Wedevelopedacomputa-tionalsystem,calledFragAnchor,basedonthetandemuseofaneuralnet-work(NN)andahiddenMarkovmodel(HMM).Firstly,NNselectspotentialGPI-anchoredproteinsinadataset,thenHMMparsesthesepotentialGPIsig-nalsandrefinesthepredictionbyqualitativescoring.FragAnchorcorrectlypredicted91%ofalltheGPI-anchoredproteinsannotatedintheSwiss-Protdatabase.Inalarge-scaleanalysisof29eukaryoteproteomes,FragAnchorpredictedthatthepercentageofhighlyprobableGPI-anchoredproteinsisbetween0.21%and2.01%.ThedistinctivefeatureofFragAnchor,comparedwithothersystems,isthatittargetsonlytheC-terminusofaprotein,makingitlesssensitivetothebackgroundnoisefoundindatabasesandpossibleincompleteproteinsequences.Moreover,FragAnchorcanbeusedtopredictGPI-anchoredproteinsinalleukaryotes.Finally,byusingqualitativescoring,thepredictionscombinebothsensitivityandinformationcontent.Thepredictorispubliclyavailableathttp://navet.ics.hawaii.edu/~fraganchor/NNHMM/NNHMM.html.