Improving the protein fold recognition accuracy of a reduced state-space hidden Markov model

Lampros, C.; Papaloukas, C.; Exarchos, K.; Fotiadis, D. I.; Tsalikakis, D.

Improving the protein fold recognition accuracy of a reduced state-space hidden Markov model

dc.contributor.author	Lampros, C.	en
dc.contributor.author	Papaloukas, C.	en
dc.contributor.author	Exarchos, K.	en
dc.contributor.author	Fotiadis, D. I.	en
dc.contributor.author	Tsalikakis, D.	en
dc.date.accessioned	2015-11-24T17:34:51Z
dc.date.available	2015-11-24T17:34:51Z
dc.identifier.issn	0010-4825	-
dc.identifier.uri	https://olympias.lib.uoi.gr/jspui/handle/123456789/14043
dc.rights	Default Licence	-
dc.subject	fold recognition	en
dc.subject	hidden markov models	en
dc.subject	protein classification	en
dc.subject	secondary structure prediction	en
dc.subject	secondary structure alphabet	en
dc.subject	support vector machines	en
dc.subject	sequence	en
dc.subject	superfamily	en
dc.title	Improving the protein fold recognition accuracy of a reduced state-space hidden Markov model	en
heal.abstract	Fold recognition is a challenging field strongly associated with protein function determination, which is crucial for biologists and the pharmaceutical industry. Hidden Markov models (HMMs) have been widely used for this purpose. In this paper we demonstrate how the fold recognition performance of a recently introduced HMM with a reduced state-space topology can be improved. Our method employs an efficient architecture and a low complexity training algorithm based on likelihood maximization. The fold recognition performance of the model is further improved in two steps. In the first step we use a smaller model architecture based on the {E,H,L} alphabet instead of the DSSP secondary structure alphabet. In the second step secondary structure information (predicted or true) is additionally used in scoring the test set sequences. The Protein Data Bank and the annotation of the SCOP database are used for the training and evaluation of the proposed methodology. The results show that the fold recognition accuracy is substantially improved in both steps. Specifically, it is increased by 2.9% in the first step to 22%. In the second step it further increases and reaches up to 30% when predicted secondary structure information is additionally used and it increases even more and reaches up to 34.7% when we use the true secondary structure. The major advantage of the proposed improvements is that the fold recognition performance is substantially increased while the size of the model and the computational complexity of scoring are decreased. (C) 2009 Elsevier Ltd. All rights reserved.	en
heal.access	campus	-
heal.fullTextAvailability	TRUE	-
heal.identifier.primary	DOI 10.1016/j.compbiomed.2009.07.007	-
heal.identifier.secondary	<Go to ISI>://000270375400007	-
heal.identifier.secondary	http://ac.els-cdn.com/S0010482509001310/1-s2.0-S0010482509001310-main.pdf?_tid=c008fe9f6eadc8eefbfce41231e692f4&acdnat=1339758257_f9d041a19426f95229a7ccecf246cc86	-
heal.journalName	Comput Biol Med	en
heal.journalType	peer reviewed	-
heal.language	en	-
heal.publicationDate	2009	-
heal.publisher	Elsevier	en
heal.recordProvider	Πανεπιστήμιο Ιωαννίνων. Σχολή Θετικών Επιστημών. Τμήμα Μηχανικών Επιστήμης Υλικών	el
heal.type	journalArticle	-
heal.type.el	Άρθρο Περιοδικού	el
heal.type.en	Journal article	en

Αρχεία

Φάκελος/Πακέτο αδειών

Προβολή: 1 - 1 of 1

Ονομα:: license.txt
Μέγεθος:: 1.74 KB
Μορφότυπο:: Item-specific license agreed upon to submission
Περιγραφή:

Κατεβάστε

Συλλογές

Άρθρα σε επιστημονικά περιοδικά ( Ανοικτά)