Extraction of consensus protein patterns in regions containing non-proline cis peptide bonds and their functional assessment
Loading...
Date
Authors
Exarchos, K. P.
Exarchos, T. P.
Rigas, G.
Papaloukas, C.
Fotiadis, D. I.
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Type
Type of the conference item
Journal type
peer reviewed
Educational material type
Conference Name
Journal name
BMC Bioinformatics
Book name
Book series
Book edition
Alternative title / Subtitle
Description
Background: In peptides and proteins, only a small percentile of peptide bonds adopts the cis configuration. Especially in the case of amide peptide bonds, the amount of cis conformations is quite limited thus hampering systematic studies, until recently. However, lately the emerging population of databases with more 3D structures of proteins has produced a considerable number of sequences containing non-proline cis formations (cis-nonPro). Results: In our work, we extract regular expression-type patterns that are descriptive of regions surrounding the cis-nonPro formations. For this purpose, three types of pattern discovery are performed: i) exact pattern discovery, ii) pattern discovery using a chemical equivalency set, and iii) pattern discovery using a structural equivalency set. Afterwards, using each pattern as predicate, we search the Eukaryotic Linear Motif (ELM) resource to identify potential functional implications of regions with cis-nonPro peptide bonds. The patterns extracted from each type of pattern discovery are further employed, in order to formulate a pattern-based classifier, which is used to discriminate between cis-nonPro and trans-nonPro formations. Conclusions: In terms of functional implications, we observe a significant association of cis-nonPro peptide bonds towards ligand/binding functionalities. As for the pattern-based classification scheme, the highest results were obtained using the structural equivalency set, which yielded 70% accuracy, 77% sensitivity and 63% specificity.
Description
Keywords
secondary structure information, support vector machines, amino-acid-sequence, cis/trans isomerization, prediction, conformation, resource, annotation, discovery, residues
Subject classification
Citation
Link
<Go to ISI>://000290764200001
http://www.biomedcentral.com/1471-2105/12/142
http://www.biomedcentral.com/1471-2105/12/142
Language
en
Publishing department/division
Advisor name
Examining committee
General Description / Additional Comments
Institution and School/Department of submitter
Πανεπιστήμιο Ιωαννίνων. Σχολή Επιστημών και Τεχνολογιών. Τμήμα Βιολογικών Εφαρμογών και Τεχνολογιών