Extraction of consensus protein patterns in regions containing non-proline cis peptide bonds and their functional assessment

Loading...
Thumbnail Image

Date

Authors

Exarchos, K. P.
Exarchos, T. P.
Rigas, G.
Papaloukas, C.
Fotiadis, D. I.

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Type of the conference item

Journal type

peer reviewed

Educational material type

Conference Name

Journal name

BMC Bioinformatics

Book name

Book series

Book edition

Alternative title / Subtitle

Description

Background: In peptides and proteins, only a small percentile of peptide bonds adopts the cis configuration. Especially in the case of amide peptide bonds, the amount of cis conformations is quite limited thus hampering systematic studies, until recently. However, lately the emerging population of databases with more 3D structures of proteins has produced a considerable number of sequences containing non-proline cis formations (cis-nonPro). Results: In our work, we extract regular expression-type patterns that are descriptive of regions surrounding the cis-nonPro formations. For this purpose, three types of pattern discovery are performed: i) exact pattern discovery, ii) pattern discovery using a chemical equivalency set, and iii) pattern discovery using a structural equivalency set. Afterwards, using each pattern as predicate, we search the Eukaryotic Linear Motif (ELM) resource to identify potential functional implications of regions with cis-nonPro peptide bonds. The patterns extracted from each type of pattern discovery are further employed, in order to formulate a pattern-based classifier, which is used to discriminate between cis-nonPro and trans-nonPro formations. Conclusions: In terms of functional implications, we observe a significant association of cis-nonPro peptide bonds towards ligand/binding functionalities. As for the pattern-based classification scheme, the highest results were obtained using the structural equivalency set, which yielded 70% accuracy, 77% sensitivity and 63% specificity.

Description

Keywords

secondary structure information, support vector machines, amino-acid-sequence, cis/trans isomerization, prediction, conformation, resource, annotation, discovery, residues

Subject classification

Citation

Link

<Go to ISI>://000290764200001
http://www.biomedcentral.com/1471-2105/12/142

Language

en

Publishing department/division

Advisor name

Examining committee

General Description / Additional Comments

Institution and School/Department of submitter

Πανεπιστήμιο Ιωαννίνων. Σχολή Επιστημών και Τεχνολογιών. Τμήμα Βιολογικών Εφαρμογών και Τεχνολογιών

Table of contents

Sponsor

Bibliographic citation

Name(s) of contributor(s)

Number of Pages

Course details

Endorsement

Review

Supplemented By

Referenced By