Piecewise linear approximation of protein structures using the principle of minimum message length

Simple and concise representations of protein-folding patterns provide powerful abstractions for visualizations, comparisons, classifications, searching and aligning structural data. Structures are often abstracted by replacing standard secondary structural features—that is, helices and strands of s...

Full description

Bibliographic Details
Main Authors: Konagurthu, Arun S., Allison, Lloyd, Stuckey, Peter J., Lesk, Arthur M.
Format: Online
Language:English
Published: Oxford University Press 2011
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3117365/
Description
Summary:Simple and concise representations of protein-folding patterns provide powerful abstractions for visualizations, comparisons, classifications, searching and aligning structural data. Structures are often abstracted by replacing standard secondary structural features—that is, helices and strands of sheet—by vectors or linear segments. Relying solely on standard secondary structure may result in a significant loss of structural information. Further, traditional methods of simplification crucially depend on the consistency and accuracy of external methods to assign secondary structures to protein coordinate data. Although many methods exist automatically to identify secondary structure, the impreciseness of definitions, along with errors and inconsistencies in experimental structure data, drastically limit their applicability to generate reliable simplified representations, especially for structural comparison.