Design of sequences with good folding properties in coarse-grained protein models
Summary, in English
BACKGROUND: Designing amino acid sequences that are stable in a given target structure amounts to maximizing a conditional probability. A straightforward approach to accomplishing this is a nested Monte Carlo where the conformation space is explored over and over again for different fixed sequences; this requires excessive computational demand. Several approximate attempts to remedy this situation, based on energy minimization for fixed structure or high-T expansions, have been proposed. These methods are fast but often not accurate, as folding occurs at low T.
RESULTS: We have developed a multisequence Monte Carlo procedure where both sequence and conformational space are simultaneously probed with efficient prescriptions for pruning sequence space. The method is explored on hydrophobic/polar models. First we discuss short lattice chains in order to compare with exact data and with other methods. The method is then successfully applied to lattice chains with up to 50 monomers and to off-lattice 20mers.
CONCLUSIONS: The multisequence Monte Carlo method offers a new approach to sequence design in coarse-grained models. It is much more efficient than previous Monte Carlo methods, and is, as it stands, applicable to a fairly wide range of two-letter models.
- Computational Biology and Biological Physics
- Energy Metabolism
- Monte Carlo Method
- Protein Conformation
- Protein Folding
- ISSN: 0969-2126