Foldit Wiki
Advertisement
Koga&Koga

The Koga & Koga paper as seen in PubMed.

In 2012, the Baker group published a paper "Principles for designing ideal protein structures” in Nature. This article is usually referred to as Koga & Koga in Foldit, after the lead authors. It's the basis of the ideal loops condition found in most recent Foldit design puzzles, and the patterns found the blueprint tool.

If you're comfortable with the rather dry language of academic biology, you can read the complete article on PubMed.

Koga & Koga describes promising secondary structure patterns for designing new proteins. The patterns are based partly on known structures of naturally occurring proteins, and partly on large-scale simulations of artificial proteins by Rosetta@home and its network of volunteers.

At the lowest level, all of these patterns involve a section of sheet or helix connected to another sheet or helix by a short section of loop. These low-level patterns involve segments (residues) that are immediately adjacent. (In natural proteins, sheets are often bonded to other sheets from "distant" parts of the protein, and sometimes loops meander for many segments, but Koga & Koga didn't address these cases in the low-level patterns.)

Koga & Koga also looks at higher-level patterns, based on complete natural proteins, that use these low-level patterns.

The page attempts to summarize the low-level Koga & Koga patterns. If you can use these patterns when designing proteins, there's a much better chance of your design being interesting from a scientific standpoint. The patterns may help in clearing those ideal loops conditions and achieving a higher score.

See Koga & Koga technical supplement for an explanation of left and right "chirality" and parallel or anti-parallel alignment. These terms are the key to the rules Koga & Koga presents for low-level patterns.

The use of these patterns in Foldit is also explained in a video by Susume.

Koga & Koga has low-level patterns for sheet - loop - sheet, sheet - loop - helix, and helix - loop - sheet.

Koga and Koga didn't include a low-level helix - loop - helix pattern. Fortunately, the Foldit blueprint tool does include samples for helix - loop - helix, along with the other Koga & Koga patterns.

Each of these patterns is discussed in more detail below. The goal is to identify the key features of each pattern and describe how to create the pattern by "hand folding" in Foldit. You can also select similar patterns using the blueprint tool. Hand folding usually involves using cutpoints and the move tool to get the helixes, sheets, and loops aligned; the blueprint shapes eliminate that step.

With these patterns, it generally doesn't matter which amino acids are used at the start. One exception: hairpin loops in Foldit are much easier to construct if you make them glycine to start. The mutate tool can always fine-tune the amino acids later on.

The segment order of the patterns does matter. Helix-loop-sheet is not the same as sheet-loop-helix from a folding standpoint. The lower-numbered segments are always listed first in the patterns.

sheet - loop - sheet[]

Several of the patterns involve two adjacent sheets connected by a short section of loop. This is called an "anti-parallel" arrangment, and its very common in natural proteins.

The key here is the number of segments in the loop, which determines whether the second sheet goes to the left or right of the first sheet. Koga & Koga talks about "chirality" or handedness, and uses L and R, but has a precise mathematical definition. Foldit doesn't give us the numbers we'd need (the X-Y-Z coordinates of each atom), so vector algebra is out.

Fortunately, there's an easy method for determining "left" and "right", as discussed below for the sheet - 2 segment loop - sheet pattern.

sheet - 2 segment loop - sheet[]

E-L2 E

Sheet connected to sheet by two loop segments.

This pattern is seen in many natural proteins. Here's a strategy to make this pattern manually in Foldit.

  1. Arrange the first sheet (the one with lower segment numbers) so that the sidechain of the last segment of the sheet points into the screen.
  2. Then the second sheet should be to the LEFT of the first sheet.
  3. When constructing this hairpin turn, life is much easier if you mutate the two loop segments to glycine. Also, put a cutpoint in the middle of the loop and wiggle it to get a reasonable shape.

To see the sidechains, turn on "show sidechains (all)" in the Foldit view options. You'll also want to turn on "show bonds (sheet)" to show the blue-and-white spirals that indicate the sheets are properly aligned.

In this example, sheet 1 has segments 55 to 59. Segments 60 and 61 constitute the loop. The second sheet has segments 62 to 66. Note that the sidechain of segment 59 (arginine) points into the page, indicating that the second sheet goes to the left. Also note that one of the loop segments (61) shows no sidechain, which means it's a glycine.

In the case of this particular secondary structure sequence, the preference for going left might be considered an absolute rule rather than a guideline. In thousands of simulated cases where this pattern occurred, and many thousands more in naturally occurring proteins, it looks from the Koga & Koga paper as if there was not a single case of the second sheet being to the right when the sheets are joined by a 2-segment loop.

This an example of "L chirality" in Koga & Koga terms. It could be called a "left-handed" turn.

sheet - 3 segment loop - sheet[]

This pattern is very similar to the previous case. The second sheet has a strong preference to go to to the left.

  1. Arrange the first sheet (the one with lower segment numbers) so that the sidechain of the last segment of the sheet points into the screen.
  2. Then the second sheet should, as previously, be to the LEFT of the first sheet.
  3. The loop here isn't quite as strained as in the 2-segment loop case above so glycines in the loop aren't a necessity during construction. Mutate may still end up putting them there though.

This preference is followed about 85% of the time (based on eyeballing the histograms in the paper), both in naturally occurring proteins and in designed ones.

This pattern occurs much less frequently in natural proteins than does the 2-segment loop case (maybe 10-15% as common).

This case also has L chirality.

sheet - 4 segment loop - sheet[]

It's 50/50 whether the second sheet goes right or left in both natural and artificial proteins, so you don't have to worry about it unduly. (With a four-segment loop, either L or R chirality is possible.)

This pattern occurs about twice as frequently in natural proteins as the 3-segment loop case, but is still relatively uncommon relative to the 2-segment loop case.

sheet - 5 segment loop - sheet[]

E-5L-E

Sheet connected to sheet by five loop segments.

  1. Arrange the first sheet (the one with lower segment numbers) so that the sidechain of the last segment of the sheet points into the screen.
  1. Then the second sheet should be to the RIGHT of the first sheet.

In this example, sheet 1 has segments 4 to 8. Segments 9 to 13 make up the loop, and the second sheet has segments 14 to 18. Note that the sidechain of segment 8 (serine: barely visible) points into the page, indicating that the second sheet goes to the right.

This pattern is followed about 70% of the time in designed proteins and about 95% of the time in naturally occurring ones.

It's also much more common in naturally occurring proteins than the 3 and 4 loop cases but still not as frequent as the 2-loop case.

This is the one case which favor R or right chirality when connecting two sheets.

sheet - loop - helix[]

There are two preferred orientations for this setup. In both of them the helix is offset diagonally from the sheet: in one case it is in front of the sheet and slants to the right. In the other case, the helix goes behind the sheet and slants to the left.

For cases involving sheets and helixes, Koga & Koga talks about parallel and anti-parallel alignments. The meaning is similar to parallel and anti-parallel when applied to to sheets, but once again there's a precise mathematical definition. Once again, there's also a way to skip the calculations, and come to the same result by inspecting the protein in Foldit.

sheet - 2 segment loop - helix[]

E-2L-H

Sheet connected to helix by two loop segments.

Here, the preference is for the helix to go behind the sheet and slant to the left as shown below.

To avoid visual clutter, only the last sidechain (segment 9) of the sheet is shown: pointing into the page as usual to provide a defined orientation. The loop (segments 10 to 11) and the start of the helix (segment 12) are also shown.

When the loop is two segments in length, this orientation is favoured over the one below by about 10-1. Koga & Koga refers to this a parallel (P) alignment.

It's actually quite hard in Foldit to achieve this geometry without the helix and sheet getting too close: furthermore, the distinction between a loop and a helix isn't all that clear.


sheet - 3 segment loop - helix[]

E 3L-H

Sheet connected to helix by three loop segments.

Here, the preference is for the helix to go in front of the sheet and slant to the right as shown below. This considered anti-parallel (A) alignment in Koga & Koga.

To avoid visual clutter, only the last sidechain (segment 9) of the sheet is shown: pointing into the page as usual to provide a defined orientation. The loop (segments 10 to 12) and the start of the helix (segment 12) are also shown.

This orientation is favoured over the two-loop arrangement in both natural and designed proteins but in neither case is the preference overwhelming: it's about 2-1 in artificial proteins and 1.5 to 1 in naturally occurring ones.


helix - loop - sheet[]

H-L-E

Helix connected to sheet.

Irrespective of the size of the loop (bit peculiar that), the orientation shown here is preferred. Again the helix is at an angle to the sheet, the first segment of which points into the plane. In Koga & Koga, this is parallel (P) alignment, and it wins out over anti-parallel by at least 2 to 1 when looking at natural proteins.

Although the helix - loop - sheet image looks similar to the one for sheet - 3 segment loop - helix, notice that the segment order is different. The different order is why one is parallel and the other is anti-parallel.

Advertisement