Peptide Modeling with BioLuminate

Tutorial Created with Software Release: 2025-2
Topics: Biologics Drug Discovery, Machine Learning, Peptide Design
Products Used: BioLuminate, Glide

Tutorial files

29 MB

This tutorial is written for use with a 3-button mouse with a scroll wheel.
Words found in the Glossary of Terms are shown like this: Workspacethe 3D display area in the center of the main window, where molecular structures are displayed

 

Tip: You can hover over a glossary term to display its definition. You can click on an image to expand it in the page.
Abstract:

 

In this tutorial, you will learn how to use various tools in the BioLuminate program to perform Peptide Modeling related tasks. You will learn how to dock peptides to a protein receptor, identify possible binding hotspots, perform lead optimization, and predict binding affinity with QSAR.

 

Tutorial Content
  1. Creating Projects and Importing Structures

  1. Docking Peptides to a Receptor

  1. Identifying Binding Hot Spots using MM-GBSA Residue Scanning

  1. Optimizing Peptide Structure using Affinity Maturation

  1. Building Quantitative Sequence-Activity Relationships (QSAR) to Predict Binding Affinity

  1. Conclusions and References

  1. Glossary of Terms

1. Creating Projects and Importing Structures

At the start of the session, change the file path to your chosen Working Directorythe location where files are saved in Maestro to make file navigation easier. Each session in Maestro begins with a default Scratch Projecta temporary project in which work is not saved, closing a scratch project removes all current work and begins a new scratch project, which is not saved. A Maestro project stores all your data and has a .prj extension. A project may contain numerous entries corresponding to imported structures, as well as the output of modeling-related tasks. Once a project is created, the project is automatically saved each time a change is made.

Structures can be imported from the PDB directly, or from your Working Directorythe location where files are saved using File > Import Structures, and are added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion and Project Tabledisplays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data. The Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion are located to the left of the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed. The Project Tabledisplays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data can be accessed by Ctrl+T (Cmd+T) or Window > Project Table if you would like to see an expanded view of your project data. The Toggle Table is another way to interact with structures and can be accessed via Window > Toggle Table.

1.1 Load pre-installed files into the working directory

  1. Double-click the BioLuminate icon.

 

Figure 1-1. Change Working Directory option.

  1. Go to File > Change Working Directory.
  2. Find your directory, and click Choose.
  3. Pre-generated input and results files are included for running jobs or examining output. Download the zip file here: https://www.schrodinger.com/sites/default/files/s3/release/current/Tutorials/zip/peptidemodeling_bl.zip
  4. After downloading the zip file, unzip the contents in your Working Directorythe location where files are saved for ease of access throughout the tutorial.  

Figure 1-2. Opening the project.

  1. Go to File > Open Project and choose peptidemodelingtutorial.prjzip.
  2. Click Open.
    • Structures are shown in the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.

 

Note: Please see the Glossary of Terms for the distinction between includedthe entry is represented in the Workspace, the circle in the In column is blue and selected(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries.

 

  1. In the Save scratch project warning box, click OK.

1.2 Save project in the working directory

Figure 1-3. Saving the project via the Save Project panel.

  1. Go to File > Save Project As.
  2. Change the File name to peptidemodeling_new.
  3. Click Save.
    • The project is now named peptidemodeling_new.prj.    

2. Docking Peptides to a Receptor

In this section, you will perform cognate liganda ligand that is bound to its protein target docking of a peptide that binds cyclophilin A (PDB ID 1AWR) using the SP-PEP protocol. This protocol was developed specifically for docking peptides; it performs optimally uncharged, linear peptides of less than 10 residues. To save time, structures in this tutorial have already been prepared using the Protein Preparation Workflow. This is an important step that should be done prior to performing a residue scan. Please see the Introduction to Structure Preparation and Visualization and Best Practices for Protein Preparation for more information on the protein preparation process in general as well as the Protein Preparation Workflow.

2.1 Dock a cognate peptide to a receptor

Figure 2-1. Including 1AWR_complex_prep entry and selecting 1AWR-random entry.

For this retrospective validation, we would want to start with a randomized input conformation of the peptide such that we can show that the docking algorithm is able to generate the crystallographic pose.

  1. Includethe entry is represented in the Workspace, the circle in the In column is blue 1AWR_complex_prep in the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
    • A receptor structure is loaded into the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed.
  2. Select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries 1AWR-random in the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
  3. Double-click Presets.
    • The Workspacethe 3D display area in the center of the main window, where molecular structures are displayed is rerendered in the default Custom Preset.

Figure 2-2. Peptide Docking in Biologics Tasks.

 

  1. Go to Tasks > Biologics > Peptide Docking.

Figure 2-3. The Peptide Docking Panel.

  1. In the Binding Site section, check the Pick box for Centroid option.
    • A banner appears prompting you to pick a ligand to define the region.

Figure 2-4. Receptor bounding boxes.

  1. Select any atom of the ligand in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed.
    • The ligand is now highlighted with a purple box and green box around it.
    • The ligand will be excluded from the grid generation.

 

Note: The purple bounding box defines the region that the docked molecule(s) can occupy to satisfy the initial stages of docking. The green box defines where the midpoint of any docked molecule must reside to satisfy the initial stages of docking.

Figure 2-5. Adding selected sequences from the Project Table.

  1. For Add Sequences from, choose Project Table.
    • The selected(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries 1AWR-random is displayed in the Peptide table.
  2. Change the Job name to 1AWR_pep_MMGBSA.
  3. Click Run.
    • This job will take ~1 hour to run on a single CPU.
    • To save time, you will look at the pregenerated results.

2.2     Analyze the docking results

Figure 2-6. Selecting the 1AWR_pep_MMGBSA group for viewing poses.

 

 

  1. Select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries the group 1AWR_pep_MMGBSA in the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
  2. Right-click on the group header and select View Poses.
    • The Pose Viewer panel opens.
    • The first structure in the group is fixed in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed, the second is includedthe entry is represented in the Workspace, the circle in the In column is blue.

 

Figure 2-7. Pose Viewer Set Up.

  1. Ensure Display per-residue interactions is disabled.
  2. Click Set Up Poses.
  3. Step through the results using the right and left arrow keys.

 

Note: Turn off AUTO-fit on the main toolbar to prevent the Workspace from zooming to fit the selection when viewing ligands.

 

Figure 2-8. Overlay of crystallographic ligand (green) with the 18th docked pose (cyan).

  1. Double-click the In circle to fix 1AWR_chainG in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed.
  2. Compare poses to the cognate liganda ligand that is bound to its protein target using the right and left arrow keys.
  3. Close the Pose Viewer panel.

3. Identifying Binding Hot Spots Using MM-GBSA Residue Scanning

In this section, you will use MM-GBSA Residue Scanning to reproduce experimental results for a Bim-derived peptide bound to the Mcl-1 protein. To save time, structures in this tutorial have already been prepared using the Protein Preparation Workflow. This is an important step that should be done prior to performing a residue scan.

3.1 Set up a MM-GBSA Residue Scanning Calculation

Figure 3-1. The MM-GBSA Residue Scanning Calculations option in Biologics Tasks.

  1. Go to Workspace > Clear Workspace.
  2. Includethe entry is represented in the Workspace, the circle in the In column is blue 2NL9_prepped in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed.

 

Note: Type Z to fit the structure to the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed, if needed.

 

  1. Go to Tasks > Biologics > MM-GBSA Residue Scanning Calculations.

Figure 3-2. Calculating for Stability and Affinity of Chain A binding to Chain B.

  1. For Import structure from, choose Workspace.
  2. Click Import.
    • The Residues table gets populated.
  3. For Calculation type, choose Stability and Affinity.
  4. For Binding partners, uncheck the B checkbox in the Chains menu.
    • Chain A is binding to Chain B.

Figure 3-3. Mutating selected residues to ALA.

  1. For Set mutations for, choose Selected residues.

 

To evaluate the residue at the peptide-protein interface, you will use alanine scanning to predict the impact on both affinity and stability that would arise when an amino acid on the interface has its side chain mutated to an alanine side chain.

 

  1. In the table, use Ctrl+Click (Cmd+Click) to select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries the following residues of chain B: 54-58, 60, 62-70, 72-73.
  2. Next to ALA, click Apply.
    • Selected(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries mutations are set to ALA.

 

Note: It has been documented that there is a bias toward arginine and methionine in residue scanning outputs. It is recommended to keep this in mind when designing your residue scanning experiments, and to look critically at results that suggest beneficial methionine or arginine mutations.

Figure 3-4. Opening the Nonstandard Residues panel.

  1. Go to the Options tab.
  2. Click Define Nonstandard Residues.
    • The Nonstandard Residues panel opens.

 

Figure 3-5. Managing the Nonstandard Residues.

Note: The Nonstandard Residue panel is loaded with all standard amino acids and a selection of 48 nonstandard amino acids.

 

  1. Find DGL in Row 35 and check Mutate to checkbox.
    • Mutating to DGL is now an option in the MM-GBSA Residue Scanning panel.

Figure 3-6. Selecting residues to mutate to GLU and DGL.

  1. Go to the Residues tab.

 

To explore the alanines to the peptide-protein interface, you are going to mutate them to the much-bulkier GLU and DGL side chains.

 

  1. Mutate residues 59, 71, and 74 of chain B to GLU and DGL.
  2. Change the Job name to mcl1_bim_alaninescan.
  3. Click Run.
    • These calculations take ~30 seconds per mutation.
    • To save time, you will look at the pregenerated results.

3.2 Analyze the MM-GBSA residue scanning results

Figure 3-7. Loading the MM-GBSA Residue Scanning results.

  1. Go to Tasks > Biologics > MM-GBSA Residue Scanning Results.
    • The MM-GBSA Residue Scanning Viewer panel opens.
  2. Next to Load results from, choose File, and click Browse.
  3. Select mcl1_bim_alaninescan-out.maegz.
  4. Click Open.
    • The Mutations table is populated with the predicted changes in affinity and stability.

Notes:

  1. The properties in the MM-GBSA Residue Scanning Viewer are frequently most helpful in assessing the potential impact of a mutation are Δ Affinity and Δ Stability (solvated).
  2. As these values are the relative change in free energy (in arbitrary energy units, see the last note in this gray box), negative Δ Stability values correspond to favorable mutations.
  3. MM-GBSA Residue Scanning is helpful for predicting mutations as significantly improving binding (Δ Affinity < -3 energy units), neutral (-3 energy units < Δ Affinity < +3 energy units), or reducing binding affinity significantly (Δ Affinity > +3 energy units). These scores are best used as criteria for prioritizing mutants for testing.
  4. As noted by Beard et al. in this paper, the changes in binding affinity calculated with MM-GBSA are not directly equivalent to those measured experimentally. MM-GBSA reports energies in an arbitrary energy unit. To compare MM-GBSA energies with experimental values, you have to obtain a scaling factor by calibrating the predicted values to a corresponding set of experimental values.

Figure 3-8. Adjusting the graph results.

  1. In the Graph representation, select the area of the graph to cover Δ Affinity of 15 by clicking and dragging in the graph area.
  2. Check Sync selection with mutations table.
    • Filtered results are now displayed in bold and selected in the Mutations table.

Figure 3-9. Adjusting the Workspace Display.

  1. In Mutations, click Workspace Display and check Original residue in gray.
  2. Select Residue B:62.
    • The Workspacethe 3D display area in the center of the main window, where molecular structures are displayed is zoomed to residue 62.
    • The mutation is green, while the original LEU residue is gray.

Figure 3-10. The LEU62 (gray) mutation to alanine (blue).

Note: In this example, alanine mutations with large, positive values for Δ Affinity are interpreted as being potential binding hot spots.

4. Optimizing Peptide Structure Using Affinity Maturation

In this section, you will use affinity maturation to predict combinations of mutations for which the binding and/or stability of a peptide-protein interaction is optimized. As compared to Residue Scanning, where mutations are performed serially, Affinity Maturation employs a Monte Carlo search algorithm to explore mutations in a combinatorial fashion.

4.1     Set up an affinity maturation calculation

Figure 4-1. Include 1BXL_minimized in the Workspace.

  1. Includethe entry is represented in the Workspace, the circle in the In column is blue 1BXL_minimized in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed.
  2. Go to Tasks > Biologics > MM-GBSA Residue Scanning Calculations.

Figure 4-2. Calculating for Stability and Affinity of Chain A binding to Chain B.

  1. For Import structure from, choose Workspace.
  2. Click Import.
    • The table is populated.
  3. For Calculation type, choose Stability and Affinity.
  4. For Binding partners, uncheck the B checkbox in the Chains menu.
    • Chain A is binding to Chain B.

Figure 4-3. Selecting residues to mutate to NonPolar (Hydrophobic) residues.

  1. For Set mutations for, choose Selected residues.
  2. In the table, use Ctrl+Click (Cmd+Click) to select the following residues of chain B: 574, 578, 581, and 585.
  3. Click the second menu for Set mutations for (following “to”), and check NonPolar (Hydrophobic).
  4. Click Apply.
    • Selected mutations are set to nonpolar residues.

Figure 4-4. Options tab of the MM-GBSA Residue Scanning panel.

  1. Click the Options tab.
  2. Check Perform affinity maturation (protein design).
    • The Combinations section appears.
  3. For Property to optimize, choose Affinity.
  4. Change Job name to bclxl_bak_affinitymaturation.
  5. Click Run.
    • This job will take ~4 hours to run on a single CPU.
    • To save time, you can use the pregenerated results.

4.2 Analyze affinity maturation results

Figure 4-5. Viewing the affinity maturation results.

  1. Go to Tasks > Biologics > MM-GBSA Residue Scanning Results.
    • The MM-GBSA Residue Scanning Viewer panel opens.
  2. Next to Load results from, choose File.
  3. Click Browse and locate the bclxl_bak_affinitymaturation-out.maegz file.
  4. Click Open.
    • Results are displayed in the table and graph.
    • Each row of data corresponds to a combination of mutations.

Figure 4-6. The resulting LOGO plot.

  1. Click Create LOGO plot at the bottom of the panel.
    • A sequence LOGO plot opens.
    • A .png image of the LOGO plot is saved to your Working Directorythe location where files are saved.
  2. Click OK.

 

Note: Given the known bias toward methionine, these affinity maturation results should be examined more carefully.

5. Building Quantitative Sequence-Activity Relationships (QSAR) to Predict Binding Affinity

Peptide QSAR can be used to build mathematical models for the prediction of a property, such as affinity, based on the amino acid sequence alone. Models are built by transforming a sequence into peptide descriptors, which are used as independent variables for K partial-least squares model building. To build an accurate model, it must be (a) trained on a sizable set of data for which the activity is known and (b) validated on a large Test set, i.e. sequences with known activity that were not used to train the model. In this section, you will use a dataset published by Tian et al to build a model of the MHC binding affinity of antigenic peptides.

5.1 Build a peptide QSAR model

Figure 5-1. Peptide QSAR in Biologics Tasks.

  1. Go to Tasks > Biologics >  Peptide QSAR.
    • The Peptide QSAR panel opens in the Setup tab.

Figure 5-2. Loading peptide sequences.

  1. Click Load Sequences and Observables.
    • The Peptide QSAR - Load Sequences and Observables panel opens.
  2. For Load sequences from, choose CSV file.
  3. Click Browse.
  4. Choose TIAN.csv and click Open.
  5. Click OK.
    • A dataset of 153 peptide sequences is loaded into the Peptide QSAR table.

Figure 5-3. Setting Peptide descriptor type and Maximum number of PLS factors.

  1. For Peptide descriptor type, choose dpps.
  2. Click Advanced Options for PLS.
  3. Set the Maximum number of PLS factors to 10.
  4. Click OK and then click Build.
    • When the model is built, the Results tab opens.

 

Note: The more PLS factors used to build the model, the more likely the model is to be over-fitted. As a suggestion, set the maximum number to be no more than the number of sequences divided by ten.

5.2 Analyze QSAR results

Figure 5-4. Results for the peptide QSAR model.

Note: While the Pearson correlation coefficient (R^2) for the Training set is satisfactory, the corresponding statistic for the Test set (Q^2) is significantly worse.

 

  1. Next to PLS factors, click the up and down arrows.
    • The statistics don’t change much with altering the number of PLS factors.
    • This suggests that a smaller number of factors could be used to build a comparably predictive model.
  2. Click Plot.
    • Peptide QSAR Scatter Plot panel opens showing Observed vs. Predicted activity values for the test and training sets.

Figure 5-5. Scatter Plot of Observed vs. Predicted activity values for the test and training sets.

Note: While the statistics could be improved, a trend is observable in the data, with only a few large outliers. However, to increase the robustness of the model, more data is necessary. Try decreasing the percentage of the dataset allocated to Test to 25% and you will observe that while R^2 increases significantly, Q^2 suffers, suggesting that the model is overfit.

 

  1. Close the Peptide-QSAR Scatter Plot.

Figure 5-6. Changing to 25% of the dataset allocated for the test set.

  1. Go to the Setup tab.
  2. Next to Use all rows in the table. Randomly select 25% for the test set.
  3. Click Build.

Figure 5-7. Results of overfit model.

  1. Go to the Results tab.

 

Note: With a 25% test set, while R^2 has increased slightly, Q^2 is much lower. This suggests that the model is overfit.

6. Conclusion and References

In this tutorial, you performed peptide-docking, identified potential binding hot spots from the residue scanning results. You also used affinity maturation to predict combinations of mutations for which binding of a peptide-protein interaction is optimized. Finally, you built a peptide QSAR model to predict binding affinity.

7. Glossary of Terms

cognate ligand - a ligand that is bound to its protein target

Entries - a simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion

included - the entry is represented in the Workspace, the circle in the In column is blue

incorporated - once a job is finished, output files from the working directory are added to the project and shown in the Entries and Project Table

Project Table - displays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data

Scratch Project - a temporary project in which work is not saved, closing a scratch project removes all current work and begins a new scratch project

selected - (1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries

Working Directory - the location where files are saved

Workspace - the 3D display area in the center of the main window, where molecular structures are displayed