Covalent Docking for Virtual Screening and Pose Prediction
Tutorial Created with Software Release: 2025-3
Topics: Hit Discovery , Hit-to-Lead & Lead Optimization , Ligand Preparations and Library Design , Virtual Screening
Products Used: Glide , Prime
|
47.7 MB |
This tutorial is written for use with a 3-button mouse with a scroll wheel.
Words found in the Glossary of Terms are shown like this: Workspacethe 3D display area in the center of the main window, where molecular structures are displayed
Abstract:
In this tutorial, you will create and validate a covalent docking model for human cathepsin L (hCatL), a cysteine protease forming a covalent thioimidate adduct with an activated nitrile group. You will learn how to enumerate a provided ligand scaffold into the S2 pocket to create a diverse ligand set for docking investigations. This system, and enumerated ligands will then be used to perform covalent docking virtual screening using a predefined chemistry pattern available in the interface.
Tutorial Content
1. What is Covalent Docking?
Distinct from the conventional design of small molecules, covalent inhibition works through forming an explicit bond to the target which can be either reversible or irreversible. Concepts such as how long the bond forms, the percentage occupancy in the active site, and specificity or off-target binding are a few of the many important factors to be considered in the design of these molecules.
Well known examples of covalent inhibitors include aspirin (marketed over 100 years ago), penicillin (discovered in 1928), and omeprazole (a blockbuster proton pump inhibitor). While they contribute to a wide range of therapeutic areas such as infectious diseases, cancer treatment, and gastrointestinal disorders, they are particularly useful for long-term therapies such as those used in CNS disorders, e.g. Parkinson's disease, due to the very nature of covalent bond formation. Covalent inhibitors make up a respectable 30% of marketed drugs to date (see this publication).
Sotorasib is an example of a covalently bound inhibitor that has recently met FDA approval. It is a once-daily oral treatment for people living with non-small cell lung cancer (NSCLC) whose tumor is KRAS(G12C) positive. The mutant cysteine of KRAS(G12C) resides adjacent to a pocket (P2) that is present in the inactive GDP-bound form of KRAS. Activating mutations in KRAS prevent the association of GTPase-activating proteins, thus stabilizing effector binding and enhancing KRAS signaling. Sotorasib inhibits this process by forming a covalent bond with the cysteine mutant of KRAS(G12C) and is able to shrink and/or suppress tumor growth in patients.
Figure 1-1: Sotorasib forming a covalent bond with the cysteine mutant of KRAS(G12C) (PDB ID: 6OIM).
2. Creating Projects and Importing Structures
At the start of the session, change the file path to your chosen Working Directorythe location that files are saved in Maestro to make file navigation easier. Each session in Maestro begins with a default Scratch Projecta temporary project in which work is not saved, closing a scratch project removes all current work and begins a new scratch project, which is not saved. A Maestro project stores all your data and has a .prj extension. A project may contain numerous entries corresponding to imported structures, as well as the output of modeling-related tasks. Once a project is created, the project is automatically saved each time a change is made.
Structures can be imported from the PDB directly, or from your Working Directorythe location that files are saved using File > Import Structures, and are added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion and Project Tabledisplays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data. The Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion is located to the left of the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed. The Project Tabledisplays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data can be accessed by Ctrl+T (Cmd+T) or Window > Project Table if you would like to see an expanded view of your project data.
- Double click the Maestro icon to start Maestro.
- (No icon? See Starting Maestro)
- Go to File > Change Working Directory.
- Find your directory, and click Choose.
- Pre-generated input and results files are included for running jobs or examining output. Download the zip file here: https://www.schrodinger.com/sites/default/files/s3/release/current/Tutorials/zip/covdock.zip
- After downloading the zip file, unzip the contents in your Working Directorythe location that files are saved for ease of access throughout the tutorial.
- Go to File > Open Project.
- Select CovDock_tutorial.prjzip and click Open.
- The structures are added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
- 5MQY is includedthe entry is represented in the Workspace, the circle in the In column is blue in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed.
- In the Save scratch project dialog box, click OK.
Note: For the difference between includedthe entry is represented in the Workspace, the circle in the In column is blue and selected(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries , please refer to the Glossary of Terms.
- Go to File > Save Project as
- Change the File name to CovDock_tutorial.
- Click Save.
- The project is now named
CovDock_tutorial.prj.
- The project is now named
3. Protein Preparation and Ligand Enumeration
Structure files obtained from the PDB, vendors, and other sources often lack necessary information for performing modeling-related tasks. Typically, these files are missing hydrogens, partial charges, side chains, and/or whole loop regions. In order to make these structures suitable for modeling tasks, we use the Protein Preparation Workflow to resolve these issues. Similarly, ligand files can be sourced from numerous places, such as vendors or databases, often in the form of 1D or 2D structures with unstandardized chemistry. In this tutorial, a premade R-group library has been provided for enumeration of a template scaffold in the S2 binding pocket. This site was chosen because SAR knowledge in the S2 pocket is very limited for this scaffold and it is amenable to diverse chemical modifications. If you would like to learn more, the structures from this tutorial, as well as the enumeration investigation of the S2 pocket originated from the publication: “Prospective Evaluation of Free Energy Calculations for the Prioritization of Cathepsin L Inhibitors” (J. Med. Chem. 2017, 60(6): 2485-2497). After enumeration, LigPrep can convert ligand files to 3D structures, with the chemistry properly standardized and extrapolated, ready for use in virtual screening. Ligands to be used for covalent docking must first be prepared using LigPrep.
In this tutorial, 5MQY protein structure has already been prepared and included in the tutorial files in order to save time. During preparation, crystallographic artifacts were removed, "Cap termini" and "Fill in missing loops (Using Prime)" were checked under Preprocess. Default settings were used for all other steps. The ligand structures have also been prepared using LigPrep. These preparation steps are a necessary part of covalent docking. Please see the Introduction to Structure Preparation and Visualization tutorial for instructions on using the Protein Preparation Workflow and LigPrep.
- Includethe entry is represented in the Workspace, the circle in the In column is blue and select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries 5MQY – prepared.
The 2D Ligand Interaction diagram will help you visualize the covalent thioimidate adduct with the catalytic CYS 25 residue.
- Go to Tasks > Structure Analysis > Ligand Interaction Diagram.
Note: In this tutorial, you will focus on the S2 pocket for enumeration.
- Close the Ligand Interaction Diagram.
- Double-click Presets.
- The default Custom Preset is applied to the structure.
- The Workspacethe 3D display area in the center of the main window, where molecular structures are displayed zooms to the ligand.
- Turn-on the Interactions Toggle (bottom right corner).
- The ligand-receptor interactions are shown as colored dashed lines in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed.
- Rotate the structure and try visualizing the interaction between CYS 25 residue and the ligand.
Differentiating the protein from the ligand in covalent complexes can be challenging. Sometimes, it may be necessary to break bonds and/or rename the species prior to docking. For this tutorial, renaming was not required as an independent ligand file was used. To rename the ligand, go to Build > Other Edits > Change Atom Properties > Residue Name.
- Right click 5MQY – prepared.
- Choose Split > Into Ligands, Water, Other.
- A new group with three entries is added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
- Ctrl+Click (Cmd+Click) to includethe entry is represented in the Workspace, the circle in the In column is blue 5MQY – prepared_ligand and 5MQY – prepared_ligand_modified in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed.
Note: 5MQY – prepared_ligand_modified is the ligand isolated from the 5MQY crystal structure. It has been prepared using the LigPrep panel at pH 7.4. The alkene has been modified to an alkyne to prepare for covalent docking using the appropriate reaction. This is done to prepare the ligand for pose prediction docking. This crystal ligand will be used to test the model. This step is not required for Covalent Docking receptor grid generation.
See Preparing Structures for Covalent Docking for more information.
- Click the Show Workspace Configuration panel icon (plus symbol at the bottom right corner).
- Choose Tile.
- The structures are tiled in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed.
- Go to Tasks > Structure Alignment > Ligand Alignment.
- The Ligand Alignment panel opens.
- For Align ligands from, choose Workspace (2 included entries).
- For Reference, choose User-specified.
- Choose the reference as 3:5MQY – prepared_ligand.
- Check the box for Overwrite existing entries.
- Change the Job name to align_ligands_5MQY.
- Click Run.
- This job takes a few seconds.
- Close the Ligand Alignment panel.
- In the Workspace configuration panel, click Tile by to un-tile the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed.
4. Validating the Covalent Docking Protocol Using the Cocrystal Ligand
This section demonstrates how to set up a covalent docking job in virtual screening and pose prediction modes. The system chosen is a crystal structure of human cathepsin L (hCatL), a cysteine protease complexed with an inhibitor, (PDB ID: 5MQY) and this inhibitor is self-docked to the receptor by means of a covalent thioimidate adduct bond formation with the activated nitrile group.
Virtual Screening mode provides a fast and accurate way to predict poses for covalent protein-ligand complexes, while Pose Prediction mode is a more thorough method of performing covalent docking calculations. In this tutorial, Pose Prediction mode will be used to compare the docking model’s result with the docked ligand in the original crystal structure.
4.1 Covalent docking in pose prediction mode
You will now set up the Pose Prediction docking job using the Covalent Docking panel.
- Includethe entry is represented in the Workspace, the circle in the In column is blue 5MQY – prepared in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed.
- Select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries 5MQY_prepared_ligand_modified in the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
Note: Any ligand could be selected here, but this pose prediction calculation is intended to test the model and compare the results with that of the initial crystal structure.
- Go to Tasks > Browse > Glide > Covalent Docking.
- The Covalent Docking panel opens in the Receptor tab.
- Under Use ligands from, choose Project Table (1 selected entry).
- Under Reactive Residue, check the box for Pick and click on the CYS 25 residue in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed.
- The field for the reactive residue is updated in the panel to read A:25.
- Under Centroid of Workspace ligand check the box for Pick and select the ligand in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed.
- The field is updated with A:301.
Note: The Centroid of Workspace option uses the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed ligand to define the centroid of the active site for docking.
Figure 4-3. Receptor grid generation view in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed.
Note: Your Workspacethe 3D display area in the center of the main window, where molecular structures are displayed should look like this. The CYS 25 residue is highlighted. The ribbons are hidden ribbons to declutter the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed.
- Go to the Reaction Type tab.
- For Reaction Type, choose Nucleophilic Addition to a Triple Bond.
Note: For Reaction types not available in this option list, see the custom reaction repository here: CovDock.
Note: This is an example of a similar ligand complex forming the same covalent thioimidate adduct with the Cys 25 residue. This will be the template backbone structure later used for virtual screening, altering the substituent in the S2 pocket.
- Go to the Docking tab.
- For the Docking Mode select Pose Prediction (Thorough).
- Set Output poses per ligand reaction site to 1.
- Change Job name to 5MQY_pose_prediction.
- Click Run.
- This job takes ~20 minutes.
- To save time, you may import the pre-generated results by going to File > Import Structures and select
5MQY_pose_prediction-out.maegzfrom the 5MQY_pose_prediction folder in your Working Directorythe location that files are saved.
- Close the Covalent Docking panel.
4.2 Analyze the results from pose prediction covalent docking
Figure 4-7. A view of the CYS 25 crystal structure pose (gray) and the best Prime predicted CovDock pose (green), included in the Workspace. The covalent bond to CYS 25 is shown in the orange box.
- Ctrl+Click (Cmd+Click) to includethe entry is represented in the Workspace, the circle in the In column is blue 5MQY – prepared and 5MQY – prepared_ligand_modified from the 5MQY_pose_prediction-out group.
Note: If you had chosen to generate more poses, they would all be present in this group and ranked according to their Prime Energies.
- Double-click Presets.
- Confirm the covalent bond present in your pose result aligns well with the prepared original cognate ligand.
- Click the Change table settings icon (three vertical dots) and choose Show property.
- The Show Properties in Table dialog box opens.
- Search and select cdock affinity and score docking score from the list.
- Click OK.
- The selected properties are added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion table.
- You may need to click and drag the edge of the table towards the right to see the added properties.
Note: See here for more information about the properties associated with Covalent Docking outputs.
5. Covalently Docking an Enumerated Series into the Protein
5.1 Enumerate the S2 pocket
You will now set up the Virtual Screening docking job using the Covalent Docking panel.
- Go to File > Import Structures.
- Navigate to and select
Template_scaffold.maein your Working Directorythe location that files are saved. - Includethe entry is represented in the Workspace, the circle in the In column is blue and select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries Template_Scaffold in the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
- Select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries the carbon adjacent to the nitrogen in the S2 pocket.
- Go to Tasks > Browse > Enumeration and Ideation > R-Group Enumeration.
- The Custom R-Group Enumeration panel opens.
This tutorial uses a pre-generated ligand library, however, you can import any fragment library from a file or generate one using the R-Group Creator panel. Additionally, Maestro includes several pre-packaged libraries. For more information about ligand enumeration, please refer to the Enumeration Tools for Library Design tutorial.
- For R-Group Library, choose File.
- A dialog box opens to select a collection fragment file to use.
- Navigate to and select
CovDock_Tutorial_Ligands_for_S2_Pocket.maefrom the S2_pocket_enumeration folder in your Working Directorythe location that files are saved. - Click Open.
- The R-Group Library column gets updated.
- Change the Job name to S2_pocket_enumeration.
- Click Run.
- This job takes a few seconds.
- A new group S2_pocket_enumeration with 35 ligands is added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
- Close the Custom R-Group Enumeration panel.
- Select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries the S2_pocket_enumeration group by clicking on the group heading.
- Go to Tasks > Browse > Ligand Preparation and Library Design > LigPrep (3D Conversion).
- The LigPrep panel opens.
- For Use structures from, choose Project Table (35 selected).
- Under Stereoisomers, for Computation, choose Determine chiralities from 3D structure.
- Change Job name to ligprep_enumeration.
- Click Run.
- The job takes ~1 minute.
- A banner appears and a new group ligprep_enumeration-out is added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
- Close the LigPrep panel.
5.2 Covalently dock the enumerated series
- Includethe entry is represented in the Workspace, the circle in the In column is blue 5MQY – prepared in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed.
- Select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries the ligprep_enumeration-out group by clicking on the group heading.
- Go to Tasks > Browse > Glide > Covalent Docking.
- The Covalent Docking panel opens.
- For Use ligands from, choose Project Table (37 selected entries).
Note: All selections in the Receptor and Reaction Type tabs will remain the same as those set in the Pose Prediction section. Confirm these settings are the same as in Section 4.1.
- In the Docking tab, for Docking Mode select Virtual Screening (Fast).
- Change the Job name to 5MQY_Virtual_Screening_S2_Pocket.
- Click Run.
- This job takes ~1.5 hours to complete.
- A banner appears and a new group 5MQY_Virtual_Screening_S2_Pocket-out is added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
Note: To save time, you can use the pre-generated results. To import the pre-generated results, go to File > Import Structures and choose 5MQY_Virtual_Screening_S2_Pocket-out.maegz from the 5MQY_Virtual_Screening_S2_pocket_enumeration folder in your Working Directorythe location that files are saved.
- Close the Covalent Docking panel.
5.3 Analyze the results from virtual screening covalent docking
- Select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries the 5MQY_Virtual_Screening_S2_Pocket-out group by clicking on the group heading.
- 38 docked poses were found, and can be viewed here.
- Go to Tasks > Glide > Pose Viewer.
- The Pose Viewer panel opens.
- Click Set Up Poses.
- The first entry in the 5MQY_Virtual_Screening_S2_Pocket-out group is fixed in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed.
-
Use the left and right arrow keys to step through the poses and compare them.
- For each pose, a covalent bond successfully formed to CYS 25.
- Notice the covalent bond attachment point for the new scaffold is slightly different from that of the crystal structure.
- Close the Pose Viewer panel.
Optional: For comparison, you can fix the 5MQY – prepared entry (crystal structure) in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed when looking at the poses of this new scaffold enumeration. Choose the visualization that is most helpful to you. For example, we have colored the new scaffold enumeration poses in pink, while the 5MQY crystal structure ligand is colored in gray.
Note: For information about virtual screening analysis please see the Structure-Based Virtual Screening using Glide tutorial.
6. Conclusions and References
In this tutorial, you learned about covalent docking modeling preparation and analysis. You further learned how to set up and analyze a Pose Prediction covalent docking job, as well as a Virtual Screening docking job with enumerated substituents using the Covalent Docking application. You also were able to visualize and analyze the docked output of both jobs.
For further reading:
- Covalent Docking User Manual
- Publications and Website link
- Virtual Screening Toledo Warshaviak, D.; et al. A Structure-Based Virtual Screening Approach for Discovery of Covalently Bound Ligands, 2014, J. Chem. Inf. Model., 54(7):1941-50
- Pose Prediction and Scoring Zhu, K.; et al. Docking Covalent Inhibitors: A Parameter Free Approach to Pose Prediction and Scoring, 2014, J. Chem. Inf. Model., 54(7):1932-40
- Case Study Bernd Kuhn.; et al. Prospective Evaluation of Free Energy Calculations for the Prioritization of Cathepsin L Inhibitors, 2017, J. Med. Chem., 60(6): 2485-2497
7. Glossary of Terms
Entries - a simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion
included - the entry is represented in the Workspace, the circle in the In column is blue
incorporated - once a job is finished, output files from the Working directory are added to the project and shown in the Entries and Project Table
Project Table - displays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data
Scratch Project - a temporary project in which work is not saved, closing a scratch project removes all current work and begins a new scratch project
selected - (1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries
Working Directory - the location that files are saved
Workspace - the 3D display area in the center of the main window, where molecular structures are displayed