BACE1 Inhibitor Design Using Free Energy Perturbation

Tutorial Created with Software Release: 2024-3

Topics: Free Energy Perturbation (FEP), Hit-to-Lead & Lead Optimization, Ligand Preparations and Library Design, Small Molecule Drug Discovery

Products Used: FEP+

Tutorial files

97 MB

This tutorial is written for use with a 3-button mouse with a scroll wheel.

Words found in the Glossary of Terms are shown like this: Workspacethe 3D display area in the center of the main window, where molecular structures are displayed

Tip: You can hover over a glossary term to display its definition. You can click on an image to expand it in the page.

Abstract:

This tutorial shows you how to prepare, run, and analyze a free energy perturbation (FEP) simulation for a series of BACE1 inhibitors using FEP+. FEP is a computational method that computes an estimate of the free energy difference to perturb from one compound to another, in a given environment. In this tutorial, we will be computing relative binding affinities between a set of ligands.

Tutorial Content

Introduction

Creating Projects and Importing Structures

Preparing Protein and Ligand Files

Setting up FEP+ Jobs

Assessing the Quality of the Map and Identifying Potential Issues

Extracting SAR Insights from the FEP+ Results

Conclusion and References

Glossary of Terms

1. Introduction

As a consequence of increasing compute power, rigorous approaches to calculate protein–ligand binding affinities, such as free-energy perturbation (FEP) methods, are becoming an essential part of the computer-aided drug design landscape. FEP is a computational method that computes an estimate of the free energy difference to perturb from one compound to another, in a given environment. This framework can be applied to many different problems in computational chemistry, including binding affinity, stability, solubility, or selectivity predictions. See the links here or in the Further Reading section at the end of this tutorial for a high-level overview of the concepts behind FEP calculations, the value FEP predictions bring to a project, and the different applications/variations of the method.

In this tutorial, you will compute relative binding affinities between a set of small molecule ligands for BACE1 using relative-binding FEP. You will use ligand poses obtained using the FEP+ Pose Builder to set up the calculation. You will then look through the technical validation of the FEP+ results to identify potential issues with the map and finally glean some insights into the SAR of the system.

2. Creating Projects and Importing Structures

Figure 2-1. Importing the input data for this tutorial.

Open Maestro and create a new project named FEP_BACE1.prj for this tutorial.
- Don’t know how? See First steps in Maestro.
Download the tutorial zip file including input files and reference outputs here: https://www.schrodinger.com/sites/default/files/s3/release/current/Tutorials/zip/fep_bace1.zip
After downloading the zip file, unzip the contents in your Working Directorythe location that files are saved for ease of access throughout the tutorial.
Go to File > Import Structures.
Find and choose FEP_BACE1_tutorial.maegz from the tutorial files.

3. Preparing Protein and Ligand Files

Structure files obtained from the PDB, vendors, and other sources often lack necessary information for performing modeling-related tasks. Typically, these files are missing hydrogens, partial charges, side chains, and/or whole loop regions. In order to make these structures suitable for modeling tasks, we use the Protein Preparation Workflow to resolve issues. Similarly, ligand files can be sourced from numerous places, such as vendors or databases, often in the form of 1D or 2D structures with unstandardized chemistry. LigPrep can convert ligand files to 3D structures, with the chemistry properly standardized and extrapolated, ready for use in virtual screening.

In this tutorial, the protein has already been prepared. However, these preparation steps are a necessary part of a virtual screen and must be done before any FEP+ calculations. Please see the Introduction to Structure Preparation and Visualization tutorial for instructions on using the Protein Preparation Workflow and Preparing Protein and Ligand Structures for FEP+ for tips on structure preparation for FEP+. If you want to run FEP+ on a membrane protein, see the Membrane-Bound FEP+ with A2A tutorial for guidance on working with membrane systems.

After preparation, the ligands need to be aligned to the reference ligand(s) as closely as possible. This step is key for obtaining accurate results from an FEP+ simulation. There is no universally optimal protocol for determining the best starting poses for FEP+, as differences in ligand chemistry and pocket geometry between systems bring distinct challenges. The Pose Generation for FEP+ documentation page contains a comprehensive overview of available alternative methods for ligand alignment.

We recommend starting by using the new FEP+ Pose Builder (Beta), as it has outperformed other methods on benchmarks, and incorporates many of the best practices shown in this tutorial that would otherwise have to be done manually. See the Ligand Binding Pose Generation for FEP+ tutorial for a hands-on tutorial for obtaining the ligand alignments used in this tutorial using the FEP+ Pose Builder.

Further information can be found in the FEP+ User Manual, where you can find guidance on how to evaluate your project for suitability for FEP+ (Preparing for FEP+ Jobs) and a condensed checklist for FEP+ calculation setup and analysis including tips (FEP+ Checklist).

In this section, you will review the receptor structure and ligand alignments which you will use in the following sections of this tutorial.

While the receptor for this tutorial has already been prepared, we want to highlight that the choice of receptor structure can affect the results of the FEP+ calculation, as the conformation of the protein, in particular small-scale rearrangements in the binding pocket, can require improved sampling.

Figure 3-1. Tyr132 in the binding pocket.

We have provided two structures of the BACE1 protein to highlight the impact of receptor choice.

Ctrl-click (Cmd-click) to includethe entry is represented in the Workspace, the circle in the In column is blue 4DJV_4b_prepared and 4DJX_17h_prepared
Type L
- Tyr132 has a different orientation between 4DJV and 4DJX
- The binding pocket is more open with 4DJV
- The phenyl group of the 4DJV ligand clashes with the 4DJX Tyr132 sidechain

If you observe similar binding site flexibility when preparing to apply FEP+ for your own system, you should add the respective residues to the set of Hot Atoms to include them in the enhanced sampling region.

In this tutorial, it is sufficient to simply use the structure with the more open binding pocket (4DJV) for the receptor in the FEP+ calculation to make sure that ligands with larger R-groups off the guanidine fit more easily during the 5 ns simulation.

Figure 3-2. Inspect ligand alignments.

Next, let’s look at the alignments for the set of ligands used for this tutorial.

Double-click the In circle to fix the 4DJV_4b_prepared receptor in the workspace.
Select the FepPoseBuilder_BACE1_subset group in the Entries.
Use the left and right arrow keys to step through the ligands and inspect their poses.

Note: You can either use the poses provided with the tutorial files for this tutorial or your own results from the Ligand Binding Pose Generation for FEP+ tutorial. Not all ligands from that data set are used in this tutorial to keep the computational costs of the FEP+ job reasonably low.

These ligand alignments closely match the reference ligand from this structure. It is particularly important to align atoms which are part of the non-perturbed core as closely as possible. See the Ligand Binding Pose Generation for FEP+ tutorial and the Pose Generation for FEP+ documentation page for more details.

Figure 3-3. Duplicate 4DJV_4b_prepared into an Existing Group.

Finally, you’ll need to create a copy of the receptor structure minus the ligand.

In the Entry Lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion, right-click 4DJV_4b_prepared
Choose Duplicate > In Place
- A duplicated entry is added to this group
Double-click the new entry and rename it 4DJV_FEP
Includethe entry is represented in the Workspace, the circle in the In column is blue 4DJV_FEP

Figure 3-4. Select and delete the ligand from 4DJV_FEP.

Now, select and delete the ligand from the structure.

Type L
- The Workspacethe 3D display area in the center of the main window, where molecular structures are displayed is zoomed to the ligand
Double-click a ligand atom
- The whole ligand is selected
Right-click and choose Delete Atoms

4. Setting up FEP+ Jobs

First, you will generate a perturbation map, connecting ligands that are most similar. You will bias the map to favor connections to two known ligands, the ligand from the receptor structure and a ligand with a moderate binding affinity. After generating the map, you will inspect the similarity scores of the edges and adjust edges as needed. Finally, you can either launch the job on your compute hardware or use the pre-generated results for further analysis.

Figure 4-1. FEP+ in Tasks.

Find FEP+ in the Tasks and open it.

Figure 4-2. Import structures from the Project Table.

Select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entry List (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries both the 4DJV_FEP entry and the FepPoseBuilder_BACE1_subset group.
For Import structures or perturbation map from, choose Project Table (14 selected entries)
Click Import
- Health checks for the inputs are performed (see below)
When the import is finished, click Relative

When you load structures into the FEP+ panel, health checks are automatically run in the background for both the receptor and the ligand. If no issues are found, you will see a green checkmark or OK.

If there are issues, you can hover over the yellow warning signs to see more information.

Figure 4-3. The protein reliability results for this receptor.

Optional: For the receptor, clicking on the warning will open the Protein Reliability Report showing various quality metrics.

Click the warning sign for the receptor
Click the Peptide Planarity heading to see the issues.
Click the rows to see where the offending residues are located.

The only issues with this receptor are a few minor deviations from peptide planarity outside the binding site, which you can safely ignore.

Figure 4-4. A glance at the Force Field builder

Optional: If you’re working with your own set of ligands, some torsions may be incompletely parametrized in the OPLS force field, causing the yellow warning sign.

In that case, you should run Force Field Builder on the ligands (either manually or as part of the FEP+ job) before proceeding with the calculation.

None of the ligands used in this tutorial should be missing torsion parameters.

In order to run Relative Binding FEP+, experimental binding affinity data for at least one ligand is required. In this tutorial, we have experimental data for all ligands in this series and we can use this to assess the accuracy of the FEP+ method.

Figure 4-5. Add Affinity Data.

In the FEP+ panel, click Affinity and choose Experimental Data
- The Choose Affinity Property panel opens

Figure 4-6. Choose dG. exp (user) as the Affinity Property.

Choose dG.exp (user)
Next to Affinity units, choose ΔG (kcal/mol)
Click OK

Note: When working on your own ligand set, you can either add the reference values as properties to the ligand entries, import them from a CSV file using this panel, or add them manually in the FEP+ panel by double-clicking the cell in the Exp. Affinity column for each ligand.

Next, you have to generate the FEP+ Map, i.e. define between which ligand pairs perturbations should be performed. The automatic map generation supports two topology types which prioritize either accuracy (Optimal topology) or throughput (Star topology) and are useful in different project stages:

The optimal map is constructed by determining the maximum common substructure between the ligands, considering their 3D alignment, and calculating their similarity. Connections are made between structures with high similarity.

The star topology is helpful when you want to maximize throughput (e.g. for triaging ideas from enumeration) by only calculating the minimum number of edges to connect all ligands to a single reference ligand. This topology does not allow for applying the cycle closure correction, so frequently double star maps (with two central ligands) are used as a compromise between throughput and accuracy.

Figure 4-7. Examples of different map topologies. For the star and double star topologies, the biased nodes are placed in the center.

For obtaining robust and accurate FEP+ models outside a high-throughput context, we recommend starting from the optimal topology and, if necessary, adding or adjusting edges to obtain good connectivity with sufficient ligand similarity.

When similarity is insufficient (<0.05) to directly draw an edge between two ligands in your data set, you may want to introduce intermediate virtual ligands to make convergence easier.

Figure 4-8. Generate Map.

We will choose bias ligands based on the protein structure used and known affinity: CAT-4b is the cognate ligand for the receptor structure we are using, and CAT-17h has a binding affinity in the middle of the range of known binders. Biasing to the best known binder may introduce a selection bias, distorting the statistics.

Click the Map tab
In the Bias Column, choose CAT-4b and CAT-17h
Click Generate Map
- The Map Options panel opens

Figure 4-9. Map topology options.

We will start from an optimal map and add further edges to allow more leeway in identifying and excluding problematic ligands and perturbations.

Next to Map topology type, choose Optimal
Click Generate Map
- This job will take less than one minute

Note: Zoom and translate the map with the same mouse actions as the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed. Click and drag to move the ligands in the map.

Figure 4-10. Show similarity scores and view the atom properties of the ligands.

You should make sure that edges are only drawn between sufficiently similar ligands.

Click Display perturbation properties
Choose Similarity scores
- Scores are shown along the map edges
- Darker edges indicate higher similarity

Note: You can also view the FEP+ protocol (default, charge perturbation, core-hopping, or fragment linking) for each edge. The appropriate protocol is chosen automatically when the edge is drawn. See here for more information.

If a similarity score is below 0.05, consider adding an intermediate compound or deleting the edge by right-clicking and choosing delete.

Due to differences in operating system and software version, the generated map you obtained may look slightly different from what is shown in the following screenshots. This should not substantially affect your results.

Figure 4-11. Add edges to the map.

You can now manually add more edges to the map.

Click Add new connection (on the right-hand side below the map)
Click CAT-13f and CAT-13o
- An edge is added
- The similarity score is shown
Repeat steps 29 and 30 to ensure edges exist between the following ligands:
- CAT-4i and CAT-4d,
- CAT-13i and CAT-17e,
- CAT-13d and CAT-13b,
- CAT-17f and CAT-17e,
- CAT-4c and CAT-4d.
Optional: Click the Auto-layout button (next to Add new connection) to make the map more legible.

Note: The “_1” and “_2” suffixes for some ligands denote which of the available Epik states of that ligand was kept by the filtering stage of the FEP+ Pose Builder. As we are only using one state per ligand, we will ignore these suffixes for the rest of this tutorial when referring to specific ligands.

If you wish to exactly reproduce the map used to generate the reference output files, you can use this image to manually draw the appropriate connections:

Figure 4-12. The map used to generate the output files in this tutorial.

Figure 4-13. View the hot atoms of an edge in Atom Properties.

You can select an individual edge to see additional information about the perturbation.

Click on an edge
- Atom Properties of the two compounds are shown
Choose Hot atoms (complex leg)
- The REST regions of the ligands are highlighted.

Note: Click an empty area in the map to clear edge selection

Figure 4-14. View the FEP+ map info.

Optional: Click on Info in the bottom right to get a quick look at the properties of information in the map (receptor health, ligand health, similarity score, etc.). We will ignore the warning about missing the FMPdb since this map has not run.

Figure 4-15. Rearrange the map layout and change the Job name.

You can now name your job and specify which computational resources to use.

Change the Job name to fep_BACE1_4DJV
Click Run Settings (cog)

Figure 4-16. Set hosts (top) and write the FEP+ job (bottom).

Choose your CPU Host and GPU Host

Ensure Maximum simultaneous subjobs is set to 0. This removes the limit on the number of subjobs, so they are all submitted to the subjob host queue. If you do not have license checking enabled, set the number of subjobs to ensure that you do not exhaust your licenses.

You can now either run the job or write out the job files to manually transfer them to a remote HPC.

Optional: Click Run.
- This job requires several hours and significant GPU resources to run.
- Instead of running the job yourself, we recommend continuing with the results in the tutorial files.

5. Assessing the quality of the map and identifying potential issues

The first stage of analyzing the results of an FEP+ map is to assess the quality and reliability of the simulation on a technical level. This will include checking for various accuracy, consistency, and convergence metrics on the reference data set and potential troubleshooting.

Figure 5-1. FEP+ in Tasks.

Find and open FEP+ in Tasks.

Figure 5-2. Import perturbation map.

For Import structures or perturbation map from, choose File
Click Browse
Choose fep_BACE1_4DJV_out.fmp and click Open
When the import is finished, click Next
- The tables and map are populated

Please note that calculations may have been performed with an earlier version of the software and the results may not be exactly the same as those you produced if you ran the job yourself. Especially for methods with a statistical component, such as MD simulations, an exact reproduction of a trajectory and its properties is only possible if the same random seed, software and hardware configuration are used. With sufficient sampling, all properties of the ensemble should be reproducible, while the properties of individual frames and the exact course of a trajectory may differ.

Figure 5-3. Rearrange the data.

For a first overview of the results, we recommend checking the range of predicted binding affinity errors.

Right-click the Pred. Binding(∆G) column header and choose Sort by Error
Left-click Pred. Binding (∆G)
- The table is reordered with the highest error values (in kcal/mol) at the top.

The error values for the predicted ∆G are values are the error bars of the FEP+ calculation, not the error of the prediction w.r.t. the experimental reference. Predicted errors of <0.5 kcal/mol are good, values >1 kcal/mol indicate that there are some inconsistencies in the map. In this case, the ligand-wise predicted ∆G errors all are well below 0.5 kcal/mol.

Click Plot
- The Correlation Plot (FEP+) panel opens

This scatter plot is the best way to get an intuition for how the model performs at a glance. The dark gray error band highlights ligands with error less than 1 kcal/mol from the correlation line. The light gray error band highlights ligands with error between 1-2 kcal/mol.

Figure 5-4. Correlation plot of FEP+ results.

Any outliers in this plot are ligands you should keep in mind while performing further analysis steps.

Hover over the worst outlier data point
- A tooltip with ligand information appears
Optional: Click Save As to save a .png of the Correlation Plot to your Working Directorythe location that files are saved.

The main metric you should check from the list on the right hand side of the plot is the two RMSE values. The edgewise RMSE is the RMS error calculated on the ∆∆G values associated with all directly connected edges in the graph. The pairwise RMSE is the RMS error for the ∆∆G of all ligand pairs in the map (regardless of connections).

You should only proceed to using an FEP+ model prospectively if the RMSE is lower than 1.3 kcal/mol on your validation data set (and there are no signs of convergence problems, see below).

Click the Question mark at the bottom right corner of the panel or see here for a detailed description of the other calculated metrics.

Figure 5-5. Analyze edges.

In our case, the edgewise RMSE is at 1.16, but the pairwise RMSE is at 1.55, so we’ll want to investigate further. You can now go back to the map to check on the performance of the individual edges for potential convergence issues.

Close the correlation plot.
Switch to the Map tab in the FEP+ panel.
Click Display perturbation properties
Choose Predicted ddG (corrected) and Experimental ddG.

Note: You can zoom in and out of the map with your scroll wheel. Depending on the zoom level, ligands will either appear as structures or numbers.

You can now see how the ∆G of binding for each ligand breaks down to edge-wise ∆∆Gs between pairs of ligands. The difference between the raw and corrected predicted ∆∆Gs is the application of the cycle closure correction. The Hysteresis tab allows you to dig deeper into which cycles of the graph might be causing issues.

Figure 5-6. Examine the hysteresis.

Click the Hysteresis tab
Click the Hysteresis column header twice
- The table is reordered with the bad hysteresis cycles at the top
Click through the cycles
- The selected cycle is highlighted in the map.

The hysteresis value for each cycle is colored according to how well the energy along the cycle has converged, with the threshold depending on the length N of the loop. Less than 0.5√N is good (green), between 0.5√N and 0.8√N is acceptable (yellow), and greater than 0.8√N is poor (red).

The main causes of bad hysteresis values are:

Bad edges: ligand similarity was too low or the simulation was too short.
Misbehaving ligands: issues with ligand alignment, bad protonation states, or binding mode switches.

We’ll dig deeper to find out what is causing the bad hysteresis loops in our case.

Figure 5-7. Optional: Display bad perturbations.

First, we can show perturbations which are involved in cycles with bad hysteresis in the map.

Click Display perturbation properties and choose Bad perturbations.
- Edges with poor convergence are colored and labeled.

Note: See Identifying Unconverged Edges in FEP+ Jobs for details on when an edge is labeled a bad perturbation.

Figure 5-8. Analyze the edge.

You can analyze individual edges to see what might be causing poor convergence.

Right-click the bad perturbation edge connecting ligands 12 (CAT-13o_2) and 9 (CAT-13f_1).
Choose Analyze.
- The Analysis: CAT-13o_2 – CAT-13f_1 panel opens.

Figure 5-9. Edge analysis summary

The initial summary shows you the two ligands and allows you to highlight e.g. the perturbed atoms, the common core, the REST region, the atom-wise RMSF, and the atom mapping between the two ligands. This can help you understand which part of the ligand might be causing issues.

Note: You can generate a PDF report of the analyses available in this panel from the button in the lower right corner. The plots in the report contain additional notes explaining what is shown and providing guidelines for interpretation. You can generate the report yourself or see the FEP_SID_report_cat13o_2-cat13f_1.pdf file in the tutorial files for an example.

Figure 5-10. Analysis of the torsional sampling.

You can now check whether the torsions in the ligands were well-sampled during the simulation, as this can hamper convergence.

In the Analysis panel, click the Ligand Details tab.
Click the Rotatable Bonds tab.
Hover over the torsion sampling graphs to highlight the atoms involved in the corresponding torsion.
- The sampling of the green and purple torsions for the CAT-13f ligand seems to be limited in both the solvent (hollow bars) and complex (filled bars) legs.

Note: The gray bars indicate the initial angle of the rotatable bond in the input structure. The strain in kcal/mol is reported below the plot for the complex and solvent legs. See more details here.

For free torsions with a low barrier height, you should see all minima be sampled during the simulation. If some torsions are not adequately sampled, you should consider adding more atoms to the REST region by using a custom core, extending the simulation time, or enhancing the sampling of dihedrals. Asymmetrically substituted ring systems or peptide bonds are common cases which can profit from increasing the sampling.

Next, you can check the overall convergence for this edge. The total free energy differences between the two ligands (∆G in kcal/mol) in solvent and complex legs are plotted as a function of time — forward, reverse, and sliding window. The tables report the associated bootstrap and analytical error estimates from the corresponding simulation legs.

The forward and reverse simulation time plots should approach a constant value, while the sliding time plot should show an overall flat trend (but may fluctuate).

Figure 5-11. Analysis of the convergence.

Click the Convergence tab
- This perturbation may benefit from a longer simulation time in both the solvent and complex legs, as both the Reverse and Sliding Time do not appear stable.

Note: The y-axis of the convergence plots will adjust to fit the scale. Pay attention to the y-axis values, as good convergence may appear poor if the scale is very small and vice versa.

You can also check whether the different lambda windows (corresponding to alchemical intermediates between the two ligands) are sampled by each replica. Ideally each replica will sample all lambda windows uniformly; however non-uniform sampling is sufficient in most instances.

Figure 5-12. Analysis of the REST Sampling.

You can check the sampling for both the solvent and complex legs.

Click the REST Sampling tab
Click Complex.
- While the sampling is mostly uniform for the solvent leg, in the complex it is not uniform.
Close the Analysis panel

Individual bad perturbations, caused by low ligand similarity, bad alignments, or insufficient sampling, can affect the results for ligands not involved in the perturbation via the cycle closure correction. Similarly, individual ligands can also be problematic for the same reasons.

You should try to understand where these issues are coming from before using the map prospectively, and then may wish to remove problematic edges and nodes from the map.

Figure 5-13. Delete the edge.

Let’s delete both bad perturbations from the map.

In the FEP+ panel, once again right-click the edge connecting ligands 9 (CAT-13f_1) and 12 (CAT-13o_2).
Choose Delete.
Also delete the edge connecting ligands 10 (CAT-13d) and 11 (CAT-13b_1).
Optional: Check the Hysteresis table again.
- The two perturbations you removed were responsible for all of the bad hysteresis cycles.

Note: You can undo the removal of an edge or a node using the controls below the map.

Figure 5-14. The Correlation Plot is improved.

Deleting bad perturbations often reduces the error bars on the predicted ∆Gs.

Click Plot
- The error bars of the data points are reduced.
Close the Correlation Plot

Note that there may be other convergence issues that do not manifest as bad hysteresis — for example, if all perturbations in a given cycle fail to sample the same sterically hindered torsion, those errors will cancel out and not be visible as bad hysteresis for the cycle.

The ligand CAT-13o, which was involved in the bad perturbation you just analyzed, seems to be the most significant outlier in the map. In a real project, you would investigate to identify the cause of the issues before using the map prospectively. See Investigating Large Outliers in FEP Simulations and Troubleshooting Common Issues for more detailed discussions of identifying and resolving issues in FEP+ maps.

Here, we will provide some general guidance for additional troubleshooting.

Figure 5-15. Reorder by predicted FEP.

The Analysis tab of the FEP+ panel provides an overview of the relevant quality metrics for the individual edges.

Click the Analysis tab
Click the FEP column twice
- The table is reordered to show the predicted Bennett Free energy difference (ΔΔG) in descending order.
- Errors for predictions are also shown.

Note: If the error for an edge in the FEP column is >0.3 kcal/mol, this is a sign that there is still an inconsistency in the map. Please see Troubleshooting Common Issues for more information.

Figure 5-16. Hover over descriptions to see more information.

The energy convergence, ligand RMSD, REST exchange, and cycle closure correction convergence columns provide another possibility to identify issues for individual edges.

Hover over the Good, Fair, and Bad descriptions to get more information about the individual metrics.

Note: The image on the left shows a representative example from a different FEP+ job.

In particular in cases where you observe bad ligand RMSD or convergence issues, it can be valuable to inspect the trajectories for the individual edges.

Keep an eye out for binding mode switches of individual ligands, or inconsistent hydration patterns in the binding pocket between different simulations.

6. Extracting SAR insights from the FEP+ results

Once the more technical analysis steps have been performed, you can use the results of the FEP+ simulation to gain insights about the SAR and inspiration for new design ideas. The following section highlights some of the relevant functionality for this purpose.

By checking how similarity to a given ligand correlates to potency, you can identify activity cliffs — small changes in structure that drastically affect binding affinity and provide insights into the SAR.

Figure 6-1. Activity Cliffs tab of the FEP+ panel showing positive correlation.

First, we’ll see how this looks for a ligand where there’s a positive correlation between similarity and potency.

In the FEP+ panel, switch to the Activity Cliffs tab.
Set the reference ligand to CAT-17f
- The data points cluster towards the lower right – the more similar a ligand is to this reference, the more potent it is.
Click on the data point for CAT-24 (see figure)
- The chosen ligand is highlighted in the table.
- The small perturbation from the nitrile to the alkyne leads to a large increase in affinity.

Figure 6-2. Activity Cliffs tab of the FEP+ panel showing negative correlation.

You can compare this to a ligand where the correlation is inverted (and less-pronounced).

Set the reference ligand to CAT-4d.
- Now, the ligands that are most similar to the reference do not show favorable potency changes.
Click on the data point for CAT-4c (see figure)
- Moving the substituent on the last phenyl ring to the para position makes the binding affinity worse by ~2 kcal/mol.

You can also use the trajectories from the FEP+ simulation to help you understand what causes these large activity differences.

Figure 6-3. Display the Complex Trajectory.

Let’s check the complex leg trajectory of the edge connecting CAT-4c and CAT-4d.

In the Complex Trajectory row for the CAT-4c <--> CAT-d edge, click 5.0 ns
- A new group with entries for each ligand are added to the Entry Lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion

Note: If there were no edge between this pair of ligands in the map, you could also use the trajectories from other edges containing either CAT-4c or CAT-4d.

Figure 6-4. Load and play the trajectory.

Click the T associated with the CAT-4d entry and choose Load Trajectory.
Click Play in the Trajectory Player to play the trajectory.
Repeat the previous two steps with the CAT-4c trajectory.

See the Introduction to MD Trajectory Analysis with Desmond tutorial for an overview of the analysis tools available in the trajectory player.

Comparing the two trajectories, you can see that there is a small pocket which is efficiently filled by the longer and differently positioned CAT-4d substituent in most frames of the simulation. In the CAT-4c trajectory, the p-methoxy substituent is unable to reach into this pocket, which is then filled by water molecules. You could run a WaterMap simulation to confirm that displacing these water molecules is energetically favorable, and use this insight when generating further ideas.

Figure 6-5. The para substituent on the CAT-4c ligand (left) fails to fill a side pocket which the bulkier substituent in the CAT-4d ligand (right) occupies.

7. Conclusion and References

In this tutorial, you set up an FEP+ calculation for a series of BACE1 ligands using the 4DJV receptor structure and a set of ligands prepared and aligned using the FEP+ Pose Builder. You generated the FEP+ map and biased the connectivity towards two ligands with mid-range experimental binding affinities. You added edges to the map to further connect similar ligands, taking care to make sure each ligand had more than one edge and was connected to a biased ligand.

Using pre-generated results, you checked for correlation between predicted and experimental binding affinities and reviewed the accuracy and convergence of the map. Finally, you checked for insights into the SAR by visualizing activity cliffs and looking into the complex leg trajectories for a pair of ligands.

Click to Expand

For further learning:

More on system preparation in general and for FEP+ specifically:
More on FEP+:
- Free energy calculations for drug design with FEP+ online certification course
- Membrane-Bound FEP+ with A2A tutorial
- FEP+ Best Practices

Click to Expand

For further reading:

Pose Generation for FEP+ documentation page
Ligand series and 4DJV structure used in this tutorial originally published by Cumming et al., Bioorganic & Medicinal Chemistry Letters 2012 22 (7), 2444-2449, DOI: 10.1016/j.bmcl.2012.02.013.

8. Glossary of Terms

Entry List - a simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion

included - the entry is represented in the Workspace, the circle in the In column is blue

Project Table - displays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data

Scratch Project - a temporary project in which work is not saved, closing a scratch project removes all current work and begins a new scratch project

selected - (1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entry List (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries

Working Directory - the location that files are saved

Workspace - the 3D display area in the center of the main window, where molecular structures are displayed