Enumeration Tools for Library Design

Tutorial Created with Software Release: 2025-3

Topics: Hit-to-Lead & Lead Optimization, Ligand Preparations and Library Design, Small Molecule Drug Discovery

Tutorial files

50 MB

This tutorial is written for use with a 3-button mouse with a scroll wheel.

Words found in the Glossary of Terms are shown like this: Workspacethe 3D display area in the center of the main window, where molecular structures are displayed

Tip: You can hover over a glossary term to display its definition. You can click on an image to expand it in the page.

Abstract:

In this tutorial, you will learn how to use various enumeration tools in Maestro to design libraries for the lead optimization stage of a Cdk2 inhibitor drug discovery project. In addition to building libraries, you will learn some workflows for library curation and enrichment.

For a comprehensive overview of our current best practice recommendations for designing quality ligand libraries, see the Library Design learning path.

Tutorial Content

Creating Projects and Importing Structures

Enumerating with an R-group Library

Enumerating with Bioisosteres

Enumerating with Pathfinder and Reaction-Based Enumeration

Creating and Managing a Custom R-Group Library

Conclusion and References

Glossary of Terms

1. Creating Projects and Importing Structures

At the start of the session, change the file path to your chosen Working Directorythe location that files are saved in Maestro to make file navigation easier. Each session in Maestro begins with a default Scratch Projecta temporary project in which work is not saved, closing a scratch project removes all current work and begins a new scratch project, which is not saved. A Maestro project stores all your data and has a .prj extension. A project may contain numerous entries corresponding to imported structures, as well as the output of modeling-related tasks. Once a project is created, the project is automatically saved each time a change is made.

Structures can be built in Maestro or can be imported using File > Import Structures (or drag-and-dropped), and are added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion and Project Tabledisplays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data. The Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion is located to the left of the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed. The Project Tabledisplays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data can be accessed by Ctrl+T (Cmd+T) or Window > Project Table if you would like to see an expanded view of your project data.

Double-click the Maestro icon.
- (No icon? See Starting Maestro)

Figure 1-1. Change Working Directory option.

Go to File > Change Working Directory.
Find your directory, and click Choose.
Pre-generated input and results files are included for running jobs or examining output. Download the zip file here: https://www.schrodinger.com/sites/default/files/s3/release/current/Tutorials/zip/library_design.zip
After downloading the zip file, unzip the contents in your Working Directory for ease of access throughout the tutorial.

Note: The Results folder in the tutorial files contains all the job input/output files.

Figure 1-2. The Open Project panel.

Go to File > Open Project.
Select the enumeration_tutorial.prjzip file.
Click Open.
- Structures are added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
In the Save scratch project dialog box, click OK.

Figure 1-3. Save Project panel.

Go to File > Save Project As.
Change the File name to enumeration_Cdk2.
Click Save.
- The project is now named enumeration_Cdk2.prj.

2. Enumerating with an R-Group Library

The R-Group Enumeration panel can be used to generate synthetically tractable analogs of a hit molecule from a library of R-group fragments. There is an R-group library packaged into the software and additional libraries can be added from file. Up to 10 R-groups can be enumerated at a time, allowing for the generation of a combinatorial library of analogs.

Coxon et al. have demonstrated through extensive SAR analysis the importance of substitutions at the purine 6-position for Cdk2 inhibition (the 6-substituent is needed to occupy a lipophilic pocket), thus we will set that position as our enumeration connection point.

After expanding the chemical space, we would like to pass it through various filtering and scoring methods. This allows for dataset enrichment for ranking by more sophisticated computational methods, such as MM-GBSA and FEP+. In this section, we will dock ligands using Glide SP Docking, and filter based on Glide gscore, Lipinski’s Rule of 5, and polar surface area (docking and filtering criteria should be selected based on the specific requirements of the system/project).

2.1 Perform custom R-Group enumeration

Figure 2-1. Selecting the connection point for enumeration.

Confirm the Hit is includedthe entry is represented in the Workspace, the circle in the In column is blue in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed and your selection scope is set to Atoms.
Select H21 in the purine core.

Note: For the difference between includedthe entry is represented in the Workspace, the circle in the In column is blue and selected(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries , please refer to the Glossary of Terms.

Go to Tasks > Browse > Enumeration and Ideation > R-Group Enumeration.
- The Custom R-Group Enumeration panel opens.

Figure 2-2. Running the R-Group Enumeration job.

Note: In this tutorial, you will use the Diverse R-groups library that is packaged along with Maestro. However, any fragment library can be loaded from file or created using the R-Group Creator panel.

For R-Group Library, choose Diverse R-groups (43).
Change the Job name to R-group_enumeration.
Click Run.
- This job takes a few seconds.
- A new group R-group_enumeration is added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
Close the Custom R-Group Enumeration panel.

Figure 2-3. Opening the LigPrep panel.

Select the R-group_enumeration group by clicking on the group heading.
Go to Tasks > Browse > Ligand Preparation and Library Design > LigPrep (3D Conversion).
- The LigPrep panel opens.

Figure 2-4. Running the LigPrep job with the default settings.

For Use structures from, choose Project Table (selected entries).
Change Job name to ligprep_enumeration.
Click Run.
- This job takes ~ 1 minute.
- A banner appears and a new group ligprep_enumeration-out is added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
Close the LigPrep panel.

2.2 Dock the prepared enumerated ligands

Figure 2-5. Opening the Ligand Docking panel.

Select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries the ligprep_enumeration-out group by clicking on the group heading.
Go to Tasks > Browse > Receptor-Based Virtual Screening > Ligand Docking.
- The Ligand Docking panel opens in the Ligands tab.

Figure 2-6. Running the docking job.

For Receptor grid, choose From file.
Next to File name, click Browse and choose glide-grid_enumeration.zip from your Working Directorythe location that files are saved.
For Use ligands from, select Project Table ( 104 selected).
Change the Job name to glide-dock_enumeration.
Click Run.
- This job takes ~3 minutes.
- A banner appears and a new group glide-dock_enumeration_pv is added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
Close the Ligand Docking panel.

Figure 2-7. The Ligand-Based ADME/Tox Prediction in Tasks.

Select the glide-dock_enumeration_pv group by clicking on the group heading.
Go to Tasks > Browse > ADME and Molecular Properties > Ligand-Based ADME/Tox Prediction.
- The QikProp panel opens.

Figure 2-8. The QikProp panel.

For Use structures from, choose Project Table (selected entries).
Check the box for Fast mode.
Change the job name to ADME-Tox_prediction_enumeration.
Click Run.
- The job takes a few seconds.
- A banner appears and a new group ADME-Tox_prediction_enumeration-out is added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
Close the QikProp panel.

Figure 2-9. Opening the Ligand Filtering panel.

Select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries the ADME-Tox_prediction_enumeration-out group by clicking on the group heading.
Go to Tasks > Browse > Ligand Preparation and Library Design > Ligand Filtering.
- The Ligand filtering panel opens.

Figure 2-10. The Ligand Filtering panel.

For Use structures from, choose Project Table (105 selected entries).
In the Properties tab, under Available properties, select glide gscore (Impact).
For Property, set up the filter criterion r_i_glide_gscore < -8.5.
Click Add.
- The filtering criterion is added.
Repeat steps 16-18 for RuleOfFive (QikProp) and PSA (QikProp) and set i_qp_RuleOfFive < 1 and PSA < 140 respectively.
Change the Job name to ligfilter_enumeration.

Note: Ligand filtering allows for a more reasonable number of compounds for later visual inspection.

Click the Job settings cog.
- The Jobs Settings dialog box opens.

Glide scores cannot be used for rank-ordering, nor do they correlate with binding affinity. Glide score filtering is used here for dataset enrichment, and the cutoff of -8.5 was chosen based on the distribution of docking scores.

Figure 2-11. Running the Ligand Filtering job.

Under Output, for Incorporate, choose Append new entries as a new group.
Click Run.
- This job takes a few seconds.
- A banner appears and a new group ligfilter_enumeration-out is added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
- Only 13 ligands remain following the ligand filtering.
Close the Ligand Filtering panel.

3. Enumerating with Bioisosteres

Bioisosteres are compounds or groups that have strong chemical and physical similarity, while also having similar biological activity. They can be used in lead optimization to improve potency, enhance selectivity, improve ADME-Tox properties, or to acquire novel IP.

The Bioisostere Replacement panel allows for 479, mostly non-classical (i.e. functional-group centric) bioisostere transformations. Bioisostere replacements are performed sequentially, resulting in every output having only one functional group replacement (a more combinatorial approach is available through the command line). The key groups for bioisostere transformation are acids, esters, t-butyls, carbonyls, amides, and phenyls, though others are included as well. The packaged bioisosteres are encoded as Reaction SMARTS, and can be easily added to or modified through the command line. Additionally, it is possible within the panel to set certain regions of a molecule as immutable, thus limiting the moieties with matching bioisosteres to be replaced.

In this section, the input structures will be one of the outputs of the final filtering of the R-group Enumeration section. Due to aqueous solubility concerns of phenyl-containing molecules, you will be performing a bioisostere replacement to add in a group with more 3D character to break the planar symmetry. The resulting ligands will be docked via Glide SP Docking and filtered by Glide gscore values. Note that docking and filtering criteria should be selected based on the specific requirements of the system/project.

3.1 Perform bioisostere replacement

Figure 3-1. Opening the Bioisostere Replacement panel.

In the ligfilter_enumeration-out group in the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion, includethe entry is represented in the Workspace, the circle in the In column is blue Hit, from pyridine 2.
Go to Tasks > Browse > Enumeration and Ideation > Bioisostere Replacement.
- The Bioisostere Replacement panel opens.

Figure 3-2. Defining immutable region and running the bioisostere replacement job.

In the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed, shift-click to select the entire molecule except for the sulfonamide group.
For Use structures from, choose Workspace.
For Define immutable region from, select SMARTS and click Get from Workspace Selection.
Change the Job name to bioisostere_replacement.
Click Run.
1. This job takes a few seconds.
2. A banner appears and a new group Hit, from pyridine 2 with 18 entries is added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
Close the Bioisostere Replacement panel.

3.2 Dock structures from bioisostere replacement

Figure 3-3. Selecting Bioisostere Replacement outputs and opening the LigPrep panel.

Expand and select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries the Hit, from pyridine 2 (18) group by clicking on the group heading.
Go to Tasks > Browse > Ligand Preparation and Library Design > LigPrep (3D Conversion).
1. The LigPrep panel opens.

Figure 3-4. Running the LigPrep job for Bioisostere Replacement outputs.

For Use structures from, choose Project Table (18 selected).
Change Job name to ligprep_bioisostere_replacement.
Click Run.
1. This job will take ~ 1 minute.
2. A banner appears and a new group ligprep_bioisostere_replacement-out is added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
Close the LigPrep panel.

Figure 3-5. Selecting the LigPrep outputs and opening the Ligand Docking panel.

Select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries the ligprep_bioisostere_replacement-out group by clicking on the group heading.
Go to Tasks > Browse > Receptor-Based Virtual Screening > Ligand Docking.
- The Ligand Docking panel opens in the Ligands tab.

Figure 3-6. Running the docking job.

For Receptor grid, glide-grid_enumeration.zip should already be selected from the previous docking setup.
For Use ligands from, choose Project Table (53 selected).
Change Job name to glide-dock_bioisostere_replacement.
Click Run.
- This job will take ~2 minutes.
- A banner appears and a new group glide-dock_bioisostere_replacement_pv is added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
Close the Ligand Docking panel.

3.3 Analyze the docking results

Figure 3-7. Selecting the docking outputs and opening the Ligand Filtering panel..

Select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries the glide-dock_bioisostere_replacement_pv group by clicking on the group heading.
Go to Tasks > Browse > Ligand Preparation and Library Design > Ligand Filtering.
- The Ligand filtering panel opens.

Figure 3-8. Running the Ligand Filtering job.

For Use structures from, choose Project Table (selected entries)
- All previous selections from the panel should still be populated.
Leaving the property r_i_glide gscore < -8.5, delete i_qp_RuleOfFive < 1 and PSA < 140 by selecting them and clicking Delete one at a time.
Change the Job name to ligfilter_bioisostere_replacement.
Click Run.
- This job takes a few seconds.
- A new group ligfilter_bioisostere_replacement-out is added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
- Only 21 ligands remain following the ligand filtering.

Figure 3-9. Setting the Workspace to step through the poses.

Go to File > Import Structures.
Navigate to proteinprep_5NEV folder in your Working Directorythe location that files are saved and select proteinprep_5NEV-out.maegz.
- A new entry 5NEV – prepared is added to Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion and includedthe entry is represented in the Workspace, the circle in the In column is blue in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed.
Double-click the In circle of 5NEV – prepared.
- The structure is fixed in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed.
Right-click the ligfilter_bioisostere_replacement-out group header and choose Include First Entry (Top Group).

Figure 3-10. Visualizing the poses.

Step through the results using the left and right arrow keys.

Note: Choose a visualization that is most helpful to you. For example, in our case, presets have been applied, interactions are toggled on, labels are hidden, and all the ligands are rendered in ball-and-stick representation. For comparing the ligand poses, it can be helpful to adjust the Auto-fit settings to your preference – if Auto-fit and Ligand are toggled on, the view will zoom to fit each ligand as you switch to its Entry. Turning auto-fit off will keep the view constant as you step through the poses.

Once you are done inspecting, right-click the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed and choose Clear Workspace.

4. Enumerating with Pathfinder and Reaction-Based Enumeration

The Reaction-Based Enumeration panel is a synthetically aware enumeration tool that can be used for core hopping, large-scale combinatorial enumeration and targeted R-group enumeration.

The workflow begins with the selection of a retrosynthetic path. Pathfinder can automatically generate possible routes to the desired targets, using reactions from a reaction file. While Pathfinder is not a retrosynthesizer, it can be used to generate a set of synthetically reasonable pathways. Pathfinder notes the reactions used in each pathway, as well as the associated reactant classes. There are currently 83 reactant classes and 127 reactions, including a tailored set of 25 functional group transformations, bundled into a Maestro installation that can be edited or added to if desired. In addition to automatic path generation through Pathfinder, reaction routes can be defined manually using the sketcher.

By default, the enumeration is performed pseudo-randomly, not systematically, but a common random seed is used in the enumeration job to ensure consistent results between runs. Sampling of combinations is performed until a specified maximum number of products is reached, or the number of samples exceeds 100 times the specified number of maximum products.

In this section, you will use the Reaction-Based Enumeration tool in three different ways to generate new ideas for potential Cdk2 inhibitors. You will start with the same input molecule, and vary settings to perform a large scale enumeration (with product filtering), core hopping, and R-group enumeration.

4.1 Perform large-scale enumeration

Figure 4-1. The Reaction-Based Enumeration option in Enumeration and Ideation in Tasks.

Expand CDK2 Inhibitors group in the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
Includethe entry is represented in the Workspace, the circle in the In column is blue and select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries CHEMBL296468 from the group.
Go to Tasks > Browse > Ligand Enumeration and Ideation > Reaction-Based Enumeration.
- The Reaction-Based Enumeration panel opens.

Figure 4-2. The Reaction-Based Enumeration panel.

For Use compound from, choose Project Table (selected entry).
Click Load.
- The hit structure is added to the sketcher.
Set Max depth to 2.

Note: Depth is the longest sequence of steps leading from the target to a starting material. For a linear synthesis, the depth is equal to the number of steps in a reaction, but in a convergent synthesis it can be less.

Click Display Breakable Bonds.
- Breakable bonds are highlighted.
Click Generate Pathways button.
- The Review Pathways step is displayed.

Figure 4-3. Adding a filter to the reaction pathways in the Filter Pathways dialog box.

Click Filter Options button.
- The Filter Pathways dialog box opens.
Check the box for Reaction Name.
In the text box, type thioether.
Click OK.
- A filter is added to the reaction pathways to only show those that include a thioether.

Note: Pathway filtering is helpful for narrowing down the pathfinder results to a more manageable number of pathways.

Figure 4-4. Choosing the reaction pathway for large-scale enumeration.

Choose Path 17: thioether-2, amide-coupling-2.
- The reaction names, as well as the reactants and products, appear under Pathway.
Click Define Reactants > button.
- The Define Reactants step is displayed.

Figure 4-5. Choosing the filter properties.

Click SMARTS Filter button.
- The Filter by SMARTS dialog box opens.
From the Read SMARTS Filter button menu, choose From Predefined Filters > REOS.
Click Save.
- Filter is set for all products.

Note: You can use the SMARTS Filter to remove any undesired moieties from the enumeration output. You can also use the Property Filter to remove compounds that fall in undesired property-space.

If the maximum number of products was set to 0 (i.e. no maximum), a total of products could be generated. However, when a maximum is set, a random seed is used to generate up to the maximum number of products. A consistent seed is used by default to enable reproducibility of enumeration results.

Figure 4-6. Running the large-scale enumeration job.

Click Enumerate in the Reaction-Based Enumeration panel.
- The Enumerate - Job Settings dialog box opens.
Change the Job name to enumerate_large_scale.
Click Run.
- This job takes a few seconds.
- A new group enumerate_large_scale is added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
- 10,000 compounds are generated.

Note: Certain reactions will generate multiple products if there are multiple potential products from the given set of reactants. Uncheck Allow multiple products per reaction step in the Reaction-Based Enumeration panel if you would prefer only a single output from each step.

4.2 Perform core-hopping enumeration

Figure 4-7. Choosing the pathway for core-hopping enumeration.

In the Reaction-Based Enumeration panel, click Back twice to return to the Choose Reaction step.

Note: This clears the Pathway filter added in the previous example.

Click Generate Pathways.
Choose Path 21: suzuki-2, amide_coupling-2.
Click Define Reactants > button.

Figure 4-8. Choosing the reactants for the core-hopping enumeration.

Note: To determine which reactants correspond to which portion of the input molecule, hover over the reaction until a tooltip pops up.

For Reactant 1 and Reactant 3, choose Original Reactant (1).
- This will ensure that only the core is varied in each output.
Click Enumerate.
- The Enumerate - Job Settings dialog box opens.

Depending on the functional groups required for the enumerated core and the reaction depth, it is potentially helpful to generate a custom library of cores that contain the particular functional groups needed for a given pathway (this is not necessary for this section).

Figure 4-9. Running the core-hopping enumeration job.

Change the Job name to enumerate_core-hopping.
Click Run.
- The job takes a few seconds.
- A new group enumerate_core-hopping is added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
- 180 core hopped compounds are generated.

Note: It is normal to get lower than the desired number of products during a core-hopping enumeration since multiple reactive handles are required.

4.3 Enumerate R-groups

Figure 4-10. Choosing the pathway for R-group enumeration.

In the Reaction-Based Enumeration panel, click Back to return to the Review Pathway page.
Choose Path 33: amide_coupling-1, thioether-2.
Click Define Reactants > button.

Figure 4-11. Choosing the reactants for R-group enumeration.

For Reactant 2, choose Original Reactant (1).
- This will ensure that only the R-groups are varied in each of the outputs.

Note: Both of the non-core groups of the input molecule will be enumerated combinatorially.

Click Enumerate.
- The Enumerate - Job Settings dialog box opens.

Figure 4-12. Running the R-group enumeration job.

Change the Job name to enumerate_R-group.
Click Run.
- The job takes 4-5 minutes.
- A new group enumerate_R-group is added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
- The enumeration results in 10,000 unique compounds.

4.4 Create reaction manually for enumeration (optional)

Figure 4-13. Sketching the reaction.

For manual reaction creation in Reaction-Based Enumeration, all R-groups must be drawn explicitly.

In the Reactions-Based Enumeration panel, click Back twice to return to the Choose Reaction page.
- You may need to click OK in the Schrödinger 2D Sketcher - Welcome dialog box.
For Get reaction from, choose New Sketch.
Sketch a Schiff base condensation reaction.
Select the green check mark to clean up.
- The reaction will appear linear, further modification may be needed.
Click Define Reactants.

Figure 4-14. Choosing the reactants for enumeration.

For Reactant 1, select Reactant Library and choose amines-prim and click OK.
For Reactant 2, select Reactant Library and choose aldehydes and click OK.

Figure 4-15. The Creating Library from SMARTS dialog box.

Note: To generate new reactant libraries from the Project Table or a file, select Reactant Library and click +New to open the Create Library from SMARTS dialog box.

Note: The Create Reactant Library from SMARTS panel can also be accessed by going to Tasks > Browse > Enumeration and Ideation > Create Reactant Library.

Figure 4-16. Choosing filter properties.

Click the filter icon next to Reactant 1.
Choose Property filter.
In the Reactants Tab, check the box for TPSA.
For TPSA, set the bounds to 0 and 60.00.
Click Save.

Figure 4-17. Running the enumeration job.

Click Enumerate.
Rename the job enumerate_custom_reaction.
Click Run.
- The job takes 10-15 minutes.
- A new group with 10,000 imine-containing compounds is added to the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
Close the Reaction-Based Enumeration panel.

Note: Enumeration jobs with a reactant filter are expected to take longer, as the reactants will need to be pre-filtered before the enumeration can proceed.

5. Creating and Managing Custom R-Group Library

In addition to using the 13 pre-packaged R-groups libraries, you can generate or import your own custom R-group libraries. R-groups can be added using a sketcher or they can be defined from a Maximum Common Substructure analysis of a set of ligands. These libraries can then be used in the Custom R-Group Enumeration panel, as shown in Section 2.

5.1 Add R-groups from a set of ligands

Figure 5-1. The Create R-Group Library option in Enumeration and Ideation in Tasks.

Expand and select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries the Ligands group in the Entriesa simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion.
Go to Tasks > Browse > Enumeration and Ideation > Create R-Group Library.
- The R-Group Creator panel opens.

Figure 5-2. Generating R-groups from a set of congeneric ligands.

In the Create R Groups tab, choose Analyze R groups from ligands.
For Analyze structures from, choose Project Table (selected entries).
For Core definition from, choose Maximum Common Substructure.

Click Analyze.
- The R-groups for the set of ligands is defined.

Note: The Maximum Common Substructure criteria can be set by clicking Settings. Click View Core to see how the core has been defined by the Maximum Common Substructure.

Figure 5-3. Adding R groups to a new library.

Click Select All.
For Add selected R groups to library, choose New.
- The New R-group Library dialog box opens.
For Library name, type Library_Design.
Click OK.

Note: New libraries can also be created in the Manage R Groups tab.

Click Add.
- The R-groups are added to a new library titled Library_Design.

5.2 Sketch custom R-groups

Figure 5-4. Sketching R-group.

Click the Sketch R Groups method.
Sketch a thiol.
Select the attachment point tool and click on the carbon in your structure.
- The R-group attachment point is now defined.
Name the Fragment thiol.

Figure 5-5. Adding R-group to the library.

For Add R group to library choose Library_Design and click Add.
- The thiol group is added to the Library_Design R-group library.

5.3 Manage R-group library

Figure 5-6. The Manage R Groups tab in the R-Group Creator panel.

Go to the Manage R Groups tab.
Deselect All.
For R-Group Libraries, select Library_Design.

Note: In the Manage R Groups tab, you can import R-groups to a library (Import), export an R-group library (Export), create a new R-group library (New), or edit an R-group (More Actions > Edit).

6. Conclusion and References

In this tutorial, you used R-group Enumeration and Bioisostere Replacement to explore SAR and address potential ADME issues. Then, working off the same input structure, you used Reaction-Based Enumeration with Pathfinder to perform three different enumerations that can be helpful during the lead optimization stage of a drug discovery project.

Click to Expand

For further learning:

Click to Expand

For further reading:

Coxon et al. J. Med. Chem. (2017) 60(5):1746-1767
Reaction-based Enumeration, Active Learning, and Free Energy Calculations to Rapidly Explore Synthetically Tractable Chemical Space and Optimize Potency of Cyclin Dependent Kinase 2 Inhibitors
Enumeration of 166 Billion Organic Small Molecules in the Chemical Universe Database GDB-17 - 2012 paper from the University of Bern in which the authors defined “unknown chemical space”.
The next level in chemical space navigation: going far beyond enumerable compound libraries (open access) - 2019 paper for Taros Chemicals and BioSolveIT that describes some recent advances in chemical space search applications.

7. Glossary of Terms

cognate ligand - a ligand that is bound to its protein target

Entries - a simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion

included - the entry is represented in the Workspace, the circle in the In column is blue

incorporated - once a job is finished, output files from the Working Directory are added to the project and shown in the Entries and Project Table

Project Table - displays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data

Scratch Project - a temporary project in which work is not saved, closing a scratch project removes all current work and begins a new scratch project

Selected - (1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entries (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries

Working Directory - the location that files are saved

Workspace - the 3D display area in the center of the main window, where molecular structures are displayed