R-group Enumeration with the Materials Science Suite
Tutorial Created with Software Release: 2024-2
Topics: Catalysis & Reactivity , Energy Capture & Storage , Organic Electronics , Thin Film Processing
Products Used: MS Maestro
|
2 KB |
This tutorial is written for use with a 3-button mouse with a scroll wheel.
Words found in the Glossary of Terms are shown like this: Workspacethe 3D display area in the center of the main window, where molecular structures are displayed
Abstract:
In this tutorial, we will learn how to manage R-group libraries and apply them for R-group enumeration over a core structure.
Tutorial Content
1. Introduction to R-group Enumeration
Schrödinger’s Materials Science Suite can assist in the exploration of new materials through the use of R-group enumeration. This technique allows the creation of virtual libraries containing materials of interest from molecular scaffolds. Extensive compound libraries can then be sorted and screened for properties of interest, which in turn can guide synthetic targets. The discovery and development of organic optoelectronic materials are of great interest, particularly with respect to organic light-emitting diodes (OLEDs) and organic photovoltaics (OPVs). One of the challenges facing the field is to explore the expansive design space. R-group enumeration can be used to expedite the search of desired materials in the design space. Further reading on how Schrödinger’s Materials Science Suite can be impactful in designing organic electronic materials can be found here.
In this tutorial, we will first construct a small library of R-groups to be used in enumeration. Then, we will apply the custom library and a built-in library to R-group enumeration on a prototypical organic compound core, 2-phenylpyridine as shown in the figure below. An analysis of the enumerated 2-phenylpyridine library is performed in the Optoelectronics tutorial, in which the 165 structure library is screened for compounds with key optoelectronic properties.
Here we introduce the potential use of the R-group enumeration tool in the field of organic electronics, however, this is not the only field that this panel is applicable to. Any project that depends on structure innovation and structure libraries can use the R-group enumeration tool to generate a molecular database.
2. Creating Projects and Importing Structures
At the start of the session, change the file path to your chosen Working Directorythe location where files are saved in MS Maestro to make file navigation easier. Each session in MS Maestro begins with a default Scratch Projecta temporary project in which work is not saved, closing a scratch project removes all current work and begins a new scratch project, which is not saved. A MS Maestro project stores all your data and has a .prj extension. A project may contain numerous entries corresponding to imported structures, as well as the output of modeling-related tasks. Once a project is saved, the project is automatically saved each time a change is made.
Structures can be built in MS Maestro or can be imported using File > Import Structures (or drag-and-dropped), and are added to the Entry Lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion and Project Tabledisplays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data. The Entry Lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion is located to the left of the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed. The Project Tabledisplays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data can be accessed by Ctrl+T (Cmd+T) or Window > Project Table if you would like to see an expanded view of your project data.
- Double-click the Materials Science icon
- (No icon? See Starting Maestro)
- Go to File > Change Working Directory
- Find your directory, and click Choose
- Pre-generated files are included for running jobs or examining output. Download the zip file here: schrodinger.com/sites/default/files/s3/release/current/Tutorials/zip/builders_rgroup_enumeration.zip
- After downloading the zip file, unzip the contents in your Working Directorythe location where files are saved for ease of access throughout the tutorial
- Go to File > Save Project As
- Change the File name to r_group_enum, click Save
- The project is now named
r_group_enum.prj
- The project is now named
3. Creating a Small R-group Library
In this tutorial, we will manage and apply R-group libraries to enumerate over a 2-phenylpyridine core structure. In this section, we will prepare a small library of R-groups using the 2D Sketcher built-in to the R-group Creator panel. The small library will then be used for an R-group enumeration job in Section 4.
First, we need to build our small R-group library for enumeration. The R-Group Creator panel is used to create, edit, and manage R-group libraries
- Go to Tasks > Materials > Enumeration > R-Group Creator
- The R-Group Creator panel opens on the 2D sketcher
The 2D sketcher functions like many standard 2D molecular drawing tools. For a complete overview of using the sketcher panel, see the 2D Sketcher Panel documentation
Note: If you will frequently use this panel (or any other), remember that you can click the star in the menu and it will be added as a favorite to your toolbar.
- Click on the New R+ group tool on the bottom left of the panel
- Click on one of the H atoms and drag it anywhere on the sketcher
- In Add R group to library, select New from the drop-down menu
- Type tutorial_lib as the new R-Group Library name
- Click OK
- Change the Fragment name to R-H
- Click Add to add the created fragment to the R-group library
To continue building out our R-group library, tutorial_lib, we will draw a methyl, R-CH3 group:
- Click on C from the sketcher toolbar
- Click on the H in the current structure to replace the H with a CH3 group (designated by just the line and end-point)
- Change the Fragment name to R-CH3
- Click Add to add the structure to tutorial_lib
- Continue to make three more fragments with R-Cl, R-t-Bu and R-Ph groups. See the Figure for guidance. Name and add each of the three R-groups to tutorial_lib. Below are some tips on drawing the three fragments:
Note: To clear the sketcher, click Reset Sketcher. You can also continue on with the current structure to create another, related R-group
- To draw R-t-Bu (tert-butyl), draw R-CH3 and at the C atom, draw a line in any direction. Draw two more lines from the same atom in different directions
- Use the eraser tool to clear your sketches
- Switch to the Manage R Groups tab and unselect All (3087) and then select the library we just created, tutorial_lib (5)
- The tutorial_lib library should have five R-groups as shown in the Figure
- The R-groups listed under R-Group Libraries are the default included libraries, but you may see different options depending on your version of the software and previous use of this panel
Note: Several R-group libraries are available by default. You can also import, export or delete from the list of R-group libraries. For a more comprehensive overview of the R-Group Creator panel, see the help documentation
- Close the R-Group Creator panel
4. Running R-group Enumeration
Here, we will run the R-group enumeration panel on 2-phenylpyridine using the library created in Section 3.
- From the main menu, go to File > Import Structures
- Navigate to where you downloaded the tutorial files and choose the
phenylpyridine.maefile - Click Open
- A new structure titled 2-phenylpyridine is selected(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entry List (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries in the entry lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion and includedthe entry is represented in the Workspace, the circle in the In column is blue in the workspacethe 3D display area in the center of the main window, where molecular structures are displayed
Note: Please refer to the Glossary of Terms for the difference between includedthe entry is represented in the Workspace, the circle in the In column is blue and selected(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entry List (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries
- Go to Tasks > Materials > Enumeration > Custom R-Group
- The Custom R-Group Enumeration panel opens with a warning message
Note: For a more comprehensive overview of the Custom R-Group Enumeration panel, see the help documentation
- Click OK on the warning message
- In this tutorial we will select attachment atoms in the following steps, you can however select attachment atoms before launching the panel to avoid the warning message
To run the Custom R-Group Enumeration, we first need to select the attachment points on 2-phenylpyridine which we wish to enumerate over
- In the workspacethe 3D display area in the center of the main window, where molecular structures are displayed, select the indicated atoms, as shown in the Figure, using Ctrl+Click (Cmd+Click on Mac) for multiple atom selection
- Make sure to select atoms in the order indicated in the Figure because these attachment positions will correlate to the R-group substituted
- This panel is interactive with the workspacethe 3D display area in the center of the main window, where molecular structures are displayed, meaning we work with structures in the workspacethe 3D display area in the center of the main window, where molecular structures are displayed while the panel is open
- In the Custom R-Group Enumeration panel, click Update From Selection
- The panel now reads in the attachment atoms selected and assigns them to an arbitrary R-group number
- The panel table shows the attachment Position number, the number, and the R-Group library to be used for substitution
- Labels on the structure in the workspacethe 3D display area in the center of the main window, where molecular structures are displayed display the R-group number as well
Suppose we want to enumerate over our core structure using both our custom library and a default library. For the purposes of this example, we will allow homo-substitution, i.e. the same R group at each attachment position, for Position 1 & 2 and Position 3 & 4. As each fragment is selected from the library, it is added to all equivalent R-groups, by default
- For Position 1 and Position 2, assign R1 as the R-Group
- This results in homo-substitution on both positions
- For the R-Group Library of Position 1, choose tutorial_lib (5)
- The R-Group Library changes for Position 2 as well
- For Position 3 and Position 4, assign R2 as the R-Group, and choose Solubilizing_R-groups (33) as the R-Group Library
- Ensure that your panel matches that which is shown in the Figure
Note: The Define R-groups section allows for extremely customizable R-group enumeration. For more information about the options presented here, see the help documentation
We want all four attachment positions to be substituted with our R-group libraries. To ensure that all four positions are being substituted simultaneously, rather than one, two, or three at a time, we must specify substitution across all positions
- In the Custom R-group Enumeration panel, select Allow partial substitution
- From the drop down menu, select across all positions and set to Use: 4 of 4
Note: The total number of enumerated structures is the raw combination of enumeration points (displayed in the panel as Library size) minus the number of duplicates that are removed due to symmetry, which results in unique output structures
- For the Job name, input phenylpyridine_enum
- Adjust the job settings (
) as needed
- This job requires a CPU host. The job can be completed in less than a minute on a CPU host
- Click Run
- Once the job is successfully finished, a new phenylpyridine_enum-core-out1 group, with 165 structures, is selected(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entry List (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries in the entry lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion and the first structure is includedthe entry is represented in the Workspace, the circle in the In column is blue in the workspacethe 3D display area in the center of the main window, where molecular structures are displayed
- Close the Custom R-Group Enumeration panel
- In general, always close the Custom R-Group Enumeration panel after use. This panel is interactive with the workspacethe 3D display area in the center of the main window, where molecular structures are displayed and leaving it open can therefore be problematic
5. Enumeration Post-processing
We can quickly prepare the enumerated structures for further calculations, such as quantum mechanical geometry optimizations and optoelectronics calculations, by using force field minimization. The molecule will still not be optimized by quantum mechanics, but it will improve the starting geometry for further calculations. We will use the Clean Up Complexes panel for this task.
- With the entry group titled phenylpyridine_enum-out (165) selected(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entry List (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries, go to Tasks > Materials > Tools > Clean Up Complexes
- The Clean Up Complexes panel opens
Note: The Clean Up Complexes panel works on small molecules as well as organometallic complexes. For a more comprehensive overview of the panel, see the help documentation. Or, visit the Help by clicking on
on the bottom right.
- For Use structures from, ensure that Project Table (165 selected entries) is listed; this corresponds with the R-group enumeration group produced in Section 4
- Click Clean Up
- The selected structures are minimized in place of the existing group in the entry lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion
Note: Clean Up Complexes panel uses the OPLS_2005 force field for minimization
Any of the enumerated structures can be visualized in the workspacethe 3D display area in the center of the main window, where molecular structures are displayed by including an entry of interest
6. Conclusion and References
In this tutorial, we built a new custom fragment library that can be used on a core for R-group enumeration. The output enumerated structures were incorporated into the Project Tabledisplays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data and are ready for further analysis within the Materials Science Suite.
For further learning:
For introductory content, focused on navigating the Schrödinger Materials Science interface, an Introduction to Materials Science Maestro tutorial is available. Please visit the materials science training website for access to 70+ tutorials. For scientific inquiries or technical troubleshooting, submit a ticket to our Technical Support Scientists at help@schrodinger.com.
For self-paced, asynchronous, online courses in Materials Science modeling, including access to Schrödinger software, please visit the Schrödinger Online Learning portal on our website.
For some related practice, proceed to explore other relevant tutorials:
- Introduction to Geometry Optimizations, Functionals and Basis Sets
- Introduction to Multistage Quantum Mechanical Workflows
- Optoelectronics
- Organometallic Complexes
- Band Shape
- Calculating Transition Dipole Moments (TDM), TDM Distributions, and Order Parameter
- Computing Atomic Charges
- Rigid and Relaxed Coordinate Scans
- Locating Transition States: Part 1
- Bond and Ligand Dissociation Energy
- Beta Elimination Reactions
- Optoelectronics Active Learning
For further reading:
- See the help documentation on the R-Group Creator panel
- See the help documentation on the Custom R-Group Enumeration panel
- See the help documentation on the Clean up Complexes panel
- Virtual screening of electron acceptor materials for organic photovoltaic applications. DOI:10.1088/1367-2630/15/10/105029
7. Glossary of Terms
Entry List - a simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion
Included - the entry is represented in the Workspace, the circle in the In column is blue
Project Table - displays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data
Scratch Project - a temporary project in which work is not saved, closing a scratch project removes all current work and begins a new scratch project
Selected - (1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entry List (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries
Working Directory - the location where files are saved
Workspace - the 3D display area in the center of the main window, where molecular structures are displayed