Crystal Structure Prediction
Tutorial Created with Software Release: 2025-2
Topics: Pharmaceutical Formulations
Products Used: Crystal Structure Prediction , MS Maestro
|
13 MB |
This tutorial is written for use with a 3-button mouse with a scroll wheel.
Words found in the Glossary of Terms are shown like this: Workspacethe 3D display area in the center of the main window, where molecular structures are displayed
Abstract:
In this tutorial, we will learn how to run the crystal structure prediction workflow using acetaminophen.
Tutorial Content
-
Introduction to Crystal Structure Prediction
1. Introduction to Crystal Structure Prediction
The way a material's atoms arrange themselves in a solid, known as its crystal structure, can have various forms called 'polymorphs'. This is a crucial factor in many industries, including pharmaceuticals, agriculture, nutrition, batteries, and aviation. In drug development, the emergence of a more stable polymorph later in the process can lead to significant complications. Traditionally, finding all these polymorphs through experiments is expensive, time-consuming, and may not discover all possibilities. The crystal structure prediction (CSP) method in the Schrödinger Suite employs a unique search algorithm and machine learning to predict and rank possible crystal structures by their energy. CSP not only identifies known polymorphs but also suggests new, potentially more stable ones that could impact the development of existing forms.
In this tutorial, we will perform a CSP workflow on acetaminophen. First, the acetaminophen molecule will be sketched using the 2D Sketcher. Second, various conformers of the acetaminophen molecule will be generated using the Crystal Structure Prediction panel. Third, a CSP workflow will be performed on the acetaminophen conformers using the Crystal Structure Prediction panel. Fourth, the CSP results will be analyzed using the Crystal Structure Prediction Viewer.
2. Creating Projects and Importing Structures
At the start of the session, change the file path to your chosen Working Directorythe location where files are saved in MS Maestro to make file navigation easier. Each session in MS Maestro begins with a default Scratch Projecta temporary project in which work is not saved, closing a scratch project removes all current work and begins a new scratch project, which is not saved. A MS Maestro project stores all your data and has a .prj extension. A project may contain numerous entries corresponding to imported structures, as well as the output of modeling-related tasks. Once a project is saved, the project is automatically saved each time a change is made.
Structures can be built in MS Maestro or can be imported using File > Import Structures (or drag-and-dropped), and are added to the Entry Lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion and Project Tabledisplays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data. The Entry Lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion is located to the left of the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed. The Project Tabledisplays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data can be accessed by Ctrl+T (Cmd+T) or Window > Project Table if you would like to see an expanded view of your project data.
-
Double-click the Materials Science icon
- (No icon? See Starting Maestro)
- Go to File > Change Working Directory
- Find your directory, and click Choose
- Pre-generated files are included for running jobs or examining output. Download the zip file here: schrodinger.com/sites/default/files/s3/release/current/Tutorials/zip/csp.zip
- After downloading the zip file, unzip the contents in your Working Directorythe location where files are saved for ease of access throughout the tutorial
- Go to File > Save Project As
-
Change the File name to csp_tutorial, click Save
-
The project is now named
csp_tutorial.prj
-
The project is now named
3. Running the Crystal Structure Prediction Workflow
In this section, a CSP workflow will be performed on an acetaminophen molecule using the Crystal Structure Prediction panel.
We first need to draw a starting molecule. The molecule could be drawn in 2D or 3D using various MS Maestro tools, or can be imported from a file. In this case we will use the 2D Sketcher:
-
Go to Edit > 2D Sketcher
- The 2D Sketcher panel opens
- Draw the structure of acetaminophen
The 2D sketcher functions like many standard 2D molecular drawing tools. For a complete overview of using the sketcher panel, see the 2D Sketcher Panel documentation or watch the Building Small Molecules video in the Getting Going with Materials Science Maestro Video Series.
- Click on Save as New and for Input Entry Title write acetaminophen. Click OK
- Close the 2D Sketcher panel
- The acetaminophen molecule is selected(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entry List (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries in the entry lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion and includedthe entry is represented in the Workspace, the circle in the In column is blue in the workspacethe 3D display area in the center of the main window, where molecular structures are displayed
If preferred, change the representation to ball-and-stick by clicking on the Style menu and choosing Apply ball-and-stick representation
-
With the acetaminophen entry selected(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entry List (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries and includedthe entry is represented in the Workspace, the circle in the In column is blue, go to Tasks > Materials > Polymorph Prediction > Crystal Structure Prediction
- The Crystal Structure Prediction panel opens
-
Change the Task to Generate conformers only
- Running a conformers job first is best practices to ensure the conformers are what you’d expect before moving to the much longer packing and ranking calculation
-
Click Load to load the molecule
- Acetaminophen will now be loaded into the panel
- Change the Job name to csp_conformers
-
Adjust the job settings (
) as needed
- This job requires a CPU host and should complete in a couple of minutes on 8 CPUs
- Job Server is required for submission of Crystal Structure Prediction calculations
- Click Run or import the file from the provided tutorial files
- Close the Crystal Structure Prediction panel
- When the job is finished, a new entry titled output_confgen is automatically included in the entry list
-
With the output_confgen group selected(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entry List (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries, use the Workflow Action Menu (WAM) button (
) or go to Tasks > Materials > Polymorph Prediction > Crystal Structure Prediction- The Crystal Structure Prediction panel opens
- The panel will automatically populate by default with all 11 acetaminophen conformers
- Change the Job name to csp_acetaminophen
-
Adjust the job settings (
) as needed
- This job requires a CPU and GPU host. The job can be completed in five days using 32 CPUs and 1 GPU. However, this calculation runs thousands of subjobs. Specifying 10,000 CPUs is ideal but will use many resources at once. A balance needs to be achieved between wall time and the use of resources.
- Job Server is required for submission of Crystal Structure Prediction calculations.
- If you would like to run the job, click Run, otherwise, we will proceed with pre-generated results in the next section
- Close the Crystal Structure Prediction panel
The calculation will pack the conformers into unit cells by using various space groups and then relaxing the structures and filtering out the highest-energy ones. The relaxation part does a series of relaxations, each of which is followed by a filter. The first uses molecular mechanics (MM) (cheapest), then machine learning force field (MLFF) (medium), then density functional theory (DFT) (most expensive). So there are less crystals that need to do DFT run on them than MM. When the include DFT ranking box is checked the DFT ranking results are returned.
In this example, the Generate missing parameters option is automatically removed upon loading the molecule because no torsions are missing. However, if the molecule had torsions missing, that box would have the option to be checked. Then there would be an option to use a customized version of the forcefield.
4. Analyzing the Crystal Structure Prediction Workflow
In this section, CSP workflow results for acetaminophen will be viewed using the Crystal Structure Prediction Viewer panel.
- From the main menu, choose File > Import Structures
-
Navigate to where you downloaded the tutorial files and choose
Section_03 > csp_acetaminophen > output_qrnn_relaxation.maegzfile -
Click Open
- A new group with 2415 structures is added to the entry lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion. The entire group is automatically selected(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entry List (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries and the first entry is includedthe entry is represented in the Workspace, the circle in the In column is blue in the workspacethe 3D display area in the center of the main window, where molecular structures are displayed
- Maintain the entire entry group selection
-
Use the WAM (workflow action menu) button (
) to open the CSP Viewer- Alternatively, go to Tasks > Materials > Polymorph Prediction > Crystal Structure Prediction Analysis
- The Crystal Structure Prediction Viewer panel opens
- Click the data points to view information about the structure
Each dot corresponds to one crystal structure and appears as a new entry in the entry list and the packing arrangement is shown in the workspace
- Go to the Data Table tab
Scroll across the data table to view information about each crystal structure. There are two known experimental crystal structures and both are found here. The first has density 1.349 and spacegroup P121/n1. The second has density 1.382 and spacegroup Pbca. Feel free to explore the results and close the panel once finished.
5. Conclusion and References
In this tutorial, we learned how to run the crystal structure prediction workflow for acetaminophen.
For further learning:
For introductory content, focused on navigating the Schrödinger Materials Science interface, an Introduction to Materials Science Maestro tutorial is available. Please visit the materials science training website for access to 70+ tutorials. For scientific inquiries or technical troubleshooting, submit a ticket to our Technical Support Scientists at help@schrodinger.com.
For self-paced, asynchronous, online courses in Materials Science modeling, including access to Schrödinger software, please visit the Schrödinger Online Learning portal on our website.
For some related practice, proceed to explore other relevant tutorials:
For further reading:
- See the help documentation
- A robust crystal structure prediction method to support small molecule drug development with large scale validation and blind study, DOI: 10.1038/s41467-025-57479-1.
- Polymorphism in Molecular Crystals (2nd edn).
6. Glossary of Terms
Entry List - a simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion
Included - the entry is represented in the Workspace, the circle in the In column is blue
Project Table - displays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data
Recent actions - This is a list of your recent actions, which you can use to reopen a panel, displayed below the Browse row. (Right-click to delete.)
Scratch Project - a temporary project in which work is not saved, closing a scratch project removes all current work and begins a new scratch project
Selected - (1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entry List (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries
Working Directory - the location where files are saved
Workspace - the 3D display area in the center of the main window, where molecular structures are displayed