Simulating Complex Protein Solutions
Tutorial Created with Software Release: 2025-4
Topics: Pharmaceutical Formulations
Methodology: All-Atom Molecular Dynamics
Products Used: Desmond , MS Maestro
|
134 MB |
This tutorial is written for use with a 3-button mouse with a scroll wheel.
Words found in the Glossary of Terms are shown like this: Workspacethe 3D display area in the center of the main window, where molecular structures are displayed
Abstract:
In this tutorial, we will learn to prepare a complex protein system for a Molecular Dynamics (MD) simulation.
Tutorial Content
1. Introduction to Complex Protein Solutions
Understanding protein behavior in solution is crucial for science and medicine. Accurate modeling of protein solutions is essential for comprehending protein function, stability, and designing therapeutics. Various computational methods can be used to analyze protein interactions in water based solutions. Atomistic simulations, like molecular dynamics, offer high-resolution detail, tracking atomic movements to reveal conformational changes, solvent interactions, and stability at the atomic level. Coarse-grained (CG) models simplify systems by representing atom groups as "particles," reducing computational cost for larger systems and longer timescales. These models are effective for aggregation and membrane interactions, where overall protein shape is more important than exact atomic positions.
This tutorial outlines the process of preparing a complex protein solution for molecular dynamics (MD) simulation. First, we will prepare the small protein 2JOF using the Protein Preparation Workflow panel. Next, we will use the Disordered System Builder panel to immerse the protein in a solution containing excipients: Tween 20, sucrose, water, chlorine anions, and sodium cations. Members of the Tween surfactant family are frequently incorporated into protein formulations. They serve multiple purposes: solubilizing proteins, mitigating protein aggregation, and passivating surfaces. Sucrose is a common excipient in lyophilization, where it safeguards proteins during the freezing process and adds bulk to the freeze-dried product. Sodium chloride (NaCl) is also routinely included in protein formulations to modulate interactions between proteins. Finally, an MD simulation will be run on this complex protein solution using the MD Multistage Workflow to analyze the protein's interactions within the solution. These steps are foundational for any workflow involving protein MD simulation.
To learn about modeling protein formulations with CG models see the Creating a Coarse-Grained Model for Protein Formulations tutorial.
2. Creating Projects and Importing Structures
At the start of the session, change the file path to your chosen Working Directorythe location where files are saved in MS Maestro to make file navigation easier. Each session in MS Maestro begins with a default Scratch Projecta temporary project in which work is not saved, closing a scratch project removes all current work and begins a new scratch project, which is not saved. A MS Maestro project stores all your data and has a .prj extension. A project may contain numerous entries corresponding to imported structures, as well as the output of modeling-related tasks. Once a project is saved, the project is automatically saved each time a change is made.
Structures can be built in MS Maestro or can be imported using File > Import Structures (or drag-and-dropped), and are added to the Entry Lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion and Project Tabledisplays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data. The Entry Lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion is located to the left of the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed. The Project Tabledisplays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data can be accessed by Ctrl+T (Cmd+T) or Window > Project Table if you would like to see an expanded view of your project data.
-
Double click the Maestro or Materials Science icon to start Maestro or MS Maestro
- No icon? See Starting Maestro
- This tutorial uses MS Maestro, but this workflow can be performed in Maestro or MS Maestro. Use whichever interface you are comfortable with or typically use for your projects.
- Go to File > Change Working Directory
- Find your directory, and click Choose
- Pre-generated files are included for running jobs or examining output. Download the zip file here: schrodinger.com/sites/default/files/s3/release/current/Tutorials/zip/complex_protein_solutions.zip
- After downloading the zip file, unzip the contents in your Working Directorythe location where files are saved for ease of access throughout the tutorial
- Go to File > Save Project As
- Change the File name to protein_solutions_tutorial, click Save
- The project is now named
protein_solutions_tutorial.prj
- The project is now named
We will proceed to import the components that we will use in the subsequent sections: Tween 20, sucrose, water, a chlorine anion, and a sodium cation. If you would prefer to draw or build these components yourself using the 2D Sketcher and the Polymer Builder, feel free to do so following similar steps outlined in the Introduction to Materials Science Maestro tutorial. Otherwise:
- Go to File > Import Structures
- Navigate to where you have downloaded the provided files (presumably in your working directorythe location where files are saved), and choose the
excipients_water.maegzfile - Click Open
- A new entry group is added to the entry lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion entitled components (5) containing the five entries
3. Protein Structure Preparation
Structure files obtained from the Protein Data Bank (PDB), vendors, and other sources often lack necessary information for performing modeling-related tasks. Typically, these files are missing hydrogens, partial charges, side chains, and/or whole loop regions. Proteins in their raw state may also have incorrect bond order assignments and group orientations. To make these structures suitable for modeling tasks, we will use the Protein Preparation Workflow panel to resolve common structural issues.
Let’s start by importing and preparing the protein of interest.
- Go to Tasks > Browse All > Protein Preparation and Refinement > Protein Preparation Workflow
- The Protein Preparation Workflow panel opens
- Click Get PBD
- For PDB IDs, type 2JOF
- Trp-cage (2JOF) is a small protein
- Click Download
28 configurations of the PDB structure 2JOF have been loaded into the workspacethe 3D display area in the center of the main window, where molecular structures are displayed. The top entry will be used in this panel setup.
A new entry group and structure have appeared in the Entry Lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion titled 2JOF-prepared, and is automatically Selected(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entry List (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries and Includedthe entry is represented in the Workspace, the circle in the In column is blue in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed. This structure is the fully prepared protein structure.
Feel free to visualize and stylize the structure in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed. In the new entry only polar hydrogens are shown. Going to Style > Show all Hydrogens to view all the hydrogen atoms.
4. Building and Running an MD Simulation
In this section, we will build a multi-component box with the Disordered System Builder panel containing Tween 20, sucrose, water, chlorine anions, and sodium cations, representing a 0.5 M sucrose, 0.15 M NaCl, and a 0.05 M Tween 20 solution. In the build, we will immerse our protein structure in the solution. We will then use the MD Multistage Workflow panel to perform a molecular dynamics simulation.
- Select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entry List (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries the five excipients_water entries: tween 20, sucrose, water, Cl_ion and Na_ion from the entry list
- Includethe entry is represented in the Workspace, the circle in the In column is blue the 2JOF-prepared entry
- Go to Tasks > Materials > Structure Builders > Disordered System
- The Disordered System Builder panel opens
- The five selected components are by default loaded into the panel
- If you see different components or additional components, revisit Step 1
- For Initial state, choose Amorphous
- Change Number of molecules to 4569
- In the Components table, change the Number of molecules for tween 20 to 4, sucrose to 40, water to 4500, Cl_ion to 12, and Na_ion to 13
- An additional Na ion is required to neutralize the overall system charge due to the protein structure's -1 charge.
- The wt% appears in the panel and the table updates interactively
- Check the Substrate box
- Import the included entry 2JOF - prepared
- Ensure that Substrate type is Immersed
- For Periodic Boundary Conditions (PBC), choose Create new cubic PBC
Note: Building complex protein systems, especially larger ones, presents significant challenges. For instance, if protein concentrations are above 20 wt%, the calculations can take significantly longer. Simply adding more water molecules to the system would likely increase the building time linearly, assuming a consistent target density.
However, the introduction of excipients complicates the system. If excipients are large, flexible, or present at high concentrations, the process can quickly become impractical. As protein levels, or the concentration and size of excipients, increase, calculations become more difficult.
If issues arise, consider separating the excipients from the solvent addition or building the system at lower densities. In such cases, the Solvation Builder can efficiently handle solvent addition for large systems.
- Go to the Disorder tab
- Uncheck the Color molecules by component box
- Set the Initial density to 0.200
- Change the Job name to disordered_system_trp_cage_excipients
- Adjust the job settings (
) as needed
- This job requires a CPU host. The job can be completed in about 5 minutes on a CPU host.
- If you would like to run the job yourself, click Run. Otherwise, import the pregenerated
Section_04 > disordered_system_trp_cage_excipients > disordered_system_trp_csge_excipients-out.cmsfile from the provided tutorial files via File > Import Structures - Close the Disordered System Builder
When the job is complete, a new entry group will be incorporated titled MD: disordered_system_trp_cage_excipients_system (1) containing one entry titled disordered_system_trp_cage_excipients_all_components_amorphous
- Includethe entry is represented in the Workspace, the circle in the In column is blue the new entry
- The box is visible in the workspace
- The components are colored by default, but feel free to stylize as you wish
- Use the WAM button (
) to open the MD Multistage Workflow panel
- Alternatively, access the panel via Tasks > Materials > Classical Mechanics > MD Simulations > MD Multistage Workflow
We will next equilibrate this disordered system in order to prepare it for an MD simulation at constant volume.
First, the Constant pressure protocol will be applied, which is a robust three-stage scheme for equilibrating materials. It consists of a Brownian stage, a low temperature MD stage, and a regular MD stage.
The system will then undergo 80 ns of MD equilibration in the isothermal-isobaric (NPT) ensemble. Although 80 ns might be more than necessary for this particular system to reach equilibrium, we are opting for a conservative approach.
- Check Relaxation protocol and choose Constant pressure
- Change the next stage (Stage 6) to Molecular Dynamics
- Set the Simulation time (ns) to 80
- Set the Trajectory Recording interval (ps) to 160
- Change the Job name to multistage_simulation_trp_cage_excipients
- Adjust the job settings (
) as needed
- This job requires a GPU host. The job can be completed in about 3 hours on a GPU host
- If you would like to run the job yourself, click Run. Otherwise, import the pre-generated
Section_04 > multistage_simulation_trp_cage_excipients > multistage_simulation_trp_cage_excipients-out.cmsfile from the provided tutorial files via File > Import Structures - Close the MD Multistage Workflow panel
- When the job is finished or after importing, select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entry List (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries and includethe entry is represented in the Workspace, the circle in the In column is blue the new disordered_system_trp_cage_excipients_all_components_amorphous entry from the entry list
- In the Figure the hydrogen atoms are hidden to better visualize the protein and excipients
Feel free to visualize the Trajectory if you are interested by clicking the
. For additional analysis, refer to the Diffusion or Cluster Analysis tutorials. Alternatively, proceed to the Creating a Coarse-Grained Model for Protein Formulations tutorial to run a CG simulation on the system.
5. Conclusion and References
In this tutorial we learned the preparation of a protein for constructing complex protein systems with various excipients.
For further learning:
For introductory content, focused on navigating the Schrödinger Materials Science interface, an Introduction to Materials Science Maestro tutorial is available. Please visit the materials science training website for access to 100+ tutorials. For scientific inquiries or technical troubleshooting, submit a ticket to our Technical Support Scientists at help@schrodinger.com.
For self-paced, asynchronous, online courses in Materials Science modeling, including access to Schrödinger software, please visit the Schrödinger Online Learning portal on our website.
For some related practice, proceed to explore other relevant tutorials:
- Creating a Coarse-Grained Model for Protein Formulations
- Building Solvated Systems
- Building, Equilibrating and Analyzing Amorphous Polymers
- Building a Semicrystalline Polymer
- Building a Coarse-Grained Surfactant Model with Martini Force Field
- Building a Coarse-Grained Skin Model using Martini Force Field
- Ibuprofen Cyclodextrin Inclusion Complexes with the Martini Coarse-Grained Force Field
- Diffusion
- Cluster Analysis
For further reading:
- See the help documentation on the Protein Preparation Workflow, Disordered System Builder, and MD Multistage Workflow panels
- A complete introduction to Desmond in the help documentation
6. Glossary of Terms
Entry List - a simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion
Included - the entry is represented in the Workspace, the circle in the In column is blue
Project Table - displays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data
Recent actions - This is a list of your recent actions, which you can use to reopen a panel, displayed below the Browse row. (Right-click to delete.)
Scratch Project - a temporary project in which work is not saved, closing a scratch project removes all current work and begins a new scratch project
Selected - (1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entry List (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries
Working Directory - the location where files are saved
Workspace - the 3D display area in the center of the main window, where molecular structures are displayed