Introduction to BioLuminate and the Multiple Sequence Viewer/Editor
Tutorial Created with Software Release: 2023-4
Topics: Antibody Design , Biologics Drug Discovery , Enzyme Engineering , Structure Prediction & Target Enablement
Products Used: BioLuminate
|
1.2 MB |
This tutorial is written for use with a 3-button mouse with a scroll wheel.
Words found in the Glossary of Terms are shown like this: Workspacethe 3D display area in the center of the main window, where molecular structures are displayed
Abstract:
In this tutorial, we will use BioLuminate to analyze structures of the influenza enzyme neuraminidase, the target of the FDA-approved flu medicines Tamiflu™ (oseltamivir) and Relenza™ (zanamivir). First, we will explore different ways of visualizing and comparing structures. Second, we will analyze sequences of neuraminidase variants and look for residue conservation in the binding site. Finally, we will build a basic homology model of a neuraminidase homolog and examine the quality of the model.
Tutorial Content
1. Creating Projects and Importing Structures
At the start of the session, change the file path to your chosen Working Directorythe location that files are saved in Maestro to make file navigation easier. Each session in Maestro begins with a default Scratch Projecta temporary project in which work is not saved, closing a scratch project removes all current work and begins a new scratch project, which is not saved. A Maestro project stores all your data and has a .prj extension. A project may contain numerous entries corresponding to imported structures, as well as the output of modeling-related tasks. Once a project is created, the project is automatically saved each time a change is made.
Structures can be imported from the PDB directly, or from your Working Directorythe location that files are saved using File > Import Structures, and are added to the Entry Lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion and Project Tabledisplays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data. The Entry Lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion is located to the left of the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed. The Project Tabledisplays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data can be accessed by Ctrl+T (Cmd+T) or Window > Project Table if you would like to see an expanded view of your project data. The Toggle Table is another way to interact with structures and can be accessed via Window > Toggle Table.
- Double-click the BioLuminate icon
- (No icon? See Starting Maestro)
- Go to File > Change Working Directory
- Find your directory, and click Choose
- Pre-generated input and results files are included for running jobs or examining output. Download the zip file here: https://www.schrodinger.com/sites/default/files/s3/release/current/Tutorials/zip/intro_bioluminate.zip
- After downloading the zip file, unzip the contents in your Working Directory for ease of access throughout the tutorial
- Go to File > Open Project
- Choose BioLuminate_basics.prjzip
- Click Open
- Structures are now shown in the Entry Lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion
2. Visualizing and Comparing Structures
In this section, we will use the Protein Preparation Workflow to fix common issues from a raw PDB structure. After, we will visualize the zanamivir-bound structure of the 1918 H1N1 “Spanish flu” neuraminidase (PDB 3B7E) using different Presets options and use Protein Structure Alignment to compare the drug-bound and apo structures (PDB 3BEQ). Please see the Introduction to Structure Preparation and Visualization and Best Practices for Protein Preparation for more information on the protein preparation process in general as well as the Protein Preparation Workflow..
2.1 Prepare the protein structures
- Includethe entry is represented in the Workspace, the circle in the In column is blue entry 3B7E_chainA
- Click Protein Preparation in the Favorites Toolbar
- The Protein Preparation Workflow opens
Note: The Protein Preparation Wizard can also be accessed by going to Tasks > Browse All > Protein Preparation and Refinement > Protein Preparation Workflow
Note: Click on the star next to items in Tasks to add it to the Favorites Toolbar
- Under Specify Protein, click Review Structure
- The Substructures tab opens
Use the Substructures tab to remove unwanted items from the structure, such as chains, waters, or crystallization agents (e.g. GOL)
- Shift-click to select(1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entry List (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries the GOL (Glycerol) rows in the Ligands, Metals, Other table
- Click Delete from Entry
- Click Diagnostics
- The Diagnostics tab opens
Note: The only issues found for the structure is that some of the residues and waters have alternate positions
- Select the 2 residues and 4 waters and click Commit
- Click Preparation Workflow
- The Preparation Workflow Tab opens
- For Job name type proteinprep_3B7E_A
-
Click Run
- This step takes ~5 minutes
- A new entry is added to the Entry Lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion
2.2 Visualize the prepared protein structure using Presets
- Includethe entry is represented in the Workspace, the circle in the In column is blue entry 3B7E_chainA_prepared
- Click Presets
- Choose BioLuminate Default
- The Workspacethe 3D display area in the center of the main window, where molecular structures are displayed is rendered to show the protein as a Green ribbon with the small molecule in grey
Note: The Toggle Table on the right of the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed can also be used to change rendering. The Toggle Table mimics PyMOL functionality.
Note: More information on Workspacethe 3D display area in the center of the main window, where molecular structures are displayed visualization can be found in the Introduction to Structure Preparation and Visualization
2.3 Generate a binding site surface
- Go to Workspace > Surface > Binding Site
- The Create Binding Site Surfaces panel opens
- Unselect Ligand
- Click OK
- A surface is shown in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed
- Negative regions are red and positive regions are blue
- An S icon is next to the entry in the Entry Lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion
- Type L
- The Workspacethe 3D display area in the center of the main window, where molecular structures are displayed zooms to the ligand
2.4 Analyze the binding site
-
Type M
- Selection mode is changed to Molecule
- The cursor has an M icon
- In the Workspace Configuration Toolbar, turn off Surfaces
-
Click on the ligand in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed
- The ligand is highlighted
- The (Selection) entry appears in the Toggle Table
-
Click Expand and choose 5Å
- Residues within 5 Å of the ligand are highlighted
Note: This can also be done using the Toggle Table (Selection) entry A > Modify > Expand > Residues within 5A
- In the Toggle Table, for (Selection), choose S > Sticks
- The selection is rendered in sticks
- In the Toggle Table, for (Selection), choose H > Surfaces and H > Waters
- The surface and waters are hidden
- In the Toggle Table, for (Selection), choose L > Residues
- Residue information is labeled in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed
- In the Quick Select toolbar, click Clear Selection
- Click on the ligand
- In the Toggle Table, for (Selection), navigate to A > Polar Contacts > Find > To Any Atoms Excluding Solvent
- Hydrogen bonds are shown in yellow, salt bridges in pink
2.5 Align the drug-bound and apo structures
- Shift-click to includethe entry is represented in the Workspace, the circle in the In column is blue the entry 3BEQ_chainA - prepared (3B7E_chainA - prepared should already be included)
- Both the drug-bound (green) and apo structures (blue) are visualized in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed
- Go to Tasks > Browse All > Protein Preparation and Refinement > Protein Structure Alignment
- The Protein Structure Alignment panel opens on the right side of the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed, along with a banner
- By default, all residues are chosen for alignment
- Click Align
- The structures are aligned
- The Protein Structure Alignment Results panel opens, showing an Alignment Score and RMSD
To more easily compare the two structures, we will want to render them identically (though the very low RMSD should already indicate that they will be exceedingly similar).
- Click Presets
- Choose Binding Site Comparison
- The Workspacethe 3D display area in the center of the main window, where molecular structures are displayed is rendered in a way that you can easily compare the binding site across the two structures
Note: This will undo some of the previous changes we made to the visualization of the the 3B7E_chainA - prepared structure
- Hover over the structures to identify regions of interest
- The 150 loop region has a different conformation between the drug-bound and apo structures
- Go to Workspace > Save Image As
- The Save Image panel opens
- Change File name to aligned_structures
- Click Save
- A .png image is saved to your Working Directorythe location that files are saved
Note: Click Options >> in the Save Image panel to adjust image properties
3. Analyzing Sequences using the Multiple Sequence Viewer/Editor
In this section, we will align and analyze a dataset of neuraminidase sequences using the Multiple Sequence Viewer/Editor. We will inspect sequences derived from PDB structures for residue characteristics and examine the residue conservation of the binding site.
3.1 Perform a Multiple Sequence Alignment
- Go to Workspace > Clear Workspace
- The Workspacethe 3D display area in the center of the main window, where molecular structures are displayed and Toggle Table are empty
- Go to Tasks > Biologics > Multiple Sequence Viewer/Editor
- The Multiple Sequence Viewer/Editor panel opens
- Go to File > Import Sequences from File
- Choose neuraminidase_InfluenzaA_ PDBsequences.fasta
- Click Open
- 54 sequences are now in the MSV
To better understand the conservation of residues compared to the reference (which by default is the top sequence) we are going to try some styling that will point out areas where there is variation and areas that are more conserved. This will be more useful after we perform the multiple sequence alignment
- Hover over the brush icon and click the …
- For Apply to, choose Residues matching reference
Note: By default, the top sequence will be considered as the reference
- Click Align
Note: Make sure Selected only is unchecked
- Click Align
- Sequences are aligned
- Click the plus icon
- Deactivate Wrap sequences
- All sequences can now be found on a single row
- Scroll to the right of the sequence viewer to look at the sequences starting with residue 84
In addition to the styling, we would want to use a Sequence Logo chart to better visualize and (when hovering over) quantify the variation at different positions.
- Hover over the graph icon and click the …
- Select Sequence Logo
- A logo plot is added above the reference sequence
- Click Sequence Logo again to toggle it off
3.2 Analyze the binding site
In order to do some binding site analysis, we will first need to bring in the 3B7E sequence into the Multiple Sequence Viewer/Editor and then set it as a the reference so we can align the rest of the sequences to it
- In the Entry Lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion, includethe entry is represented in the Workspace, the circle in the In column is blue 3B7E_chainA_prepared
- In the Multiple Sequence Viewer/Editor panel, for Load from, choose Workspace and click the download icon
- The sequence is added to the bottom of the MSV
- Right-click on 3B7E_chainA_prepared
- Click Set as Reference
- Click Align
- Unselect Selected only
- Click Align
- Sequences are re-aligned to 3B7E_chainA_prepared
- Scroll to the right of the sequence viewer to look at the sequences starting with residue 84
- Go to Select > Binding Site Residues
- The binding site residues for the 3B7E_chainA_prepared structure are selected in the Multiple Sequence Viewer/Editor
- Hover over the graph icon and click the …
- Select Binding Site
- The binding site region is annotated in the Multiple Sequence Viewer/Editor
We will now color the 3B7E_chainA_prepared structure such that the binding site residues are all red (based on the sequence).
- Make sure the binding site residues are still selected, otherwise go to Select > Binding Site Residues to select them again.
- Hover over the brush icon and click the …
- Click Apply to and choose No residues on tab
- For Paint selected, choose Red
- Click Color entire residues
- Click Color Residues
- Click the Presets dropdown
- Choose Legacy > Pretty
- The binding site residues are all colored red in the Workspace
4. Building a Homology Model
In this section, we will build a structural model of 1918 H1N1 neuraminidase using a template derived from a homolog. We will choose a homolog from a BLAST search and visualize the results in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed. Then, we will examine the quality of the model and use the Protein Preparation Wizard to refine the structure.
4.1 Choose a template and generate a model
- In the Multiple Sequence Viewer/Editor, right click on the second listed sequence gi|407943793|pdb|4B7Q|A and select Set as Reference
- Click Other Tasks
- Choose Build Homology Model
- The Build Homology Model panel opens
While we could use the 3B7E_chainA - prepared structure as a reference, we would like to simulate finding a homolog from a BLAST search
- Make sure that Use is set to 1 target, 1 template
- For Find a template structure, click Find
- Click the cog
- Uncheck Use local server only
Note: This requires internet access. The ‘Use local server only option’ is checked by default to prevent your BLAST searches and related tasks from going out to remote servers. To allow remote access, uncheck this option. You will be prompted to confirm the online search every time to avoid accidentally leaking information. This option is also available from the top-level Edit → Settings and Defaults menu.
- Click Run Search
While there are many templates with 100% sequence identity which would surely be the most ideal if we were running this as part of a project, we will choose one with ~96% sequence identity, which though not perfect is still very good to better demonstrate the homology modeling process.
- Choose ID 2HTY_H
- Click Import
- A new structure is includedthe entry is represented in the Workspace, the circle in the In column is blue in the Workspace the 3D display area in the center of the main window, where molecular structures are displayedand as an entry in the Multiple Sequence Viewer/Editor
- The Find a template structure section in the Build Homology Model panel is now populated
Note: With the high sequence identity between the two sequences we don’t need to run any alignment beyond the auto-alignment
Note: Make sure no ligands are selected in the Include ligands & cofactors (optional) section
- For Job name, write homology_modeling_2HTY
- Click Generate Model
- This step takes ~1 minute
- The structure is updated in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed
Note: Purple indicates regions that overlap directly with the template, cyan indicates regions where the target and template sequence differ
4.2 Examine the model quality
Now that we have predicted the structure of the protein via homology, we want to assess its quality compared to other structures in the Protein Data Bank. See the documentation page for Protein Reliability Report panel for more information.
-
Go to Biologics > Structure Reliability Report
- The Protein Reliability Report opens
- Change job name to homology_reliability
- Click Run
- This step takes ~2 minutes
- A new entry is added to the Entry Lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion
- A banner appears
- On the banner, click Protein Reliability Report
- The Protein Reliability Report Panel Opens
Note: The Protein Reliability Report output can also be viewed by including homology_reliability-out in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed and reopening the Protein Reliability Report Panel through Tasks > Biologics > Reliability Report
Note: In the Protein Reliability Report, small green circles represent properties within a reasonable distance of the average PDB structure, while large red circles are considered potential matters of concern. In general, the three outputs that are looked at to determine the quality of a protein structure are ‘Steric Clashes’, ‘Missing Atoms’ and ‘Missing Loops’
To address some of the issues found in the Protein Reliability Report we are going to do some structural refinement using the Protein Preparation Workflow.
- Go to Tasks > Browse All > Protein Preparation and Refinement > Protein Preparation Workflow
- The Protein Preparation Workflow panel opens
- Click INTERACTIVE
- Under Minimize and Delete Waters, go click Settings
- Under Delete waters, uncheck Distant from ligands (hets)
- Click Clean Up
- This job takes ~2 minutes
- A new entry is in the Entry Lista simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion
- In the Protein Reliability Report, change the job name to homology_reliability_prepared
- Click Run
- On the banner, click Protein Reliability Report
- The Protein Reliability Report Panel Opens
Note: The biggest issues present in the Protein Reliability Report of the unprepared structure (steric clashes, and side chain and backbone dihedrals) have been fixed with the minimization.
5. Conclusion and References
In this tutorial, we used BioLuminate to analyze structures of the influenza enzyme, neuraminidase. We prepared the structures, visualized them in the Workspacethe 3D display area in the center of the main window, where molecular structures are displayed, and aligned the binding sites. Then, we analyzed sequences of neuraminidase variants, inspecting the binding site for residue conservation. Finally, we built a homology model of a neuraminidase homolog, analyzed the model quality, and refined the structure.
For further learning:
- Introduction to Structure Preparation and Visualization
- Sequence Annotation of Antibodies with the Multiple Sequence Viewer/Editor
- Peptide Modeling with BioLuminate
- Antibody Visualization and Modeling in BioLuminate
- Batch Homology Modeling Using the Multiple Sequence Viewer/Editor
- Chimeric Homology Modeling Using the Multiple Sequence Viewer/Editor
- Introduction to Computational Antibody Engineering online course (Course Page | Preview)
- Introduction to Molecular Modeling in Drug Discovery online course (Course Page | Preview)
For further reading:
6. Glossary of Terms
Entry List - a simplified view of the Project Table that allows you to perform basic operations such as selection and inclusion
included - the entry is represented in the Workspace, the circle in the In column is blue
incorporated - once a job is finished, output files from the working directory are added to the project and shown in the Entry List and Project Table
Project Table - displays the contents of a project and is also an interface for performing operations on selected entries, viewing properties, and organizing structures and data
Scratch Project - a temporary project in which work is not saved, closing a scratch project removes all current work and begins a new scratch project
selected - (1) the atoms are chosen in the Workspace. These atoms are referred to as "the selection" or "the atom selection". Workspace operations are performed on the selected atoms. (2) The entry is chosen in the Entry List (and Project Table) and the row for the entry is highlighted. Project operations are performed on all selected entries
Working Directory - the location that files are saved
Workspace - the 3D display area in the center of the main window, where molecular structures are displayed