Transcript - Using the NCBI database to visualise the 3D structures of proteins Welcome to Bioinformatics for Biologists. IÕm Stevie Bain, a researcher from the University of Edinburgh. In this video, we are going to use the NCBI (National Center for Biotechnology Information) to visualize 3-dimensional protein structures. Firstly, we need to navigate to the NCBI homepage by typing ncbi.nlm.nih.gov into the address bar of your web browser. The NCBI homepage has a search bar that allows us to search the databases using a keyword search. In this video, we are going to focus on the 3-dimensional protein structure database. So, we click on the dropdown menu to the left of the search bar and select ÔStructureÕ. Now we type the name of the protein whose structure we want to search for in the search bar. In this video, weÕll type deoxyhaemoglobin AND Homo sapiens [ORGN]. The addition of [ORGN] after Homo sapiens means that we only want to search for deoxyhaemoglobin found in the species, Homo sapiens. It basically allows us to perform a more specific search of the database. We then click search, and in a few moments, we should see our results page. We can see that we have 106 results for deoxyhaemoglobin in Homo sapiens in this database. For each of these results, the line in blue is the description, it tells us the full name of the structure. And importantly, if we look underneath this we can see ÒtaxonomyÓ which tells us what species this structure comes from. When using this database, it is important to take time look through the descriptions of results to ensure that you find the most suitable structure. You can always refine your search terms and search again if necessary. If we click on the description line of our chosen result it will take us to a page with more details. We can see the name of the structure and also some information about the scientific literature associated with this database entry. When we scroll down, we can see an image of the structure and also the molecular components. In this example, we have 4 protein chains: 2 that are Hemoglobin S alpha chains and 2 that are Hemoglobin S beta chains. Underneath the chain name, we have the gene symbol for each. If we click on this, we are taken to a page that gives us some info about the gene. For example, we can see the HBA1 gene is located on chromosome 16. On this page, we also have some more information about the chemicals and molecules that bind to these proteins. Here we can see that we have 4 molecules that contain iron - these are the haem groups. Once again, we can click on this description to get more information about this structure. If we want to look at this deoxyhaemoglobin protein structure in more detail, we can scroll back up and click full-featured 3D viewer. This may take a while to load but itÕs worth having patience as this feature will allow us to interact with a 3D view of our structure. Here we can hover the mouse over the structure to see which amino acids are in the protein. If we click down on the structure, we can also move it to focus on different parts. In this viewer, we can also very easily visualize secondary structures, for example, alpha helices are shown as curled ribbons. On the right-hand side, we can once again see each of the protein chains in this structure and the molecules. On the right-hand side, each of the protein chains and their associated conserved domains are represented by colored boxes. If we wish to highlight any of these regions in the 3D viewer, we simply click on the colored box and a yellow highlight will appear around that area. If we wish to remove this highlight, we simply go to the toggle at the top that says ÔselectionÕ and click. As you can see this selection has now been removed so we can click on a different protein chain. At the top of the viewer, there are a number of menus. One particularly useful menu is the ÔcolorÕ menu. This allows us to color the 3D structure based on particular properties or features. Right now, ÔchainÕ is selected. That means that each of the protein chains in the structure is uniquely colored. LetÕs change this to charge. Now we can very clearly see areas of the structure that have different charges: negatively charged areas are red, positively charged areas are blue and those that are grey are neutral. Once again, we can use the boxes at the side to make specific selections. We can also change the style of the structure. If we go to proteins, you will see that there are many options that allow us to change how the structure is styled. We can change this to lines for example or show the proteins as spheres. LetÕs go back to proteins and change their style to ribbon. Then we can then move along to the side chains option - which you will see right now is hidden - and change this to show the side chains. There are many options in this viewer, and we recommend that you have a go at exploring these yourself. We hope you found this video useful. If you would like some more information about our project and our free resources: 4273pi.org or @4273pi