|
CCP4 Tutorial: Session 22a) Data processingThe ProblemThis example will start with intensity data that has already been scaled and merged (e.g. with scala or scalepack). The data is from the crystal structure of GerE, a transcription activator from Bacillus subtilis, which was solved by MAD phasing using the Se signal (V.M.A. Ducros, R.J. Lewis, C.S. Verma, E.J. Dodson, G. Leonard, J.P. Turkenburg, G.N. Murshudov, A.J. Wilkinson and J.A. Brannigan, J. Mol. Biol. (2001) 306 759-771).We are going to convert the intensities to structure factor amplitudes, and discuss some statistics that are generated. These statistics are essential for assessing the quality of the data, whether there is anisotropy, and whether there is twinning. Exercise2.1 Select the Data Reduction module, and open the Convert Intensities to SFs task window.2.2 On the first line, enter a suitable job title such as
2.3 Change the next line to:
2.4 Now enter the input file as:
2.5 In the section Data Harvesting, leave as:
2.6 In the section Required Parameters, we need to enter an estimate of the number of residues in the asymmetric unit. This is used in Wilson scaling, which allows one to put the data on an approximate absolute scale. Enter:
(there are 6 chains of 74 residues each in the asymmetric unit). 2.7 Do not change any of the remaining options, and click on Run -> Run Now. 2.8 When the job has finished, return to the main window, highlight the job in the Job List, and select View Files from Job -> View Log Graphs. This opens up the loggraph viewer. Graphs are selected by first clicking in the middle window to select a group of graphs, and then clicking in the bottom window to select a particular graph. 2.9 The graphs in Acentric Moments of ..., Centric Moments of ... and Cumulative intensity distribution are useful for deciding whether twinning is present. Have a look at these graphs. Use the cross-wires to estimate values. Compare the plotted values of the moments with the Expected values shown at the top of the window. These plots confirm there is no problem with twinning. The graph of the 2nd moment is the clearest. (See the accompanying document for an example where twinning occurs.) 2.10 Next, look at the graphs in Anisotropy analysis (FALLOFF). The graph of Mn(F/sd) v. resolution suggests that the data is slightly poorer along direction 3, which is defined to be perpendicular to a* and b*, i.e. there is some anisotropy in the data. 2.11 Close the loggraph window using File -> Exit. Close all other windows except the main window. 2b) Standardise MTZ fileThe ProblemYou now have a file of structure factor amplitudes for the reflections that were collected. It is considered good practice to add in all other reflections appropriate to the spacegroup and resolution, even if there is no data for them ("completing the dataset"). It is also good practice to add a column of free-R flags at this stage.Exercise2.20 Select the Reflection Data Utilities module, and open the Convert to MTZ and Standardise task window.2.21 On the first line, enter a suitable job title such as
2.22 On the second line, select MTZ from the pull-down menu:
2.23 On the 3rd line, select:
2.24 Now enter the input file as:
The output file will be automatically set to:
2.25 In the section MTZ Project & Dataset Names, the correct project and dataset names will have been inherited from the input file, so do not change these. 2.26 The remainder of the task window can be left unchanged, so go to the bottom of the task window and click on Run -> Run Now.
2.27 When the job has finished, view the output file by selecting
in the main window View Files from Job ->
gere_nat_unique1.mtz. First, notice that there is now
an extra column holding FreeR flags:
2.28 Click on List More Info at the
bottom of the display window. Accept the defaults
and click Apply&Exit. Now
look at the table of statistics at the bottom of the display window:
2.29 Close all windows except the main window. 2c) Combine native data with MAD dataThe ProblemYou now have a file of structure factors suitable for using in structure solution. Often you will have several files, obtained from different crystals or heavy atom derivatives. It is usually convenient to combine all these files into one MTZ file. In this example, we will combine the native data we have just processed with some MAD data for a selenomethionine derivative of GerE.Exercise2.40 Select the Reflection Data Utilities module, and open the Merge MTZ Files (Cad) task window.2.41 On the first line, enter a suitable job title such as
2.42 Now enter the first input file as:
2.43 Click on Add input MTZ file. Enter the second input file as:
2.44 Enter the output file as:
2.45 In the File completion and freeR extension section, make sure that FreeR_flag is declared:
If this is not declared, add this manually by selecting Enter label from the pull down menu and declaring FreeR_flag in the dialogue box. 2.46 The remainder of the task window can be left unchanged, so go to the bottom of the task window and click on Run -> Run Now.
2.47 When the job has finished, view the output file by selecting
in the main window View Files from Job ->
gere_MAD.mtz. The output file has 38 columns:
2.48 Close all windows except the main window. |