[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

how to choose same test set for related data sets

***  For details on how to be removed from this list visit the  ***
***    CCP4 home page http://www.dl.ac.uk/CCP/CCP4/main.html    ***

Dear all,

I apologize in advance for asking a question that I'm sure has been
asked before, but I haven't been able to find that previous answer...

Anyway, our situation is that we have several data sets with different
ligands bound to the same protein, all in the same spacegroup and
essentially the same unit cell.  After solving one of these structures
by molecular replacement, we intend to use that model as the starting
model for the rest of the datasets, (and then of course look for the
ligand in each).  My understanding is that we should choose the same
reflections to be the test set for all datasets in order to maintain
true cross-validation in the later refinements.  

My question is:  how do we go about ensuring that the same reflections
are chosen for the test set in all cases?  We are processing the data
with Mosflm/Scala, and doing refinement with X-PLOR, so a strategy that
works either with mtz files or X-PLOR cv files would be fine.

                             Thanks in advance.

Susan Heffron
Department of Physiology and Biophysics
University of California, Irvine
e-mail:   sheffron@uci.edu