[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
RE: [ccp4bb]: Amore Rotation Function Scoring?
*** For details on how to be removed from this list visit the ***
*** CCP4 home page http://www.ccp4.ac.uk ***
Dirk-
You can always just grep out the SOLUTIONRC lines and sort them using shell
commands:
cat amore.output | grep SOLUTIONRC | sort -r +8 > sort.list
Hope that is useful-
Steve
---------------------------------------
Stephen M. Soisson, Ph.D.
X-ray Crystallography
Merck Research Laboratories
Phone: (732)594-4349
-----Original Message-----
From: Dirk Kostrewa [mailto:dirk.kostrewa@psi.ch]
Sent: Monday, September 23, 2002 5:44 AM
To: CCP4BB
Subject: [ccp4bb]: Amore Rotation Function Scoring?
*** For details on how to be removed from this list visit the ***
*** CCP4 home page http://www.ccp4.ac.uk ***
Dear CCP4 users,
the Amore cross-rotation function basically calculates a correlation
coefficient between the observed and calculated Patterson function (CC_P).
However, the output of the cross-rotation search is for some dubious reason
sorted on the correlation coefficient between calculated and observed F
(CC_F). This doesn't make much sense to me for the following reasons:
(1) The search function is the CC_P, thus, from a methodological point of
view, the output should be sorted on this value and not on something else.
(2) Both, the calculated F and I of the model only make sense after it has
been correctly positioned, which is not the case in the cross-rotation
search.
(3) Accordingly, the signal-to-noise must be much better for CC_P than for
either CC_F or CC_I. To illustrate this, I have run a cross-rotation search
with the refined protein-only model of the A. niger phytase (Kostrewa et
al.,
NSB, 4, 185ff, 1995) against its observed data. The top 10 of the amore
cross-rotation output looks like this (I've removed the TX,TY,TZ columns for
better readability):
ITAB ALPHA BETA GAMMA CC_F RF_F CC_I CC_P Icp
SOLUTIONRC 1 3.28 85.77 237.92 27.9 55.3 42.8 26.8 1
SOLUTIONRC 1 117.85 90.00 58.64 22.2 57.2 34.3 16.3 2
SOLUTIONRC 1 90.57 80.41 235.17 18.2 58.3 26.3 5.6 3
SOLUTIONRC 1 60.20 85.07 240.47 17.9 58.5 25.9 4.9 4
SOLUTIONRC 1 22.57 57.12 223.13 17.9 58.5 26.6 4.1 5
SOLUTIONRC 1 47.85 86.10 237.71 17.8 58.5 25.8 5.2 6
SOLUTIONRC 1 87.65 60.22 71.67 17.8 58.4 25.9 4.4 7
SOLUTIONRC 1 80.37 85.82 235.99 17.7 58.4 25.1 4.5 8
SOLUTIONRC 1 44.86 24.72 48.00 17.7 58.5 26.0 5.6 9
SOLUTIONRC 1 41.18 58.25 87.29 17.7 58.4 25.7 6.4 10
Interestingly, the correct top peak appears to be also the top peak in CC_F
and CC_I. However, as you can clearly see, the signal-to-noise ratio is MUCH
better for CC_P. Now, imagine that you do not have a perfect search model.
In
this case, I think, the chances to find the correct peak are much poorer if
the output is sorted on CC_F rather than on CC_P. I don't know what you
other
users of CCP4 think about this, but I would strongly prefer a sorting on the
real search function values rather than on something else in order to get
the
best chances to find the correct molecular replacement solution.
Unfortunately, CCP4 Amore apparently does not give the user the choice on
which values he/she wants to sort the output. Thus, the request from my side
to the CCP4 developers is to give the user the choice on which values the
output should be sorted, and to set the sorting on CC_P as the default, and
not the sorting on CC_F.
Best regards,
Dirk.
--
***************************************************************
Dirk Kostrewa
Paul Scherrer Institut E-mail: dirk.kostrewa@psi.ch
Life Sciences, OSRA/007 Phone: +41-56-310-4722
CH-5232 Villigen PSI Fax: +41-56-310-4556
Switzerland Internet: http://www.sb.psi.ch
***************************************************************
------------------------------------------------------------------------------
Notice: This e-mail message, together with any attachments, contains information of Merck & Co., Inc. (Whitehouse Station, New Jersey, USA) that may be confidential, proprietary copyrighted and/or legally privileged, and is intended solely for the use of the individual or entity named in this message. If you are not the intended recipient, and have received this message in error, please immediately return this by e-mail and then delete it.
==============================================================================