CINF 38 |
| In QSAR modeling of property/ bioactivity of chemicals using calculated molecular descriptors, we are faced with the usual problem of “few compounds and many descriptors.” Hence, variable-selection (descriptor-thinning) methods are used to select a proper subset of descriptors to develop QSAR models. It is vital to incorporate the descriptor selection, as well as any parameter selection, as part of the modeling procedure to be cross-validated for assessment of the model. When the cross-validation step does not include all such elements of the modeling procedure, the “naïve q2” thus estimated suffers from an upward bias. Application of proper cross-validation that includes descriptor thinning is necessary for developing QSAR models with good predictive ability. The importance of embedding descriptor selection as well as parameter selection inside the cross-validation step, resulting in calculation of the “true q2”, is highlighted by a comparison of true q2 with naïve q2 for a few sets of compounds. |
|
Sci-Mix
8:00 PM-10:00 PM, Monday, August 20, 2007 BCEC -- Exhibit Hall - B2, Sci-Mix
Division of Chemical Information |