Abstract
Quantitative structure - property/activity relationships (QSPRs/QSARs) can be used to predict physicochemical and/or biochemical behavior of substances which were not studied experimentally. Typically predicted values for chemicals in the training set are accurate since they were used to build the model. QSPR/QSAR models must be validated before they are used in practice. Unfortunately, the majority of the suggested approaches of the validation of QSPR/QSAR models are based on consideration of geometrical features of clusters of data points in the plot of experimental versus calculated values of an endpoint. We believe these geometrical criteria can be more useful if they are analyzed for several splits into the training and test sets. In this way, one can estimate the reproducibility of the model with various splits and better evaluate model reliability. The probability of the correct prediction of an endpoint for external validation set (in the series of the above-mentioned splits) can provide an useful way to evaluate the domain of applicability of the model.
Original language | English |
---|---|
Pages (from-to) | 1223-1230 |
Number of pages | 8 |
Journal | Bioorganic and Medicinal Chemistry |
Volume | 23 |
Issue number | 6 |
DOIs | |
Publication status | Published - Mar 15 2015 |
Keywords
- CORAL software
- Domain of applicability
- Oral rat toxicity
- QSAR
- Validation
ASJC Scopus subject areas
- Biochemistry
- Clinical Biochemistry
- Molecular Biology
- Molecular Medicine
- Organic Chemistry
- Drug Discovery
- Pharmaceutical Science
- Medicine(all)