TY - JOUR
T1 - SMILES-based optimal descriptors
T2 - QSAR modeling of estrogen receptor binding affinity by correlation balance
AU - Toropov, Andrey A.
AU - Toropova, Alla P.
AU - Diaza, Rodolfo Gonella
AU - Benfenati, Emilio
AU - Gini, Giuseppina
PY - 2012/4
Y1 - 2012/4
N2 - Quantitative structure-activity relationships for model of estrogen receptor relative binding affinity (pRBA) have been built. These models are one-variable correlations between pRBA and optimal descriptor calculated with simplified molecular input line entry system. These models were obtained by means of the correlation balance: one subset of the training set (sub-training set) plays role of the training; the second subset (calibration set) plays role of the preliminary check of the models. Three splits into the sub-training set, calibration set, and external test set were examined. It has been shown that the correlation balance is a more robust predictor for the endpoint than classic scheme (training set-test set: without of the calibration). The statistical characteristics of the model are n = 59, r 2 = 0.8792, s = 0.643, F = 415 (sub-training set); n = 39, r 2 = 0.8805, s = 0.637, F = 273 (calibration set); and n = 31, r 2 = 0.8132, s = 0.781, F = 126 (test set).
AB - Quantitative structure-activity relationships for model of estrogen receptor relative binding affinity (pRBA) have been built. These models are one-variable correlations between pRBA and optimal descriptor calculated with simplified molecular input line entry system. These models were obtained by means of the correlation balance: one subset of the training set (sub-training set) plays role of the training; the second subset (calibration set) plays role of the preliminary check of the models. Three splits into the sub-training set, calibration set, and external test set were examined. It has been shown that the correlation balance is a more robust predictor for the endpoint than classic scheme (training set-test set: without of the calibration). The statistical characteristics of the model are n = 59, r 2 = 0.8792, s = 0.643, F = 415 (sub-training set); n = 39, r 2 = 0.8805, s = 0.637, F = 273 (calibration set); and n = 31, r 2 = 0.8132, s = 0.781, F = 126 (test set).
KW - Binding affinity
KW - Correlation balance
KW - Estrogen receptor
KW - Optimal descriptor
KW - QSAR
KW - SMILES
UR - http://www.scopus.com/inward/record.url?scp=84862688225&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84862688225&partnerID=8YFLogxK
U2 - 10.1007/s11224-011-9892-y
DO - 10.1007/s11224-011-9892-y
M3 - Article
AN - SCOPUS:84862688225
SN - 1040-0400
VL - 23
SP - 529
EP - 544
JO - Structural Chemistry
JF - Structural Chemistry
IS - 2
ER -