Artículos de revistas
Is your QSAR/QSPR descriptor real or trash?
Registro en:
Journal Of Chemometrics. Wiley-blackwell, v. 24, n. 41984, n. 681, n. 693, 2010.
0886-9383
WOS:000286291500007
10.1002/cem.1331
Autor
Kiralj, R
Ferreira, MMC
Institución
Resumen
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) The sign change problem in quantitative structure-activity relationship (QSAR), quantitative structure-property relationship (QSPR) and related studies is the controversy related to the signs of correlation coefficients and regression coefficients of a descriptor in univariate and multivariate regressions, before and after the data split. Among 50 investigated regression models with 227 descriptors extracted from the literature, the sign change problem was shown to have a very high frequency, according to four new criteria proposed in this work for its assessment. The sign change problem can be substantially reduced and even eliminated for a given dataset by statistically based variable selection and by checking for the sign change problem before model validation and interpretation. Knowing the fundamentals of statistics related to the sign change problem, its identification and understanding aid in finding effective means to remedy regression models with this deficiency. Copyright (C) 2010 John Wiley & Sons, Ltd. Supporting information may be found in the online version of this article 24 41984 SI 681 693 Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)