We found a match
Your institution may have rights to this item. Sign in to continue.
- Title
Incorporating PLS model information into particle swarm optimization for descriptor selection in QSAR/QSPR.
- Authors
Wang, Yong; Huang, Jing‐Jing; Zhou, Neng; Cao, Dong‐Sheng; Dong, Jie; Li, Han‐Xiong
- Abstract
As a representative paradigm of evolutionary algorithms, particle swarm optimization (PSO) has been combined with partial least square (PLS) (called PSO-PLS) to select informative descriptors in quantitative structure-activity/property relationship (QSAR/QSPR). However, one of the main limitations of PSO-PLS is that it ignores PLS model information. In this paper, by incorporating the PLS model information into PSO-PLS, we present a novel weighted sampling method (called WS-PSO-PLS) to choose the optimal descriptor subset. Due to the fact that the regression coefficients of the PLS model reflect the importance of descriptors in the model development, we firstly obtain the normalized regression coefficients by establishing the PLS model with all the descriptors. Afterward, weighted sampling is used to generate some individuals according to the aforementioned normalized regression coefficients. Finally, we employ some dimensions of the generated individuals to replace the corresponding dimensions of the individuals with poor quality in the population at each generation. WS-PSO-PLS has been assessed through three QSAR/QSPR datasets and the experimental results suggest that WS-PSO-PLS has the capability to effectively guide the search process by introducing the PLS model coefficients into PSO during the evolution and, therefore, performs better than PSO-PLS. WS-PSO-PLS could be considered as a general and promising mechanism to introduce extra information to improve the performance of PSO for descriptor selection in QSAR/QSPR.
- Subjects
LEAST squares; INFORMATION theory; PARTICLE swarm optimization; DESCRIPTOR systems; QSAR models; REGRESSION analysis
- Publication
Journal of Chemometrics, 2015, Vol 29, Issue 12, p627
- ISSN
0886-9383
- Publication type
Article
- DOI
10.1002/cem.2746