Lorentz JÄNTSCHI (lori) works ?id=215
- [id] => 215
- [recorddate] => 2010:11:07:16:31:22
- [lastupdate] => 2010:11:07:16:31:22
- [type] => article
- [place] => Weinheim, Switzerland
- [subject] => biology - evolution; chemistry - organic; mathematics - modeling; mathematics - statistics
- [relatedworks] =>
- 3 (low):
- Supervised evolution: research concerning the number of evolutions that occur under certain constraints, ?id=270

- 4 (some):
- Distribution of QSARs correlation coefficients, ?id=234

- [file] => ?f=215
- [mime] => application/pdf
- [size] => 620247
- [pubname] => Chemistry & Biodiversity
- [pubinfo] => KGaA & Wiley-VCH Verlag
- [pubkey] => ISSN 1612-1872, eISSN 1612-1880
- [workinfo] => 7(8): 1978-1989, DOI: 10.1002/cbdv.200900356
- [year] => 2010
- [title] => A study of genetic algorithm evolution on the lipophilicity of polychlorinated biphenyls
- [authors] => Lorentz JÄNTSCHI, Sorana D. BOLBOACĂ, Radu E. SESTRAŞ
- [abstract] =>

The search for multivariate linear regression (MLR) in quantitative structure–property relationships (QSPR) is a hard problem, due to the dimension of the entire search space. A genetic algorithm (GA) was developed and assessed, to select proper descriptors for predicting the octan-1-ol/H2O partition coefficient of polychlorinated biphenyls. The GA was implemented as a Windows based FreePascal application with MySQL connectivity for fetching the data. An outcome study based on 30 runs was done keeping all parameters constant: sample size, 8; number of variables in the MLR, 2; adaptation-imposed requirements; maximum number of generations, 1000; selection strategy, proportional; probability of mutation, 0.05; number of genes implied in mutation, 2; optimization parameter, r2 ; optimization score, minimum in sample; and optimization objective, maximum. The results revealed that the number of evolutions followed the Poisson distribution with the sample size as parameter. The average of the determination coefficient is higher than 98% of the determination coefficient obtained through complete search, and follows the Gaussian distribution. The correlation coefficients obtained by the best performing GA-MLR models proved not to be statistically different from the correlation coefficient of the QSPR model obtained by complete search.
- [keywords] => water partition-coefficients; neural networks; aqueous solubility; variable selection; qsar models; congeners; hydrophobicity; chromatography; visualization; hydrocarbons
- [acknowledgment] => Financial support is gratefully acknowledged to CNCSIS-UEFISCSU Romania (project PNIIIDEI1051/ 202/2007).