QSPR Models to Predict Thermodynamic Properties of Cycloalkanes Using Molecular Descriptors and GA-MLR Method

(E-pub Ahead of Print)

Author(s): Daryoush Joudaki, Fatemeh Shafiei*.

Journal Name: Current Computer-Aided Drug Design

Become EABM
Become Reviewer


Aim and Objective: QSPR models establish relationships between different types of structural information to their observed properties. In the present study the relationship between the molecular descriptors and quantum properties of cycloalkanes is represented.

Materials and Methods: Genetic algorithm (GA) and multiple linear regressions (MLR) were successfully developed to predict quantum properties of cycloalkanes. A large number of molecular descriptors were calculated with Dragon software and a subset of calculated descriptors was selected with a genetic algorithm as a feature selection technique. The quantum properties consist of the heat capacity(Cv)/ Jmol-1K-1 entropy(S)/ Jmol-1K-1 and thermal energy(Eth)/ kJmol-1 were obtained from quantum-chemistry technique at the Hartree-Fock (HF) level using the ab initio 6-31G* basis sets.

Results: The genetic algorithm (GA) method was used to selected important molecular descriptors and then they were used as inputs for SPSS software package. The predictive powers of the MLR models were discussed using leave-one-out (LOO) cross-validation, leave-group (5-fold)-out (LGO) and external prediction series. The statistical parameters of the training, and test sets for GA–MLR models were calculated.

Conclusion: The resulting quantitative GA-MLR models of Cv, S, and Eth were obtained:[r2=0.950, Q2=0.989, r2ext=0.969, MAE(overall,5-flod)=0.6825 Jmol-1K-1], [r2=0.980, Q2=0.947, r2ext=0.943, MAE(overall,5-flod)=0.5891Jmol-1K-1], and [r2=0.980, Q2=0.809, r2ext=0.985, MAE(overall,5-flod)=2.0284 kJmol-1]. The Rresults showed that the predictive ability of the models was satisfactory, and the constitutional, topological indices and ring descriptor could be used to predict the mentioned properties of 103 cycloalkanes.

Keywords: Multiple linear regression, Molecular descriptors, Genetic algorithm, validation, cycloalkanes.

Rights & PermissionsPrintExport Cite as

Article Details

(E-pub Ahead of Print)
DOI: 10.2174/1573409915666190227230744
Price: $95