Developing a Support Vector Machine Based QSPR Model to Predict Gas-to-Benzene Solvation Enthalpy of

来源 :物理化学学报 | 被引量 : 0次 | 上传用户:lidenglu1114
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
The purpose of this paper is to present a novel way to building quantitative structure-property relationship(QSPR) models for predicting the gas-to-benzene solvation enthalpy(ΔHSolv) of 158 organic compounds based on molecular descriptors calculated from the structure alone. Different kinds of descriptors were calculated for each compounds using dragon package. The variable selection technique of enhanced replacement method(ERM) was employed to select optimal subset of descriptors. Our investigation reveals that the dependence of physico-chemical properties on solvation enthalpy is a nonlinear observable fact and that ERM method is unable to model the solvation enthalpy accurately. The standard error value of prediction set for support vector machine(SVM) is 1.681 kJ ? mol~(-1) while it is 4.624 kJ ? mol~(-1) for ERM. The results established that the calculated ΔHSolvvalues by SVM were in good agreement with the experimental ones, and the performances of the SVM models were superior to those obtained by ERM one. This indicates that SVM can be used as an alternative modeling tool for QSPR studies. The purpose of this paper is to present a novel way to building quantitative structure-property relationship (QSPR) models for predicting the gas-to-benzene solvation enthalpy (ΔHSolv) of 158 organic compounds based on molecular descriptors calculated from the structure alone. Different The investigation of the dependence of physico-chemical properties on solvation enthalpy is a nonlinear observable fact and that ERM method is unable to model the solvation enthalpy accurately. The standard error value of prediction set for support vector machine (SVM) is 1.681 kJ? mol ~ (-1) while it is 4.624 kJ? mol ~ (-1) for ERM. The results established that the calculated ΔHSolvvalues ​​by SVM were in good agreement with the experimental ones, and the performances of the SVM models were superior This means that SVM can be used as an alternative modeling tool for QSPR studies.
其他文献
为探究吕家坨井田地质构造格局,根据钻孔勘探资料,采用分形理论和趋势面分析方法,研究了井田7
期刊
在自组装膜修饰的硅表面制备有序的蛋白阵列是研发生物传感器的先决条件之一,因此如何产生有序的表面蛋白阵列一直是生物医药研究方向的前沿.本研究通过应用纳米球刻蚀法在氧
为探究吕家坨井田地质构造格局,根据钻孔勘探资料,采用分形理论和趋势面分析方法,研究了井田7
为探究吕家坨井田地质构造格局,根据钻孔勘探资料,采用分形理论和趋势面分析方法,研究了井田7
期刊
为探究吕家坨井田地质构造格局,根据钻孔勘探资料,采用分形理论和趋势面分析方法,研究了井田7
期刊
为探究吕家坨井田地质构造格局,根据钻孔勘探资料,采用分形理论和趋势面分析方法,研究了井田7
期刊