Combinatorial Chemistry & High Throughput Screening

Author(s): Joanna Procelewska, Javier Llamas Galilea, Frederic Clerc, David Farrusseng and Ferdi Schuth

DOI: 10.2174/138620707779802805

DownloadDownload PDF Flyer Cite As
Computational Methods in the Development of a Knowledge-Based System for the Prediction of Solid Catalyst Performance

Page: [37 - 50] Pages: 14

  • * (Excluding Mailing and Handling)

Abstract

The objective of this work is the construction of a correlation between characteristics of heterogeneous catalysts, encoded in a descriptor vector, and their experimentally measured performances in the propene oxidation reaction. In this paper the key issue in the modeling process, namely the selection of adequate input variables, is explored. Several data-driven feature selection strategies were applied in order to obtain an estimate of the differences in variance and information content of various attributes, furthermore to compare their relative importance. Quantitative property activity relationship techniques using probabilistic neural networks have been used for the creation of various semi-empirical models. Finally, a robust classification model, assigning selected attributes of solid compounds as input to an appropriate performance class in the model reaction was obtained. It has been evident that the mathematical support for the primary attributes set proposed by chemists can be highly desirable.

Keywords: heterogeneous catalysts, Descriptors, Shannon Entropy, Unsupervised Forward Selection algorithm, Collinearity