Is Feature Selection Essential for ANN Modeling?

Show simple item record

dc.contributor.author Goodarzi, Mohammad
dc.contributor.author Deshpande, Shreekant
dc.contributor.author Murugesan, Vanangamudi
dc.contributor.author Katti, S B
dc.contributor.author Prabhakar, Y S
dc.date.accessioned 2010-09-09T08:53:55Z
dc.date.available 2010-09-09T08:53:55Z
dc.date.issued 2009
dc.identifier.citation QSAR & Combinatorial Science, 28,11-12, 1487–1499,2009 en
dc.identifier.uri http://hdl.handle.net/123456789/595
dc.description.abstract In modeling approaches, artificial neural networks (ANNs) have a special place to address the nonlinear phenomena or curved manifold. Often one or other feature selection approach is used prior to ANN to feed the input variables for its models. The function of ‘selected’ versus ‘arbitrary’ features on the outcome of ANN models is investigated with a variety of objectively selected and arbitrarily chosen variables from chemical databases namely thiazolidinones, anilinoquinolines and piperazinoquinolines. For each database, its biological activity is considered as the dependent variable and the molecular descriptors from DRAGON software are used as explanatory variables. The selection sets are obtained from feature selection approaches namely, combinatorial protocol in multiple linear regression, stepwise regression and genetic algorithm. Apart from these, a large number of arbitrary sets have been created by randomly picking the descriptors from corresponding databases. The features of all sets have shown a variety of inter- and intra- set diversities. A three-layer back propagation ANN with Levenberg-Marquardt optimization algorithm has been used for modeling the phenomena. Regardless of the origin of the feature sets, the ANN models from a very large number of sets have well explained the activity and qualified themselves to be predictive models. Also, no specific pattern is apparent between the quality of ANN model and the origin of its input feature set. Since these results are unusual, the study is extended to a few more databases. All the results emphasized the innate ability of ANN in developing complex network of relations among features to estimate the target variable. This has prompted us to suggest that prior feature selection is not essential for ANN and it is a desirable option for meaningful outputs in terms of the rationale behind the inputs. en
dc.format.extent 215515 bytes
dc.format.mimetype application/pdf
dc.language.iso en en
dc.subject Artificial neural networks en
dc.subject Feature selection en
dc.subject DRAGON descriptors en
dc.subject Thiazolidin-4-ones en
dc.subject HIV-1 RT en
dc.subject Anilinoquinolines en
dc.subject Piperazinoquinolines en
dc.subject Antimalarials. en
dc.title Is Feature Selection Essential for ANN Modeling? en
dc.type Article en


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account