Subset selection in multiple linear regression models: A hybrid of genetic and simulated annealing algorithms


ÖRKCÜ H. H.

APPLIED MATHEMATICS AND COMPUTATION, vol.219, no.23, pp.11018-11028, 2013 (Journal Indexed in SCI) identifier identifier

  • Publication Type: Article / Article
  • Volume: 219 Issue: 23
  • Publication Date: 2013
  • Doi Number: 10.1016/j.amc.2013.05.016
  • Title of Journal : APPLIED MATHEMATICS AND COMPUTATION
  • Page Numbers: pp.11018-11028
  • Keywords: Regression analysis, Subset selection problem, Genetic algorithm, Simulated annealing algorithm, Hybrid heuristic optimization, VARIABLE SELECTION, OPTIMIZATION, CROSSOVER

Abstract

The question of variable selection in a multiple linear regression model is a major open research topic in statistics. The subset selection problem in multiple linear regression deals with the selection of a minimal subset of input variables without loss of explanatory power. In this paper, we adapt the genetic and simulated annealing algorithms for variable selection in multiple linear regression. The performance of this hybrid heuristic method is compared to those obtained by forward selection, backward elimination and classical genetic algorithm search. A comparative analysis on the literature data sets and simulation data shows that our hybrid heuristic method may suggest efficient alternative to traditional subset selection methods for the variable selection problem in multiple linear regression models. (C) 2013 Elsevier Inc. All rights reserved.