print print


Evolutionary Data Mining Approaches for Rule-based and Tree-based Classifiers

Authors

Authors: Thomas Weise and Raymond Chiong

Abstract

Data mining is an important process, with applications found in many business, science and industrial problems. While a wide variety of algorithms have already been proposed in the literature for classification tasks in large data sets, and the majority of them have been proven to be very effective, not all of them are flexible and easily extensible. In this paper, we introduce two new approaches for synthesizing classifiers with Evolutionary Algorithms (EAs) in supervised data mining scenarios. The first method is based on encoding rule sets with bit string genomes and the second one utilizes Genetic Programming to create decision trees with arbitrary expressions attached to the nodes. Comparisons with some sophisticated standard approaches, such as C4.5 and Random-Forest, show that the performance of the evolved classifiers can be very competitive. We further demonstrate that both proposed approaches work well across different configurations of the EAs.

Keywords

Genetic Algorithms, GAs, Standard Genetic Programming, SGP, Rule-based Classification, Classification, Data Mining, Learning Classifier Systems, LCS, Decision Trees, Iris Dataset, Hepatitis Dataset, Wisconsin Breast Cancer Dataset, Wine Dataset, Heart Disease Dataset, Weka

BibTeX

@inproceedings{WC2010EDMAFRBATBC,
  author                    = {Thomas Weise and Raymond Chiong},
  title                     = {{Evolutionary Data Mining Approaches for Rule-based and Tree-based Classifiers}},
  booktitle                 = {Proceedings of the 9th IEEE International Conference on Cognitive Informatics (ICCI'10)},
  editor                    = {Fuchun Sun and Yingxu Wang and Jianhua Lu and Bo Zhang and Witold Kinsner and Lotfi A. Zadeh},
  publisher                 = {{IEEE Computer Society Press: {Los Alamitos, CA, USA}}},
  pages                     = {696--703},
  year                      = {2010},
  location                  = {{Tsinghua University: {B{\v{e}}ij{\={\i}}ng, China}}},
  url                       = {http://www.it-weise.de/documents/files/WC2010EDMAFRBATBC.pdf},
  doi                       = {10.1109/COGINF.2010.5599821},
  key                       = {WC2010EDMAFRBATBC},
},

Links

Metadata: http://www.it-weise.de/documents/metaWC2010EDMAFRBATBC.html
 
Full document: http://www.it-weise.de/documents/files/WC2010EDMAFRBATBC.pdf (224 kiB)
 
Presentation: http://www.it-weise.de/documents/files/WC2010EDMAFRBATBC_slides.pdf (3 MiB)

back to the publication