A pool-based pattern generation algorithm for logical analysis of data with automatic fine-tuning

In this paper, we address the binary classification problem, in which one is given a set of observations, characterized by a number of (binary and non-binary) attributes and wants to determine which class each observation belongs to. The proposed classification algorithm is based on the Logical Anal...

Full description

Bibliographic Details
Main Authors: Caserta, M., Reiners, Torsten
Format: Journal Article
Published: Elsevier BV * North-Holland 2015
Subjects:
Online Access:http://hdl.handle.net/20.500.11937/11799
_version_ 1848747902509252608
author Caserta, M.
Reiners, Torsten
author_facet Caserta, M.
Reiners, Torsten
author_sort Caserta, M.
building Curtin Institutional Repository
collection Online Access
description In this paper, we address the binary classification problem, in which one is given a set of observations, characterized by a number of (binary and non-binary) attributes and wants to determine which class each observation belongs to. The proposed classification algorithm is based on the Logical Analysis of Data (LAD) technique and belongs to the class of supervised learning algorithms. We introduce a novel metaheuristic-based approach for pattern generation within LAD. The key idea relies on the generation of a pool of patterns for each given observation of the training set. Such a pool is built with one or more criteria in mind (e.g., diversity, homogeneity, coverage, etc.), and is paramount in the achievement of high classification accuracy, as shown by the computational results we obtained. In addition, we address one of the major concerns of many data mining algorithms, i.e., the fine-tuning and calibration of parameters. We employ here a novel technique, called biased Random-Key Genetic Algorithm that allows the calibration of all the parameters of the algorithm in an automatic fashion, hence reducing the fine-tuning effort required and enhancing the performance of the algorithm itself. We tested the proposed approach on 10 benchmark instances from the UCI repository and we proved that the algorithm is competitive, both in terms of classification accuracy and running time.
first_indexed 2025-11-14T06:56:32Z
format Journal Article
id curtin-20.500.11937-11799
institution Curtin University Malaysia
institution_category Local University
last_indexed 2025-11-14T06:56:32Z
publishDate 2015
publisher Elsevier BV * North-Holland
recordtype eprints
repository_type Digital Repository
spelling curtin-20.500.11937-117992017-09-13T16:02:37Z A pool-based pattern generation algorithm for logical analysis of data with automatic fine-tuning Caserta, M. Reiners, Torsten Data mining bRKGA Machine learning Fine-tuning Logical Analysis of Data In this paper, we address the binary classification problem, in which one is given a set of observations, characterized by a number of (binary and non-binary) attributes and wants to determine which class each observation belongs to. The proposed classification algorithm is based on the Logical Analysis of Data (LAD) technique and belongs to the class of supervised learning algorithms. We introduce a novel metaheuristic-based approach for pattern generation within LAD. The key idea relies on the generation of a pool of patterns for each given observation of the training set. Such a pool is built with one or more criteria in mind (e.g., diversity, homogeneity, coverage, etc.), and is paramount in the achievement of high classification accuracy, as shown by the computational results we obtained. In addition, we address one of the major concerns of many data mining algorithms, i.e., the fine-tuning and calibration of parameters. We employ here a novel technique, called biased Random-Key Genetic Algorithm that allows the calibration of all the parameters of the algorithm in an automatic fashion, hence reducing the fine-tuning effort required and enhancing the performance of the algorithm itself. We tested the proposed approach on 10 benchmark instances from the UCI repository and we proved that the algorithm is competitive, both in terms of classification accuracy and running time. 2015 Journal Article http://hdl.handle.net/20.500.11937/11799 10.1016/j.ejor.2015.05.078 Elsevier BV * North-Holland restricted
spellingShingle Data mining
bRKGA
Machine learning
Fine-tuning
Logical Analysis of Data
Caserta, M.
Reiners, Torsten
A pool-based pattern generation algorithm for logical analysis of data with automatic fine-tuning
title A pool-based pattern generation algorithm for logical analysis of data with automatic fine-tuning
title_full A pool-based pattern generation algorithm for logical analysis of data with automatic fine-tuning
title_fullStr A pool-based pattern generation algorithm for logical analysis of data with automatic fine-tuning
title_full_unstemmed A pool-based pattern generation algorithm for logical analysis of data with automatic fine-tuning
title_short A pool-based pattern generation algorithm for logical analysis of data with automatic fine-tuning
title_sort pool-based pattern generation algorithm for logical analysis of data with automatic fine-tuning
topic Data mining
bRKGA
Machine learning
Fine-tuning
Logical Analysis of Data
url http://hdl.handle.net/20.500.11937/11799