Reinforcement learning hyper-heuristics for optimisation

Hyper-heuristics are search algorithms which operate on a set of heuristics with the goal of solving a wide range of optimisation problems. It has been observed that different heuristics perform differently between different optimisation problems. A hyper-heuristic combines a set of predefined heuri...

Full description

Bibliographic Details
Main Author: Alanazi, Fawaz
Format: Thesis (University of Nottingham only)
Language:English
Published: 2017
Subjects:
Online Access:https://eprints.nottingham.ac.uk/42204/
_version_ 1848796444089122816
author Alanazi, Fawaz
author_facet Alanazi, Fawaz
author_sort Alanazi, Fawaz
building Nottingham Research Data Repository
collection Online Access
description Hyper-heuristics are search algorithms which operate on a set of heuristics with the goal of solving a wide range of optimisation problems. It has been observed that different heuristics perform differently between different optimisation problems. A hyper-heuristic combines a set of predefined heuristics, and applies a machine learning technique to predict which heuristic is the most suitable to apply at a given point in time while solving a given problem. A variety of machine learning techniques have been proposed in the literature. Most of the existing machine learning techniques are reinforcement learning mechanisms interacting with the search environment with the goal of adapting the selection of heuristics during the search process. The literature on the theoretical foundation of reinforcement learning hyper-heuristics is almost nonexisting. This work provides theoretical analyses of reinforcement learning hyper-heuristics. The goal is to shed light on the learning capabilities and limitations of reinforcement learning hyper-heuristics. This improves our understanding of these hyper-heuristics, and aid the design of better reinforcement learning hyper-heuristics. It is revealed that the commonly used additive reinforcement learning mechanism, under a mild assumption, chooses asymptotically heuristics uniformly at random. This thesis also proposes the problem of identifying the most suitable heuristic with a given error probability. We show a general lower bound on the time that "every" reinforcement learning hyper-heuristic needs to identify the most suitable heuristic with a given error probability. The results reveal a general limitation to learning achieved by this computational approach. Following our theoretical analysis, different reusable and easyto-implement reinforcement learning hyper-heuristics are proposed in this thesis. The proposed hyper-heuristics are evaluated on well-known combinatorial optimisation problems. One of the proposed reinforcement learning hyper-heuristics outperformed a state-of-the-art algorithm on several benchmark problems of the well-known CHeSC 2011.
first_indexed 2025-11-14T19:48:04Z
format Thesis (University of Nottingham only)
id nottingham-42204
institution University of Nottingham Malaysia Campus
institution_category Local University
language English
last_indexed 2025-11-14T19:48:04Z
publishDate 2017
recordtype eprints
repository_type Digital Repository
spelling nottingham-422042025-02-28T13:45:00Z https://eprints.nottingham.ac.uk/42204/ Reinforcement learning hyper-heuristics for optimisation Alanazi, Fawaz Hyper-heuristics are search algorithms which operate on a set of heuristics with the goal of solving a wide range of optimisation problems. It has been observed that different heuristics perform differently between different optimisation problems. A hyper-heuristic combines a set of predefined heuristics, and applies a machine learning technique to predict which heuristic is the most suitable to apply at a given point in time while solving a given problem. A variety of machine learning techniques have been proposed in the literature. Most of the existing machine learning techniques are reinforcement learning mechanisms interacting with the search environment with the goal of adapting the selection of heuristics during the search process. The literature on the theoretical foundation of reinforcement learning hyper-heuristics is almost nonexisting. This work provides theoretical analyses of reinforcement learning hyper-heuristics. The goal is to shed light on the learning capabilities and limitations of reinforcement learning hyper-heuristics. This improves our understanding of these hyper-heuristics, and aid the design of better reinforcement learning hyper-heuristics. It is revealed that the commonly used additive reinforcement learning mechanism, under a mild assumption, chooses asymptotically heuristics uniformly at random. This thesis also proposes the problem of identifying the most suitable heuristic with a given error probability. We show a general lower bound on the time that "every" reinforcement learning hyper-heuristic needs to identify the most suitable heuristic with a given error probability. The results reveal a general limitation to learning achieved by this computational approach. Following our theoretical analysis, different reusable and easyto-implement reinforcement learning hyper-heuristics are proposed in this thesis. The proposed hyper-heuristics are evaluated on well-known combinatorial optimisation problems. One of the proposed reinforcement learning hyper-heuristics outperformed a state-of-the-art algorithm on several benchmark problems of the well-known CHeSC 2011. 2017-07-18 Thesis (University of Nottingham only) NonPeerReviewed application/pdf en arr https://eprints.nottingham.ac.uk/42204/1/Thesis.pdf Alanazi, Fawaz (2017) Reinforcement learning hyper-heuristics for optimisation. PhD thesis, University of Nottingham. Hyper-heuristics Reinforcement Learning Runtime Analysis
spellingShingle Hyper-heuristics
Reinforcement Learning
Runtime Analysis
Alanazi, Fawaz
Reinforcement learning hyper-heuristics for optimisation
title Reinforcement learning hyper-heuristics for optimisation
title_full Reinforcement learning hyper-heuristics for optimisation
title_fullStr Reinforcement learning hyper-heuristics for optimisation
title_full_unstemmed Reinforcement learning hyper-heuristics for optimisation
title_short Reinforcement learning hyper-heuristics for optimisation
title_sort reinforcement learning hyper-heuristics for optimisation
topic Hyper-heuristics
Reinforcement Learning
Runtime Analysis
url https://eprints.nottingham.ac.uk/42204/