Handling Class Imbalance Using Swarm Intelligence Techniques, Hybrid Data and Algorithmic Level Solutions

Alhakbani, Haya. 2019. Handling Class Imbalance Using Swarm Intelligence Techniques, Hybrid Data and Algorithmic Level Solutions. Doctoral thesis, Goldsmiths, University of London [Thesis]

[img]
Preview
Text (Handling Class Imbalance Using Swarm Intelligence Techniques, Hybrid Data and Algorithmic Level Solutions)
COM_thesis_AlhakbaniH_2019.pdf - Accepted Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (2MB) | Preview

Abstract or Description

This research focuses mainly on the binary class imbalance problem in data mining. It investigates the use of combined approaches of data and algorithmic level solutions. Moreover, it examines the use of swarm intelligence and population-based techniques to combat the class imbalance problem at all levels, including at the data, algorithmic, and feature level. It also introduces various solutions to the class imbalance problem, in which swarm intelligence techniques like Stochastic Diffusion Search (SDS) and Dispersive Flies Optimisation (DFO) are used. The algorithms were evaluated using experiments on imbalanced datasets, in which the Support Vector Machine (SVM) was used as a classifier. SDS was used to perform informed undersampling of the majority class to balance the dataset. The results indicate that this algorithm improves the classifier performance and can be used on imbalanced datasets. Moreover, SDS was extended further to perform feature selection on high dimensional datasets. Experimental results show that SDS can be used to perform feature selection and improve the classifier performance on imbalanced datasets. Further experiments evaluated DFO as an algorithmic level solution to optimise the SVM kernel parameters when learning from imbalanced datasets. Based on the promising results of DFO in these experiments, the novel approach was extended further to provide a hybrid algorithm that simultaneously optimises the kernel parameters and performs feature selection.

Item Type:

Thesis (Doctoral)

Identification Number (DOI):

https://doi.org/10.25602/GOLD.00026351

Keywords:

Class imbalance, SDS, DFO, Classification, SVM, Feature selection

Departments, Centres and Research Units:

Computing

Date:

31 March 2019

Item ID:

26351

Date Deposited:

21 May 2019 09:07

Last Modified:

07 Sep 2022 17:14

URI:

https://research.gold.ac.uk/id/eprint/26351

View statistics for this item...

Edit Record Edit Record (login required)