Handling Class Imbalance In Direct Marketing Dataset Using A Hybrid Data and Algorithmic Level Solutions

Alhakbani, Haya Abdullah and al-Rifaie, Mohammad Majid. 2016. 'Handling Class Imbalance In Direct Marketing Dataset Using A Hybrid Data and Algorithmic Level Solutions'. In: SAI Computing Conference, 2016. London, United Kingdom 13-15 July 2016. [Conference or Workshop Item]

[img]
Preview
Text
2016_SAI_Computing_IEEE_Class_imbalance.pdf

Download (132kB) | Preview

Abstract or Description

Class imbalance is a major problem in machine learning. It occurs when the number of instances in the majority class is significantly more than the number of instances in the minority class. This is a common problem which is recurring in most datasets, including the one used in this paper (i.e. direct marketing dataset). In direct marketing, businesses are interested in identifying potential buyers, or charities wish to identify potential givers. Several solutions have been suggested in the literature to address this problem, amongst which are data-level techniques, algorithmic-level techniques and a combination of both. In this paper, a model is proposed to solve imbalanced data using a Hybrid of Data-level and Algorithmic-level solutions (HybridDA), which involves oversampling the minority class, undersampling the majority class, and additionally, optimising the cost parameter, the gamma and the kernel type of Support Vector Machines (SVM) using a grid search. The proposed model perfomed competitively compared with other models on the same dataset. The dataset used in this work are real-world data collected from a Portuguese marketing campaign for bank-deposit subscriptions and are available from the University of California, Irvine (UCI) Machine Learning Repository.

Item Type:

Conference or Workshop Item (Paper)

Identification Number (DOI):

https://doi.org/10.1109/SAI.2016.7556019

Departments, Centres and Research Units:

Computing

Dates:

DateEvent
1 September 2016Published

Event Location:

London, United Kingdom

Date range:

13-15 July 2016

Item ID:

17248

Date Deposited:

21 Mar 2016 14:17

Last Modified:

29 Apr 2020 16:15

URI:

https://research.gold.ac.uk/id/eprint/17248

View statistics for this item...

Edit Record Edit Record (login required)