Handling Class Imbalance In Direct Marketing Dataset Using A Hybrid Data and Algorithmic Level Solutions
Alhakbani, Haya Abdullah and al-Rifaie, Mohammad Majid. 2016. 'Handling Class Imbalance In Direct Marketing Dataset Using A Hybrid Data and Algorithmic Level Solutions'. In: SAI Computing Conference, 2016. London, United Kingdom 13-15 July 2016. [Conference or Workshop Item]
|
Text
2016_SAI_Computing_IEEE_Class_imbalance.pdf Download (132kB) | Preview |
Abstract or Description
Class imbalance is a major problem in machine learning. It occurs when the number of instances in the majority class is significantly more than the number of instances in the minority class. This is a common problem which is recurring in most datasets, including the one used in this paper (i.e. direct marketing dataset). In direct marketing, businesses are interested in identifying potential buyers, or charities wish to identify potential givers. Several solutions have been suggested in the literature to address this problem, amongst which are data-level techniques, algorithmic-level techniques and a combination of both. In this paper, a model is proposed to solve imbalanced data using a Hybrid of Data-level and Algorithmic-level solutions (HybridDA), which involves oversampling the minority class, undersampling the majority class, and additionally, optimising the cost parameter, the gamma and the kernel type of Support Vector Machines (SVM) using a grid search. The proposed model perfomed competitively compared with other models on the same dataset. The dataset used in this work are real-world data collected from a Portuguese marketing campaign for bank-deposit subscriptions and are available from the University of California, Irvine (UCI) Machine Learning Repository.
Item Type: |
Conference or Workshop Item (Paper) |
||||
Identification Number (DOI): |
|||||
Departments, Centres and Research Units: |
|||||
Dates: |
|
||||
Event Location: |
London, United Kingdom |
||||
Date range: |
13-15 July 2016 |
||||
Item ID: |
17248 |
||||
Date Deposited: |
21 Mar 2016 14:17 |
||||
Last Modified: |
29 Apr 2020 16:15 |
||||
URI: |
View statistics for this item...
Edit Record (login required) |