Classification Methods- Episode 8: Handling Imbalanced Data

  • 27/10/2021
    1:00 pm - 2:30 pm

Course details

affiliation: Ghent University


Imbalanced response classes are a common problem in classification whereby a disproportionate ratio of observations in each response class can occur. Class imbalance can be found in many different areas including medical diagnosis, spam filtering, and fraud detection. The main problem with class imbalanced data is their ability to significantly compromise the overall performance of most standard learning algorithms. e.g. classifiers attempt to reduce global quantities such as the error rate, not taking the data distribution into consideration.

Class imbalance can be tackled from different angles:

• the algorithm level,
• the data level
• using ensemble-based learning

In this seminar, we are going to discuss methods for handling the two-class imbalanced learning problem the IRC package in the R software.


Background readings

Dr Emmanuel Abatih

