Meeting material

In this work,
we investigate the impact of synthetic data generation in fairness-aware classification and demonstrate that conventional sampling methods amplify unfairness. We propose a data sampling method combined with boosting that accounts for fairness in a cumulative manner, the so-called FairSMOTEBoost, to tackle the combined problem of class-imbalance and unfairness.

A large number of experiments have been conducted on FairSMOTEBoost in comparison with 4 competitors. Our results indicate that the combination of synthetic oversampling with a fair boosting algorithm is efficient in terms of both the predictive performance and the fairness performance of the method.

the method has two variations: 1-In the first variation, we extend the vanilla SMOTEBoost.

-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

------------------------------------------------------------------------------------------------------------------------------

The experimental results are conducted on the following datasets:

Adult-gender
Adult-race
Bank-gender
Credit-gender
Credit-marriage
KDD census
NYPD complaints-race
NYPD complaints-gender
Compas-gender
Compas-race
Dutch census-age
Dutch census-gender

-------------------------------------------------------------------

Experiments include 1- a set of bar charts comparing 8 metrics for each of the methods as shown in the images. 2- a set of convergence charts showing how the measures change over each boosting round as shown in the charts. 3- A set of weight distribution charts showing how the synthetic generation, augments the weights of the minority groups of data over boosting rounds. 4- ABROCA charts showing the Area Between two Roc Curves of each method.

Also for the Vanilla setup of the algorithm, the results are conducted based on four different oversampling ratios. Namely 1%, 5%, 10%, and 20% of the number of minority samples in the original training set. In the following, you can see the results for the Bank & Adult-gender datasets.

Bank dataset

1- Performance charts

2- Internal behavior


Adaboost- Bank	RUSBoost-bank


SMOTEBoost-bank	FairSMOTEBoost-bank

from the internal behaviour charts for different methods, we can see that RUSBoost and FarisSMOTEBoost have similar trends. In both methods, the performance for groups Protected_Positives and non_Protected_Positives increases during the learning phase but the final values for our method are slightly better than the RUSBoost and others. For the other two methods, the increase is smaller. TNRs for protected and non-protectd groups remains almost the same in all methods. Also, note that in all methods error rates increase a bit however the increase includes a keen jump for RUSBoost and the two other methods respectively, with final value larger than our method but for FairSMOTEBoost there is only a gentle increase with less final value. Balanced Error rates on the other hand, decrease greatly again with best final values for our method.

3- Weight Distribution charts


cumulative #of_instances per group in 10 boosting rounds	4 groups weights per boosting rounds	6 group weights per boosting round

-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------