Step 4: Clean acquired data

80 %

* We've missed an important step in AI training: data cleaning, i.e., fixing or removing incorrect, corrupted, incorrectly formatted, duplicate, or incomplete data within a dataset. By cleaning the training data, we're likely to reduce biases contained.

* Please modify the data points or add entries in the dataset which contains more gender bias. Your main goal is to make the dataset more fair in terms of gender.

Dataset1
Dataset2

* Now let's run the code again and see what changes! Again, please run Google Colab below to see the results on a new batch of candidates. After you go to Google Colab, select "Runtime->Run all".

Colab

* If you worked on Dataset1, the results are shown under the section Training With Dataset1. If you selected Dataset2, the results are shown under the section Training With Dataset2. Any biases noticed?

We're not done yet!

BackNext stepGo back to the homepage