[GSOC]: Fair Data Reweighting

**Project Description:** Fair data reweighting is a simple and effective pre-processing to ensure model fairness. The goal of this project is to design and develop an open source implementation of fair data reweighting, demonstrate its effectiveness on publicly available datasets and perform a study of how the algorithm performance and efficiency scales with increasing the number of protected subgroups. Time permitting, the project will also involve exploring algorithmic techniques for improving the reweighting algorithm and/or making it faster.

Towards the above goal we encourage participants to first complete the following starter tasks.

**Note:** The following tasks are open for multiple submissions. Anyone interested in the project is encouraged so solve the tasks below and submit their work.

**Starter Task 1:** Train a one layer fully connected network in TensorFlow to solve the binary classification problem of income prediction for the [[UCI Adult Dataset](https://archive.ics.uci.edu/ml/datasets/adult)]. Compute the overall model accuracy on the test set and also compute the performance gap in the model accuracy for points with sex=Male vs. points with sex=Female. 

**Starter Task 2:** Recompute the same metrics and performance gap as above via the [TensorFlow Fairness Indicators](https://github.com/tensorflow/fairness-indicators) library.

**Starter Task 3:** Partition the training set into two subsets based on sex. Compute a new training set of the same size with half of the points coming from each set (implement uniform sampling with replacement). Re-train the model on the new training set and recompute the metrics and performance gap. Report your observations. Suggest one or two ways to improve the sampling of the training set that may result in a reduced performance gap.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GSOC]: Fair Data Reweighting #28

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[GSOC]: Fair Data Reweighting #28

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions