Ethical and Socially-Aware Data Labels

Information Management and Big Data. SIMBig 2018. Communications in Computer and Information Science, vol 898. Springer, Cham
Elena Beretta, Antonio Vetrò, Bruno Lepri, Juan Carlos De Martin
08 February 2019
PDF icon Beretta_Data_Ethics.pdf255.56 KB

Many software systems today make use of large amount of personal data to make recommendations or decisions that affect our daily lives. These software systems generally operate without guarantees of non-discriminatory practices, as instead often required to human decision-makers, and therefore are attracting increasing scrutiny. Our research is focused on the specific problem of biased software-based decisions caused from biased input data. In this regard, we propose a data labeling framework based on the identification of measurable data characteristics that could lead to downstream discriminating effects. We test the proposed framework on a real dataset, which allowed us to detect risks of discrimination for the case of population groups.