TY - JOUR
T1 - Imperfect ImaGANation
T2 - Implications of GANs exacerbating biases on facial data augmentation and snapchat face lenses
AU - Jain, Niharika
AU - Olmo, Alberto
AU - Sengupta, Sailik
AU - Manikonda, Lydia
AU - Kambhampati, Subbarao
N1 - Funding Information:
Kambhampati's research is supported in part by ONR grants N00014-16-1-2892 , N00014-18-1-2442 , and N00014-18-1-2840 , AFOSR grant FA9550-18-1-0067 , NASA grant NNX17AD06G and a JP Morgan AI Faculty Research grant.
Publisher Copyright:
© 2021 Elsevier B.V.
PY - 2022/3
Y1 - 2022/3
N2 - In this paper, we show that popular Generative Adversarial Network (GAN) variants exacerbate biases along the axes of gender and skin tone in the generated data. The use of synthetic data generated by GANs is widely used for a variety of tasks ranging from data augmentation to stylizing images. While practitioners celebrate this method as an economical way to obtain synthetic data to train data-hungry machine learning models or provide new features to users of mobile applications, it is unclear whether they recognize the perils of such techniques when applied to real world datasets biased along latent dimensions. Although one expects GANs to replicate the distribution of the original data, in real-world settings with limited data and finite network capacity, GANs suffer from mode collapse. First, we show readily-accessible GAN variants such as DCGANs ‘imagine’ faces of synthetic engineering professors that have masculine facial features and fair skin tones. When using popular GAN architectures that attempt to address mode-collapse, we observe that these variants either provide a false sense of security or suffer from other inherent limitations due to their design choice. Second, we show that a conditional GAN variant transforms input images of female and nonwhite faces to have more masculine features and lighter skin when asked to generate faces of engineering professors. Worse yet, prevalent filters on Snapchat end up consistently lightening the skin tones in people of color when trying to make face images appear more feminine. Thus, our study is meant to serve as a cautionary tale for practitioners and educate them about the side-effect of bias amplification when applying GAN-based techniques.
AB - In this paper, we show that popular Generative Adversarial Network (GAN) variants exacerbate biases along the axes of gender and skin tone in the generated data. The use of synthetic data generated by GANs is widely used for a variety of tasks ranging from data augmentation to stylizing images. While practitioners celebrate this method as an economical way to obtain synthetic data to train data-hungry machine learning models or provide new features to users of mobile applications, it is unclear whether they recognize the perils of such techniques when applied to real world datasets biased along latent dimensions. Although one expects GANs to replicate the distribution of the original data, in real-world settings with limited data and finite network capacity, GANs suffer from mode collapse. First, we show readily-accessible GAN variants such as DCGANs ‘imagine’ faces of synthetic engineering professors that have masculine facial features and fair skin tones. When using popular GAN architectures that attempt to address mode-collapse, we observe that these variants either provide a false sense of security or suffer from other inherent limitations due to their design choice. Second, we show that a conditional GAN variant transforms input images of female and nonwhite faces to have more masculine features and lighter skin when asked to generate faces of engineering professors. Worse yet, prevalent filters on Snapchat end up consistently lightening the skin tones in people of color when trying to make face images appear more feminine. Thus, our study is meant to serve as a cautionary tale for practitioners and educate them about the side-effect of bias amplification when applying GAN-based techniques.
KW - Algorithmic bias
KW - Data augmentation
KW - Generative adversarial networks (GANs)
KW - Social media
KW - Societal impacts
UR - http://www.scopus.com/inward/record.url?scp=85122493539&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85122493539&partnerID=8YFLogxK
U2 - 10.1016/j.artint.2021.103652
DO - 10.1016/j.artint.2021.103652
M3 - Article
AN - SCOPUS:85122493539
SN - 0004-3702
VL - 304
JO - Artificial Intelligence
JF - Artificial Intelligence
M1 - 103652
ER -