TY - JOUR
T1 - Using public data to measure diversity in computer science research communities
T2 - A critical data governance perspective
AU - Bosua, Rachelle
AU - Cheong, Marc
AU - Clark, Karin
AU - Clifford, Damian
AU - Coghlan, Simon
AU - Culnane, Chris
AU - Leins, Kobi
AU - Richardson, Megan
N1 - Publisher Copyright:
© 2022 Rachelle Bosua, Marc Cheong, Karin Clark, Damian Clifford, Simon Coghlan, Chris Culnane, Kobi Leins, Megan Richardson
PY - 2022/4
Y1 - 2022/4
N2 - Encouraging and supporting diversity and inclusion in computer science research communities is a critical issue for many reasons, including the ethical and robust design, delivery and publication of research that addresses real-world situations ranging from the use of digital tools in health to predictive policing to workplace hiring practices, just to name a few. One way to measure diversity is to apply analytical research methods to data sourced from the public domain for use in research. However, attempts to measure diversity using public data may themselves raise legal and ethical questions about the provenance of the data, research methods adopted, and treatment of diversity in the publication of results. This article interrogates the challenges of measuring diversity using public data, examining an illustrative case study framed around an academic research project at an Australian university using a public data set to identify gender representation in computer science communities. Employing a critical data governance perspective, we point to a range of ethical and legal concerns and recommend greater regulatory guardrails to better balance public interests in research and the privacy, data protection and other ethical interests of research subjects.
AB - Encouraging and supporting diversity and inclusion in computer science research communities is a critical issue for many reasons, including the ethical and robust design, delivery and publication of research that addresses real-world situations ranging from the use of digital tools in health to predictive policing to workplace hiring practices, just to name a few. One way to measure diversity is to apply analytical research methods to data sourced from the public domain for use in research. However, attempts to measure diversity using public data may themselves raise legal and ethical questions about the provenance of the data, research methods adopted, and treatment of diversity in the publication of results. This article interrogates the challenges of measuring diversity using public data, examining an illustrative case study framed around an academic research project at an Australian university using a public data set to identify gender representation in computer science communities. Employing a critical data governance perspective, we point to a range of ethical and legal concerns and recommend greater regulatory guardrails to better balance public interests in research and the privacy, data protection and other ethical interests of research subjects.
KW - Critical data governance
KW - Data protection
KW - Diversity
KW - Privacy
KW - Public data
UR - http://www.scopus.com/inward/record.url?scp=85124472390&partnerID=8YFLogxK
U2 - 10.1016/j.clsr.2022.105655
DO - 10.1016/j.clsr.2022.105655
M3 - Article
SN - 0267-3649
VL - 44
JO - Computer Law and Security Review
JF - Computer Law and Security Review
M1 - 105655
ER -