Abstract
Ground-truth labeling is an important activity in machine learning. Many studies have examined how crowdworkers apply labels to records in machine learning datasets. However, there have been few studies that have examined the work of domain experts when their knowledge and expertise are needed to apply labels. We provide a grounded account of the work of labeling teams with domain experts, including the experiences of labeling, collaborative configurations and work-practices, and quality issues. We show three major patterns in the social design of ground truth data: Principled design, Iterative design, and Improvisational design. We interpret our results through theories of from Human Centered Data Science, and particularly work on human interventions in data science work through the design and creation of data.
Original language | English |
---|---|
Title of host publication | CHI '21: Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems |
Editors | Pernille Bjrn, Steven Drucker |
Place of Publication | Online |
Publisher | Association for Computing Machinery (ACM) |
ISBN (Print) | 9781450380966 |
DOIs | |
Publication status | Published - 2021 |
Event | 2021 CHI Conference on Human Factors in Computing Systems - Yokohama, Japan, Japan Duration: 1 Jan 2021 → … https://dl.acm.org/action/showFmPdf?doi=10.1145%2F3411764 |
Conference
Conference | 2021 CHI Conference on Human Factors in Computing Systems |
---|---|
Country/Territory | Japan |
Period | 1/01/21 → … |
Other | May 8-13 |
Internet address |