PPIG 2024 - 35th Annual Workshop
Automatic Bias Detection in Source Code Review
Yoseph Berhanu Alebachew, Chris Brown
Abstract: Bias is an inherent threat to human decision-making, including in decisions made during software de- velopment. Extensive research has demonstrated the presence of biases at various stages of the software development life-cycle. Notably, code reviews are highly susceptible to prejudice-induced biases, and individuals are often unaware of these biases as they occur. Developing methods to automatically detect these biases is crucial for addressing the associated challenges. Recent advancements in visual data an- alytics have shown promising results in detecting potential biases by analyzing user interaction patterns. In this project, we propose a controlled experiment to extend this approach to detect potentially biased outcomes in code reviews by observing how reviewers interact with the code. We employ the "spot- light model of attention", a cognitive framework where a reviewer’s gaze is tracked to determine their focus areas on the review screen. This focus, identified through gaze tracking, serves as an indicator of the reviewer’s areas of interest or concern. We plan to analyze the sequence of gaze focus using ad- vanced sequence modeling techniques, including Markov Models, Recurrent Neural Networks (RNNs), and Conditional Random Fields (CRF). These techniques will help us identify patterns that may suggest biased interactions. We anticipate that the ability to automatically detect potentially biased interactions in code reviews will significantly reduce unnecessary push-backs, enhance operational efficiency, and foster greater diversity and inclusion in software development. This approach not only helps in identi- fying biases but also in creating a more equitable development environment by mitigating these biases effectively.