Machine learning used to study risk factors for chronic diseases: A scoping review
Machine learning (ML) has received significant attention for its potential to process and learn from vast amounts of data. Our aim was to perform a scoping review to identify studies that used ML to study risk factors for chronic diseases at a population level, notably those that incorporated methods to mitigate algorithmic bias.
We focused on ML applications for the most common risk factors for chronic disease: tobacco use, alcohol use, unhealthy eating, physical activity, and psychological stress.
We searched the peer-reviewed, indexed literature and examined whether bias was considered and identified strategies employed to mitigate bias.
We found nine studies (45%) included some discussion of algorithmic bias. Studies that incorporated a broad array of sociodemographic variables did so primarily to improve the performance of a ML model rather than to mitigate potential harms to populations made vulnerable by social and economic policies.
Published in Canadian Journal of Public Health