Machine learning used to study risk factors for chronic diseases: A scoping review

Machine learning (ML) has received significant attention for its potential to process and learn from vast amounts of data. Our aim was to perform a scoping review to identify studies that used ML to study risk factors for chronic diseases at a population level, notably those that incorporated methods to mitigate algorithmic bias.

We focused on ML applications for the most common risk factors for chronic disease: tobacco use, alcohol use, unhealthy eating, physical activity, and psychological stress.

We searched the peer-reviewed, indexed literature and examined whether bias was considered and identified strategies employed to mitigate bias.

We found nine studies (45%) included some discussion of algorithmic bias. Studies that incorporated a broad array of sociodemographic variables did so primarily to improve the performance of a ML model rather than to mitigate potential harms to populations made vulnerable by social and economic policies.

Published in Canadian Journal of Public Health

READ Article

 

Authors: Mahek Shergill, Steve Durant, Sharon Birdi, Roxana Rabet, Carolyn Ziegler, Shehzad Ali, David Buckeridge, Marzyeh Ghassemi, Jennifer Gibson, Ava John-Baptiste, Jillian Macklin, Melissa McCradden, Kwame McKenzie, Parisa Naraei, Akwasi Owusu-Bempah, Laura C. Rosella, James Shaw, Ross Upshur, Sharmistha Mishra & Andrew D. Pinto

Year: 2025