PERFORM Publications

Simulating federated learning for steatosis detection using ultrasound images

Authors: Qi Y, Vianna P, Cadrin-Chênevert A, Blanchet K, Montagnon E, Belilovsky E, Wolf G, Mullie LA, Cloutier G, Chassé M, Tang A

Affiliations

1 Centre de Recherche du Centre Hospitalier de l'Université de Montréal (CRCHUM), Montréal, QC, Canada.
2 Institute of Biomedical Engineering, Université de Montréal, Montréal, QC, Canada.
3 Laboratory of Biorheology and Medical Ultrasonics - CRCHUM, Montréal, QC, Canada.
4 Radiology, Radiation Oncology and Nuclear Medicine Department, Université de Montréal, Montréal, QC, Canada.
5 CISSS de Lanaudière, Joliette, QC, Canada.
6 Clinical Laboratory of Image Processing - CRCHUM, Montréal, QC, Canada.
7 Mila - Quebec Artificial Intelligence Institute, Montréal, QC, Canada.
8 Concordia University, Montréal, QC, Canada.
9 Department of Mathematics and Statistics, Université de Montréal, Montréal, QC, Canada.
10 Department of Medicine, Division of Critical Care Medicine, Centre Hospitalier de l'Université de Montréal (CHUM), Montréal, QC, Canada.
11 Faculty of Medicine, Université de Montréal, Montréal, QC, Canada.
12 Centre de Recherche du Centre Hospitalier de l'Université de Montréal (CRCHUM), Montréal, QC, Canada. an.tang@umontreal.ca.
13 Clinical Laboratory of Image Processing - CRCHUM, Montréal, QC, Canada. an.tang@umontreal.ca.
14 Faculty of Medicine, Université de Montréal, Montréal, QC, Canada. an.tang@umontreal.ca.
15 Département de Radiologie, Centre Hospitalier de l'Université de Montréal (CHUM), 1058 Rue Saint-Denis, Montréal, QC, H2X 3J4, Canada. an.tang@umontreal.ca.

Description

We aimed to implement four data partitioning strategies evaluated with four federated learning (FL) algorithms and investigate the impact of data distribution on FL model performance in detecting steatosis using B-mode US images. A private dataset (153 patients; 1530 images) and a public dataset (55 patient; 550 images) were included in this retrospective study. The datasets contained patients with metabolic dysfunction-associated fatty liver disease (MAFLD) with biopsy-proven steatosis grades and control individuals without steatosis. We employed four data partitioning strategies to simulate FL scenarios and we assessed four FL algorithms. We investigated the impact of class imbalance and the mismatch between the global and local data distributions on the learning outcome. Classification performance was assessed with area under the receiver operating characteristic curve (AUC) on a separate test set. AUCs were 0.93 (95% CI 0.92, 0.94) for source-based partitioning scenario with FedAvg, 0.90 (95% CI 0.89, 0.91) for a centralized model, and 0.83 (95% CI 0.81, 0.85) for a model trained in a single-center scenario. When data was perfectly balanced on the global level and each site had an identical data distribution, the model yielded an AUC of 0.90 (95% CI 0.88, 0.92). When each site contained data exclusively from one single class, irrespective of the global data distribution, the AUC fell in the range of 0.34-0.70. FL applied to B-mode US images provide performance comparable to a centralized model and higher than single-center scenario. Global data imbalance and local data heterogeneity influenced the learning outcome.

Keywords: B-mode ultrasound image; Class imbalance; Data partition; Federated learning; Steatosis;

Links

PubMed: https://pubmed.ncbi.nlm.nih.gov/38858500/

DOI: 10.1038/s41598-024-63969-x

Search publications

No publications found.

Simulating federated learning for steatosis detection using ultrasound images

Affiliations

Description

Links