| Keyword search (4,163 papers available) | ![]() |
"reinforcement learning" Keyword-tagged Publications:
| Title | Authors | PubMed ID | |
|---|---|---|---|
| 1 | Disentangling prediction error and value in a formal test of dopamine s role in reinforcement learning | Usypchuk AA; Maes EJP; Lozzi M; Avramidis DK; Schoenbaum G; Esber GR; Gardner MPH; Iordanova MD; | 40738112 CSBN |
| 2 | Comprehensive review of reinforcement learning for medical ultrasound imaging | Elmekki H; Islam S; Alagha A; Sami H; Spilkin A; Zakeri E; Zanuttini AM; Bentahar J; Kadem L; Xie WF; Pibarot P; Mizouni R; Otrok H; Singh S; Mourad A; | 40567264 ENCS |
| 3 | Machine learning innovations in CPR: a comprehensive survey on enhanced resuscitation techniques | Islam S; Rjoub G; Elmekki H; Bentahar J; Pedrycz W; Cohen R; | 40336660 ENCS |
| 4 | Computational neuroscience across the lifespan: Promises and pitfalls | van den Bos W; Bruckner R; Nassar MR; Mata R; Eppinger B; | 29066078 PSYCHOLOGY |
| 5 | Does phasic dopamine release cause policy updates? | Carter F; Cossette MP; Trujillo-Pisanty I; Pallikaras V; Breton YA; Conover K; Caplan J; Solis P; Voisard J; Yaksich A; Shizgal P; | 38039083 PSYCHOLOGY |
| 6 | Nonlinear dynamic modeling and model-based AI-driven control of a magnetoactive soft continuum robot in a fluidic environment | Moezi SA; Sedaghati R; Rakheja S; | 37932207 ENCS |
| 7 | Sub-hourly measurement datasets from 6 real buildings: Energy use and indoor climate | Sartori I; Walnum HT; Skeie KS; Georges L; Knudsen MD; Bacher P; Candanedo J; Sigounis AM; Prakash AK; Pritoni M; Granderson J; Yang S; Wan MP; | 37153123 ENCS |
| 8 | Reinforcement learning for automatic quadrilateral mesh generation: A soft actor-critic approach | Pan J; Huang J; Cheng G; Zeng Y; | 36375347 ENCS |
| 9 | Trust-Augmented Deep Reinforcement Learning for Federated Learning Client Selection | Rjoub G; Wahab OA; Bentahar J; Cohen R; Bataineh AS; | 35875592 ENCS |
| 10 | Designing a hybrid reinforcement learning based algorithm with application in prediction of the COVID-19 pandemic in Quebec. | Khalilpourazari S, Hashemi Doulabi H | 33424076 ENCS |
| 11 | Cue-Evoked Dopamine Neuron Activity Helps Maintain but Does Not Encode Expected Value. | Mendoza JA, Lafferty CK, Yang AK, Britt JP | 31693885 CSBN |
| 12 | Metacontrol of decision-making strategies in human aging. | Bolenz F, Kool W, Reiter AM, Eppinger B | 31397670 PERFORM |
| 13 | Developmental Changes in Learning: Computational Mechanisms and Social Influences. | Bolenz F, Reiter AMF, Eppinger B | 29250006 PERFORM |
| Title: | Trust-Augmented Deep Reinforcement Learning for Federated Learning Client Selection | ||||
| Authors: | Rjoub G, Wahab OA, Bentahar J, Cohen R, Bataineh AS | ||||
| Link: | https://pubmed.ncbi.nlm.nih.gov/35875592/ | ||||
| DOI: | 10.1007/s10796-022-10307-z | ||||
| Publication: | Information systems frontiers : a journal of research and innovation | ||||
| Keywords: | COVID-19 detection; Deep reinforcement learning; Edge computing; Federated learning; Internet of things (IoT); Transfer learning; | ||||
| PMID: | 35875592 | Category: | Date Added: | 2022-07-25 | |
| Dept Affiliation: |
ENCS
1 Concordia Institute for Information Systems Engineering, Concordia University, 1455 De Maisonneuve Blvd. W.2, Montreal, H3G 1M8 Quebec Canada. 2 Department of Computer Science and Engineering, Université du Québec en Outaouais, 101, Saint-Jean-Bosco, C.P. 1250, succursale Hull, Gatineau, J8X 3X7 Quebec Canada. 3 David R. Cheriton School of Computer Science, University of Waterloo, 200 University Avenue West, Waterloo, N2L 3G1 ON Canada. |
||||
Description: |
In the context of distributed machine learning, the concept of federated learning (FL) has emerged as a solution to the privacy concerns that users have about sharing their own data with a third-party server. FL allows a group of users (often referred to as clients) to locally train a single machine learning model on their devices without sharing their raw data. One of the main challenges in FL is how to select the most appropriate clients to participate in the training of a certain task. In this paper, we address this challenge and propose a trust-based deep reinforcement learning approach to select the most adequate clients in terms of resource consumption and training time. On top of the client selection mechanism, we embed a transfer learning approach to handle the scarcity of data in some regions and compensate potential lack of learning at some servers. We apply our solution in the healthcare domain in a COVID-19 detection scenario over IoT devices. In the considered scenario, edge servers collaborate with IoT devices to train a COVID-19 detection model using FL without having to share any raw confidential data. Experiments conducted on a real-world COVID-19 dataset reveal that our solution achieves a good trade-off between detection accuracy and model execution time compared to existing approaches. |



