| Keyword search (4,163 papers available) | ![]() |
"Reinforcement learning" Keyword-tagged Publications:
| Title | Authors | PubMed ID | |
|---|---|---|---|
| 1 | Disentangling prediction error and value in a formal test of dopamine s role in reinforcement learning | Usypchuk AA; Maes EJP; Lozzi M; Avramidis DK; Schoenbaum G; Esber GR; Gardner MPH; Iordanova MD; | 40738112 CSBN |
| 2 | Comprehensive review of reinforcement learning for medical ultrasound imaging | Elmekki H; Islam S; Alagha A; Sami H; Spilkin A; Zakeri E; Zanuttini AM; Bentahar J; Kadem L; Xie WF; Pibarot P; Mizouni R; Otrok H; Singh S; Mourad A; | 40567264 ENCS |
| 3 | Machine learning innovations in CPR: a comprehensive survey on enhanced resuscitation techniques | Islam S; Rjoub G; Elmekki H; Bentahar J; Pedrycz W; Cohen R; | 40336660 ENCS |
| 4 | Computational neuroscience across the lifespan: Promises and pitfalls | van den Bos W; Bruckner R; Nassar MR; Mata R; Eppinger B; | 29066078 PSYCHOLOGY |
| 5 | Does phasic dopamine release cause policy updates? | Carter F; Cossette MP; Trujillo-Pisanty I; Pallikaras V; Breton YA; Conover K; Caplan J; Solis P; Voisard J; Yaksich A; Shizgal P; | 38039083 PSYCHOLOGY |
| 6 | Nonlinear dynamic modeling and model-based AI-driven control of a magnetoactive soft continuum robot in a fluidic environment | Moezi SA; Sedaghati R; Rakheja S; | 37932207 ENCS |
| 7 | Sub-hourly measurement datasets from 6 real buildings: Energy use and indoor climate | Sartori I; Walnum HT; Skeie KS; Georges L; Knudsen MD; Bacher P; Candanedo J; Sigounis AM; Prakash AK; Pritoni M; Granderson J; Yang S; Wan MP; | 37153123 ENCS |
| 8 | Reinforcement learning for automatic quadrilateral mesh generation: A soft actor-critic approach | Pan J; Huang J; Cheng G; Zeng Y; | 36375347 ENCS |
| 9 | Trust-Augmented Deep Reinforcement Learning for Federated Learning Client Selection | Rjoub G; Wahab OA; Bentahar J; Cohen R; Bataineh AS; | 35875592 ENCS |
| 10 | Designing a hybrid reinforcement learning based algorithm with application in prediction of the COVID-19 pandemic in Quebec. | Khalilpourazari S, Hashemi Doulabi H | 33424076 ENCS |
| 11 | Cue-Evoked Dopamine Neuron Activity Helps Maintain but Does Not Encode Expected Value. | Mendoza JA, Lafferty CK, Yang AK, Britt JP | 31693885 CSBN |
| 12 | Metacontrol of decision-making strategies in human aging. | Bolenz F, Kool W, Reiter AM, Eppinger B | 31397670 PERFORM |
| 13 | Developmental Changes in Learning: Computational Mechanisms and Social Influences. | Bolenz F, Reiter AMF, Eppinger B | 29250006 PERFORM |
| Title: | Does phasic dopamine release cause policy updates? | ||||
| Authors: | Carter F, Cossette MP, Trujillo-Pisanty I, Pallikaras V, Breton YA, Conover K, Caplan J, Solis P, Voisard J, Yaksich A, Shizgal P | ||||
| Link: | https://pubmed.ncbi.nlm.nih.gov/38039083/ | ||||
| DOI: | 10.1111/ejn.16199 | ||||
| Publication: | The European journal of neuroscience | ||||
| Keywords: | intracranial self-stimulation; operant conditioning; reinforcement learning; reward; | ||||
| PMID: | 38039083 | Category: | Date Added: | 2023-12-01 | |
| Dept Affiliation: |
PSYCHOLOGY
1 Department of Psychology, Concordia University, Montreal, Quebec, Canada. 2 Montreal Institute for Learning Algorithms, Université de Montréal, Montreal, Quebec, Canada. 3 Department of Psychology, Langara College, Vancouver, British Columbia, Canada. |
||||
Description: |
Phasic dopamine activity is believed to both encode reward-prediction errors (RPEs) and to cause the adaptations that these errors engender. If so, a rat working for optogenetic stimulation of dopamine neurons will repeatedly update its policy and/or action values, thus iteratively increasing its work rate. Here, we challenge this view by demonstrating stable, non-maximal work rates in the face of repeated optogenetic stimulation of midbrain dopamine neurons. Furthermore, we show that rats learn to discriminate between world states distinguished only by their history of dopamine activation. Comparison of these results to reinforcement learning simulations suggests that the induced dopamine transients acted more as rewards than RPEs. However, pursuit of dopaminergic stimulation drifted upwards over a time scale of days and weeks, despite its stability within trials. To reconcile the results with prior findings, we consider multiple roles for dopamine signalling. |



