Trial-by-trial probability that a participant responds with an action for each action–valence condition separated by DBS of the STN site. (A) Action–reward. (B) Action–avoid punishment. (C) Inhibition–reward. The GLM showed improved inhibition–reward with dorsal versus ventral DBS (OR = 2.49, t = 2.6, P < 0.05). (D) Inhibition–avoid punishment. Note that when learning takes place, the probability to act across trials is expected be high at the end of the 40 trials for the action conditions (A and B) and expected to be low at the end of the 40 trials for the inhibition learning conditions (C and D).