News

Drift diffusion models (DDM) are fundamental to our understanding of perceptual decision-making. Here, the authors show that DDM can implement optimal choice strategies in value-based decisions ...
The first paper, The Value Function Polytope in Reinforcement Learning, is written by Google Brain's Robert Dadashi, Adrien Ali Taïga, Nicolas Le Roux, Dale Schuurmans, and Marc G. Bellemare ...
We then develop two least squares Monte Carlo algorithms based on the dynamic programming principle to efficiently value these contracts. In these methods, conditional expectations are classically ...