Now showing items 1-1 of 1
Gain/variability tradeoffs in undiscounted Markov Decision Processes
(Institute of Electrical and Electronic Engineers, 1985)
We consider a finite state/action Markov Decision Process over the infinite time horizon, and with the limiting average reward criterion. However, we are interested not only in maximizing the above reward criterion but ...