Dimitrakakis, Christos

Voici les éléments 1 - 1 sur 1

Publication

Accès libre

Minimax-Bayes Reinforcement Learning

2023, Thomas Kleine Buening, Dimitrakakis, Christos, Hannes Eriksson, Divya Grover, Emilio Jorge

While the Bayesian decision-theoretic framework offers an elegant solution to the problem of decision making under uncertainty, one question is how to appropriately select the prior distribution. One idea is to employ a worst-case prior. However, this is not as easy to specify in sequential decision making as in simple statistical estimation problems. This paper studies (sometimes approximate) minimax-Bayes solutions for various reinforcement learning problems to gain insights into the properties of the corresponding priors and policies. We find that while the worst-case prior depends on the setting, the corresponding minimax policies are more robust than those that assume a standard (i.e. uniform) prior.

Afficher

Dimitrakakis, Christos

Résultat de la recherche

Filtres

Auteur

Institution

Fichier(s) présent(s)

Type

Paramètres

Trier par

Résultats par page

Minimax-Bayes Reinforcement Learning

Options

Dimitrakakis, Christos

Résultat de la recherche

Filtres

Auteur

Institution

Fichier(s) présent(s)

Type

Paramètres

Trier par

Résultats par page

Minimax-Bayes Reinforcement Learning