Dimitrakakis, Christos

Voici les éléments 1 - 3 sur 3

Accès libre
On Meritocracy in Optimal Set Selection
(2021-02-23T20:36:36Z)
Thomas Kleine Buening
;
Meirav Segal
;
Debabrota Basu
;
Dimitrakakis, Christos
;
Anne-Marie George
Typically, merit is defined with respect to some intrinsic measure of worth. We instead consider a setting where an individual's worth is \emph{relative}: when a Decision Maker (DM) selects a set of individuals from a population to maximise expected utility, it is natural to consider the \emph{Expected Marginal Contribution} (EMC) of each person to the utility. We show that this notion satisfies an axiomatic definition of fairness for this setting. We also show that for certain policy structures, this notion of fairness is aligned with maximising expected utility, while for linear utility functions it is identical to the Shapley value. However, for certain natural policies, such as those that select individuals with a specific set of attributes (e.g. high enough test scores for college admissions), there is a trade-off between meritocracy and utility maximisation. We analyse the effect of constraints on the policy on both utility and fairness in extensive experiments based on college admissions and outcomes in Norwegian universities.
Accès libre
Bayesian Reinforcement Learning via Deep, Sparse Sampling
(2020)
Divya Grover
;
Debabrota Basu
;
Dimitrakakis, Christos
We address the problem of Bayesian reinforcement learning using efficient model-based online planning. We propose an optimism-free Bayes-adaptive algorithm to induce deeper and sparser exploration with a theoretical bound on its performance relative to the Bayes optimal policy, with a lower computational complexity. The main novelty is the use of a candidate policy generator, to generate long-term options in the planning tree (over beliefs), which allows us to create much sparser and deeper trees. Experimental results on different environments show that in comparison to the state-of-the-art, our algorithm is both computationally more efficient, and obtains significantly higher reward in discrete environments.
Accès libre
Near-optimal Optimistic Reinforcement Learning using Empirical Bernstein Inequalities
(2019)
Aristide Tossou
;
Debabrota Basu
;
Dimitrakakis, Christos
We study model-based reinforcement learning in an unknown finite communicating Markov decision process. We propose a simple algorithm that leverages a variance based confidence interval. We show that the proposed algorithm, UCRL-V, achieves the optimal regret O~(DSAT−−−−−−√) up to logarithmic factors, and so our work closes a gap with the lower bound without additional assumptions on the MDP. We perform experiments in a variety of environments that validates the theoretical bounds as well as prove UCRL-V to be better than the state-of-the-art algorithms.

Dimitrakakis, Christos

Résultat de la recherche

Filtres

Auteur

Institution

Sujet

Fichier(s) présent(s)

Type

Paramètres

Trier par

Résultats par page

Options

Dimitrakakis, Christos

Résultat de la recherche

Filtres

Auteur

Institution

Sujet

Fichier(s) présent(s)

Type

Paramètres

Trier par

Résultats par page