Repository logo
Research Data
Publications
Projects
Persons
Organizations
English
Français
Log In(current)
  1. Home
  2. Publications
  3. Article de recherche (journal article)
  4. Rollout sampling approximate policy iteration

Rollout sampling approximate policy iteration

Author(s)
Dimitrakakis, Christos  
Chaire de science des données  
Michail G. Lagoudakis
Date issued
2008
In
Machine Learning
Vol
72
No
3
Subjects
Machine Learning (cs.LG) Artificial Intelligence (cs.AI) Computational Complexity (cs.CC)
Abstract
Several researchers have recently investigated the connection between reinforcement learning and classification. We are motivated by proposals of approximate policy iteration schemes without value functions which focus on policy representation using classifiers and address policy learning as a supervised learning problem. This paper proposes variants of an improved policy iteration scheme which addresses the core sampling problem in evaluating a policy through simulation as a multi-armed bandit machine. The resulting algorithm offers comparable performance to the previous algorithm achieved, however, with significantly less computational effort. An order of magnitude improvement is demonstrated experimentally in two standard reinforcement learning domains: inverted pendulum and mountain-car.
Publication type
journal article
Identifiers
https://libra.unine.ch/handle/20.500.14713/64430
DOI
10.1007/s10994-008-5069-3
-
0805.2027v2
File(s)
Loading...
Thumbnail Image
Download
Name

0805.2027.pdf

Type

Main Article

Size

253.08 KB

Format

Adobe PDF

Université de Neuchâtel logo

Service information scientifique & bibliothèques

Rue Emile-Argand 11

2000 Neuchâtel

contact.libra@unine.ch

Service informatique et télématique

Rue Emile-Argand 11

Bâtiment B, rez-de-chaussée

Powered by DSpace-CRIS

libra v2.1.0

© 2025 Université de Neuchâtel

Portal overviewUser guideOpen Access strategyOpen Access directive Research at UniNE Open Access ORCIDWhat's new