EWRL-5 Home Page

WORKSHOP PRELIMINARY PROGRAM

Utrecht, The Netherlands, October 5-6, 2001

Friday 5 October 2001

09:30 - Registration and Coffee

10:00 - Introductory remarks

10.10 - 10.30 Policy Search Session chair: Marco Wiering

10:10 - Policy Search using a State-Policy Evaluation Function
Malcolm Strens, QinetiQ Center for Robotics & Machine Vision, mjstrens@QinetiQ.com

10:30 - Gradient-based Reinforcement Planning in Policy-Search Methods
Ivo Kwee, Marcus Hutter and Juergen Schmidhuber, Idsia, Lugano, ivo@idsia.ch

10:50 - Policy Improvement for Several Environments
Andreas Matt and Georg Regensburger, University of Innsbruck,andreas.matt@uibk.ac.at and georg.regensburger@uibk.ac.at

11:10 - Coffee break

11.40 - 12.40 POMDPs and Combinatorial Optimization Session chair: Marco Dorigo

11:40 - Learning to use Contextual Information for Solving Partially Observed Markov Decision Problems
Alain Dutech and Bruno Scherrer, Loria/Inria, France, dutech@loria.fr

12:00 - Reinforcement learning in non-Markovian domains using LSTM recurrent neural networks
Bram Bakker,Dept. of Psychology, Leiden University, bbakker@fsw.leidenuniv.nl

12:20 - Reinforcement Learning of Combinatorial Optimization Problem with Ant algorithm
Nicolas Meuleau, MIT, nm@ai.mit.edu

12:40 - Lunch

14.00 - 14.40 Function Approximation Session chair: Jeremy Wyatt

14:00 - Hippocampal Spatial Model for State Space Representation in Robotic Reinforcement Learning
Angelo Arleo and Wulfram Gerstner, EPFL Lausanne Switzerland, angelo.arleo@ep fl.ch

14:20 - Reinforcement Learning and the Perception of Time Intervals
Jon Shapiro, University of Manchester, jls@cs.man.ac.uk

14:40 - Coffee break

15.10 - 16.10 Multi-Agent RL Session chair: Juergen Schmidhuber

15:10 - A Markov Model for Dyadic Interaction Learning
Walter Gutjahr and Anselm Eder, University of Vienna, Austria, walter.gutjahr @univie.ac.at

15:30 - Learning Fair Periodical Policies
Katja Verbeeck, Ann Now\'e, and Johan Parent. Vrije Universiteit Brussel, Belgium, kaverbee@vub.ac.be

15:50 - A non supervised multi-reinforcement agents architecture to model the development of behavior of living organisms
Philippe Preux, C. Cassagnabere, S. Delepoulle and J-C Darcheville, Laboratoire d'Informatique du Littoral, France, philippe.preux@lil.univ-littoral.fr

16:10 - Coffee break

16.40 - 17.20 Exploration Session chair: Malcolm Strens

16:40 - Advances in exploration control in reinforcement learning
Jeremy Wyatt and Funlade Summola, University of Birmingham, jlw@cs.bham.ac.uk

17:00 - The Curse of Optimism
Stuart Reynolds, The University of Birmingham, sir@cs.bham.ac.uk

20:00 - Social dinner (organized by EWRL-5)

Saturday 6 October 2001

9:40 - Robotic Reinforcement learning and the use of teacher signals (Invited talk)
Leslie Kaelbling, MIT, USA

10:30 - Coffee break

11.00 - 12.20 Hierarchy Session chair: Stuart Reynolds

11:00 - Looking for Scalable Agents
Olivier Buffet and Alain Dutech, LORIA/INRIA, France, buffet@loria.fr

11:20 - Using Multi-step Actions for Faster Reinforcement Learning
Ralf Schoknecht and Martin Riedmiller, University of Karlsruhe, Germany, schokn@ira.uka.de

11:40 - Learning Digger using Hierarchical Reinforcement Learning for Concurrent Goals
Kurt Driessens and Hendrik Blockeel, Katholieke Universiteit Leuven, Belgium, kurt.driessens@cs.kuleuven.ac.be

12:00 - ATD and AQ-learning: reward baseline reinforcement learning algorithms for discounted reward problems
Frederick Garcia, INRA/BIA, France, fgarcia@toulouse.inra.fr

12:20 - Lunch

13.50 - 14.50 Different Session chair: Leslie Kaelbling

13:50 - Experience Stack Reinforcement Learning: An Online Forward lambda-Return Method
Stuart Reynolds, The University of Birmingham, sir@cs.bham.ac.uk

14:10 - Kolmogorov Complexity and universal learners
Juergen Schmidhuber, Idsia, Switzerland, juergen@idsia.ch

14:30 - Universal Sequential Decisions in Unknown Environment
Marcus Hutter, Idsia, Switzerland, marcus@idsia.ch

14:50 - Coffee break

15.20 - 16.20 Applications Session chair: Ralf Schoknecht

15:20 - Scheduling with adaptive agents - an empirical evaluation
Werner Hunger and Martin Riedmiller, University of Karlsruhe, Germany, riedml@ira.uka.de

15:40 - Order Acceptance with Reinforcement Learning
Marisela Mainegra Hing and Aart van Harten, University of Twente, The Netherlands, M.MainegraHing@sms.utwente.nl

16:00 - Applying Reinforcement Learning To ITS For Adapting Learning Situations
Abdellah Bennane, I. Michiels, B. Manderick, and T. D'Hondt, Vrije Universiteit Brussel, Belgium, abennane@vub.ac.be

16:20 - Final Discussion