Publications
- N. Stolt Anso, A.O. Wiehe, M.M. Drugan and M.A. Wiering  Deep Reinforcement Learning for Pellet Eating in Agar.io  International Conference on Agents and Artificial Intelligence (ICAART), Prague, 2019
 - F. van Beers, A. Lindstrom, E. Okafor and M.A. Wiering  Deep Neural Networks with Intersection over Union Loss for Binary Image Segmentation  International Conference on Pattern Recognition Applications and Methods (ICPRAM), Prague, 2019
 - L. Boulogne, K. Dijkstra and M.A. Wiering  Extra Domain Data Generation with Generative Adversarial Nets  IEEE Symposium Series on Computational Intelligence (SSCI): Deep Learning (DL), 2018
 - J. Schilperoort, I. Mak, M.M. Drugan and M.A. Wiering  Learning to Play Pac-Xon with Q-Learning and Two Double Q-Learning Variants  IEEE Symposium Series on Computational Intelligence (SSCI): Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2018
 - R. Niel and M.A. Wiering  Hierarchical Reinforcement Learning for Playing a Dynamic Dungeon Crawler Game  IEEE Symposium Series on Computational Intelligence (SSCI): Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2018
 - M. Sabatelli, G. Louppe, P. Geurts, and M.A. Wiering  Deep Quality-Value (DQV) Learning. arXiv:1810.00368 [stat.ML], September 2018
 - F. Dal Canton, V.M. Quinten, and M.A. Wiering  Early Detection of Sepsis Induced Deterioration Using Machine Learning. Belgian-Dutch Conference on Machine Learning (Benelearn), 2018
 - M. Aslani, S. Seipel, M.S. Mesgari and M.A. Wiering Traffic signal optimization through discrete and continuous reinforcement learning with robustness analysis in downtown Tehran . Advanced Engineering Informatics, Vol 38, 639-655, 2018
 - K. Dijkstra, J. van de Loosdrecht, L.R.B. Schomaker and M.A. Wiering  CentroidNet: A Deep Neural Network for Joint Object Localization and Counting. The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, September 2018
 - A.O. Wiehe, N. Stolt Anso, M.M. Drugan and  M.A. Wiering  Sampled Policy Gradient for Learning to Play the Game Agar.io. arXiv:1809.05763 [cs.AI], September 2018
 - M.A. Wiering  Reinforcement learning: from methods to applications. Nieuw Archief voor Wiskunde, guest Eds. M. Wiering, J. Portegies and S. Bohte, vijfde serie, deel 19, nummer 3, pp 157-167, 2018
 - K. Dijkstra, J. van de Loosdrecht, L.R.B. Schomaker and M.A. Wiering  Hyperspectral demosaicking and crosstalk correction using deep learning. Machine Vision and Applications, pp 1-21, 2018
 - E. Okafor, L.R.B. Schomaker and  M.A. Wiering  An analysis of rotation matrix and colour constancy data augmentation in classifying images of animals  Journal of Information and Telecommunication, pp 1-27, 2018 (best JIT paper 2018)
 - M. Aslani, S. Seipel and M.A. Wiering  Continuous Residual Reinforcement Learning for Traffic Signal Control Optimization. Canadian Journal of Civil Engineering, 2018
 - B. Verheij and M. Wiering Artificial Intelligence: 29th Benelux Conference, BNAIC 2017, Groningen, The Netherlands, Revised Selected Paper, Springer Book, 2018.
 - M. Pieters and M.A. Wiering  Comparing Generative Adversarial Network Techniques for Image Creation and Modification. arXiv:1803.09093 [cs.LG], March 2018
 - M. Pieters and M.A. Wiering  Comparison of Machine Learning Techniques for Multi-label Genre Classification. Springer Book: Artificial Intelligence: 29th Benelux Conference, BNAIC 2017, Revised Selected Papers, Eds. B. Verheij and M.A. Wiering, 2018. DOI: 10.1007/978-3-319-76892-2\_11, 2018. BEST PAPER AWARD FROM BNAIC'2017 Conference
 - P. Ozkohen, J. Visser, M. van Otterlo and M.A. Wiering  Learning to Play Donkey Kong Using Neural Networks and Reinforcement Learning. Springer Book: Artificial Intelligence, 29th Benelux Conference, BNAIC 2017, Revised Selected Papers. Eds. B. Verheij and M.A. Wiering, 2018. DOI: 10.1007/978-3-319-76892-2\_11, 2018. 
 - M. Aslani, M. Saadi Mesgari, S. Seipel and M.A. Wiering  Developing adaptive traffic signal control by actor-critic and direct exploration methods . Proceedings of the institution of civil engineers - Transport, pp: 1-25, 2018
 - S. Knegt, M.M. Drugan and M.A. Wiering Opponent Modelling in the Game of Tron using Reinforcement Learning. Proceedings of the 10th International Conference on Agents and Artificial Intelligence (ICAART),  2018.
 - J. Groot Kormelink, M.M. Drugan and M.A. Wiering Exploration Methods for Connectionist Q-Learning in Bomberman. Proceedings of the 10th International Conference on Agents and Artificial Intelligence (ICAART),  2018.
 - G. Leuenberger and M.A. Wiering Actor-Critic Reinforcement Learning with Neural Networks in Continuous Games. Proceedings of the 10th International Conference on Agents and Artificial Intelligence (ICAART),  2018, BEST PAPER AWARD.
 - J. van de Wolfshaar, M.A. Wiering and L. Schomaker Deep Learning Policy Quantization. Proceedings of the 10th International Conference on Agents and Artificial Intelligence (ICAART),  2018.
 - R. Niel, J. Krebbers, M.M. Drugan and M.A. Wiering Hierarchical Reinforcement Learning for Real-Time Strategy Games. Proceedings of the 10th International Conference on Agents and Artificial Intelligence (ICAART),  2018.
 - M. Sabatelli, F. Bidoia, V. Codreanu and M.A. Wiering Learning to Evaluate Chess Positions with Deep Neural Networks and Limited Lookahead. Proceedings of the 7th International Conference on Pattern Recognition Applications and Methods (ICPRAM),  2018.
 - F. Bidoia, M. Sabatelli, A. Shantia, M.A. Wiering and L. Schomaker A Deep Convolutional Neural Network for Location Recognition and Geometry based Information. Proceedings of the 7th International Conference on Pattern Recognition Applications and Methods (ICPRAM),  2018.
 - M. Aslani, M. Saadi Mesgari and M.A. Wiering Adaptive traffic signal control with actor-critic methods in a real-world traffic network with different traffic disruption events. Transportation Research Part C: Emerging Technologies, vol. 85, pp: 732-752, 2017
 - J. Hogervorst, E. Okafor and  M.A. Wiering Deep Colorization for Facial Gender Recognition. Preproceedings of the 29th Benelux Conference on Artificial Intelligence (BNAIC'2017), pp: 317-325, 2017.
 - H. Maathuis, L. Boulogne, M.A. Wiering and A. Sterk Predicting Chaotic Time Series using Machine Learning Techniques. Preproceedings of the 29th Benelux Conference on Artificial Intelligence (BNAIC'2017), pp: 326-340, 2017.
 - M. Schutten, M.A. Wiering and P. MacDougall Balancing Imbalances: On using reinforcement learning to increase stability in smart electricity grids. Preproceedings of the 29th Benelux Conference on Artificial Intelligence (BNAIC'2017), pp: 423-424, 2017.
 - J.C. Forte. M.A. Wiering, H.R. Bouma, F. de Geus, A.H. Epema. Predicting long-term mortality with first week post-operative data after Coronary Artery Bypass Grafting using Machine Learning models. Proceedings of Machine Learning for Healthcare, Vol. 68, 2017.
 - M.M. Drugan, M.A. Wiering, P. Vamplew and M. Chetty. Special issue on multi-objective reinforcement learning. Neurocomputing, 263, pp 1-2, 2017
 - L.H. Boulogne, B.J. Wolf, M.A. Wiering and S.M. van Netten. Performance of neural networks for localizing moving objects with an artificial lateral line. Bioinspiration & Biomimetics, 2017
 - P. Pawara, E. Okafor, L.R.B. Schomaker and  M.A. Wiering   Data Augmentation for Plant Classification  Advanced Concepts for Intelligent Vision Systems (Acivs), 2017
 - E. Okafor, R. Smit, L.R.B. Schomaker and  M.A. Wiering   Operational Data Augmentation in Classifying Single Aerial Images of Animals  IEEE International Conference on INnovations in Intelligent SysTems and Applications (INISTA), 2017
 - K. Dijkstra, J. van de Loosdrecht, L.R.B. Schomaker and  M.A. Wiering   Hyper-spectral frequency selection for the classification of vegetation diseases European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN), pages 483-488, 2017
 - M.H. van der Ree, J.B.T.M. Roerdink, C. Phillips, G. Garraux, E. Salmon and  M.A. Wiering   Support Vector Components Analysis European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN), 2017
 - P. Pawara, E. Okafor, O. Surinta, L. Schomaker and M.A. Wiering   Comparing Local Descriptors and Bags of Visual Words to Deep Convolutional Neural Networks for Plant Recognition International Conference on Pattern Recognition Applications and Methods, 2017
 - M. Wagenaar, E. Okafor, W. Frencken and M.A. Wiering   Using Deep Convolutional Neural Networks to Predict Goal-Scoring Opportunities in Soccer   International Conference on Pattern Recognition Applications and Methods, 2017
 - R. Elderman, L.J.J. Pater, A.S. Thie, M.M. Drugan and M.A. Wiering   Adversarial Reinforcement Learning in a Cyber Security Simulation  International Conference on Agents and Artificial Intelligence, 2017
 - A. Shantia, F. Bidoia, L. Schomaker and M.A. Wiering   Dynamic Parameter Update for Robot Navigation Systems through Unsupervised Environmental Situational Analysis  Symposium Series on Computational Intelligence (IEEE-SSCI), Athens, 2016. BEST PAPER of SSCI-2016 
 - E. Okafor, P. Pawara, F. Karaaba, O. Surinta, V. Codreanu, L. Schomaker and M.A. Wiering   Comparative Study Between Deep Learning and
Bag of Visual Words for Wild-Animal Recognition Symposium Series on Computational Intelligence (IEEE-SSCI), Athens, 2016.
 - A. Tijsma, M.M. Drugan and M.A. Wiering   Comparing Exploration Strategies for Q-learning in Random Stochastic Mazes Adaptive Dynamic Programming and Reinrforcement Learning (ADPRL-2016), Athens, 2016.
 - M. Pieters and M.A. Wiering   Q-learning with Experience Replay in a Dynamic Environment Adaptive Dynamic Programming and Reinrforcement Learning (ADPRL-2016), Athens, 2016.
 - M. Schutten and M.A. Wiering  An Analysis on Better Testing than Training Performances on the Iris Dataset Belgian-Dutch Artificical Intelligence Conference, BNAIC 2016, Amsterdam, 2016.
 - J.L. Maas, E. Okafor and M.A. Wiering  The Dual Codebook:  Combining Bags of Visual Words in Image Classification Belgian-Dutch Artificical Intelligence Conference, BNAIC 2016, Amsterdam, 2016.
 - F.N. Martins, M. de Groot, X. Stokkel and M.A. Wiering  Human Detection and Classification of Landing Sites for Search and Rescue Drones.European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, ESANN 2016, Bruges Belgium, 2016.
 - J. van de Wolfshaar, M.F. Karaaba and M.A. Wiering  Deep Convolutional Neural Networks and Support Vector Machines for Gender Recognition .IEEE Symposium on Computational Intelligence in Biometrics and Identity Management 
(IEEE CIBIM'15), 2015.
 - F. Schimbinschi, L.R.B. Schomaker and M.A. Wiering  Ensemble Methods for Robust 3D Face Recognition Using Commodity Depth Sensors .IEEE Symposium on Computational Intelligence in Biometrics and Identity Management (IEEE CIBIM'15), 2015
 
 - M.F. Karaaba, O. Surinta, L.R.B. Schomaker and M.A. Wiering  Robust Face Recognition by Computing Distances from Multiple Histograms of Oriented Gradients .IEEE Symposium on Computational Intelligence in Biometrics and Identity Management (IEEE CIBIM'15), 2015
 
 - M. van de Steeg, M.M. Drugan and M.A. Wiering  Temporal Difference Learning for the Game Tic-Tac-Toe 3D: Applying Structure to Neural Networks . IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (IEEE ADPRL'15), 2015.
 - O. Surinta, M.F. Karaaba, T.K. Mishra, L.R.B. Schomaker and M.A. Wiering  Recognizing Handwritten Characters with Local Descriptors and Bags of Visual Words . 16th International Conference on Engineering Applications of Neural Networks (EANN), 2015.
 -  S. He, M.A. Wiering and L.R.B. Schomaker
 Junction detection in handwritten documents and its application to writer identification. Pattern Recognition, 2015.
 -  O. Surinta, M.F. Karaaba, L.R.B. Schomaker and M.A. Wiering  Recognition of handwritten characters using local gradient feature descriptors.  in Engineering Applications of Artificial Intelligence, (45)2015, pp. 405-414
 - A. Shantia, R. Timmers, L.R.B. Schomaker and M.A. Wiering  Indoor Localization by Denoising Autoencoders and Semi-supervised Learning in 3D Simulated Environment. International Joint Conference on Neural Networks (IJCNN), 2015.
 - S. Jansen, A. Shantia and M.A. Wiering The Neural-SIFT Feature Descriptor for Visual Vocabulary Object Recognition . International Joint Conference on Neural Networks (IJCNN), 2015.
 - M.F. Karaaba, O. Surinta, L.R.B. Schomaker and M.A. Wiering  In-Plane Rotational Alignment of Faces by Eye and Eye-Pair Detection . 11th International Conference on Computer Vision Theory and Applications (VISAPP), 2015.
 - M.A. Wiering, M. Withagen and M.M. Drugan  Model-Based Multi-Objective Reinforcement Learning. IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014.
 - M.F. Karaaba, L.R.B. Schomaker, and M.A. Wiering.  Machine Learning for multi-view eye-pair detection. In Engineering Applications of Artificial Intelligence, 33: 69-79,  2014.
 - V. Codreanu, B. Droge, D. Williams, B. Yasar, P. Yang, B. Liu, F. Dong, O. Surinta,  L.R.B. Schomaker, J.B.T.M. Roerdink and M.A. Wiering.   Evaluating automatically parallelized versions of the support vector machine . In Concurrency and Computation Practice and Experience 10, 2014
 - O. Surinta, M. Holtkamp, F. Karaaba, J-P. van Oosten, L.R.B. Schomaker and M.A. Wiering  A* Path Planning for Line Segmentation of Handwritten Documents. International Conference on Frontiers in Handwritten Recognition (ICFHR), 2014.
 - Z. Sun, M.A. Wiering, and N. Petkov  Classification System for Mortgage Arrear Management. IEEE Computational Intelligence for Financial Engineering and Economics (CIFER), pp 478-496, 2014.
 - M.A. Wiering and L.R.B. Schomaker Multi-Layer Support Vector Machines  
In book: Regularization, Optimization, Kernels, and Support Vector Machines, Edition: CRC Machine Learning and Pattern Recognition Series, Publisher: Chapman & Hall, Editors: Johan A.K. Suykens, Marco Signoretto, Andreas Argyriou, 2013 
 - M.A. Wiering, M.H. van der Ree, M.J. Embrechts, M.F. Stollenga, A. Meijster, A. Nolte and L.R.B. Schomaker The Neural Support Vector Machine . The 25th Benelux Artificial Intelligence Conference (BNAIC), 2013. (BEST PAPER AWARD)
 - O. Surinta, L. Schomaker, and M.A. Wiering.  A Comparison of Feature and Pixel-based Methods for Recognizing Handwritten Bangla Digits. The Twelfth International Conference on Document Analysis and Recognition (ICDAR), 2013.
 - M.A. Wiering, M. Schutten, A. Millea, A. Meijster, and L. Schomaker.  Deep Support Vector Machines for Regression Problems.  International Workshop on Advances in Regularization, Optimization, Kernel Methods, and Support Vector Machines: theory and applications, Leuven Belgium, pages 53-54, 2013.
 - F. Puglierin, M.M. Drugan and M.A. Wiering.  Bandit-Inspired Memetic Algorithms for Solving Quadratic Assignment Problems.  Proceedings of IEEE International Conference on Evolutionary Computation (CEC'13), Cancun Mexico, 2013.
 - L. Bom, R. Henken and M.A. Wiering.  Reinforcement Learning to Train Ms. Pac-Man Using Higher-order Action-relative Inputs.  Proceedings of IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), Singapore, 2013.
 - M. van der Ree and M.A. Wiering.  Reinforcement Learning in the Game of Othello: Learning Against a Fixed Opponent and Learning from Self-Play. Proceedings of IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), Singapore, 2013.
 - S. van den Dries and M.A. Wiering. Neural-Fitted TD-Leaf learning for Playing Othello with Structured Neural Networks. IEEE Journal of Transactions on Neural Networks and Learning Systems, Volume 23(11), pages: 1701-1713, 2012.
 - F. Schimbinschi, M.A. Wiering, R.E. Mohan, and J. K. Sheba.  4D Unconstrained Real-time Face Recognition Using a Commodity Depthh Camera, 7th IEEE Conference on Industrial Electronics and Applications (ICIEA), 2012.
 - O. Surinta, L. Schomaker and M.A. Wiering.  Handwritten Character Classification Using the Hotspot Feature Extraction Technqiue, International Conference on Pattern Recognition Applications and Methods (ICPRAM), 2012.
 - M.A. Wiering  and M. van Otterlo, editors. Reinforcement Learning: State of the Art.  Springer Verlag Berlin Heidelberg. isbn: 978-3-642-27644-6. doi: 10.1007/978-3-642-27645-3, 2012.
 - H. van Hoof, T. van der Zant, and M.A. Wiering.  Adaptive Visual Face Tracking for an Autonomous Robot.
BNAIC'11: Belgian Dutch Artificial Intelligence Conference, 2011.
 - A.D. Pietersma, L. Schomaker, and M.A. Wiering.  Kernel Learning in Support Vector Machines using Dual-Objective Optimization.
BNAIC'11: Belgian Dutch Artificial Intelligence Conference, 2011.
 -  H. van Seijen, S. Whiteson, H. van Hasselt, M.A. Wiering.  Exploiting Best-Match Equations for Efficient Reinforcement Learning. Journal of Machine Learning Research (JMLR), 12, 2045-2094, 2011.
 - J. de Vries, I. Hooge, M.A. Wiering, F. Verstraten. How longer saccade latencies lead to a competition for salience.  Psychological Science, 2011.
 - A. Shantia, E. Begue, M.A. Wiering.  Connectionist Reinforcement Learning for Intelligent Unit Micro
Management in StarCraft.  International Joint Conference on Neural Networks, 2011.
 - J. de Vries, I. Hooge, M.A. Wiering, F. Verstraten. Saccadic selection and crowding in visual search: Stronger lateral masking leads to shorter search times. Experimental Brain Research, 2011.
 - M.A. Wiering, H. van Hasselt, A.D. Pietersma, L. Schomaker.  Reinforcement Learning Algorithms for solving Classification Problems.  
Proceedings of IEEE International Symposium on
Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), Paris, 2011,
 - A. Abdullah, R.C. Veltkamp and M.A. Wiering.  Ensembles of Novel Visual Keywords Descriptors for Image Categorization . ICARV, 2010.
 -  M.A. Wiering.  Zelflerende verkeerslichtnetwerken, BLIND (interdisciplinair tijdschrift), special issue about networks, 2010.
 -  T.P. Schmidt, M.A. Wiering, A.C. van Rossum, R.A.J. van Elburg, T.C. Andringa, B. Valkenier. Robust Real-Time Vowel Classification with an Echo State Network, Workshop on "Cognitive and neural models for automated processing of speech and text" 2010 (CONAS).
 - M.A. Wiering and T. Kooi.  Region Enhanced Neural Q-learning for Solving Model-based POMDPs. International Joint Conference on Neural Networks, 2010.
- M.A. Wiering.  Self-Play and Using an Expert to Learn to Play Backgammon with Temporal Difference Learning . Journal of Intelligent Learning Systems and Applications, 2010, 2, pp: 57-68.
 - M.M. Drugan and M.A. Wiering. Feature selection for Bayesian Network Classifiers using the MDL-FS score,  International Journal of Approximate Reasoning, Elsevier, 2010.
 - A. Abdullah, R.C. Veltkamp, and M.A. Wiering. 
Fixed Partitioning and Salient Points with MPEG-7 Cluster Correlograms for Image Categorization.
Pattern Recognition, Volume 43. Issue 3, Pages 650-662, March 2010.
 - A. Abdullah, R.C. Veltkamp, and M.A. Wiering. An Ensemble of Deep Support Vector Machines for Image Categorization. Proceedings of the International Conference on Soft Computing and Pattern Recognition (SocPar), pp. 301-306, 2009.  BEST PAPER AWARD 
 - H. van Hasselt and Marco Wiering. Using Continuous Action Spaces to Solve Discrete Problems. Proceedings of the International Joint Conference on Neural Networks (IJCNN), Atlanta, USA, 2009. 
 -  A. Abdullah, R.C. Veltkamp, and M.A. Wiering. Spatial Pyramids and Two-layer Stacking SVM classifiers for Image Categorization: A Comparative Study. Proceedings of the International Joint Conference on Neural Networks (IJCNN), Atlanta, USA, 2009. 
 - M.A. Wiering and H. van Hasselt.   The QV Family Compared to Other Reinforcement Learning Algorithms.
Proceedings of IEEE International Symposium on
Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), Nashville,
USA, pp. 101-108, 2009.
 - H. van Seijen, H. van Hasselt, S. Whiteson, and M. Wiering. A Theoretical and Empirical Analysis of Expected Sarsa.  
In ADPRL 2009: Proceedings of the IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, pp. 177-184, 2009.
 - M.A. Wiering and H. van Hasselt.
 Ensemble Algorithms in Reinforcement Learning. IEEE Transactions on
Systems, Man, and Cybernetics, Part B, Volume 38, 4, 930-936, 2008.
 - M. van Otterlo, M. Wiering, M. Dastani, and J-J. Meyer.
A Characterization of Sapient Agents,  In R.V. Mayorga and L. Perlovsky 
(Eds.) Toward Artificial Sapience, Principles and Methods for Wise Systems,
pp. 129-141. Berlin: Springer, 2008.
 - R. Opsomer, P. Knoth, F. van Polen, J. Trapman and M.A. Wiering.
Categorizing Children: Automated Text Classification of CHILDES files.
BNAIC'08: Proceedings of the 20 Belgium-Netherlands Conference on
Artificial Intelligence, A. Nijholt, M. Pantic, M. Poel and H. Hondorp (eds.), 
pp. 209-216, 2008.
 - L. Pape, J. de Gruijl, and M.A. Wiering, 2008. Democratic Liquid
State Machines for Music Recognition. In: Speech, Audio, Image and
Biomedical Signal Processing using Neural Networks, Bookseries:
Studies in Computational Intelligence, vol 83. B. Prasad and S.R.M. Prasanna
(Eds.), 2008.
 - L. Lefakis and M.A. Wiering. 
Semi-Supervised Methods for Handwritten Character Recognition using Active
Learning. 
BNAIC'07: Proceedings of the 19th Belgium-Netherlands Conference on
Artificial Intelligence, Mehdi Dastani and Edwin de Jong (eds.), pp. 205-212, 2007.
 - L. Pape, B.G. Ruessink, M.A. Wiering and I.L. Turner. 
Recurrent Neural Network Modeling of Nearshore Sandbar Behavior. Neural
Networks, Special Issue on Earth and Environmental Sciences, 20,
509-518, 2007.
 - Marco Wiering and Edwin D. de Jong. 
Computing Optimal Stationary Policies for Multi-objective Markov Decision
Processes.  
Proceedings of IEEE International Symposium on
Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), Honolulu,
HI, USA, pp. 158-165, 2007.
 - Azizi Abdullah and Marco Wiering. 
CIREC: Cluster Correlogram Image Retrieval and Categorization using MPEG-7
Descriptors. 
Proceedings of IEEE International Symposium on
Computational Intelligence in Image and Signal Processing (CIISP), Honolulu,
HI, USA, pp. 431-437, 2007.
 - Hado van Hasselt and Marco Wiering. 
Reinforcement Learning in Continuous Action Spaces. 
Proceedings of IEEE International Symposium on
Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), Honolulu,
HI, USA, pp. 272-279, 2007.
 - Hado van Hasselt and Marco Wiering. 
Convergence of Model-Based Temporal Difference Learning for Control. 
Proceedings of IEEE International Symposium on Adaptive Dynamic Programming
and Reinforcement Learning (ADPRL), Honolulu, HI, USA, pp. 60-67, 2007.
 -  Marco Wiering and Hado van Hasselt. 
Two Novel On-policy Reinforcement Learning Algorithms based on
TD(lambda)-methods.
  Proceedings of IEEE International Symposium on Adaptive Dynamic
  Programming and Reinforcement Learning (ADPRL), Honolulu, HI, USA, pp.
  280-287, 2007.
 -  Wilco Moerman, Bram Bakker and Marco Wiering. 
Hierarchical Assignment of Behaviors to Subpolicies.
  NIPS'2007 workshop on Hierarchical Organization of Behavior: Computational, Psychological and Neural Perspectives, 2007.  
 -  Tijn van der Zant, Lambert Schomaker, Marco Wiering, Axel Brink. 
Cognitive Developmental Pattern Recognition: Learning to Learn
Proceedings of the IEEE International Conference on Systems, Man and
Cybernetics, pp. 1208-1213, 2006
 - L. Pape, B.G. Ruessink, M.A. Wiering and I.L. Turner, 
Neural network modeling of nearshore sandbar behavior . 
Proceedings of the 2006 International Joint Conference on Neural Networks,
Vancouver, Canada, pp. 8735-8742, 2006.
 - W. de Back, E.D. de Jong, and M.A. Wiering.
Red Queen Dynamics in a predator-prey ecosystem
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-06),
Maarten Keijzer et al. editors, pp. 381-382, 2006
 - J. van Diggelen, E.D. de Jong, and M.A. Wiering.
Strategies for Ontology Negotiation: Finding the Right Level of Generality
International workshop on agent communication, held with AAMAS'06, 2006 
 - J.R. de Gruijl and M.A. Wiering.
Musical Instrument Classification using Democratic Liquid State Machines
Benelearn'06: Proceedings of the 15th Belgian-Dutch Conference on Machine
Learning, pp. 33-40, edited by Y. Saeys, E. Tsiporkova, B. De Baets, and
Y. Van de Peer,  2006 
 - L. Zwanepol Klinkmeijer, E.D. de Jong, and M.A. Wiering.
A Serial Population Genetic Algorithm for Dynamic Optimization Problems  
Benelearn'06: Proceedings of the 15th Belgian-Dutch Conference on Machine
Learning, pp. 41-48, edited by Y. Saeys, E. Tsiporkova, B. De Baets, and
Y. Van de Peer,  2006 
 - M. Wiering, J.P. Patist, and H. Mannen
Learning to PLay Board Games using Temporal Difference Methods .
Technical Report, Utrecht University, UU-CS-2005-048, 30 pages, 2005.
 - M. Wiering.
QV(lambda)-learning: A New On-policy Reinforcement Learning Algorithm.
Proceedings of the 7th European Workshop on Reinforcement Learning,
D. Leone (editor), pages 17-18, 2005.
 - M. Wiering.
Comparing Training Paradigms for Learning to Play Backgammon 
Proceedings of the 7th European Workshop on Reinforcement Learning,
D. Leone (editor), pages 29-30, 2005.
 -  T. van der Zant, M. Wiering, and J. van Eijck.
On-line robot learning using the interval estimation algorithm
Proceedings of the 7th European Workshop on Reinforcement Learning,
D. Leone (editor), pages 11-12, 2005.
 -  S. Maas, M. Wiering, and B. Verhaar.
Reinforcement Learning of a Pneumatic Robot Arm Controller
Proceedings of the 7th European Workshop on Reinforcement Learning,
D. Leone (editor), pages 23-24, 2005.
 -  H. van Kuilenburg, M. Wiering, and M. den Uyl.
A Model Based Method for Automatic Facial Expression Recognition.
Proceedings of the 
16th European Conference on Machine Learning (ECML'05), J. Gama et a. (eds),
Springer-Verlag Berlin Heidelberg, pages 194-205,2005.
 -  R.R. Negenborn, B. De Schutter, M.A. Wiering, and H. Hellendoorn. Learning-based
model predictive control for Markov decision processes. 
Proceedings of the 16th IFAC World Congress, Prague, Czech Republic, July
2005. 
 - M. Wiering, F. Mignogna, and B, Maassen 
Evolving Neural Networks for Forest Fire Control  
Benelearn'05: Proceedings of the 14th Belgian-Dutch Conference on Machine
Learning, pages 113 - 120, edited by M. van Otterlo, M. Poel, and A. Nijholt, 2005 
 - M. Sindlar and M. Wiering. 
A Modular Approach to Facial Expression Recognition 
Benelearn'05: Proceedings of the 14th Belgian-Dutch Conference on Machine
Learning, pages 81 - 88, edited by M. van Otterlo, M. Poel, and A. Nijholt, 2005 
 -  R.R. Negenborn, B. De Schutter, M.A. Wiering, and H. Hellendoorn. Learning-based
model predictive control for Markov decision processes. Tech. rep. 04-021,
Delft Center for Systems and Control, Delft University of Technology, Delft,
The Netherlands, Sept. 2004. 
 - R.R. Negenborn, B. De Schutter, M.A. Wiering, and J. Hellendoorn, 
Experience-based model predictive control using reinforcement 
learning. Proceedings of the 8th TRAIL Congress 2004 - A World of 
Transport, Infrastructure and Logistics - CD-ROM, Rotterdam, The 
Netherlands, Nov. 2004.
 - M. Wiering. 
 Convergence and Divergence in Standard and Averaging Reinforcement Learning
 Proceedings of
the 15th European Conference on Machine Learning (ECML'04), edited by J-F Boulicaut, F. Esposito, F. Giannotti, and D. Pedreschi, pp. 477-488, Springer-Verlag Berlin Heidelberg, 2004.
 - M. Wiering, S. Leijnen, A. Koster, S. van Weers, and W. de Back.
 
Autonomous Intelligent Robots at Utrecht University. International
Journal of Advanced Robotic Systems, 1(2), pages 125-128, 2004.
 - M. Wiering, J. van Veenen, J. Vreeken, and A. Koopman.
 
Intelligent Traffic Light Control.  Technical Report UU-CS-2004-029,
University Utrecht, 2004.
 - D. Wierstra and M. Wiering. 
 
Utile Distinction Hidden Markov Models.  Proceedings of
the Twenty-first International Conference on Machine Learning (ICML'04), pp. 855-862, ACM Press, 2004.
 - M. Wiering, J. Vreeken, J. van Veenen, and A. Koopman.
 Simulation and Optimization of Traffic in a City.  
IEEE Intelligent Vehicles symposium (IV'04), 2004.
 - H. Mannen and M. Wiering. 
 
Learning to play chess using TD(lambda)-learning with database games 
Benelearn'04: Proceedings of the Thirteenth Belgian-Dutch Conference on Machine
Learning, edited by A. Nowe, T. Lenaerts, and K. Steenhout, pp.72-79, 2004 
 - J-P. Patist and M. Wiering. 
 
Learning to play draughts using temporal difference learning with neural
networks and databases
Benelearn'04: Proceedings of the Thirteenth Belgian-Dutch Conference on Machine
Learning, edited by A. Nowe, T. Lenaerts, and K. Steenhout, pp.87-94, 2004 
 - M. Wiering. 
 
Memory-based Memetic Algorithms
Benelearn'04: Proceedings of the Thirteenth Belgian-Dutch Conference on Machine
Learning, edited by A. Nowe, T. Lenaerts, and K. Steenhout, pp.191-198, 2004 
 - M. van Otterlo, M. Wiering, M. Dastani, and J-J. Meyer.
 
A Characterization of Sapient Agents, 
First International Conference on Integration of Knowledge Intensive 
Multi-Agent Systems (KIMAS-03), edited by H. Hexmoor, IEEE Press, Boston, MA,
pages 172-177, 2003.
 - M. Wiering. 
 
Hierarchical Mixtures of Naive Bayesian Classifiers.
European Conference on Machine Learning (ECML'2003) Workshop on 
Probabilistic Graphical Models for Classification, edited by
P. Larranaga, J.A> Lozano, J.M. Pena, and I. Inza, pages 93-104, 2003.
 - M. Wiering and F. Mignogna. 
 
Learning to Control Forest Fires with ESP.  
Proceedings of the Sixth European Workshop on Reinforcement Learning, edited by
Alain Dutech and Olivier Buffet, pp. 22-23, 2003. 
 - M. Wiering. Intelligent Traffic Light Control. ERCIM News Special:
Cognitive Systems, 53, pp. 40-41, 2003. 
 - M. Wiering.
 
Evolving Causal Neural Networks.
Benelearn'02: Proceedings of the Twelfth Belgian-Dutch Conference on Machine
Learning, edited by Marco Wiering, pp. 103-108, 2002 
 - M. Wiering.
 
Hierarchical Mixtures of Naive Bayesian Classifiers.
BNAIC'02: Proceedings of the Thirteenth Belgium-Netherlands Conference on
Artificial Intelligence, Hendrik Blockeel and Marc Denecker (eds.), pp. 363-370, 2002. 
 - M. Wiering.
 
Model-based Reinforcement Learning in Dynamic Environments.
Technical Report CS-UU-2002-029, Utrecht University, 2002.
 - S. Reynolds and M. Wiering.
 
Fast Q(lambda) revisited.
Technical Report CSRP-02-2, University of Birmingham, School of Computer
Science, 2002.
 - M. Wiering.
 
Hierarchical Mixtures of Naive Bayes Classifiers.
Technical Report CS-UU-2002-003, Utrecht University, 2002.
 - J. de Jong and M. Wiering.
 
Multiple Ant Colony Systems for the Busstop Allocation Problem.
BNAIC'01: Proceedings of the Thirteenth Belgium-Netherlands Conference on
Artificial Intelligence, pp. 141-148, 2001. 
 - Marco Wiering, R.P. Salustowicz, and Juergen Schmidhuber. 
 
Model-based Reinforcement Learning for Evolving Soccer Strategies. 
In Computational Intelligence in Games, chapter 5. Editors N. Baba and L.
Jain. pp. 99-131, 2001.
 - Marco Wiering.
 
Reinforcement Learning in Dynamic Environments using Instantiated Information.
Machine Learning: Proceedings of the Eighteenth International Conference
(ICML'2001), pp. 585-592, 2001. 
 -  K. ten Tusscher, S. ten Hagen and M. Wiering
 
The influence of commmunication on the choice to behave cooperatively.
Proceedings of the Tenth Belgian-Dutch Conference on Machine Learning.
Editor Ad Feelders. pp. 39-46, 2000.
 - Marco Wiering.
 
Multi-Agent Reinforcement Learning for Traffic Light control.
Machine Learning: Proceedings of the Seventeenth International Conference
(ICML'2000), pp. 1151-1158, 2000. 
 - Marco Wiering, Ben Krose, and Frans Groen.
 
Learning in Multi-Agent Systems. Technical Report, University of
Amsterdam, 1999.
 - Marco Wiering, R.P. Salustowicz, and Juergen Schmidhuber. 
 
Reinforcement Learning Soccer Teams with Incomplete World
Models 
Neural Networks for Robot Learning. Special issue of Autonomous Robots,
Vol 7(1), pp. 77-88, 1999.
 - Marco Wiering. 
 
Explorations in Efficient Reinforcement Learning.
 Ph.D. thesis. February 1999 (784K). 
 - Marco Wiering, R.P. Salustowicz, and Juergen Schmidhuber. 
CMAC models learn to play soccer.
 Proceedings of the 8th International Conference on Artificial
Neural Networks (ICANN'98), 443-448. In 
L. Niklasson and M. Boden and T. Ziemke (eds.), Springer-Verlag, London,
1998. 
 -  Marco Wiering and Marco Dorigo.
Learning to Control Forest Fires. Umweltinformatik'98: Vernetzte
Strukturen in Informatik, Umwelt und Wirtschaft, Proceedings des 12.
Internationalen Symposiums 'Informatik den Umweltschutz', H.-D. Haasis, K.C.
Ranze (eds.), pp 378-388, 1998. 
 - Marco Wiering and Juergen Schmidhuber. 
 
Efficient Model-Based Exploration.
 Proceedings of the Fifth
International Conference on Simulation of
Adaptive Behavior (SAB'98): From Animals to Animats 5,
223-228, R. Pfeiffer, B. Blumberg, J. A. Meyer and S. W. Wilson (eds.), MIT Press/Bradford Books,
1998. 
 - Marco Wiering and Juergen Schmidhuber. 
 
Learning Exploration Policies with Models.
 Conference on Automated Learning and Discovery (CONALD'98), 1998. 
 - Marco Wiering and Juergen Schmidhuber. 
 Fast Online Q(lambda).
 Machine Learning, 33(1), 105-115, 1998. 
 - Marco Wiering and Juergen Schmidhuber. 
 Speeding Up 
Q(lambda)-learning . In Proceedings of the Tenth European Conference on Machine 
Learning (ECML'98), pp. 352-363, 1998. (13 pages). 
 - Marco Wiering and Juergen Schmidhuber.  HQ-Learning. 
Adaptive Behavior, 6:2, 219-246, 1997. 
 -  Rafal Salustowicz, Marco Wiering and Juergen Schmidhuber.
   Learning Team
        Strategies: Soccer Case Studies.   
        Machine Learning, 33, (2/3), 1-19, 1998.
 
        
        You can also just check the abstract
       or pick up the bibtex 
       entry.
 -  Rafal Salustowicz, Marco Wiering and Juergen Schmidhuber.
       Evolving
        soccer strategies.  
        In N. Kasabov, R. Kozma, K. Ko, R. O'Shea, G. Coghill, and T. Gedeon,
        editors, Progress in Connectionist-based Information Systems:
        Proceedings of the Fourth International Conference on Neural
        Information Processing ICONIP'97, volume 1, 502-505,
        
 ,1997 (74K).
        
        You can also just check the abstract
 or pick up the bibtex entry.
-  Rafal Salustowicz, Marco Wiering and Juergen Schmidhuber.
On learning
        soccer strategies. 
        In W. Gerstner, A. Germond, M. Hasler, and J.-D. Nicoud, editors,
        Proceedings of the Seventh International Conference on Artificial
        Neural Networks  (ICANN'97),  volume  1327 of Lecture  Notes  in
        Computer Science,  769-774.  Springer-Verlag Berlin Heidelberg,
        1997 (68K)
 
        You can also just check the 
abstract
or pick up the bibtex entry.
 - Juergen Schmidhuber, Jieyu Zhao and Marco Wiering.  Shifting Inductive Bias with Success-Story Algorithm, Adaptive Levin Search, and Incremental Self-Improvement. 
Machine Learning, 28:1, 105-130, 1997.
 - Marco Wiering and Juergen Schmidhuber.  HQ-Learning: Discovering Markovian Subgoals for
non-Markovian Reinforcement Learning.   
Technical Report IDSIA-95-96, October 1996 (108K). 
 - Juergen Schmidhuber and Jieyu Zhao and Marco Wiering.  Simple principles of metalearning. 
Technical Report IDSIA-69-96, June 1996 (195 K). 
 -  Rafal Salustowicz, Marco Wiering and Juergen Schmidhuber.
   Learning team
       strategies with multiple policy-sharing agents:
       A soccer case study. 
       Technical Report IDSIA-29-97, IDSIA, Lugano, Switzerland.
        -- 20 pages, 1997 (134K).
 
       
       You can also just check the 
   abstract
      or pick up the bibtex
       entry.  
- Marco Wiering and Juergen Schmidhuber.  Solving POMDPs with Levin Search and EIRA.   
Machine Learning: Proceedings of the thirteenth International Conference.
534-542, 1996 (86K). 
 - Marco Wiering.  TD Learning of Game Evaluation Functions with Hierarchical
   Neural Architectures.
   Master's thesis, University of Amsterdam, Holland, April 1995 (241K).
 -  Marco Wiering and Ben Kroese.  TD Learning of Game Evaluation Functions with Hierarchies
of Adaptive Experts.  University of Amsterdam, Holland, April 1995 (59K).