Marco Wiering's publications page

Publications

N. Stolt Anso, A.O. Wiehe, M.M. Drugan and M.A. Wiering Deep Reinforcement Learning for Pellet Eating in Agar.io International Conference on Agents and Artificial Intelligence (ICAART), Prague, 2019
F. van Beers, A. Lindstrom, E. Okafor and M.A. Wiering Deep Neural Networks with Intersection over Union Loss for Binary Image Segmentation International Conference on Pattern Recognition Applications and Methods (ICPRAM), Prague, 2019
L. Boulogne, K. Dijkstra and M.A. Wiering Extra Domain Data Generation with Generative Adversarial Nets IEEE Symposium Series on Computational Intelligence (SSCI): Deep Learning (DL), 2018
J. Schilperoort, I. Mak, M.M. Drugan and M.A. Wiering Learning to Play Pac-Xon with Q-Learning and Two Double Q-Learning Variants IEEE Symposium Series on Computational Intelligence (SSCI): Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2018
R. Niel and M.A. Wiering Hierarchical Reinforcement Learning for Playing a Dynamic Dungeon Crawler Game IEEE Symposium Series on Computational Intelligence (SSCI): Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2018
M. Sabatelli, G. Louppe, P. Geurts, and M.A. Wiering Deep Quality-Value (DQV) Learning. arXiv:1810.00368 [stat.ML], September 2018
F. Dal Canton, V.M. Quinten, and M.A. Wiering Early Detection of Sepsis Induced Deterioration Using Machine Learning. Belgian-Dutch Conference on Machine Learning (Benelearn), 2018
M. Aslani, S. Seipel, M.S. Mesgari and M.A. Wiering Traffic signal optimization through discrete and continuous reinforcement learning with robustness analysis in downtown Tehran . Advanced Engineering Informatics, Vol 38, 639-655, 2018
K. Dijkstra, J. van de Loosdrecht, L.R.B. Schomaker and M.A. Wiering CentroidNet: A Deep Neural Network for Joint Object Localization and Counting. The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, September 2018
A.O. Wiehe, N. Stolt Anso, M.M. Drugan and M.A. Wiering Sampled Policy Gradient for Learning to Play the Game Agar.io. arXiv:1809.05763 [cs.AI], September 2018
M.A. Wiering Reinforcement learning: from methods to applications. Nieuw Archief voor Wiskunde, guest Eds. M. Wiering, J. Portegies and S. Bohte, vijfde serie, deel 19, nummer 3, pp 157-167, 2018
K. Dijkstra, J. van de Loosdrecht, L.R.B. Schomaker and M.A. Wiering Hyperspectral demosaicking and crosstalk correction using deep learning. Machine Vision and Applications, pp 1-21, 2018
E. Okafor, L.R.B. Schomaker and M.A. Wiering An analysis of rotation matrix and colour constancy data augmentation in classifying images of animals Journal of Information and Telecommunication, pp 1-27, 2018 (best JIT paper 2018)
M. Aslani, S. Seipel and M.A. Wiering Continuous Residual Reinforcement Learning for Traffic Signal Control Optimization. Canadian Journal of Civil Engineering, 2018
B. Verheij and M. Wiering Artificial Intelligence: 29th Benelux Conference, BNAIC 2017, Groningen, The Netherlands, Revised Selected Paper, Springer Book, 2018.
M. Pieters and M.A. Wiering Comparing Generative Adversarial Network Techniques for Image Creation and Modification. arXiv:1803.09093 [cs.LG], March 2018
M. Pieters and M.A. Wiering Comparison of Machine Learning Techniques for Multi-label Genre Classification. Springer Book: Artificial Intelligence: 29th Benelux Conference, BNAIC 2017, Revised Selected Papers, Eds. B. Verheij and M.A. Wiering, 2018. DOI: 10.1007/978-3-319-76892-2\_11, 2018. BEST PAPER AWARD FROM BNAIC'2017 Conference
P. Ozkohen, J. Visser, M. van Otterlo and M.A. Wiering Learning to Play Donkey Kong Using Neural Networks and Reinforcement Learning. Springer Book: Artificial Intelligence, 29th Benelux Conference, BNAIC 2017, Revised Selected Papers. Eds. B. Verheij and M.A. Wiering, 2018. DOI: 10.1007/978-3-319-76892-2\_11, 2018.
M. Aslani, M. Saadi Mesgari, S. Seipel and M.A. Wiering Developing adaptive traffic signal control by actor-critic and direct exploration methods . Proceedings of the institution of civil engineers - Transport, pp: 1-25, 2018
S. Knegt, M.M. Drugan and M.A. Wiering Opponent Modelling in the Game of Tron using Reinforcement Learning. Proceedings of the 10th International Conference on Agents and Artificial Intelligence (ICAART), 2018.
J. Groot Kormelink, M.M. Drugan and M.A. Wiering Exploration Methods for Connectionist Q-Learning in Bomberman. Proceedings of the 10th International Conference on Agents and Artificial Intelligence (ICAART), 2018.
G. Leuenberger and M.A. Wiering Actor-Critic Reinforcement Learning with Neural Networks in Continuous Games. Proceedings of the 10th International Conference on Agents and Artificial Intelligence (ICAART), 2018, BEST PAPER AWARD.
J. van de Wolfshaar, M.A. Wiering and L. Schomaker Deep Learning Policy Quantization. Proceedings of the 10th International Conference on Agents and Artificial Intelligence (ICAART), 2018.
R. Niel, J. Krebbers, M.M. Drugan and M.A. Wiering Hierarchical Reinforcement Learning for Real-Time Strategy Games. Proceedings of the 10th International Conference on Agents and Artificial Intelligence (ICAART), 2018.
M. Sabatelli, F. Bidoia, V. Codreanu and M.A. Wiering Learning to Evaluate Chess Positions with Deep Neural Networks and Limited Lookahead. Proceedings of the 7th International Conference on Pattern Recognition Applications and Methods (ICPRAM), 2018.
F. Bidoia, M. Sabatelli, A. Shantia, M.A. Wiering and L. Schomaker A Deep Convolutional Neural Network for Location Recognition and Geometry based Information. Proceedings of the 7th International Conference on Pattern Recognition Applications and Methods (ICPRAM), 2018.
M. Aslani, M. Saadi Mesgari and M.A. Wiering Adaptive traffic signal control with actor-critic methods in a real-world traffic network with different traffic disruption events. Transportation Research Part C: Emerging Technologies, vol. 85, pp: 732-752, 2017
J. Hogervorst, E. Okafor and M.A. Wiering Deep Colorization for Facial Gender Recognition. Preproceedings of the 29th Benelux Conference on Artificial Intelligence (BNAIC'2017), pp: 317-325, 2017.
H. Maathuis, L. Boulogne, M.A. Wiering and A. Sterk Predicting Chaotic Time Series using Machine Learning Techniques. Preproceedings of the 29th Benelux Conference on Artificial Intelligence (BNAIC'2017), pp: 326-340, 2017.
M. Schutten, M.A. Wiering and P. MacDougall Balancing Imbalances: On using reinforcement learning to increase stability in smart electricity grids. Preproceedings of the 29th Benelux Conference on Artificial Intelligence (BNAIC'2017), pp: 423-424, 2017.
J.C. Forte. M.A. Wiering, H.R. Bouma, F. de Geus, A.H. Epema. Predicting long-term mortality with first week post-operative data after Coronary Artery Bypass Grafting using Machine Learning models. Proceedings of Machine Learning for Healthcare, Vol. 68, 2017.
M.M. Drugan, M.A. Wiering, P. Vamplew and M. Chetty. Special issue on multi-objective reinforcement learning. Neurocomputing, 263, pp 1-2, 2017
L.H. Boulogne, B.J. Wolf, M.A. Wiering and S.M. van Netten. Performance of neural networks for localizing moving objects with an artificial lateral line. Bioinspiration & Biomimetics, 2017
P. Pawara, E. Okafor, L.R.B. Schomaker and M.A. Wiering Data Augmentation for Plant Classification Advanced Concepts for Intelligent Vision Systems (Acivs), 2017
E. Okafor, R. Smit, L.R.B. Schomaker and M.A. Wiering Operational Data Augmentation in Classifying Single Aerial Images of Animals IEEE International Conference on INnovations in Intelligent SysTems and Applications (INISTA), 2017
K. Dijkstra, J. van de Loosdrecht, L.R.B. Schomaker and M.A. Wiering Hyper-spectral frequency selection for the classification of vegetation diseases European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN), pages 483-488, 2017
M.H. van der Ree, J.B.T.M. Roerdink, C. Phillips, G. Garraux, E. Salmon and M.A. Wiering Support Vector Components Analysis European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN), 2017
P. Pawara, E. Okafor, O. Surinta, L. Schomaker and M.A. Wiering Comparing Local Descriptors and Bags of Visual Words to Deep Convolutional Neural Networks for Plant Recognition International Conference on Pattern Recognition Applications and Methods, 2017
M. Wagenaar, E. Okafor, W. Frencken and M.A. Wiering Using Deep Convolutional Neural Networks to Predict Goal-Scoring Opportunities in Soccer International Conference on Pattern Recognition Applications and Methods, 2017
R. Elderman, L.J.J. Pater, A.S. Thie, M.M. Drugan and M.A. Wiering Adversarial Reinforcement Learning in a Cyber Security Simulation International Conference on Agents and Artificial Intelligence, 2017
A. Shantia, F. Bidoia, L. Schomaker and M.A. Wiering Dynamic Parameter Update for Robot Navigation Systems through Unsupervised Environmental Situational Analysis Symposium Series on Computational Intelligence (IEEE-SSCI), Athens, 2016. BEST PAPER of SSCI-2016
E. Okafor, P. Pawara, F. Karaaba, O. Surinta, V. Codreanu, L. Schomaker and M.A. Wiering Comparative Study Between Deep Learning and Bag of Visual Words for Wild-Animal Recognition Symposium Series on Computational Intelligence (IEEE-SSCI), Athens, 2016.
A. Tijsma, M.M. Drugan and M.A. Wiering Comparing Exploration Strategies for Q-learning in Random Stochastic Mazes Adaptive Dynamic Programming and Reinrforcement Learning (ADPRL-2016), Athens, 2016.
M. Pieters and M.A. Wiering Q-learning with Experience Replay in a Dynamic Environment Adaptive Dynamic Programming and Reinrforcement Learning (ADPRL-2016), Athens, 2016.
M. Schutten and M.A. Wiering An Analysis on Better Testing than Training Performances on the Iris Dataset Belgian-Dutch Artificical Intelligence Conference, BNAIC 2016, Amsterdam, 2016.
J.L. Maas, E. Okafor and M.A. Wiering The Dual Codebook: Combining Bags of Visual Words in Image Classification Belgian-Dutch Artificical Intelligence Conference, BNAIC 2016, Amsterdam, 2016.
F.N. Martins, M. de Groot, X. Stokkel and M.A. Wiering Human Detection and Classification of Landing Sites for Search and Rescue Drones.European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, ESANN 2016, Bruges Belgium, 2016.
J. van de Wolfshaar, M.F. Karaaba and M.A. Wiering Deep Convolutional Neural Networks and Support Vector Machines for Gender Recognition .IEEE Symposium on Computational Intelligence in Biometrics and Identity Management (IEEE CIBIM'15), 2015.
F. Schimbinschi, L.R.B. Schomaker and M.A. Wiering Ensemble Methods for Robust 3D Face Recognition Using Commodity Depth Sensors .IEEE Symposium on Computational Intelligence in Biometrics and Identity Management (IEEE CIBIM'15), 2015
M.F. Karaaba, O. Surinta, L.R.B. Schomaker and M.A. Wiering Robust Face Recognition by Computing Distances from Multiple Histograms of Oriented Gradients .IEEE Symposium on Computational Intelligence in Biometrics and Identity Management (IEEE CIBIM'15), 2015
M. van de Steeg, M.M. Drugan and M.A. Wiering Temporal Difference Learning for the Game Tic-Tac-Toe 3D: Applying Structure to Neural Networks . IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (IEEE ADPRL'15), 2015.
O. Surinta, M.F. Karaaba, T.K. Mishra, L.R.B. Schomaker and M.A. Wiering Recognizing Handwritten Characters with Local Descriptors and Bags of Visual Words . 16th International Conference on Engineering Applications of Neural Networks (EANN), 2015.
S. He, M.A. Wiering and L.R.B. Schomaker Junction detection in handwritten documents and its application to writer identification. Pattern Recognition, 2015.
O. Surinta, M.F. Karaaba, L.R.B. Schomaker and M.A. Wiering Recognition of handwritten characters using local gradient feature descriptors. in Engineering Applications of Artificial Intelligence, (45)2015, pp. 405-414
A. Shantia, R. Timmers, L.R.B. Schomaker and M.A. Wiering Indoor Localization by Denoising Autoencoders and Semi-supervised Learning in 3D Simulated Environment. International Joint Conference on Neural Networks (IJCNN), 2015.
S. Jansen, A. Shantia and M.A. Wiering The Neural-SIFT Feature Descriptor for Visual Vocabulary Object Recognition . International Joint Conference on Neural Networks (IJCNN), 2015.
M.F. Karaaba, O. Surinta, L.R.B. Schomaker and M.A. Wiering In-Plane Rotational Alignment of Faces by Eye and Eye-Pair Detection . 11th International Conference on Computer Vision Theory and Applications (VISAPP), 2015.
M.A. Wiering, M. Withagen and M.M. Drugan Model-Based Multi-Objective Reinforcement Learning. IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014.
M.F. Karaaba, L.R.B. Schomaker, and M.A. Wiering. Machine Learning for multi-view eye-pair detection. In Engineering Applications of Artificial Intelligence, 33: 69-79, 2014.
V. Codreanu, B. Droge, D. Williams, B. Yasar, P. Yang, B. Liu, F. Dong, O. Surinta, L.R.B. Schomaker, J.B.T.M. Roerdink and M.A. Wiering. Evaluating automatically parallelized versions of the support vector machine . In Concurrency and Computation Practice and Experience 10, 2014
O. Surinta, M. Holtkamp, F. Karaaba, J-P. van Oosten, L.R.B. Schomaker and M.A. Wiering A* Path Planning for Line Segmentation of Handwritten Documents. International Conference on Frontiers in Handwritten Recognition (ICFHR), 2014.
Z. Sun, M.A. Wiering, and N. Petkov Classification System for Mortgage Arrear Management. IEEE Computational Intelligence for Financial Engineering and Economics (CIFER), pp 478-496, 2014.
M.A. Wiering and L.R.B. Schomaker Multi-Layer Support Vector Machines In book: Regularization, Optimization, Kernels, and Support Vector Machines, Edition: CRC Machine Learning and Pattern Recognition Series, Publisher: Chapman & Hall, Editors: Johan A.K. Suykens, Marco Signoretto, Andreas Argyriou, 2013
M.A. Wiering, M.H. van der Ree, M.J. Embrechts, M.F. Stollenga, A. Meijster, A. Nolte and L.R.B. Schomaker The Neural Support Vector Machine . The 25th Benelux Artificial Intelligence Conference (BNAIC), 2013. (BEST PAPER AWARD)
O. Surinta, L. Schomaker, and M.A. Wiering. A Comparison of Feature and Pixel-based Methods for Recognizing Handwritten Bangla Digits. The Twelfth International Conference on Document Analysis and Recognition (ICDAR), 2013.
M.A. Wiering, M. Schutten, A. Millea, A. Meijster, and L. Schomaker. Deep Support Vector Machines for Regression Problems. International Workshop on Advances in Regularization, Optimization, Kernel Methods, and Support Vector Machines: theory and applications, Leuven Belgium, pages 53-54, 2013.
F. Puglierin, M.M. Drugan and M.A. Wiering. Bandit-Inspired Memetic Algorithms for Solving Quadratic Assignment Problems. Proceedings of IEEE International Conference on Evolutionary Computation (CEC'13), Cancun Mexico, 2013.
L. Bom, R. Henken and M.A. Wiering. Reinforcement Learning to Train Ms. Pac-Man Using Higher-order Action-relative Inputs. Proceedings of IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), Singapore, 2013.
M. van der Ree and M.A. Wiering. Reinforcement Learning in the Game of Othello: Learning Against a Fixed Opponent and Learning from Self-Play. Proceedings of IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), Singapore, 2013.
S. van den Dries and M.A. Wiering. Neural-Fitted TD-Leaf learning for Playing Othello with Structured Neural Networks. IEEE Journal of Transactions on Neural Networks and Learning Systems, Volume 23(11), pages: 1701-1713, 2012.
F. Schimbinschi, M.A. Wiering, R.E. Mohan, and J. K. Sheba. 4D Unconstrained Real-time Face Recognition Using a Commodity Depthh Camera, 7th IEEE Conference on Industrial Electronics and Applications (ICIEA), 2012.
O. Surinta, L. Schomaker and M.A. Wiering. Handwritten Character Classification Using the Hotspot Feature Extraction Technqiue, International Conference on Pattern Recognition Applications and Methods (ICPRAM), 2012.
M.A. Wiering and M. van Otterlo, editors. Reinforcement Learning: State of the Art. Springer Verlag Berlin Heidelberg. isbn: 978-3-642-27644-6. doi: 10.1007/978-3-642-27645-3, 2012.
H. van Hoof, T. van der Zant, and M.A. Wiering. Adaptive Visual Face Tracking for an Autonomous Robot. BNAIC'11: Belgian Dutch Artificial Intelligence Conference, 2011.
A.D. Pietersma, L. Schomaker, and M.A. Wiering. Kernel Learning in Support Vector Machines using Dual-Objective Optimization. BNAIC'11: Belgian Dutch Artificial Intelligence Conference, 2011.
H. van Seijen, S. Whiteson, H. van Hasselt, M.A. Wiering. Exploiting Best-Match Equations for Efficient Reinforcement Learning. Journal of Machine Learning Research (JMLR), 12, 2045-2094, 2011.
J. de Vries, I. Hooge, M.A. Wiering, F. Verstraten. How longer saccade latencies lead to a competition for salience. Psychological Science, 2011.
A. Shantia, E. Begue, M.A. Wiering. Connectionist Reinforcement Learning for Intelligent Unit Micro Management in StarCraft. International Joint Conference on Neural Networks, 2011.
J. de Vries, I. Hooge, M.A. Wiering, F. Verstraten. Saccadic selection and crowding in visual search: Stronger lateral masking leads to shorter search times. Experimental Brain Research, 2011.
M.A. Wiering, H. van Hasselt, A.D. Pietersma, L. Schomaker. Reinforcement Learning Algorithms for solving Classification Problems. Proceedings of IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), Paris, 2011,
A. Abdullah, R.C. Veltkamp and M.A. Wiering. Ensembles of Novel Visual Keywords Descriptors for Image Categorization . ICARV, 2010.
M.A. Wiering. Zelflerende verkeerslichtnetwerken, BLIND (interdisciplinair tijdschrift), special issue about networks, 2010.
T.P. Schmidt, M.A. Wiering, A.C. van Rossum, R.A.J. van Elburg, T.C. Andringa, B. Valkenier. Robust Real-Time Vowel Classification with an Echo State Network, Workshop on "Cognitive and neural models for automated processing of speech and text" 2010 (CONAS).
M.A. Wiering and T. Kooi. Region Enhanced Neural Q-learning for Solving Model-based POMDPs. International Joint Conference on Neural Networks, 2010.
M.A. Wiering. Self-Play and Using an Expert to Learn to Play Backgammon with Temporal Difference Learning . Journal of Intelligent Learning Systems and Applications, 2010, 2, pp: 57-68.
M.M. Drugan and M.A. Wiering. Feature selection for Bayesian Network Classifiers using the MDL-FS score, International Journal of Approximate Reasoning, Elsevier, 2010.
A. Abdullah, R.C. Veltkamp, and M.A. Wiering. Fixed Partitioning and Salient Points with MPEG-7 Cluster Correlograms for Image Categorization. Pattern Recognition, Volume 43. Issue 3, Pages 650-662, March 2010.
A. Abdullah, R.C. Veltkamp, and M.A. Wiering. An Ensemble of Deep Support Vector Machines for Image Categorization. Proceedings of the International Conference on Soft Computing and Pattern Recognition (SocPar), pp. 301-306, 2009. BEST PAPER AWARD
H. van Hasselt and Marco Wiering. Using Continuous Action Spaces to Solve Discrete Problems. Proceedings of the International Joint Conference on Neural Networks (IJCNN), Atlanta, USA, 2009.
A. Abdullah, R.C. Veltkamp, and M.A. Wiering. Spatial Pyramids and Two-layer Stacking SVM classifiers for Image Categorization: A Comparative Study. Proceedings of the International Joint Conference on Neural Networks (IJCNN), Atlanta, USA, 2009.
M.A. Wiering and H. van Hasselt. The QV Family Compared to Other Reinforcement Learning Algorithms. Proceedings of IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), Nashville, USA, pp. 101-108, 2009.
H. van Seijen, H. van Hasselt, S. Whiteson, and M. Wiering. A Theoretical and Empirical Analysis of Expected Sarsa. In ADPRL 2009: Proceedings of the IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, pp. 177-184, 2009.
M.A. Wiering and H. van Hasselt. Ensemble Algorithms in Reinforcement Learning. IEEE Transactions on Systems, Man, and Cybernetics, Part B, Volume 38, 4, 930-936, 2008.
M. van Otterlo, M. Wiering, M. Dastani, and J-J. Meyer. A Characterization of Sapient Agents, In R.V. Mayorga and L. Perlovsky (Eds.) Toward Artificial Sapience, Principles and Methods for Wise Systems, pp. 129-141. Berlin: Springer, 2008.
R. Opsomer, P. Knoth, F. van Polen, J. Trapman and M.A. Wiering. Categorizing Children: Automated Text Classification of CHILDES files. BNAIC'08: Proceedings of the 20 Belgium-Netherlands Conference on Artificial Intelligence, A. Nijholt, M. Pantic, M. Poel and H. Hondorp (eds.), pp. 209-216, 2008.
L. Pape, J. de Gruijl, and M.A. Wiering, 2008. Democratic Liquid State Machines for Music Recognition. In: Speech, Audio, Image and Biomedical Signal Processing using Neural Networks, Bookseries: Studies in Computational Intelligence, vol 83. B. Prasad and S.R.M. Prasanna (Eds.), 2008.
L. Lefakis and M.A. Wiering. Semi-Supervised Methods for Handwritten Character Recognition using Active Learning. BNAIC'07: Proceedings of the 19th Belgium-Netherlands Conference on Artificial Intelligence, Mehdi Dastani and Edwin de Jong (eds.), pp. 205-212, 2007.
L. Pape, B.G. Ruessink, M.A. Wiering and I.L. Turner. Recurrent Neural Network Modeling of Nearshore Sandbar Behavior. Neural Networks, Special Issue on Earth and Environmental Sciences, 20, 509-518, 2007.
Marco Wiering and Edwin D. de Jong. Computing Optimal Stationary Policies for Multi-objective Markov Decision Processes. Proceedings of IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), Honolulu, HI, USA, pp. 158-165, 2007.
Azizi Abdullah and Marco Wiering. CIREC: Cluster Correlogram Image Retrieval and Categorization using MPEG-7 Descriptors. Proceedings of IEEE International Symposium on Computational Intelligence in Image and Signal Processing (CIISP), Honolulu, HI, USA, pp. 431-437, 2007.
Hado van Hasselt and Marco Wiering. Reinforcement Learning in Continuous Action Spaces. Proceedings of IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), Honolulu, HI, USA, pp. 272-279, 2007.
Hado van Hasselt and Marco Wiering. Convergence of Model-Based Temporal Difference Learning for Control. Proceedings of IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), Honolulu, HI, USA, pp. 60-67, 2007.
Marco Wiering and Hado van Hasselt. Two Novel On-policy Reinforcement Learning Algorithms based on TD(lambda)-methods. Proceedings of IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), Honolulu, HI, USA, pp. 280-287, 2007.
Wilco Moerman, Bram Bakker and Marco Wiering. Hierarchical Assignment of Behaviors to Subpolicies. NIPS'2007 workshop on Hierarchical Organization of Behavior: Computational, Psychological and Neural Perspectives, 2007.
Tijn van der Zant, Lambert Schomaker, Marco Wiering, Axel Brink. Cognitive Developmental Pattern Recognition: Learning to Learn Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pp. 1208-1213, 2006
L. Pape, B.G. Ruessink, M.A. Wiering and I.L. Turner, Neural network modeling of nearshore sandbar behavior . Proceedings of the 2006 International Joint Conference on Neural Networks, Vancouver, Canada, pp. 8735-8742, 2006.
W. de Back, E.D. de Jong, and M.A. Wiering. Red Queen Dynamics in a predator-prey ecosystem Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-06), Maarten Keijzer et al. editors, pp. 381-382, 2006
J. van Diggelen, E.D. de Jong, and M.A. Wiering. Strategies for Ontology Negotiation: Finding the Right Level of Generality International workshop on agent communication, held with AAMAS'06, 2006
J.R. de Gruijl and M.A. Wiering. Musical Instrument Classification using Democratic Liquid State Machines Benelearn'06: Proceedings of the 15th Belgian-Dutch Conference on Machine Learning, pp. 33-40, edited by Y. Saeys, E. Tsiporkova, B. De Baets, and Y. Van de Peer, 2006
L. Zwanepol Klinkmeijer, E.D. de Jong, and M.A. Wiering. A Serial Population Genetic Algorithm for Dynamic Optimization Problems Benelearn'06: Proceedings of the 15th Belgian-Dutch Conference on Machine Learning, pp. 41-48, edited by Y. Saeys, E. Tsiporkova, B. De Baets, and Y. Van de Peer, 2006
M. Wiering, J.P. Patist, and H. Mannen Learning to PLay Board Games using Temporal Difference Methods . Technical Report, Utrecht University, UU-CS-2005-048, 30 pages, 2005.
M. Wiering. QV(lambda)-learning: A New On-policy Reinforcement Learning Algorithm. Proceedings of the 7th European Workshop on Reinforcement Learning, D. Leone (editor), pages 17-18, 2005.
M. Wiering. Comparing Training Paradigms for Learning to Play Backgammon Proceedings of the 7th European Workshop on Reinforcement Learning, D. Leone (editor), pages 29-30, 2005.
T. van der Zant, M. Wiering, and J. van Eijck. On-line robot learning using the interval estimation algorithm Proceedings of the 7th European Workshop on Reinforcement Learning, D. Leone (editor), pages 11-12, 2005.
S. Maas, M. Wiering, and B. Verhaar. Reinforcement Learning of a Pneumatic Robot Arm Controller Proceedings of the 7th European Workshop on Reinforcement Learning, D. Leone (editor), pages 23-24, 2005.
H. van Kuilenburg, M. Wiering, and M. den Uyl. A Model Based Method for Automatic Facial Expression Recognition. Proceedings of the 16th European Conference on Machine Learning (ECML'05), J. Gama et a. (eds), Springer-Verlag Berlin Heidelberg, pages 194-205,2005.
R.R. Negenborn, B. De Schutter, M.A. Wiering, and H. Hellendoorn. Learning-based model predictive control for Markov decision processes. Proceedings of the 16th IFAC World Congress, Prague, Czech Republic, July 2005.
M. Wiering, F. Mignogna, and B, Maassen Evolving Neural Networks for Forest Fire Control Benelearn'05: Proceedings of the 14th Belgian-Dutch Conference on Machine Learning, pages 113 - 120, edited by M. van Otterlo, M. Poel, and A. Nijholt, 2005
M. Sindlar and M. Wiering. A Modular Approach to Facial Expression Recognition Benelearn'05: Proceedings of the 14th Belgian-Dutch Conference on Machine Learning, pages 81 - 88, edited by M. van Otterlo, M. Poel, and A. Nijholt, 2005
R.R. Negenborn, B. De Schutter, M.A. Wiering, and H. Hellendoorn. Learning-based model predictive control for Markov decision processes. Tech. rep. 04-021, Delft Center for Systems and Control, Delft University of Technology, Delft, The Netherlands, Sept. 2004.
R.R. Negenborn, B. De Schutter, M.A. Wiering, and J. Hellendoorn, Experience-based model predictive control using reinforcement learning. Proceedings of the 8th TRAIL Congress 2004 - A World of Transport, Infrastructure and Logistics - CD-ROM, Rotterdam, The Netherlands, Nov. 2004.
M. Wiering. Convergence and Divergence in Standard and Averaging Reinforcement Learning Proceedings of the 15th European Conference on Machine Learning (ECML'04), edited by J-F Boulicaut, F. Esposito, F. Giannotti, and D. Pedreschi, pp. 477-488, Springer-Verlag Berlin Heidelberg, 2004.
M. Wiering, S. Leijnen, A. Koster, S. van Weers, and W. de Back. Autonomous Intelligent Robots at Utrecht University. International Journal of Advanced Robotic Systems, 1(2), pages 125-128, 2004.
M. Wiering, J. van Veenen, J. Vreeken, and A. Koopman. Intelligent Traffic Light Control. Technical Report UU-CS-2004-029, University Utrecht, 2004.
D. Wierstra and M. Wiering. Utile Distinction Hidden Markov Models. Proceedings of the Twenty-first International Conference on Machine Learning (ICML'04), pp. 855-862, ACM Press, 2004.
M. Wiering, J. Vreeken, J. van Veenen, and A. Koopman. Simulation and Optimization of Traffic in a City. IEEE Intelligent Vehicles symposium (IV'04), 2004.
H. Mannen and M. Wiering. Learning to play chess using TD(lambda)-learning with database games Benelearn'04: Proceedings of the Thirteenth Belgian-Dutch Conference on Machine Learning, edited by A. Nowe, T. Lenaerts, and K. Steenhout, pp.72-79, 2004
J-P. Patist and M. Wiering. Learning to play draughts using temporal difference learning with neural networks and databases Benelearn'04: Proceedings of the Thirteenth Belgian-Dutch Conference on Machine Learning, edited by A. Nowe, T. Lenaerts, and K. Steenhout, pp.87-94, 2004
M. Wiering. Memory-based Memetic Algorithms Benelearn'04: Proceedings of the Thirteenth Belgian-Dutch Conference on Machine Learning, edited by A. Nowe, T. Lenaerts, and K. Steenhout, pp.191-198, 2004
M. van Otterlo, M. Wiering, M. Dastani, and J-J. Meyer. A Characterization of Sapient Agents, First International Conference on Integration of Knowledge Intensive Multi-Agent Systems (KIMAS-03), edited by H. Hexmoor, IEEE Press, Boston, MA, pages 172-177, 2003.
M. Wiering. Hierarchical Mixtures of Naive Bayesian Classifiers. European Conference on Machine Learning (ECML'2003) Workshop on Probabilistic Graphical Models for Classification, edited by P. Larranaga, J.A> Lozano, J.M. Pena, and I. Inza, pages 93-104, 2003.
M. Wiering and F. Mignogna. Learning to Control Forest Fires with ESP. Proceedings of the Sixth European Workshop on Reinforcement Learning, edited by Alain Dutech and Olivier Buffet, pp. 22-23, 2003.
M. Wiering. Intelligent Traffic Light Control. ERCIM News Special: Cognitive Systems, 53, pp. 40-41, 2003.
M. Wiering. Evolving Causal Neural Networks. Benelearn'02: Proceedings of the Twelfth Belgian-Dutch Conference on Machine Learning, edited by Marco Wiering, pp. 103-108, 2002
M. Wiering. Hierarchical Mixtures of Naive Bayesian Classifiers. BNAIC'02: Proceedings of the Thirteenth Belgium-Netherlands Conference on Artificial Intelligence, Hendrik Blockeel and Marc Denecker (eds.), pp. 363-370, 2002.
M. Wiering. Model-based Reinforcement Learning in Dynamic Environments. Technical Report CS-UU-2002-029, Utrecht University, 2002.
S. Reynolds and M. Wiering. Fast Q(lambda) revisited. Technical Report CSRP-02-2, University of Birmingham, School of Computer Science, 2002.
M. Wiering. Hierarchical Mixtures of Naive Bayes Classifiers. Technical Report CS-UU-2002-003, Utrecht University, 2002.
J. de Jong and M. Wiering. Multiple Ant Colony Systems for the Busstop Allocation Problem. BNAIC'01: Proceedings of the Thirteenth Belgium-Netherlands Conference on Artificial Intelligence, pp. 141-148, 2001.
Marco Wiering, R.P. Salustowicz, and Juergen Schmidhuber. Model-based Reinforcement Learning for Evolving Soccer Strategies. In Computational Intelligence in Games, chapter 5. Editors N. Baba and L. Jain. pp. 99-131, 2001.
Marco Wiering. Reinforcement Learning in Dynamic Environments using Instantiated Information. Machine Learning: Proceedings of the Eighteenth International Conference (ICML'2001), pp. 585-592, 2001.
K. ten Tusscher, S. ten Hagen and M. Wiering The influence of commmunication on the choice to behave cooperatively. Proceedings of the Tenth Belgian-Dutch Conference on Machine Learning. Editor Ad Feelders. pp. 39-46, 2000.
Marco Wiering. Multi-Agent Reinforcement Learning for Traffic Light control. Machine Learning: Proceedings of the Seventeenth International Conference (ICML'2000), pp. 1151-1158, 2000.
Marco Wiering, Ben Krose, and Frans Groen. Learning in Multi-Agent Systems. Technical Report, University of Amsterdam, 1999.
Marco Wiering, R.P. Salustowicz, and Juergen Schmidhuber. Reinforcement Learning Soccer Teams with Incomplete World Models Neural Networks for Robot Learning. Special issue of Autonomous Robots, Vol 7(1), pp. 77-88, 1999.
Marco Wiering. Explorations in Efficient Reinforcement Learning. Ph.D. thesis. February 1999 (784K).
Marco Wiering, R.P. Salustowicz, and Juergen Schmidhuber. CMAC models learn to play soccer. Proceedings of the 8th International Conference on Artificial Neural Networks (ICANN'98), 443-448. In L. Niklasson and M. Boden and T. Ziemke (eds.), Springer-Verlag, London, 1998.
Marco Wiering and Marco Dorigo. Learning to Control Forest Fires. Umweltinformatik'98: Vernetzte Strukturen in Informatik, Umwelt und Wirtschaft, Proceedings des 12. Internationalen Symposiums 'Informatik den Umweltschutz', H.-D. Haasis, K.C. Ranze (eds.), pp 378-388, 1998.
Marco Wiering and Juergen Schmidhuber. Efficient Model-Based Exploration. Proceedings of the Fifth International Conference on Simulation of Adaptive Behavior (SAB'98): From Animals to Animats 5, 223-228, R. Pfeiffer, B. Blumberg, J. A. Meyer and S. W. Wilson (eds.), MIT Press/Bradford Books, 1998.
Marco Wiering and Juergen Schmidhuber. Learning Exploration Policies with Models. Conference on Automated Learning and Discovery (CONALD'98), 1998.
Marco Wiering and Juergen Schmidhuber. Fast Online Q(lambda). Machine Learning, 33(1), 105-115, 1998.
Marco Wiering and Juergen Schmidhuber. Speeding Up Q(lambda)-learning . In Proceedings of the Tenth European Conference on Machine Learning (ECML'98), pp. 352-363, 1998. (13 pages).
Marco Wiering and Juergen Schmidhuber. HQ-Learning. Adaptive Behavior, 6:2, 219-246, 1997.
Rafal Salustowicz, Marco Wiering and Juergen Schmidhuber. Learning Team Strategies: Soccer Case Studies. Machine Learning, 33, (2/3), 1-19, 1998.

abstract

bibtex

Rafal Salustowicz, Marco Wiering and Juergen Schmidhuber. Evolving soccer strategies. In N. Kasabov, R. Kozma, K. Ko, R. O'Shea, G. Coghill, and T. Gedeon, editors, Progress in Connectionist-based Information Systems: Proceedings of the Fourth International Conference on Neural Information Processing ICONIP'97, volume 1, 502-505,

abstract

bibtex

Rafal Salustowicz, Marco Wiering and Juergen Schmidhuber. On learning soccer strategies. In W. Gerstner, A. Germond, M. Hasler, and J.-D. Nicoud, editors, Proceedings of the Seventh International Conference on Artificial Neural Networks (ICANN'97), volume 1327 of Lecture Notes in Computer Science, 769-774. Springer-Verlag Berlin Heidelberg, 1997 (68K)
You can also just check the abstract or pick up the bibtex entry.
Juergen Schmidhuber, Jieyu Zhao and Marco Wiering. Shifting Inductive Bias with Success-Story Algorithm, Adaptive Levin Search, and Incremental Self-Improvement. Machine Learning, 28:1, 105-130, 1997.
Marco Wiering and Juergen Schmidhuber. HQ-Learning: Discovering Markovian Subgoals for non-Markovian Reinforcement Learning. Technical Report IDSIA-95-96, October 1996 (108K).
Juergen Schmidhuber and Jieyu Zhao and Marco Wiering. Simple principles of metalearning. Technical Report IDSIA-69-96, June 1996 (195 K).
Rafal Salustowicz, Marco Wiering and Juergen Schmidhuber. Learning team strategies with multiple policy-sharing agents: A soccer case study. Technical Report IDSIA-29-97, IDSIA, Lugano, Switzerland. -- 20 pages, 1997 (134K).

abstract

bibtex

Marco Wiering and Juergen Schmidhuber. Solving POMDPs with Levin Search and EIRA. Machine Learning: Proceedings of the thirteenth International Conference. 534-542, 1996 (86K).
Marco Wiering. TD Learning of Game Evaluation Functions with Hierarchical Neural Architectures. Master's thesis, University of Amsterdam, Holland, April 1995 (241K).
Marco Wiering and Ben Kroese. TD Learning of Game Evaluation Functions with Hierarchies of Adaptive Experts. University of Amsterdam, Holland, April 1995 (59K).