Jan Hendrik Metzen's publications

[1] Jan Hendrik Metzen. Online skill discovery using graph-based clustering. In 10th European Workshop on Reinforcement Learning, (EWRL 2012), June 2012. [ bib | poster |slides |.pdf |supplement ]
We introduce a new online skill discovery method for reinforcement learning in discrete domains. The method is based on the bottleneck principle and identifies skills using a bottom-up hierarchical clustering of the estimated transition graph. In contrast to prior clustering approaches, it can be used incrementally and thus several times during the learning process. Our empirical evaluation shows that “assuming high connectivity in the face of uncertainty” can prevent premature identification of skills. Furthermore, we show that the choice of the linkage criterion is crucial for dealing with non-random sampling policies and stochastic environments.

[2] Jan Hendrik Metzen. Model-based evolutionary policy search for skill learning in continuous domains. In 10th European Workshop on Reinforcement Learning, (EWRL 2012), June 2012. [ bib | .pdf ]
[3] Sirko Straube, Jan Hendrik Metzen, Anett Seeland, Mario Krell, and Elsa Andrea Kirchner. Choosing an appropriate performance measure: Classification of EEG-data with varying class distribution. In Proceedings of the 41st Meeting of the Society for Neuroscience 2011, November 2011. [ bib ]
[4] Jan Hendrik Metzen and Elsa Andrea Kirchner. Rapid adaptation of brain reading interfaces based on threshold adjustment. In Proceedings of the 2011 Conference of the German Classification Society, (GfKl-2011), page 138, August 2011. [ bib | .pdf |slides ]
[5] Jan Hendrik Metzen, Su-Kyoung Kim, and Elsa Andrea Kirchner. Minimizing calibration time for brain reading. In Pattern Recognition, volume 6835 of Lecture Notes in Computer Science, pages 366-375. Springer Berlin / Heidelberg, August 2011. The original publication is available at www.springerlink.com under http://www.springerlink.com/content/731775n33wg062w2/fulltext.pdf. [ bib | .pdf |slides ]
Machine learning is increasingly used to autonomously adapt brain-machine interfaces to user-specific brain patterns. In order to minimize the preparation time of the system, it is highly desirable to reduce the length of the calibration procedure, during which training data is acquired from the user, to a minimum. One recently proposed approach is to reuse models that have been trained in historic usage sessions of the same or other users by utilizing an ensemble-based approach. In this work, we propose two extensions of this approach which are based on the idea to combine predictions made by the historic ensemble with session-specific predictions that become available once a small amount of training data has been collected. These extensions are particularly useful for Brain Reading Interfaces (BRIs), a specific kind of brain-machine interfaces. BRIs do not require that user feedback is given and thus, additional training data may be acquired concurrently to the usage session. Accordingly, BRIs should initially perform well when only a small amount of training data acquired in a short calibration procedure is available and allow an increased performance when more training data becomes available during the usage session. An empirical offline-study in a testbed for the use of BRIs to support robotic telemanipulation shows that the proposed extensions allow to achieve this kind of behavior.

[6] Jan Hendrik Metzen, Su Kyoung Kim, Timo Duchrow, Elsa Andrea Kirchner, and Frank Kirchner. On transferring spatial filters in a brain reading scenario. In Statistical Signal Processing Workshop (SSP), 2011 IEEE, pages 797-800, June 2011. [ bib |poster ]
Machine learning approaches are increasingly used in brain-machine-interfaces to allow automatic adaptation to user-specific brain patterns. One of the most crucial factors for the practical success of these systems is that this adaptation can be achieved with a minimum amount of training data since training data needs to be recorded during a calibration procedure prior to the actual usage session. To this end, one promising approach is to reuse models based on data recorded in preceding sessions of the same or of other users. In this paper, we investigate under which conditions it is favorable to reuse models (more specifically spatial filters) trained on data from historic sessions compared to learning new spatial filters on the current session's calibration data. We present an empirical study in a scenario in which Brain Reading, a particular kind of brain-machine-interface, is used to support robotic telemanipulation.

[7] Elsa Andrea Kirchner, Hendrik Wöhrle, Constantin Bergatt, Su-Kyoung Kim, Jan Hendrik Metzen, David Feess, and Frank Kirchner. Towards operator monitoring via brain reading - an EEG-based approach for space applications. In Proceedings of the 10th International Symposium on Artificial Intelligence, Robotics and Automation in Space (iSAIRAS-10), pages 448-455, September 2010. [ bib | http ]
[8] Jan Hendrik Metzen and Frank Kirchner. Model-based direct policy search. In Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems, AAMAS '10, pages 1589-1590, Richland, SC, 2010. International Foundation for Autonomous Agents and Multiagent Systems. [ bib | http |poster ]
[9] Jan Hendrik Metzen, Elsa Andrea Kirchner, Larbi Abdenebaoui, and Frank Kirchner. Learning to play the BRIO labyrinth game. Zeitschrift für Künstliche Intelligenz, Themenheft Reinforcement Learning:34-37, 2009. [ bib | .pdf ]
[10] Jan Hendrik Metzen, Tim Kröger, Andrea Schenk, Stephan Zidowitz, Heinz-Otto Peitgen, and Xiaoyi Jiang. Matching of anatomical tree structures for registration of medical images. Image and Vision Computing, 27(7):923-933, June 2009. [ bib | DOI | http ]
Many medical applications require a registration of different images of the same organ. In many cases, such a registration is accomplished by manual placement of landmarks in the images. In this paper we propose a method which is able to find reasonable landmarks automatically. To achieve this, bifurcations of the vessel systems, which have been extracted from the images by a segmentation algorithm, are assigned by the so-called association graph method and the coordinates of these matched bifurcations can be used as landmarks for a non-rigid registration algorithm. Several constraints to be used in combination with the association graph method are proposed and evaluated on a ground truth consisting of anatomical trees from liver and lung. Furthermore, a method for preprocessing (tree pruning) as well as for postprocessing (clique augmentation) are proposed and evaluated on this ground truth. The proposed method achieves promising results for anatomical trees of liver and lung and for medical images obtained with different modalities and at different points in time.

[11] Constantin Bergatt, Jan Hendrik Metzen, Elsa Andrea Kirchner, and Frank Kirchner. Quantification and minimization of the Simulation-Reality-Gap on a BRIO labyrinth game. In Proceedings of the first International Workshop on Learning and Data Mining for Robotics (LEMIR-09), Bled, Slovenia, 2009. [ bib ]
[12] Elsa Andrea Kirchner, Jan Hendrik Metzen, Timo Duchrow, Su Kyong Kim, and Frank Kirchner. Assisting telemanipulation operators via Real-Time brain reading. In Lemgoer Schriftenreihe zur industriellen Informationstechnik, Paderborn, September 2009. [ bib | .pdf ]
[13] Yohannes Kassahun, Jan Hendrik Metzen, Mark Edgington, and Frank Kirchner. Incremental acquisition of neural structures through evolution. In Design and Control of Intelligent Robotic Systems, pages 187-208. 2009. [ bib | http ]
[14] Jan Hendrik Metzen, Mark Edgington, Yohannes Kassahun, and Frank Kirchner. Evolving neural networks for online reinforcement learning. In Parallel Problem Solving from Nature - PPSN X, pages 518-527, September 2008. [ bib | http ]
For many complex Reinforcement Learning problems with large and continuous state spaces, neuroevolution (the evolution of artificial neural networks) has achieved promising results. This is especially true when there is noise in sensor and/or actuator signals. These results have mainly been obtained in offline learning settings, where the training and evaluation phase of the system are separated. In contrast, in online Reinforcement Learning tasks where the actual performance of the systems during its learning phase matters, the results of neuroevolution are significantly impaired by its purely exploratory nature, meaning that it does not use (i.e. exploit) its knowledge of the performance of single individuals in order to improve its performance during learning. In this paper we describe modifications which significantly improve the online performance of the neuroevolutionary method Evolutionary Acquisition of Neural Topologies (EANT) and discuss the results obtained on two benchmark problems.

[15] Malte Römmermann, Mark Edgington, Jan Hendrik Metzen, Jose de Gea, Yohannes Kassahun, and Frank Kirchner. Learning walking patterns for kinematically complex robots using evolution strategies. In Parallel Problem Solving from Nature - PPSN X, pages 1091-1100, 2008. [ bib | http ]
[16] Yohannes Kassahun, Jose de Gea, Mark Edgington, Jan Hendrik Metzen, and Frank Kirchner. Accelerating neuroevolutionary methods using a kalman filter. In GECCO '08: Proceedings of the 10th annual conference on Genetic and evolutionary computation, pages 1397-1404, New York, NY, USA, 2008. ACM. [ bib | http ]
[17] Jan Hendrik Metzen, Frank Kirchner, Mark Edgington, and Yohannes Kassahun. Towards efficient online reinforcement learning using neuroevolution. In Proceedings of the 10th Annual Conference on Genetic and Evolutionary Computation, GECCO '08, pages 1425-1426, New York, NY, USA, 2008. ACM. [ bib | http ]
[18] Jan Hendrik Metzen, Mark Edgington, Yohannes Kassahun, and Frank Kirchner. Analysis of an evolutionary reinforcement learning method in a multiagent domain. In Proceedings of the 7th International Conference on Autonomous Agents and Multiagent Systems, AAMAS '08, pages 291-298, Richland, SC, May 2008. International Foundation for Autonomous Agents and Multiagent Systems. [ bib | http ]
Many multiagent problems comprise subtasks which can be considered as reinforcement learning (RL) problems. In addition to classical temporal difference methods, evolutionary algorithms are among the most promising approaches for such RL problems. The relative performance of these approaches in certain subdomains (e.g. multiagent learning) of the general RL problem remains an open question at this time. In addition to theoretical analysis, benchmarks are one of the most important tools for comparing different RL methods in certain problem domains. A recently proposed multiagent RL benchmark problem is the RoboCup Keepaway benchmark. This benchmark is one of the most challenging multiagent learning problems because its state-space is continuous and high dimensional, and both the sensors and the actuators are noisy. In this paper we analyze the performance of the neuroevolutionary approach called Evolutionary Acquisition of Neural Topologies (EANT) in the Keepaway benchmark, and compare the results obtained using EANT with the results of other algorithms tested on the same benchmark.

[19] Jan Hendrik Metzen, Mark Edgington, Yohannes Kassahun, and Frank Kirchner. Performance evaluation of EANT in the RoboCup keepaway benchmark. In ICMLA '07: Proceedings of the Sixth International Conference on Machine Learning and Applications, pages 342-347, Washington, DC, USA, 2007. IEEE Computer Society. [ bib | http ]
[20] Yohannes Kassahun, Mark Edgington, Jan H Metzen, Gerald Sommer, and Frank Kirchner. A common genetic encoding for both direct and indirect encodings of networks. In GECCO '07: Proceedings of the 9th annual conference on Genetic and evolutionary computation, pages 1029-1036, New York, NY, USA, July 2007. ACM. [ bib | http ]
[21] Yohannes Kassahun, Jan Metzen, Jose de Gea, Mark Edgington, and Frank Kirchner. A general framework for encoding and evolving neural networks. In KI 2007: Advances in Artificial Intelligence, pages 205-219, Osnabrück, Germany, 2007. [ bib | http ]
[22] Jan Metzen, Tim Kröger, Andrea Schenk, Stephan Zidowitz, Heinz-Otto Peitgen, and Xiaoyi Jiang. Matching of tree structures for registration of medical images. In Graph-Based Representations in Pattern Recognition, pages 13-24, Alicante, Spain, 2007. Springer Verlag. [ bib | http ]
Many medical applications require a registration of different images of the same organ. In many cases, such a registration is accomplished by manually placing landmarks in the images. In this paper we propose a method which is able to find reasonable landmarks automatically. To achieve this, nodes of the vessel systems, which have been extracted from the images by a segmentation algorithm, will be assigned by the so-called association graph method and the coordinates of these matched nodes can be used as landmarks for a non-rigid registration algorithm.

[23] Jan Hendrik Metzen, Tim Kröger, Andrea Schenk, Stephan Zidowitz, Heinz-Otto Peitgen, and Xiaoyi Jiang. Matching von baumstrukturen - zuordnung von gefäßsystemen aus leber und lunge. In Bildverarbeitung für die Medizin 2007, pages 116-120, March 2007. [ bib | http ]
[24] Jan Hendrik Metzen. Matching von Baumstrukturen in der medizinischen Bildverarbeitung. Master's thesis, Westfälische Wilhelms-Universität Münster, July 2006. [ bib | .pdf ]
[25] Jens Müller, Jan Hendrik Metzen, Alexander Ploss, Maraike Schellmann, and Sergei Gorlatch. Rokkatan: Scaling an RTS game design to the massively multiplayer realm. In ACM SIGHCHI International Conference on Advances in Computer Entertainment Technology (ACE 05), pages 125-132, Valencia, Spain, June 2005. ACM. [ bib | .pdf ]

This file has been generated by bibtex2html 1.88.