Research

OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning

Peter Henderson, Wei-Di Chang, Pierre-Luc Bacon, David Meger, Joelle Pineau, and Doina Precup
AAAI Conference on Artificial Intelligence (AAAI ‘18), New Orleans, USA.
pdf | bibtex | code


Benchmark Environments for Multitask Learning in Continuous Domains

Peter Henderson, Wei-Di Chang, Florian Shkurti, Johanna Hansen, David Meger, and Gregory Dudek
Lifelong Learning: A Reinforcement Learning Approach Workshop (ICML ‘17), Sidney, Australia.
pdf | bibtex | code


Underwater Multi-Robot Convoying using Visual Tracking by Detection

Florian Shkurti, Wei-Di Chang, Peter Henderson, Jahidul Islam, Juan Camilo Gamboa Higuera, Jimmy Li, Travis Manderson, Anqi Xu, Gregory Dudek, and Junaed Sattar
IEEE International Conference on Robotics and Intelligent Systems (IROS ‘17), Vancouver, Canada.
pdf | bibtex | project page | dataset