Reinforcement Learning

RL is the study of learning decision making policies from experience with computers.

One of my core research areas is into understanding the computational mechanisms that can enable learning to perform complex tasks primarily from experience and feedback. This topic, called Reinforcement Learning, has a complex history tying fields as diverse as neuroscience, behavioural and development psychology, economics and computer science. I approach it as a computational researcher aiming to build Artificial Intelligence agents that learn to way Humans do, not by any correspondence of their “brain” and it “neural” structure by the algorithms they both use to learn to act in a complex, mysterious world.

Learning Resources from The Lab

External Resources

  • Revised Textbook by Sutton and Barto - http://incompleteideas.net/book/the-book-2nd.html
  • Martha White has a great RL Fundamentals Course
  • Sergey Levine has a very detailed Deep RL Course

Our Papers on Reinforcement Learning

  1. Balancing Information with Observation Costs in Deep Reinforcement Learning
    Bellinger, Colin, Drozdyuk, Andriy, Crowley, Mark, and Tamblyn, Isaac
    In Canadian Conference on Artificial Intelligence 2022
  2. Scientific Discovery and the Cost of Measurement – Balancing Information and Cost in Reinforcement Learning
    Bellinger, Colin, Drozdyuk, Andriy, Crowley, Mark, and Tamblyn, Isaac
    In 1st Annual AAAI Workshop on AI to Accelerate Science and Engineering (AI2ASE) 2022
  3. Mean Field MARL
    Decentralized Mean Field Games
    Ganapathi Subramanian, Sriram, Taylor, Mathew, Crowley, Mark, and Poupart, Pascal
    In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI-2022) 2022
  4. MARLEmpircal
    Investigation of Independent Reinforcement Learning Algorithms in Multi-Agent Environments
    Frontiers in Artificial Intelligence 2022
  5. MARLEmpircal
    Investigation of Independent Reinforcement Learning Algorithms in Multi-Agent Environments
    In NeurIPS 2021 Deep Reinforcement Learning Workshop 2021
  6. Multi-Advisor-QL
    Multi-Agent Advisor Q-Learning
    Journal of Artificial Intelligence Research (JAIR) 2022
  7. A Complementary Approach to Improve WildFire Prediction Systems.
    Subramanian, Sriram Ganapathi, and Crowley, Mark
    In Neural Information Processing Systems (AI for social good workshop) 2018
  8. Partially Observable Mean Field Reinforcement Learning
    Ganapathi Subramanian, Sriram, Taylor, Matthew, Crowley, Mark, and Poupart, Pascal
    In Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS) 2021
  9. Amrl
    Active Measure Reinforcement Learning for Observation Cost Minimization: A framework for minimizing measurement costs in reinforcement learning
    Bellinger, Colin, Coles, Rory, Crowley, Mark, and Tamblyn, Isaac
    In Canadian Conference on Artificial Intelligence 2021
  10. Deep Multi Agent Reinforcement Learning for Autonomous Driving
    Bhalla, Sushrut, Ganapathi Subramanian, Sriram, and Crowley, Mark
    In Canadian Conference on Artificial Intelligence 2020
  11. Learning Multi-Agent Communication with Reinforcement Learning
    Bhalla, Sushrut, Ganapathi Subramanian, Sriram, and Crowley, Mark
    In Conference on Reinforcement Learning and Decision Making (RLDM-19) 2019
  12. Training Cooperative Agents for Multi-Agent Reinforcement Learning
    Bhalla, Sushrut, Ganapathi Subramanian, Sriram, and Crowley, Mark
    In Proc. of the 18th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2019) 2019
  13. Learning Forest Wildfire Dynamics from Satellite Images Using Reinforcement Learning
    Subramanian, Sriram Ganapathi, and Crowley, Mark
    In Conference on Reinforcement Learning and Decision Making 2017
  14. Policy Gradient Optimization Using Equilibrium Policies for Spatial Planning Domains
    Crowley, Mark
    In 13th INFORMS Computing Society Conference 2013
  15. phd-thesis
    Equilibrium Policy Gradients for Spatiotemporal Planning
    Crowley, Mark
    2011