Reinforcement Learning

RL is the study of learning decision making policies from experience with computers.

One of my core research areas is into understanding the computational mechanisms that can enable learning to perform complex tasks primarily from experience and feedback. This topic, called Reinforcement Learning, has a complex history tying fields as diverse as neuroscience, behavioural and development psychology, economics and computer science. I approach it as a computational researcher aiming to build Artificial Intelligence agents that learn to way Humans do, not by any correspondence of their “brain” and it “neural” structure by the algorithms they both use to learn to act in a complex, mysterious world.

Learning Resources

Courses and Texts

Seminal Deep RL Papers

  • https://www.cs.toronto.edu/~vmnih/docs/dqn.pdf

Tools

  • https://gym.openai.com/

Our Papers on Reinforcement Learning

  1. Scientific Discovery and the Cost of Measurement – Balancing Information and Cost in Reinforcement Learning
    Bellinger, Colin, Drozdyuk, Andriy, Crowley, Mark, and Tamblyn, Isaac
    In 1st Annual AAAI Workshop on AI to Accelerate Science and Engineering (AI2ASE) 2022
  2. Mean Field MARL
    Decentralized Mean Field Games
    Ganapathi Subramanian, Sriram, Taylor, Mathew, Crowley, Mark, and Poupart, Pascal
    In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI-2022) 2022
  3. MARLEmpircal
    Investigation of Independent Reinforcement Learning Algorithms in Multi-Agent Environments
    Lee, Ken Ming, Ganapathi Subramanian, Sriram, and Crowley, Mark
    In NeurIPS 2021 Deep Reinforcement Learning Workshop 2021
  4. Multi-Advisor-QL
    Multi-Agent Advisor Q-Learning
    Ganapathi Subramanian, Sriram, Larson, Kate, Taylor, Mathew, and Crowley, Mark
    Journal of Artificial Intelligence Research (JAIR) 2022
  5. A Complementary Approach to Improve WildFire Prediction Systems.
    Subramanian, Sriram Ganapathi, and Crowley, Mark
    In Neural Information Processing Systems (AI for social good workshop) 2018
  6. Partially Observable Mean Field Reinforcement Learning
    Ganapathi Subramanian, Sriram, Taylor, Matthew, Crowley, Mark, and Poupart, Pascal
    In Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS) 2021
  7. Amrl
    Active Measure Reinforcement Learning for Observation Cost Minimization: A framework for minimizing measurement costs in reinforcement learning
    Bellinger, Colin, Coles, Rory, Crowley, Mark, and Tamblyn, Isaac
    In Canadian Conference on Artificial Intelligence 2021
  8. Deep Multi Agent Reinforcement Learning for Autonomous Driving
    Bhalla, Sushrut, Ganapathi Subramanian, Sriram, and Crowley, Mark
    In Canadian Conference on Artificial Intelligence 2020
  9. Learning Multi-Agent Communication with Reinforcement Learning
    Bhalla, Sushrut, Ganapathi Subramanian, Sriram, and Crowley, Mark
    In Conference on Reinforcement Learning and Decision Making (RLDM-19) 2019
  10. Training Cooperative Agents for Multi-Agent Reinforcement Learning
    Bhalla, Sushrut, Ganapathi Subramanian, Sriram, and Crowley, Mark
    In Proc. of the 18th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2019) 2019
  11. Learning Forest Wildfire Dynamics from Satellite Images Using Reinforcement Learning
    Subramanian, Sriram Ganapathi, and Crowley, Mark
    In Conference on Reinforcement Learning and Decision Making 2017
  12. Policy Gradient Optimization Using Equilibrium Policies for Spatial Planning Domains
    Crowley, Mark
    In 13th INFORMS Computing Society Conference 2013
  13. phd-thesis
    Equilibrium Policy Gradients for Spatiotemporal Planning
    Crowley, Mark
    2011