Mark Crowley | Full Schedule (Expected)

This is the expected schedule of topics to be covered in lecture along with the dates for various holiday’s, Assignment release and due dates. The final exam will be scheduled centrally during the exam period, all assignments will be done and handed in by then.

PART I: Fundamentals of Decision Making Under Uncertainty

Week	Lecture Date	Title	Topics Discussed	Textbook Sections
1	Monday, May 8, 2023	Classes Begin
1	Monday, May 8, 2023	Introduction	Introduction
1	Monday, May 8, 2023	Learning and Decision Making	Learning and Decision Making
1	Friday, May 12, 2023	Multi-Armed Bandits	Multi-Armed Bandits	1.1, 1.2, 17.6
2	Wednesday, May 17, 2023	Assignment 1 Released	Content up to MDP policy evaluation
2	Monday, May 15, 2023	Regret Minimization, UCB and Thompson Sampling	MAB and More
2	Friday, May 19, 2023	MDP Definitions	Definition and Motivation	3.1
2	Friday, May 19, 2023	MDP Rewards and Returns	Rewards and Returns	3.3-3.4
3	Monday, May 22, 2023	Victoria Day Holiday (no classes)
3	Tuesday, May 23, 2023	Make-up Day for Victoria Day
2	Friday, May 19, 2023	MDP Value Functions	Value Functions	3.5-3.6
3	Tuesday, May 23, 2023	Dynamic programming 1	Dynamic Programming	4.0-4.1
3	Friday, May 26, 2023	Dynamic programming 2	PI, VI	4.2-4.4
3	Friday, May 26, 2023	Dynamic programming 3	GPI, AsyncVI	4.4-

PART II: Iterative Approximately Optimal Methods

Week	Lecture Date	Title	Topics Discussed	Textbook Sections
4	Monday, May 29, 2023	MC- Monte Carlo Methods	Monte Carlo Methods	5
4	Friday, June 2, 2023	TD1 Temporal Difference Learning	Temporal Difference Learning	6.0
4	Friday, June 2, 2023	Assignment 2 Released	Requires up to : VI,PI, QL, SARSA, Double Qlearning, ExpSarsa
4	Friday, June 2, 2023	Assignment 1 Due	Content up to MDP policy evaluation
4	Friday, June 2, 2023	TD2 Comparison to MC on Random Walk	MC vs Random Walk	6.0-6.5
5	Monday, June 5, 2023	TD3 Sarsa and Qlearning	Sarsa and Qlearning	6.0-6.5
5	Friday, June 9, 2023	TD4 Expected Sarsa and Double Qlearning	Expected Sarsa and Double Qlearning	6.5?
5	Friday, June 9, 2023	ET1 One Step vs Direct Value Updates	One Step TD	12.1, 12.1
5	Friday, June 9, 2023	ET2 N Step TD Forward View	N step TD Forward View
6	Monday, June 12, 2023	ET3 N step TD backward View	N step TD backward View
6	Monday, June 12, 2023	ET4 Eligibility Trace on Policy	Eligibility Trace on Policy
6	Monday, June 12, 2023	ET5 Eligibility Traces Off Policy	Eligibility Traces Off Policy
6	Friday, June 16, 2023	Value Function Approximation	Value Function Approximation

PART III: Generalizing States and Policies

Week	Lecture Date	Title	Topics Discussed	Textbook Sections
7	Monday, June 19, 2023	Neural Networks basics and Deep RL as DQN	Deep RL (DQN)
7	Friday, June 16, 2023	Deep Learning Fundamentals	Neural Networks, activation fucntions, loss functions, training, gradient optimizers, regularization methods, CNNs, LSTMs
7	Monday, June 19, 2023	Midterm Week (no midterm, regular classes)		None
7	Friday, June 23, 2023	Assignment 2 Due	Content up to Monday Week 6	None
7	Friday, June 23, 2023	Assignment 3 Released	Content up to Deep RL basics plus review of some papers	None
7	Friday, June 23, 2023	Assignment 4 Released	Content up to Monday Week 10	None
8	Friday, June 30, 2023	Policy Gradient Algorithm	Policy Gradient Algorithm, REINFORCE	13.1,13.2,13.65
9	Monday, July 3, 2023	Canada Day Holiday (no classes)		None
9	Friday, July 7, 2023	REINFORCE, Actor-Critic Algorithm, A3C,A2C
10	Monday, July 10, 2023	PPO	TRPO, PPO
10	Friday, July 14, 2023	Evaluating RL Algorithms and Double DQN	Evaluation Methods, Double DQN
10	Friday, July 14, 2023	Prioritized Experience Replay and other New Deep RL algorithms
10	Sunday, July 16, 2023	Assignment 3 Due	Reviewing a Deep RL paper	None
11	Monday, July 17, 2023	DPG, DDPG, SAC and Prioritized Experience Replay	DPG, DDPG and SAC
11	Monday, July 17, 2023	AlphaGo and MonteCarlo Tree Search (MCTS)	AlphaGo and MonteCarlo Tree Search (MCTS)
12	Monday, July 24, 2023	RL Next Steps (pre-recorded)	MARL, Hierarchical RL, Supervised and Curriculum Learning, RL competitions, Free Energy, Big Ideas about AI	https://gingkoapp.com/rl-next-steps
12	Friday, July 28, 2023	Live from ICML! RLHF and Ongoing Research	(Pre-recorded or streamed) Reinforcement Learning with Human Feedback, other uses of RL for creative solutions.
13	Monday, July 31, 2023	Assignment 4 Due	Comparison of multiple Deep RL implementation on Gymnasium environments.
13	Monday, July 31, 2023	Review Lecture	All topics (TA will lead review)

End of Classes

Week	Date	Title
13	Wednesday, August 2, 2023	Pre-exam Study Day
14	Monday, August 7, 2023	Civic Holiday
15	Wednesday, August 9, 2023 at 7:30pm in MC4045	Final Exam (see LEARN for info)