Full Schedule (Expected)
Spring 2023 - ECE 457C
This is the expected schedule of topics to be covered in lecture along with the dates for various holiday’s, Assignment release and due dates. The final exam will be scheduled centrally during the exam period, all assignments will be done and handed in by then.
PART I: Fundamentals of Decision Making Under Uncertainty
Week | Lecture Date | Title | Topics Discussed | Textbook Sections |
---|---|---|---|---|
1 | Monday, May 8, 2023 | Classes Begin | ||
1 | Monday, May 8, 2023 | Introduction | Introduction | |
1 | Monday, May 8, 2023 | Learning and Decision Making | Learning and Decision Making | |
1 | Friday, May 12, 2023 | Multi-Armed Bandits | Multi-Armed Bandits | 1.1, 1.2, 17.6 |
2 | Wednesday, May 17, 2023 | Assignment 1 Released | Content up to MDP policy evaluation | |
2 | Monday, May 15, 2023 | Regret Minimization, UCB and Thompson Sampling | MAB and More | |
2 | Friday, May 19, 2023 | MDP Definitions | Definition and Motivation | 3.1 |
2 | Friday, May 19, 2023 | MDP Rewards and Returns | Rewards and Returns | 3.3-3.4 |
3 | Monday, May 22, 2023 | Victoria Day Holiday (no classes) | ||
3 | Tuesday, May 23, 2023 | Make-up Day for Victoria Day | ||
2 | Friday, May 19, 2023 | MDP Value Functions | Value Functions | 3.5-3.6 |
3 | Tuesday, May 23, 2023 | Dynamic programming 1 | Dynamic Programming | 4.0-4.1 |
3 | Friday, May 26, 2023 | Dynamic programming 2 | PI, VI | 4.2-4.4 |
3 | Friday, May 26, 2023 | Dynamic programming 3 | GPI, AsyncVI | 4.4- |
PART II: Iterative Approximately Optimal Methods
Week | Lecture Date | Title | Topics Discussed | Textbook Sections |
---|---|---|---|---|
4 | Monday, May 29, 2023 | MC- Monte Carlo Methods | Monte Carlo Methods | 5 |
4 | Friday, June 2, 2023 | TD1 Temporal Difference Learning | Temporal Difference Learning | 6.0 |
4 | Friday, June 2, 2023 | Assignment 2 Released | Requires up to : VI,PI, QL, SARSA, Double Qlearning, ExpSarsa | |
4 | Friday, June 2, 2023 | Assignment 1 Due | Content up to MDP policy evaluation | |
4 | Friday, June 2, 2023 | TD2 Comparison to MC on Random Walk | MC vs Random Walk | 6.0-6.5 |
5 | Monday, June 5, 2023 | TD3 Sarsa and Qlearning | Sarsa and Qlearning | 6.0-6.5 |
5 | Friday, June 9, 2023 | TD4 Expected Sarsa and Double Qlearning | Expected Sarsa and Double Qlearning | 6.5? |
5 | Friday, June 9, 2023 | ET1 One Step vs Direct Value Updates | One Step TD | 12.1, 12.1 |
5 | Friday, June 9, 2023 | ET2 N Step TD Forward View | N step TD Forward View | |
6 | Monday, June 12, 2023 | ET3 N step TD backward View | N step TD backward View | |
6 | Monday, June 12, 2023 | ET4 Eligibility Trace on Policy | Eligibility Trace on Policy | |
6 | Monday, June 12, 2023 | ET5 Eligibility Traces Off Policy | Eligibility Traces Off Policy | |
6 | Friday, June 16, 2023 | Value Function Approximation | Value Function Approximation |
PART III: Generalizing States and Policies
Week | Lecture Date | Title | Topics Discussed | Textbook Sections |
---|---|---|---|---|
7 | Monday, June 19, 2023 | Neural Networks basics and Deep RL as DQN | Deep RL (DQN) | |
7 | Friday, June 16, 2023 | Deep Learning Fundamentals | Neural Networks, activation fucntions, loss functions, training, gradient optimizers, regularization methods, CNNs, LSTMs | |
7 | Monday, June 19, 2023 | Midterm Week (no midterm, regular classes) | None | |
7 | Friday, June 23, 2023 | Assignment 2 Due | Content up to Monday Week 6 | None |
7 | Friday, June 23, 2023 | Assignment 3 Released | Content up to Deep RL basics plus review of some papers | None |
7 | Friday, June 23, 2023 | Assignment 4 Released | Content up to Monday Week 10 | None |
8 | Friday, June 30, 2023 | Policy Gradient Algorithm | Policy Gradient Algorithm, REINFORCE | 13.1,13.2,13.65 |
9 | Monday, July 3, 2023 | Canada Day Holiday (no classes) | None | |
9 | Friday, July 7, 2023 | REINFORCE, Actor-Critic Algorithm, A3C,A2C | ||
10 | Monday, July 10, 2023 | PPO | TRPO, PPO | |
10 | Friday, July 14, 2023 | Evaluating RL Algorithms and Double DQN | Evaluation Methods, Double DQN | |
10 | Friday, July 14, 2023 | Prioritized Experience Replay and other New Deep RL algorithms | ||
10 | Sunday, July 16, 2023 | Assignment 3 Due | Reviewing a Deep RL paper | None |
11 | Monday, July 17, 2023 | DPG, DDPG, SAC and Prioritized Experience Replay | DPG, DDPG and SAC | |
11 | Monday, July 17, 2023 | AlphaGo and MonteCarlo Tree Search (MCTS) | AlphaGo and MonteCarlo Tree Search (MCTS) | |
12 | Monday, July 24, 2023 | RL Next Steps (pre-recorded) | MARL, Hierarchical RL, Supervised and Curriculum Learning, RL competitions, Free Energy, Big Ideas about AI | https://gingkoapp.com/rl-next-steps |
12 | Friday, July 28, 2023 | Live from ICML! RLHF and Ongoing Research | (Pre-recorded or streamed) Reinforcement Learning with Human Feedback, other uses of RL for creative solutions. | |
13 | Monday, July 31, 2023 | Assignment 4 Due | Comparison of multiple Deep RL implementation on Gymnasium environments. | |
13 | Monday, July 31, 2023 | Review Lecture | All topics (TA will lead review) |
End of Classes
Week | Date | Title | |
---|---|---|---|
13 | Wednesday, August 2, 2023 | Pre-exam Study Day | |
14 | Monday, August 7, 2023 | Civic Holiday | |
15 | Wednesday, August 9, 2023 at 7:30pm in MC4045 | Final Exam (see LEARN for info) |