name | office hour | ||
---|---|---|---|
Instructor | Shuai Li | shuaili8@sjtu.edu.cn | Tue 1-2 PM Rm 1406-2, Software College |
Chief TA | Fang Kong | fangkong@sjtu.edu.cn | Thu 7-9 PM Online |
TA | Canzhe Zhao | canzhezhao@sjtu.edu.cn | Wed 7-9 PM Online |
Ruofeng Yang | wanshuiyin@sjtu.edu.cn | Tue 7-9 PM Rm 1119, Software College |
|
Zilong Wang | wangzilong@sjtu.edu.cn | Mon 7-9 PM Rm 1119, Software College |
Lecture times
Grading
References
week | date | topic | materials |
---|---|---|---|
1 | Feb 14 | 1 Introduction | |
2 | Feb 21 | 2 Markov Decision Process | |
3 | Feb 28 | 3 Value-based Methods | |
4 | Mar 7 | 4 Policy-based Methods | |
5 | Mar 14 | 5 Planning | |
6 | Mar 21 | 6 Approximation | |
7 | Mar 28 | 7 Deep Reinforcement Learning | |
8 | Apr 4 | 8 Deep Policy Methods | |
9 | Apr 11 | 9 Model-based Reinforcement Learning | |
10 | Apr 18 | 10 Immitation Learning | |
11 | Apr 25 | 11 Offline Reinforcement Learning | |
12 | May 2 | Holiday | |
13 | May 9 | 12 Multi-agent Reinforcement Learning | |
14 | May 16 | 13 Parametrized Reinforcement Learning | |
16 | May 30 May 31 | Presentations |