| name | office hour | ||
|---|---|---|---|
| Instructor | Shuai Li | shuaili8@sjtu.edu.cn | Tue 1-2 PM Rm 1406-2, Software College |
| Chief TA | Fang Kong | fangkong@sjtu.edu.cn | Thu 7-9 PM Online |
| TA | Canzhe Zhao | canzhezhao@sjtu.edu.cn | Wed 7-9 PM Online |
| Ruofeng Yang | wanshuiyin@sjtu.edu.cn | Tue 7-9 PM Rm 1119, Software College |
|
| Zilong Wang | wangzilong@sjtu.edu.cn | Mon 7-9 PM Rm 1119, Software College |
Lecture times
Grading
References
| week | date | topic | materials |
|---|---|---|---|
| 1 | Feb 14 | 1 Introduction | |
| 2 | Feb 21 | 2 Markov Decision Process | |
| 3 | Feb 28 | 3 Value-based Methods | |
| 4 | Mar 7 | 4 Policy-based Methods | |
| 5 | Mar 14 | 5 Planning | |
| 6 | Mar 21 | 6 Approximation | |
| 7 | Mar 28 | 7 Deep Reinforcement Learning | |
| 8 | Apr 4 | 8 Deep Policy Methods | |
| 9 | Apr 11 | 9 Model-based Reinforcement Learning | |
| 10 | Apr 18 | 10 Immitation Learning | |
| 11 | Apr 25 | 11 Offline Reinforcement Learning | |
| 12 | May 2 | Holiday | |
| 13 | May 9 | 12 Multi-agent Reinforcement Learning | |
| 14 | May 16 | 13 Parametrized Reinforcement Learning | |
| 16 | May 30 May 31 | Presentations |