name | office hour | ||
---|---|---|---|
Instructor | Shuai Li | shuaili8@sjtu.edu.cn | Thr 1-2 PM Rm 1406-2, Software College |
Chief TA | Ruofeng Yang | wanshuiyin@sjtu.edu.cn | Thr 7-9 PM Rm 1119, Software College |
TA | Zilong Wang | wangzilong@sjtu.edu.cn | Wed 7-9 PM Rm 1119, Software College |
Haitong Ma | mahaitong@sjtu.edu.cn | Fri 7-9 PM Online |
|
Letian Yang | moekid101@sjtu.edu.cn | Tue 7-9 PM Rm 1119, Software College |
Lecture times
References
week | date | topic | materials |
---|---|---|---|
1 | Feb 20 | 1 Introduction | |
2 | Feb 27 | 2 Markov Decision Process | |
3 | Mar 6 | 3 Value Function Estimation | |
4 | Mar 13 | ||
5 | Mar 20 | 4 Model-free Control | |
6 | Mar 27 | 5 Planning | |
7 | Apr 3 | 6 Approximation | |
8 | Apr 10 | 7 Deep Reinforcement Learning | |
9 | Apr 17 | 8 Deep Policy Methods | |
10 | Apr 24 | 9 Model-based Reinforcement Learning |