| name | office hour | ||
|---|---|---|---|
| Instructor | Shuai Li | shuaili8@sjtu.edu.cn | Thr 1-2 PM Rm 1406-2, Software College |
| Chief TA | Ruofeng Yang | wanshuiyin@sjtu.edu.cn | Thr 7-9 PM Rm 1119, Software College |
| TA | Zilong Wang | wangzilong@sjtu.edu.cn | Wed 7-9 PM Rm 1119, Software College |
| Haitong Ma | mahaitong@sjtu.edu.cn | Fri 7-9 PM Online |
|
| Letian Yang | moekid101@sjtu.edu.cn | Tue 7-9 PM Rm 1119, Software College |
Lecture times
References
| week | date | topic | materials |
|---|---|---|---|
| 1 | Feb 20 | 1 Introduction | |
| 2 | Feb 27 | 2 Markov Decision Process | |
| 3 | Mar 6 | 3 Value Function Estimation | |
| 4 | Mar 13 | ||
| 5 | Mar 20 | 4 Model-free Control | |
| 6 | Mar 27 | 5 Planning | |
| 7 | Apr 3 | 6 Approximation | |
| 8 | Apr 10 | 7 Deep Reinforcement Learning | |
| 9 | Apr 17 | 8 Deep Policy Methods | |
| 10 | Apr 24 | 9 Model-based Reinforcement Learning |