논문 제목 : Maximum Margin Planning(2006)
● 논문 저자 : Nathan D. Ratliff, J. Andrew Bagnell, Martin A. Zinkevich
● 논문 링크 : https://www.ri.cmu.edu/pub_files/pub4/ratliff_nathan_2006_1/ratliff_nathan_2006_1.pdf
● 이전에 보면 좋은 논문 :
○ Apprenticeship Learning via Inverse Reinforcement Learning(2004)
Inverse Reinforcement Learning Travel
- Algorithms for Inverse Reinforcement Learning(2000)
- Apprenticeship Learning via Inverse Reinforcement Learning(2004)
- Maximum Margin Planning(2006) - Selected
- Maximum Entropy Inverse Reinforcement Learning(2008)
- Generative Adversarial Imitation Learning(2016)
- Variational Discriminator Bottleneck(2018)
리뷰 : https://reinforcement-learning-kr.github.io/2019/02/07/3_mmp/
'Artificial Intelligence > Reinforcement Learning' 카테고리의 다른 글
Generative Adversarial Imitation Learning (0) | 2018.12.26 |
---|---|
Maximum Entropy Inverse Reinforcement Learning (0) | 2018.12.26 |
Apprenticeship Learning via Inverse Reinforcement Learning (0) | 2018.11.12 |
Algorithms for Inverse Reinforcement Learning (2) | 2018.09.18 |
High-Dimensional Continuous Control using Generalized Advantage Estimation (0) | 2018.07.03 |