[정리] Variational Discriminator Bottleneck
Variational Discriminator Bottleneck:Improving Imitation Learning, Inverse RL, and GANs by Constrainting Information Flow (Peng et al., 2018)
[정리] Maximum Margin Planning
Maximum Margin Planning (Ratliff et al., 2006)
[정리] Maximum Entropy Inverse Reinforcement Learning
Maximum Entropy Inverse Reinforcement Learning (Ziebart et al., 2008)
[정리] Maximum Entropy Deep Inverse Reinforcement Learning
Maximum Entropy Deep Inverse Reinforcement Learning (Wulfmeier et al., 2015)
[정리] Guided Cost Learning
Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization (Finn et al., 2016)