From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses. Alexey Naumov, HSE University

  • 33 views
  • 20 minutes, 54 seconds
  • Uploaded by Сбер

Related videos

You might be interested