22.11.2023_Reinforcement Learning в мультиязычных моделях перевода: PPO и DPO. Дмитрий Акимов, Сбер

  • 141 views
  • 17 minutes, 51 seconds
  • Uploaded by Сбер

Related videos

You might be interested