ページ

ホーム
ウェブページについて
訳語
講義ビデオ
課題とコースプロジェクト

講義ノート

MDPでのプランニング
バッチ強化学習
- 17. イントロダクション
- 18.有限MDPのサンプル効率
オンライン強化学習

Website of the course CMPUT 653: Theoretical Foundations of Reinforcement Learning.

MDPでのプランニング
12. TensorPlan and eluder sequences

12. TensorPlan and eluder sequences

Under construction.

Copyright © 2020 RL Theory.