TODO: a model-free algorithm which is Probably Approximately Correct (PAC).
Strehl, Alexander L., et al. "PAC model-free reinforcement learning." Proceedings of the 23rd international conference on Machine learning. ACM, 2006.
TODO: a model-free algorithm which is Probably Approximately Correct (PAC).
Strehl, Alexander L., et al. "PAC model-free reinforcement learning." Proceedings of the 23rd international conference on Machine learning. ACM, 2006.