Statistics of An Optimistic Posterior Sampling Strategy for Bayesian Reinforcement Learning

Contact ORBi