Off-line Estimation of Controlled Markov Chains: Minimaxity and Sample Complexity - BizPub.ai