Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis - BizPub.ai