Adversarial reinforcement learning