reinforcement learning for prediction