Overcoming Model-Bias In Reinforcement Learning