Convergence Of A Reinforcement Learning Algorithm In Continuous Domains