Bhatnagar, Shalabh and Babu, Mohan K (2008) New algorithms of the Q-learning type. In: Automatica, 44 (4). pp. 1111-1119.