Abdulla, Mohammed Shahid and Bhatnagar, Shalabh (2007) Reinforcement Learning Based Algorithms for Average Cost Markov Decision Processes. In: Discrete Event Dynamic Systems - Theory and Applications, 17 (1). pp. 23-52.