Browse by IISc Authors

Group by: Item Type | No Grouping

Number of items: 3.

Conference Paper

Lakshmanan, K and Bhatnagar, Shalabh (2012) A novel Q-learning algorithm with function approximation for constrained Markov decision processes. In: 2012 50th Annual Allerton Conference on Communication, Control, and Computing (Allerton), 1-5 Oct. 2012 , Monticello, IL, USA.

Lakshmanan, K and Bhatnagar, Shalabh (2011) Smoothed functional and Quasi-Newton algorithms for routing in multi-stage queueing network with constraints. In: ICDCIT'11 Proceedings of the 7th international conference on Distributed Computing and Internet Technology, 2011, Heidelberg.

Bhatnagar, Shalabh and Lakshmanan, K (2012) An Online Actor-Critic Algorithm with Function Approximation for Constrained Markov Decision Processes. In: JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 153 (3). pp. 688-708.

This list was generated on Sat Dec 21 16:52:40 2024 IST.


	Powered by EPrints		A service from The J.R.D. Tata Memorial Library Indian Institute of Science, Bengaluru-560012, India

Export as	Atom RSS 1.0 RSS 2.0