ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Browse by Author

Up a level
Export as [feed] Atom [feed] RSS 1.0 [feed] RSS 2.0
Group by: Item Type | No Grouping
Number of items: 10.

Conference Paper

Abdulla, Mohammed Shahid and Bhatnagar, Shalabh (2007) Network flow-control using asynchronous stochastic approximation. In: 46th IEEE Conference on Decision and Control, DEC 12-14, 2007, New Orleans, LA.

Abdulla, Mohammed Shahid and Bhatnagar, Shalabh (2007) Parametrized actor-critic algorithms for finite-horizon MDPs. In: American Control Conference 2007, JUL 09-13, 2007, New York,.

Abdulla, Mohammed Shahid and Bhatnagar, Shalabh (2007) Solving MDPs using two-timescale simulated annealing with multiplicative weights. In: American Control Conference 2007, JUL 09-13, 2007, New York, NY.

Bhatnagar, Shalabh and Abdulla, Mohammed Shahid (2006) An Actor-Critic Algorithm for Finite Horizon Markov Decision Processes. In: Proceedings of the 45th IEEE Conference on Decision & Control Manchester Grand Hyatt Hotel, December 13-15, 2006, San Diego, CA.

Abdulla, Mohammed Shahid and Bhatnagar, Shalabh (2006) SPSA algorithms with measurement reuse. In: 2006 Winter Simulation Conference,, Dec 03-06, 2006, Monterey, CA,, pp. 319-327.

Bhatnagar, Shalabh and Abdulla, Mohammed Shahid (2006) A reinforcement learning based algorithm for finite horizon Markov decision processes. In: 45th IEEE Conference on Decision and Control,, Dec 13-15, 2006, San Diego, CA, pp. 5519-5524.

Abdulla, Mohammed Shahid and Bhatnagar, Shalabh (2005) Solution of MDPS using simulation-based value iteration. In: 2nd International Conference on Artificial Intelligence Applications and Innovations, SEP 07-09, 2005, Beijing.

Journal Article

Abdulla, Mohammed Shahid and Bhatnagar, Shalabh (2016) MULTI-ARMED BANDITS BASED ON A VARIANT OF SIMULATED ANNEALING. In: INDIAN JOURNAL OF PURE & APPLIED MATHEMATICS, 47 (2). pp. 195-212.

Bhatnagar, Shalabh and Abdulla, Mohammed Shahid (2008) Simulation-Based Optimization Algorithms for Finite-Horizon Markov Decision Processes. In: Simulation- Transactions of the Society for Modeling and Simulation international, 84 (12). pp. 577-600.

Abdulla, Mohammed Shahid and Bhatnagar, Shalabh (2007) Reinforcement Learning Based Algorithms for Average Cost Markov Decision Processes. In: Discrete Event Dynamic Systems - Theory and Applications, 17 (1). pp. 23-52.

This list was generated on Thu Apr 25 00:03:40 2024 IST.