Up a level |
Ramaswamy, Arunselvan and Bhatnagar, Shalabh (2019) Stability of Stochastic Approximations With ``Controlled Markov'' Noise and Temporal Difference Learning. In: IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 64 (6). pp. 2614-2620.
Joseph, Ajin George and Bhatnagar, Shalabh (2018) An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method. In: MACHINE LEARNING, 107 (8-10, ). pp. 1385-1429.
Joseph, Ajin George and Bhatnagar, Shalabh (2018) An incremental off-policy search in a model-free Markov decision process using a single sample path. In: MACHINE LEARNING, 107 (6). pp. 969-1011.
Ramaswamy, Arunselvan and Bhatnagar, Shalabh (2018) Analysis of Gradient Descent Methods With Nondiminishing Bounded Errors. In: IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 63 (5). pp. 1465-1471.
Karmakar, Prasenjit and Bhatnagar, Shalabh (2018) Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning. In: MATHEMATICS OF OPERATIONS RESEARCH, 43 (1). pp. 130-151.
Zhou, Enlu and Bhatnagar, Shalabh (2018) Gradient-Based Adaptive Stochastic Search for Simulation Optimization Over Continuous Space. In: INFORMS JOURNAL ON COMPUTING, 30 (1). pp. 154-167.
Yaji, Vinayaka G and Bhatnagar, Shalabh (2018) Stochastic recursive inclusions with non-additive iterate-dependent Markov noise. In: STOCHASTICS-AN INTERNATIONAL JOURNAL OF PROBABILITY AND STOCHASTIC PROCESSES, 90 (3). pp. 330-363.
Bharadwaj, Raghuram D and Reddy, Sai Koti and Narayanam, Krishnasuri and Bhatnagar, Shalabh (2018) A unified decision making framework for supply and demand management in microgrid networks. In: IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids (SmartGridComm), OCT 29-31, 2018, Aalborg, DENMARK.
Joseph, Ajin George and Bhatnagar, Shalabh (2017) An Incremental Fast Policy Search Using a Single Sample Path. In: PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, DEC 05-08, 2017, Kolkata, INDIA, pp. 3-10.
Karmeshu, Karmeshu and Patel, Sanjeev and Bhatnagar, Shalabh (2017) Adaptive mean queue size and its rate of change: queue management with random dropping. In: Telecommunication Systems, 65 (2). pp. 281-295. ISSN 1018-4864
Prashanth, L A and Bhatnagar, Shalabh and Fu, Michael and Marcus, Steve (2017) Adaptive System Optimization Using Random Directions Stochastic Approximation. In: IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 62 (5). pp. 2223-2238.
Ramaswamy, Arunselvan and Bhatnagar, Shalabh (2017) A Generalization of the Borkar-Meyn Theorem for Stochastic Recursive Inclusions. In: MATHEMATICS OF OPERATIONS RESEARCH, 42 (3). pp. 648-661.
Reddy, D Sai Koti and Prashanth, L A and Bhatnagar, Shalabh (2017) Improved Hessian estimation for adaptive random directions stochastic approximation. In: 55th IEEE Conference on Decision and Control (CDC), DEC 12-14, 2016, Las Vegas, NV, pp. 3682-3687.
Joseph, Ajin George and Bhatnagar, Shalabh (2017) A Model based Search Method for Prediction in Model-free Markov Decision Process. In: International Joint Conference on Neural Networks (IJCNN), MAY 14-19, 2017, Anchorage, AK, pp. 170-177.
Lakshmanan, K and Bhatnagar, Shalabh (2017) Quasi-Newton smoothed functional algorithms for unconstrained and constrained simulation optimization. In: COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 66 (3). pp. 533-556.
Kumar, Sandeep and Padakandla, Sindhu and Chandrashekar, L and Parihar, Priyank and Gopinath, K and Bhatnagar, Shalabh (2017) Scalable Performance Tuning of Hadoop MapReduce: A Noisy Gradient Approach. In: 10th IEEE International Conference on Cloud Computing (CLOUD), JUN 25-30, 2017, Honolulu, HI, pp. 375-382.
Lakshminarayanan, Chandrashekar and Bhatnagar, Shalabh (2017) A stability criterion for two timescale stochastic approximation schemes. In: AUTOMATICA, 79 . pp. 108-114.
Prabuchandran, KJ and Bhatnagar, Shalabh and Borkar, VS (2016) Actor-Critic Algorithms with Online Feature Adaptation. In: ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 26 (4).
Abdulla, Mohammed Shahid and Bhatnagar, Shalabh (2016) MULTI-ARMED BANDITS BASED ON A VARIANT OF SIMULATED ANNEALING. In: INDIAN JOURNAL OF PURE & APPLIED MATHEMATICS, 47 (2). pp. 195-212.
Bhatnagar, Shalabh and Lakshmanan, K (2016) Multiscale Q-learning with linear function approximation. In: DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 26 (3). pp. 477-509.
Joseph, Ajin George and Bhatnagar, Shalabh (2016) A RANDOMIZED ALGORITHM FOR CONTINUOUS OPTIMIZATION. In: Winter Simulation Conference (WSC), DEC 11-14, 2016, Arlington, VA, pp. 907-918.
Joseph, Ajin George and Bhatnagar, Shalabh (2016) Revisiting the Cross Entropy Method with Applications in Stochastic Global Optimization and Reinforcement Learning. In: 22nd European Conference on Artificial Intelligence (ECAI), AUG 29-SEP 02, 2016, Hague, NETHERLANDS, pp. 1026-1034.
Maity, Raj Kumar and Lakshminarayanan, Chandrashekar and Padakandla, Sindhu and Bhatnagar, Shalabh (2016) Shaping Proto-Value Functions Using Rewards. In: 22nd European Conference on Artificial Intelligence (ECAI), AUG 29-SEP 02, 2016, Hague, NETHERLANDS, pp. 1690-1691.
Joseph, Ajin George and Bhatnagar, Shalabh (2016) A Stochastic Approximation Algorithm for Quantile Estimation. In: 22nd International Conference on Neural Information Processing (ICONIP), NOV 09-12, 2015, Istanbul, TURKEY, pp. 311-319.
Ramaswamy, Arunselvan and Bhatnagar, Shalabh (2016) Stochastic recursive inclusion in two timescales with an application to the Lagrangian dual problem. In: STOCHASTICS-AN INTERNATIONAL JOURNAL OF PROBABILITY AND STOCHASTIC REPORTS, 88 (8). pp. 1173-1187.
Prashanth, LA and Prasad, HL and Bhatnagar, Shalabh and Chandra, Prakash (2016) A constrained optimization perspective on actor-critic algorithms and application to network routing. In: SYSTEMS & CONTROL LETTERS, 92 . pp. 46-51.
Prabuchandran, KJ and Hemanth, Kumar AN and Bhatnagar, Shalabh (2015) Decentralized Learning for Traffic Signal Control. In: 7th International Conference on Communication Systems and Networks, JAN 06-10, 2015, Bangalore, INDIA.
Padakandla, Sindhu and Prabuchandran, KJ and Bhatnagar, Shalabh (2015) Energy Sharing for Multiple Sensor Nodes With Finite Buffers. In: IEEE TRANSACTIONS ON COMMUNICATIONS, 63 (5). pp. 1811-1823.
Yaji, Vinayaka G and Bhatnagar, Shalabh (2015) Necessary and sufficient conditions for optimality in constrained general sum stochastic games. In: SYSTEMS & CONTROL LETTERS, 85 . pp. 8-15.
Bhatnagar, Shalabh and Prashanth, LA (2015) Simultaneous Perturbation Newton Algorithms for Simulation Optimization. In: JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 164 (2). pp. 621-643.
Prashanth, LA and Prasad, HL and Desai, Nirmit and Bhatnagar, Shalabh and Dasgupta, Gargi (2015) Simultaneous perturbation methods for adaptive labor staffing in service systems. In: SIMULATION-TRANSACTIONS OF THE SOCIETY FOR MODELING AND SIMULATION INTERNATIONAL, 91 (5). pp. 432-455.
Prashanth, LA and Chatterjee, Abhranil and Bhatnagar, Shalabh (2014) Adaptive Sleep-Wake Control using Reinforcement Learning in Sensor Networks. In: 6th International Conference on Communication Systems and Networks (COMSNETS), JAN 07-10, 2014, Bangalore, INDIA.
Prabuchandran, KJ and Kumar, Hemanth AN and Bhatnagar, Shalabh (2014) Multi-agent Reinforcement Learning for Traffic Signal Control. In: IEEE 17th International Conference on Intelligent Transportation Systems (ITSC), OCT 08-11, 2014, Qingdao, PEOPLES R CHINA, pp. 2529-2534.
Ghoshdastidar, Debarghya and Dukkipati, Ambedkar and Bhatnagar, Shalabh (2014) Newton-based stochastic optimization using q-Gaussian smoothed functional algorithms. In: AUTOMATICA, 50 (10). pp. 2606-2614.
Ghoshdastidar, Debarghya and Dukkipati, Ambedkar and Bhatnagar, Shalabh (2014) Smoothed Functional Algorithms for Stochastic Optimization Using q-Gaussian Distributions. In: ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 24 (3).
Prashanth, LA and Chatterjee, Abhranil and Bhatnagar, Shalabh (2014) Two timescale convergent Q-learning for sleep-scheduling in wireless sensor networks. In: WIRELESS NETWORKS, 20 (8). pp. 2589-2604.
Chakravarty, Saswata and Padakandla, Sindhu and Bhatnagar, Shalabh (2014) A simulation-based algorithm for optimal pricing policy under demand uncertainty. In: INTERNATIONAL TRANSACTIONS IN OPERATIONAL RESEARCH, 21 (5). pp. 737-760.
Bhatnagar, Shalabh and Borkar, VS and Prabuchandran, KJ (2013) Feature Search in the Grassmanian in Online Reinforcement Learning. In: IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 7 (5). pp. 746-758.
Vemu, Koteswara Rao and Bhatnagar, Shalabh and Hemachandra, N (2012) Optimal multi-layered congestion based pricing schemes for enhanced QoS. In: Computer Networks, 56 (4). pp. 1249-1262.
Bhatnagar, Shalabh and Lakshmanan, K (2012) An Online Actor-Critic Algorithm with Function Approximation for Constrained Markov Decision Processes. In: JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 153 (3). pp. 688-708.
Ghoshdastidar, Debarghya and Dukkipati, Ambedkar and Bhatnagar, Shalabh (2012) q-Gaussian based Smoothed Functional Algorithms for Stochastic Optimization. In: IEEE International Symposium on Information Theory, JUL 01-06, 2012 , Cambridge, MA .
Lakshmanan, K and Bhatnagar, Shalabh (2011) Smoothed functional and Quasi-Newton algorithms for routing in multi-stage queueing network with constraints. In: ICDCIT'11 Proceedings of the 7th international conference on Distributed Computing and Internet Technology, 2011, Heidelberg.
Prashanth, LA and Bhatnagar, Shalabh and Desai, Nirmit and Prasad, HL and Dasgupta, Gargi (2011) Stochastic optimization for adaptive labor staffing in service systems. In: ICSOC'11 Proceedings of the 9th international conference on Service-Oriented Computing, 2011, Heidelberg.