Up a level |
Sen, D and Prashanth, LA and Gopalan, A (2023) Adaptive Estimation of Random Vectors with Bandit Feedback: A Mean-Squared Error Viewpoint. In: UNSPECIFIED, pp. 180-181.
Prashanth, LA and Chatterjee, Abhranil and Bhatnagar, Shalabh (2014) Adaptive Sleep-Wake Control using Reinforcement Learning in Sensor Networks. In: 6th International Conference on Communication Systems and Networks (COMSNETS), JAN 07-10, 2014, Bangalore, INDIA.
Prashanth, LA and Bhatnagar, Shalabh (2011) Reinforcement learning with average cost for adaptive control of traffic lights at intersections. In: 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC), 5-7 Oct. 2011, Washington, DC, USA.
Prashanth, LA and Bhatnagar, Shalabh and Desai, Nirmit and Prasad, HL and Dasgupta, Gargi (2011) Stochastic optimization for adaptive labor staffing in service systems. In: ICSOC'11 Proceedings of the 9th international conference on Service-Oriented Computing, 2011, Heidelberg.
Prashanth, LA and Das, Sajal Kumar and Gopinath, K (2008) MAC design for heterogeneous application support in OFDM based wireless systems. In: 5th IEEE Consumer Communications and Networking Conference, JAN 10-12, 2008, Las Vegas.
Prashanth, LA and Gopinath, K (2008) OFDM-MAC algorithms and their impact on TCP performance in next generation mobile networks. In: 3rd International Conference on Communication System Software and Middleware and Workshop, JAN 06-10, 2008, Bangalore.
Mondal, A and Prashanth, LA and Bhatnagar, S (2024) Truncated Cauchy random perturbations for smoothed functional-based stochastic optimization. In: Automatica, 162 .
Prashanth, LA and Bhatnagar, S and Bhavsar, N and Fu, M and Marcus, SI (2020) Random Directions Stochastic Approximation with Deterministic Perturbations. In: IEEE Transactions on Automatic Control, 65 (6). pp. 2450-2465.
Prashanth, LA and Prasad, HL and Bhatnagar, Shalabh and Chandra, Prakash (2016) A constrained optimization perspective on actor-critic algorithms and application to network routing. In: SYSTEMS & CONTROL LETTERS, 92 . pp. 46-51.
Bhatnagar, Shalabh and Prashanth, LA (2015) Simultaneous Perturbation Newton Algorithms for Simulation Optimization. In: JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 164 (2). pp. 621-643.
Prashanth, LA and Prasad, HL and Desai, Nirmit and Bhatnagar, Shalabh and Dasgupta, Gargi (2015) Simultaneous perturbation methods for adaptive labor staffing in service systems. In: SIMULATION-TRANSACTIONS OF THE SOCIETY FOR MODELING AND SIMULATION INTERNATIONAL, 91 (5). pp. 432-455.
Prashanth, LA and Chatterjee, Abhranil and Bhatnagar, Shalabh (2014) Two timescale convergent Q-learning for sleep-scheduling in wireless sensor networks. In: WIRELESS NETWORKS, 20 (8). pp. 2589-2604.
Prashanth, LA and Bhatnagar, Shalabh (2011) Reinforcement Learning With Function Approximation for Traffic Signal Control. In: IEEE Transactions onIntelligent Transportation Systems, 12 (2, Sp.). pp. 412-421.