Up a level |
Karumanchi, SH and Diddigi, RB and Prabuchandran, KJ and Bhatnagar, S (2023) Autonomous UAV Navigation in Complex Environments using Human Feedback. In: UNSPECIFIED, pp. 499-506.
Guin, S and Bhatnagar, S (2023) A Policy Gradient Approach for Finite Horizon Constrained Markov Decision Processes. In: UNSPECIFIED, pp. 3353-3359.
Deb, R and Gandhi, M and Bhatnagar, S (2022) Schedule Based Temporal Difference Algorithms. In: 58th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2022, 27 - 30 September 2022, Monticello.
Mishra, UA and Samineni, SR and Goel, P and Kunjeti, C and Lodha, H and Singh, A and Sagi, A and Bhatnagar, S and Kolathaya, S (2022) Dynamic Mirror Descent based Model Predictive Control for Accelerating Robot Learning. In: 39th IEEE International Conference on Robotics and Automation, ICRA 2022, 23 - 27 May 2022, Philadelphia, pp. 1631-1637.
Diddigi, RB and Jain, P and Prabuchandran, JK and Bhatnagar, S (2022) Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm. In: 2022 International Joint Conference on Neural Networks, IJCNN 2022, 18 - 23 July 2022, Padua.
Deb, R and Bhatnagar, S (2022) Gradient Temporal Difference with Momentum: Stability and Convergence. In: 36th AAAI Conference on Artificial Intelligence, AAAI 2022, 22 February - 1 March 2022, Virtual, Online, pp. 6488-6496.
Shanmugasundaram, P and Bhatnagar, S (2022) Co-operative Multi-agent Twin Delayed DDPG for Robust Phase Duration Optimization of Large Road Networks. In: 14th International Conference on Agents and Artificial Intelligence, ICAART 2022, 3 - 5 February 2022, Virtual, Online, pp. 122-142.
Bhatnagar, S and Chakraborti, B and Kumar, PV (2021) Streaming Codes for Handling a Combination of Burst and Random Erasures. In: 2021 IEEE Information Theory Workshop, ITW 2021, 17-21 Oct 2021, Virtual, Online.
Paigwar, K and Krishna, L and Tirumala, S and Khetan, N and Sagi, A and Joglekar, A and Bhatnagar, S and Ghosal, A and Amrutur, B and Kolathaya, S (2020) Robust Quadrupedal Locomotion on Sloped Terrains: A Linear Policy Approach. In: UNSPECIFIED, pp. 2257-2267.
John, I and Bhatnagar, S (2020) Deep Reinforcement Learning with Successive Over-Relaxation and its Application in Autoscaling Cloud Resources. In: Proceedings of the International Joint Conference on Neural Networks, 19-24 July 2020, Virtual, Glasgow.
Tirumala, S and Gubbi, S and Paigwar, K and Sagi, A and Joglekar, A and Bhatnagar, S and Ghosal, A and Amrutur, B and Kolathaya, S (2020) Learning Stable Manoeuvres in Quadruped Robots from Expert Demonstrations. In: 29th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2020, 31 Aug - 4 Sept 2020, Virtual, Naples; Italy, pp. 1107-1112.
Dharmavaram, A and Riemer, M and Bhatnagar, S (2020) Hierarchical Average Reward Policy Gradient Algorithms. In: 34th AAAI Conference on Artificial Intelligence, AAAI 2020, 7-12 Feb 2020, New York, pp. 13777-13778.
Padakandla, S and Rao, S and Bhatnagar, S (2020) Learning-based resource allocation in industrial IoT systems. In: IEEE International Symposium on Personal, Indoor and Mobile Radio Communications, PIMRC, 31 August - 3 September 2020, United Kingdom.
Bharadwaj Diddigi, R and Kamanchi, C and Bhatnagar, S (2020) A convergent off-policy temporal difference algorithm. In: Frontiers in Artificial Intelligence and Applications, 29 August-8 September 2020, Online; Spain, pp. 1103-1110.
Joseph, AG and Bhatnagar, S (2019) An Incremental Algorithm for Estimating Extreme Quantiles. In: 2019 Sixth Indian Control Conference (ICC)Proceedings, 18-20 Dec. 2019, Hyderabad, India, pp. 286-291.
John, I and Karumanchi, R and Bhatnagar, S (2019) Predictive and prescriptive analytics for performance optimization: Framework and a case study on a large-scale enterprise system. In: Proceedings-18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019, 16-19, December 2019, United States, pp. 876-881.
Kolathaya, S and Ghosal, A and Amrutur, B and Joglekar, A and Shetty, S and Dholakiya, D and Abhimanyu, . and Sagi, A and Bhattacharya, S and Singla, A and Bhatnagar, S (2019) Trajectory based Deep Policy Search for Quadrupedal Walking. In: 28th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2019, 14-18 October 2019, New Delhi; India.
Joseph, AG and Bhatnagar, S (2019) Stochastic Approximation Trackers for Model-Based Search. In: 57th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2019, 24 -27 September 2019, Monticello, IL, USA, USA, pp. 741-748.
John, I and Sreekantan, A and Bhatnagar, S (2019) Efficient adaptive resource provisioning for cloud applications using reinforcement learning. In: 4th IEEE International Workshops on Foundations and Applications of Self* Systems, FAS*W 2019, 16 June 2019 - 20 June 2019, Umea, pp. 271-272.
Diddigi, RB and Prabuchandran, KJ and Sai Koti Reddy, D and Bhatnagar, S (2019) Actor-critic algorithms for constrained multi-agent reinforcement learning. In: 18th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2019, 13 May 2019through 17 May 2019, Montreal, pp. 1931-1933.
Joseph, AG and Bhatnagar, S (2019) An Adaptive Sampling Algorithm for Policy Evaluation. In: 5th Indian Control Conference, ICC 2019, 9 January 2019- 11 January 2019, Delhi, pp. 2-7.
Dholakiya, D and Bhattacharya, S and Gunalan, A and Singla, A and Bhatnagar, S and Amrutur, B and Ghosal, A and Kolathaya, S (2019) Design, Development and Experimental Realization of A Quadrupedal Research Platform: Stoch. In: 5th International Conference on Control, Automation and Robotics, ICCAR 2019, 19 - 22 April 2019, Beijing, pp. 229-234.
John, I and Sreekantan, A and Bhatnagar, S (2019) Auto-scaling Resources for Cloud Applications using Reinforcement learning. In: 2019 Grace Hopper Celebration India (GHCI), 6-8 Nov. 2019, Bangalore, India.
Bhatnagar, S The Reinforce Policy Gradient Algorithm Revisited. In: UNSPECIFIED, p. 177.
VP, V and Bhatnagar, S (2024) Efficient energy management in smart grids with finite horizon Q-learning. In: Sustainable Energy, Grids and Networks, 38 .
Mondal, A and Prashanth, LA and Bhatnagar, S (2024) Truncated Cauchy random perturbations for smoothed functional-based stochastic optimization. In: Automatica, 162 .
Bhatnagar, S and Rambha, T and Ramadurai, G (2022) An agent-based fleet management model for first- and last-mile services. In: Transportation .
Singla, A and Padakandla, S and Bhatnagar, S (2021) Memory-Based Deep Reinforcement Learning for Obstacle Avoidance in UAV with Limited Environment Knowledge. In: IEEE Transactions on Intelligent Transportation Systems, 22 (1). pp. 107-118.
Padakandla, S and Prabuchandran, KJ and Bhatnagar, S (2020) Reinforcement learning algorithm for non-stationary environments. In: Applied Intelligence, 50 (11). pp. 3590-3606.
Yaji, VG and Bhatnagar, S (2020) Stochastic recursive inclusions in two timescales with nonadditive iterate-dependent markov noise. In: Mathematics of Operations Research, 45 (4). pp. 1405-1444.
J, PK and Penubothula, S and Kamanchi, C and Bhatnagar, S (2020) Novel First Order Bayesian Optimization with an Application to Reinforcement Learning. In: Applied Intelligence .
John, I and Kamanchi, C and Bhatnagar, S (2020) Generalized Speedy Q-Learning. In: IEEE Control Systems Letters, 4 (3). pp. 524-529.
Prashanth, LA and Bhatnagar, S and Bhavsar, N and Fu, M and Marcus, SI (2020) Random Directions Stochastic Approximation with Deterministic Perturbations. In: IEEE Transactions on Automatic Control, 65 (6). pp. 2450-2465.
Yaji, VG and Bhatnagar, S (2020) Analysis of Stochastic Approximation Schemes with Set-Valued Maps in the Absence of a Stability Guarantee and Their Stabilization. In: IEEE Transactions on Automatic Control, 65 (3). pp. 1100-1115.
Kamanchi, C and Diddigi, RB and Prabuchandran, KJ and Bhatnagar, S (2019) An Online Sample-Based Method for Mode Estimation Using ODE Analysis of Stochastic Approximation Algorithms. In: IEEE Control Systems Letters, 3 (3). pp. 697-702.
Bhatnagar, S and Patel, S and Karmeshu, Karmeshu (2018) A stochastic approximation approach to active queue management. In: Telecommunication Systems, 68 (1). pp. 89-104.
Prasad, HL and Bhatnagar, S (2012) General-sum stochastic games: Verifiability conditions for Nash equilibria. In: AUTOMATICA, 48 (11). pp. 2923-2930.