ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Browse by Author

Up a level
Export as [feed] Atom [feed] RSS 1.0 [feed] RSS 2.0
Group by: Item Type | No Grouping
Number of items: 47.

Conference Paper

Saxena, N and Khastagir, S and Kolathaya, S and Bhatnagar, S (2023) Off-Policy Average Reward Actor-Critic with Deterministic Policy Search. In: Proceedings of Machine Learning Research, 23 - 29 July 2023, Honolulu, pp. 30130-30203.

Deb, R and Gandhi, M and Bhatnagar, S (2022) Schedule Based Temporal Difference Algorithms. In: 58th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2022, 27 - 30 September 2022, Monticello.

Mishra, UA and Samineni, SR and Goel, P and Kunjeti, C and Lodha, H and Singh, A and Sagi, A and Bhatnagar, S and Kolathaya, S (2022) Dynamic Mirror Descent based Model Predictive Control for Accelerating Robot Learning. In: 39th IEEE International Conference on Robotics and Automation, ICRA 2022, 23 - 27 May 2022, Philadelphia, pp. 1631-1637.

Diddigi, RB and Jain, P and Prabuchandran, JK and Bhatnagar, S (2022) Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm. In: 2022 International Joint Conference on Neural Networks, IJCNN 2022, 18 - 23 July 2022, Padua.

Deb, R and Bhatnagar, S (2022) Gradient Temporal Difference with Momentum: Stability and Convergence. In: 36th AAAI Conference on Artificial Intelligence, AAAI 2022, 22 February - 1 March 2022, Virtual, Online, pp. 6488-6496.

Shanmugasundaram, P and Bhatnagar, S (2022) Co-operative Multi-agent Twin Delayed DDPG for Robust Phase Duration Optimization of Large Road Networks. In: 14th International Conference on Agents and Artificial Intelligence, ICAART 2022, 3 - 5 February 2022, Virtual, Online, pp. 122-142.

Bhatnagar, S and Chakraborti, B and Kumar, PV (2021) Streaming Codes for Handling a Combination of Burst and Random Erasures. In: 2021 IEEE Information Theory Workshop, ITW 2021, 17-21 Oct 2021, Virtual, Online.

Paigwar, K and Krishna, L and Tirumala, S and Khetan, N and Sagi, A and Joglekar, A and Bhatnagar, S and Ghosal, A and Amrutur, B and Kolathaya, S (2020) Robust Quadrupedal Locomotion on Sloped Terrains: A Linear Policy Approach. In: UNSPECIFIED, pp. 2257-2267.

John, I and Bhatnagar, S (2020) Deep Reinforcement Learning with Successive Over-Relaxation and its Application in Autoscaling Cloud Resources. In: Proceedings of the International Joint Conference on Neural Networks, 19-24 July 2020, Virtual, Glasgow.

Tirumala, S and Gubbi, S and Paigwar, K and Sagi, A and Joglekar, A and Bhatnagar, S and Ghosal, A and Amrutur, B and Kolathaya, S (2020) Learning Stable Manoeuvres in Quadruped Robots from Expert Demonstrations. In: 29th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2020, 31 Aug - 4 Sept 2020, Virtual, Naples; Italy, pp. 1107-1112.

Dharmavaram, A and Riemer, M and Bhatnagar, S (2020) Hierarchical Average Reward Policy Gradient Algorithms. In: 34th AAAI Conference on Artificial Intelligence, AAAI 2020, 7-12 Feb 2020, New York, pp. 13777-13778.

Padakandla, S and Rao, S and Bhatnagar, S (2020) Learning-based resource allocation in industrial IoT systems. In: IEEE International Symposium on Personal, Indoor and Mobile Radio Communications, PIMRC, 31 August - 3 September 2020, United Kingdom.

Bharadwaj Diddigi, R and Kamanchi, C and Bhatnagar, S (2020) A convergent off-policy temporal difference algorithm. In: Frontiers in Artificial Intelligence and Applications, 29 August-8 September 2020, Online; Spain, pp. 1103-1110.

Joseph, AG and Bhatnagar, S (2019) An Incremental Algorithm for Estimating Extreme Quantiles. In: 2019 Sixth Indian Control Conference (ICC)Proceedings, 18-20 Dec. 2019, Hyderabad, India, pp. 286-291.

John, I and Karumanchi, R and Bhatnagar, S (2019) Predictive and prescriptive analytics for performance optimization: Framework and a case study on a large-scale enterprise system. In: Proceedings-18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019, 16-19, December 2019, United States, pp. 876-881.

Kolathaya, S and Ghosal, A and Amrutur, B and Joglekar, A and Shetty, S and Dholakiya, D and Abhimanyu, . and Sagi, A and Bhattacharya, S and Singla, A and Bhatnagar, S (2019) Trajectory based Deep Policy Search for Quadrupedal Walking. In: 28th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2019, 14-18 October 2019, New Delhi; India.

Joseph, AG and Bhatnagar, S (2019) Stochastic Approximation Trackers for Model-Based Search. In: 57th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2019, 24 -27 September 2019, Monticello, IL, USA, USA, pp. 741-748.

John, I and Sreekantan, A and Bhatnagar, S (2019) Efficient adaptive resource provisioning for cloud applications using reinforcement learning. In: 4th IEEE International Workshops on Foundations and Applications of Self* Systems, FAS*W 2019, 16 June 2019 - 20 June 2019, Umea, pp. 271-272.

Diddigi, RB and Prabuchandran, KJ and Sai Koti Reddy, D and Bhatnagar, S (2019) Actor-critic algorithms for constrained multi-agent reinforcement learning. In: 18th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2019, 13 May 2019through 17 May 2019, Montreal, pp. 1931-1933.

Joseph, AG and Bhatnagar, S (2019) An Adaptive Sampling Algorithm for Policy Evaluation. In: 5th Indian Control Conference, ICC 2019, 9 January 2019- 11 January 2019, Delhi, pp. 2-7.

Dholakiya, D and Bhattacharya, S and Gunalan, A and Singla, A and Bhatnagar, S and Amrutur, B and Ghosal, A and Kolathaya, S (2019) Design, Development and Experimental Realization of A Quadrupedal Research Platform: Stoch. In: 5th International Conference on Control, Automation and Robotics, ICCAR 2019, 19 - 22 April 2019, Beijing, pp. 229-234.

John, I and Sreekantan, A and Bhatnagar, S (2019) Auto-scaling Resources for Cloud Applications using Reinforcement learning. In: 2019 Grace Hopper Celebration India (GHCI), 6-8 Nov. 2019, Bangalore, India.

Patro, R and Bhatnagar, S (2008) An optimal RIO with statistical delay assurances. In: Proceedings of National Conference on Communications (NCC), Mumbai , Mumbai .

Raju Chinthalapati, VL and Bhatnagar, S (2006) A Simultaneous Deterministic Perturbation Actor-Critic Algorithm with an Application to Optimal Mortgage Refinancing. In: Proceedings of the 45th IEEE Conference on Decision & Control Manchester Grand Hyatt Hotel, 13-15 Dec. 2006, San Diego, CA .

Journal Article

VP, V and Bhatnagar, S (2024) Efficient energy management in smart grids with finite horizon Q-learning. In: Sustainable Energy, Grids and Networks, 38 .

Mondal, A and Prashanth, LA and Bhatnagar, S (2024) Truncated Cauchy random perturbations for smoothed functional-based stochastic optimization. In: Automatica, 162 .

Bhatnagar, S and Rambha, T and Ramadurai, G (2022) An agent-based fleet management model for first- and last-mile services. In: Transportation .

Bhatnagar, S and Taloor, AK and Roy, S and Bhattacharya, P (2022) Delineation of aquifers favorable for groundwater development using Schlumberger configuration resistivity survey techniques in Rajouri district of Jammu and Kashmir, India. In: Groundwater for Sustainable Development, 17 .

Singla, A and Padakandla, S and Bhatnagar, S (2021) Memory-Based Deep Reinforcement Learning for Obstacle Avoidance in UAV with Limited Environment Knowledge. In: IEEE Transactions on Intelligent Transportation Systems, 22 (1). pp. 107-118.

Padakandla, S and Prabuchandran, KJ and Bhatnagar, S (2020) Reinforcement learning algorithm for non-stationary environments. In: Applied Intelligence, 50 (11). pp. 3590-3606.

Yaji, VG and Bhatnagar, S (2020) Stochastic recursive inclusions in two timescales with nonadditive iterate-dependent markov noise. In: Mathematics of Operations Research, 45 (4). pp. 1405-1444.

J, PK and Penubothula, S and Kamanchi, C and Bhatnagar, S (2020) Novel First Order Bayesian Optimization with an Application to Reinforcement Learning. In: Applied Intelligence .

John, I and Kamanchi, C and Bhatnagar, S (2020) Generalized Speedy Q-Learning. In: IEEE Control Systems Letters, 4 (3). pp. 524-529.

Prashanth, LA and Bhatnagar, S and Bhavsar, N and Fu, M and Marcus, SI (2020) Random Directions Stochastic Approximation with Deterministic Perturbations. In: IEEE Transactions on Automatic Control, 65 (6). pp. 2450-2465.

Yaji, VG and Bhatnagar, S (2020) Analysis of Stochastic Approximation Schemes with Set-Valued Maps in the Absence of a Stability Guarantee and Their Stabilization. In: IEEE Transactions on Automatic Control, 65 (3). pp. 1100-1115.

Kamanchi, C and Diddigi, RB and Prabuchandran, KJ and Bhatnagar, S (2019) An Online Sample-Based Method for Mode Estimation Using ODE Analysis of Stochastic Approximation Algorithms. In: IEEE Control Systems Letters, 3 (3). pp. 697-702.

Bhatnagar, S and Patel, S and Karmeshu, Karmeshu (2018) A stochastic approximation approach to active queue management. In: Telecommunication Systems, 68 (1). pp. 89-104.

Prasad, HL and Bhatnagar, S (2012) General-sum stochastic games: Verifiability conditions for Nash equilibria. In: AUTOMATICA, 48 (11). pp. 2923-2930.

Vignat, C and Bhatnagar, S (2008) An extension of Wick's theorem. In: Statistics & Probability Letters, 78 (15). pp. 2404-2407.

Dukkipati, A and Bhatnagar, S and Murty, MN (2007) Gelfand-Yaglom-Perez theorem for generalized relative entropy functionals. In: Information Sciences, 177 (24). pp. 5707-5714.

Chandra, P and Ray, A and Bhatnagar, S (2004) The Late-Time Radio Emission from SN 1993J at Meter Wavelengths. In: The Astrophysical Journal, 612 (2). pp. 974-987.

Sutaria, FK and Chandra, P and Bhatnagar, S and Ray, A (2003) The nature of the prompt X-ray and radio emission from SN 2002ap. In: Astronomy & Astrophysics, 397 (3). pp. 1011-1018.

Gupta, VH and Bhatnagar, S (1997) An optimal fuel-injection policy for performance enhancement in internal combustion engines. In: Sadhana-Academy Proceedings In Engineering Sciences, 22 (4). pp. 545-552.

Bhatnagar, S and Borkar, VS (1995) A convex analytic framework for ergodic control of semi-Markov processes. In: Mathematics of Operations Research, 20 (4). pp. 923-936.

Preprint

Chandra, P and Ray, A and Bhatnagar, S (2003) Low frequency observations of SN 1993J with Giant Meterwave Radio Telescope. [Preprint]

Ray, A and Chandra, P and Sutaria, F and Bhatnagar, S (2003) Low frequency radio and X-ray properties of core-collapse supernovae. [Preprint]

Sutaria, FK and Chandra, P and Bhatnagar, S and Ray, A (2002) The nature of the prompt X-ray and radio emission from SN2002ap. [Preprint]

This list was generated on Wed Apr 17 01:50:39 2024 IST.