Up a level |
Bharadwaj Diddigi, R and Kamanchi, C and Bhatnagar, S (2020) A convergent off-policy temporal difference algorithm. In: Frontiers in Artificial Intelligence and Applications, 29 August-8 September 2020, Online; Spain, pp. 1103-1110.
J, PK and Penubothula, S and Kamanchi, C and Bhatnagar, S (2020) Novel First Order Bayesian Optimization with an Application to Reinforcement Learning. In: Applied Intelligence .
John, I and Kamanchi, C and Bhatnagar, S (2020) Generalized Speedy Q-Learning. In: IEEE Control Systems Letters, 4 (3). pp. 524-529.
Kamanchi, C and Diddigi, RB and Prabuchandran, KJ and Bhatnagar, S (2019) An Online Sample-Based Method for Mode Estimation Using ODE Analysis of Stochastic Approximation Algorithms. In: IEEE Control Systems Letters, 3 (3). pp. 697-702.