Up a level |
Sen, D and Prashanth, LA and Gopalan, A (2023) Adaptive Estimation of Random Vectors with Bandit Feedback: A Mean-Squared Error Viewpoint. In: UNSPECIFIED, pp. 180-181.
Thontepu, P and Goswami, BG and Tayal, M and Singh, N and Shyam Sundar, PI and Shyam Sundar, MG and Sundaram, S and Katewa, V and Kolathaya, S (2023) Collision Cone Control Barrier Functions for Kinematic Obstacle Avoidance in UGVs. In: UNSPECIFIED, pp. 293-298.
Naskar, A and Thoppe, G (2023) Convergence of Momentum-based Distributed Stochastic Approximation with RL Applications. In: UNSPECIFIED, pp. 178-179.
Velhal, S and Krishna Kishore, VS and Sundaram, S (2023) A Non-iterative Spatio-Temporal Multi-Task Assignments based Collision-free Trajectories for Music Playing Robots. In: UNSPECIFIED, pp. 275-280.
Saxena, N and Sandeep, G and Jagtap, P (2023) Reinforcement Learning for Signal Temporal Logic using Funnel-Based Approach. In: UNSPECIFIED, pp. 1-6.
Bhatnagar, S The Reinforce Policy Gradient Algorithm Revisited. In: UNSPECIFIED, p. 177.