Saxena, N and Khastagir, S and Kolathaya, S and Bhatnagar, S (2023) Off-Policy Average Reward Actor-Critic with Deterministic Policy Search. In: Proceedings of Machine Learning Research, 23 - 29 July 2023, Honolulu, pp. 30130-30203.