Joseph, Ajin George and Bhatnagar, Shalabh (2016) Revisiting the Cross Entropy Method with Applications in Stochastic Global Optimization and Reinforcement Learning. In: 22nd European Conference on Artificial Intelligence (ECAI), AUG 29-SEP 02, 2016, Hague, NETHERLANDS, pp. 1026-1034.
PDF
Eur_Con_Art_Int_285_1026_2016.pdf - Published Version Restricted to Registered users only Download (2MB) | Request a copy |
Abstract
In this paper, we provide a new algorithm for the problem of stochastic global optimization where only noisy versions of the objective function are available. The algorithm is inspired by the well known cross entropy (CE) method. The algorithm takes the shape of a multi-timescale stochastic approximation algorithm, where we reuse the previous samples based on discounted averaging, and hence it saves the overall computational and storage cost. We provide proof of the stability and the global optimization property of our algorithm. The algorithm shows good performance on the noisy versions of global optimization benchmarks and outperforms a state-of-the-art algorithm for non-linear function approximation in reinforcement learning.
Item Type: | Conference Proceedings |
---|---|
Series.: | Frontiers in Artificial Intelligence and Applications |
Additional Information: | Copy right for this article belongs to the IOS PRESS, NIEUWE HEMWEG 6B, 1013 BG AMSTERDAM, NETHERLANDS |
Department/Centre: | Division of Electrical Sciences > Computer Science & Automation |
Date Deposited: | 03 Dec 2016 09:45 |
Last Modified: | 03 Dec 2016 09:45 |
URI: | http://eprints.iisc.ac.in/id/eprint/55368 |
Actions (login required)
View Item |