Thathachar, MAL and Phansalkar, VV (1995) Convergence of Teams and Hierarchies of Learning Automata in Connectionist Systems. In: IEEE Transactions on Systems, Man and Cybernetics, 25 (11). pp. 1459-1469.
|
PDF
Convergence_of_Teams.pdf Download (1MB) |
Abstract
Learning algorithms for feedforward connectionist systems in a reinforcement learning environment are developed and analyzed in this paper. The connectionist system is made of units of groups of learning automata. The learning algorithm used is the $L_{R-I}$ and the asymptotic behavior of this algorithm is approximated by an Ordinary Differential Equation (ODE) for low values of the learning parameter. This is done using weak convergence techniques. The reinforcement learning model is used to pose the goal of the system as a constrained optimization problem. It is shown that the ODE, and hence the algorithm exhibits local convergence properties, converging to local solutions of the related optimization problem. The three layer pattern recognition network is used as an example to show that the system does behave as predicted and reasonable rates of convergence are obtained. Simulations also show that the algorithm is robust to noise.
Item Type: | Journal Article |
---|---|
Publication: | IEEE Transactions on Systems, Man and Cybernetics |
Publisher: | IEEE |
Additional Information: | Copyright 2006 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. |
Department/Centre: | Division of Electrical Sciences > Electrical Engineering |
Date Deposited: | 22 Aug 2008 |
Last Modified: | 19 Sep 2010 04:35 |
URI: | http://eprints.iisc.ac.in/id/eprint/9812 |
Actions (login required)
View Item |