Learning algorithms for feedforward connectionist systems in a reinforcement learning environment are developed and analyzed in this paper. The connectionist system is made of units of groups of learning automata. The learning algorithm used is the $L_{R-I}$ and the asymptotic behavior of this algorithm is approximated by an Ordinary Differential Equation (ODE) for low values of the learning parameter. This is done using weak convergence techniques. The reinforcement learning model is used to pose the goal of the system as a constrained optimization problem. It is shown that the ODE, and hence the algorithm exhibits local convergence properties, converging to local solutions of the related optimization problem. The three layer pattern recog...
Any non-associative reinforcement learning algorithm can be viewed as a method for performing functi...
A key open problem in reinforcement learning is to assure convergence when using a compact hy-pothes...
This paper considers the problem of learning optimal discriminant functions for pattern classificati...
Learning algorithms for feedforward connectionist systems in a reinforcement learning environment ar...
This paper analyses the behaviour of a general class of learning automata algorithms for feedforward...
A model made of units of teams of learning automata is developed for the three layer pattern classif...
A feedforward network composed of units of teams of parametrised learning autmata is considered as a...
Analyzes the long-term behavior of the REINFORCE and related algorithms (Williams, 1986, 1988, 1992)...
A feedforward network composed of units of teams of parameterized learning automata is considered as...
The problem of learning using connectionist networks, in which network connection strengths are modi...
This thesis studies the use of connectionist algorithms for solving reinforcement learning problems....
Reinforcement learning algorithms comprise a class of learning algorithms for neural networks. Reinf...
The difficulties of learning in multilayered networks of computational units has limited the use of ...
Weak convergence methods are used to analyse generalized learning automata algorithms. The REINFORCE...
We consider stochastic automata models of learning systems in this article. Such learning automata s...
Any non-associative reinforcement learning algorithm can be viewed as a method for performing functi...
A key open problem in reinforcement learning is to assure convergence when using a compact hy-pothes...
This paper considers the problem of learning optimal discriminant functions for pattern classificati...
Learning algorithms for feedforward connectionist systems in a reinforcement learning environment ar...
This paper analyses the behaviour of a general class of learning automata algorithms for feedforward...
A model made of units of teams of learning automata is developed for the three layer pattern classif...
A feedforward network composed of units of teams of parametrised learning autmata is considered as a...
Analyzes the long-term behavior of the REINFORCE and related algorithms (Williams, 1986, 1988, 1992)...
A feedforward network composed of units of teams of parameterized learning automata is considered as...
The problem of learning using connectionist networks, in which network connection strengths are modi...
This thesis studies the use of connectionist algorithms for solving reinforcement learning problems....
Reinforcement learning algorithms comprise a class of learning algorithms for neural networks. Reinf...
The difficulties of learning in multilayered networks of computational units has limited the use of ...
Weak convergence methods are used to analyse generalized learning automata algorithms. The REINFORCE...
We consider stochastic automata models of learning systems in this article. Such learning automata s...
Any non-associative reinforcement learning algorithm can be viewed as a method for performing functi...
A key open problem in reinforcement learning is to assure convergence when using a compact hy-pothes...
This paper considers the problem of learning optimal discriminant functions for pattern classificati...