Block training methods for perceptrons and their applications

Anibal R. Figueiras-Vidal; Angel Navia-Vasquez

doi:10.1117/12.271522

4 April 1997 Block training methods for perceptrons and their applications

Anibal R. Figueiras-Vidal, Angel Navia-Vasquez

Proceedings Volume 3077, Applications and Science of Artificial Neural Networks III; (1997) https://doi.org/10.1117/12.271522
Event: AeroSense '97, 1997, Orlando, FL, United States

Abstract

Many techniques have been developed in recent years in order to speed up the training process of Multilayer Perceptrons (MLPs), such as accelerated versions of gradient descent, second order methods, Kalman related algorithms, and block learning methods; among them, the latter offer a reasonable way of reducing training effort and getting good results with MLPs. However, they usually find suboptimal solutions and, sometimes, the values obtained for the weights are excessively large. In this paper, we analyze the drawbacks of these block methods using a very simple test model, we propose several modifications by discarding samples (DS- LSB), using controlled perturbation (CP-LSB), and by means of a Reduced Sensitivity version (RS-LSB) and we also extend their forms to include some especial characteristics: we propose a robust training algorithm relying on Total Least Squares (TLS) minimizations (the so-called RS-TLS algorithm), useful for noisy training patterns, and we also present a method for training MLPs with general output cost function (the `RS-WLSB' algorithm). The advantages of these methods with respect to related algorithms are illustrated using several test problems. Finally, we extract some conclusions and propose, as a further work, the development of recursive implementations, able to learn on-line and to deal with non-stationary problems, and the application of block training methods to recurrent networks.

Citation Download Citation

Anibal R. Figueiras-Vidal and Angel Navia-Vasquez "Block training methods for perceptrons and their applications", Proc. SPIE 3077, Applications and Science of Artificial Neural Networks III, (4 April 1997); https://doi.org/10.1117/12.271522

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available