Paper
1 September 1990 Multistaged neural network architecture for position invariant shape recognition
Jay I. Minnix, Eugene S. McVey, Rafael M. Inigo
Author Affiliations +
Proceedings Volume 1360, Visual Communications and Image Processing '90: Fifth in a Series; (1990) https://doi.org/10.1117/12.24106
Event: Visual Communications and Image Processing '90, 1990, Lausanne, Switzerland
Abstract
This paper presents a pattern recognition system that self-organizes to recognize objects by shape as part of an Integrated Visual Network (IVN) for autonomous flight control. The system uses a multistaged hierarchical neural network that exhibits insensitivity to the location of the object in the visual field. The network's three layers perform the functionally disjoint tasks of preprocessing (dynamic thresholding), invariance (position normalization), and recognition (identification of the shape). The Preprocessing stage uses a single layer of elements to dynamically threshold the grey level input image into a binary image. The Invariance stage is a multilayered neural network implementation of a modified Walsh-Hadamard transform that generates a representation of the object that is invariant with respect to the object’s position. The Recognition stage is a modified version of Fukushima's Neocognitron that identifies the position normalized representation by shape. The inclusion of the Preprocessing and Invariance stages allows reduction of the massively replicated processing structures used for translation invariance in the Neocognitron. This system offers roughly the same translation invariance capabilities as the Neocognitron with a dramatic reduction in the number of elements and the network's interconnection complexity.
© (1990) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jay I. Minnix, Eugene S. McVey, and Rafael M. Inigo "Multistaged neural network architecture for position invariant shape recognition", Proc. SPIE 1360, Visual Communications and Image Processing '90: Fifth in a Series, (1 September 1990); https://doi.org/10.1117/12.24106
Lens.org Logo
CITATIONS
Cited by 4 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image processing

Neural networks

Binary data

Visual communications

Chemical elements

Pattern recognition

Visualization

Back to Top