Naoki Saito, "Local Feature Extraction and Its Applications Using a Library of Bases," Ph.D., Department of Mathematics, Yale University, New Haven, CT, December 1994.
Advisor: Ronald R. Coifman
Extracting relevant features from signals is important for signal analysis such as compression, noise removal, classification, or regression (prediction). Often, important features for these problems, such as edges, spikes, or transients, are characterized by local information in the time (space) domain and the frequency (wave number) domain.
The conventional techniques are not efficient for extracting features
localized simultaneously in the time and frequency domains.
These methods include:
the Fourier transform for signal/noise separation,
the Karhunen-Loève transform for compression,
and the linear discriminant analysis for classification.
The features extracted by these methods are of global nature
either in time or in frequency domain so that the interpretation
of the results may not be straightforward.
Moreover, some of them require solving the eigenvalue systems
so that they are fragile to outliers or perturbations
and are computationally expensive, i.e.,
, where n is a
dimensionality of a signal.
The approach explored here is guided by the best-basis paradigm which consists of three steps: 1) select a ``best'' basis (or coordinate system) for the problem at hand from a library of bases (a fixed yet flexible set of bases consisting of wavelets, wavelet packets, local trigonometric bases, and the autocorrelation functions of wavelets), 2) sort the coordinates (features) by ``importance'' for the problem at hand and discard ``unimportant'' coordinates, and 3) use these survived coordinates to solve the problem at hand. What is ``best'' and ``important'' clearly depends on the problem: for example, minimizing a description length (or entropy) is important for signal compression whereas maximizing class separation (or relative entropy among classes) is important for classification.
These bases ``fill the gap'' between the standard Euclidean
basis and the Fourier basis so that they can capture the local features
and provide an array of tools unifying the conventional techniques.
Moreover, these tools provide efficient numerical algorithms, e.g.,
, where p=0,1,2, depending on the basis.
In this thesis, these methods have been applied usefully to a variety of
problems: simultaneous noise suppression and signal compression,
classification, regression, multiscale edge detection and representation,
and extraction of geological information from acoustic waveforms.