APL-UW

Lane Owsley

Principal Engineer

Email

lane@apl.washington.edu

Phone

206-685-3592

Research Interests

Image, Array, and Speech Processing, Feature Extraction and Classification

Department Affiliation

Environmental & Information Systems

Education

B.S. Engineering, Brown University, 1991

M.S. Electrical Engineering, University of Washington, 1994

Ph.D. Electrical Engineering, University of Washington, 1998

Publications

2000-present and while at APL-UW

A technique for adjusting Gaussian mixture model weights that improves speaker identification performance in the presence of phonemic train/test mismatch

McLaughlin, J., and L. Owsley, "A technique for adjusting Gaussian mixture model weights that improves speaker identification performance in the presence of phonemic train/test mismatch," J. Acoust. Soc. Am., 129, 2423, doi:10.1121/1.3587917, 2011.

More Info

1 Apr 2011

Speaker identification is complicated by cases where training material is phonemically deficient. Misclassifications can result either because subsequent test material from that speaker contains primarily the phonemes missing from the training data or because that test material is phonemically most consistent with another talker's model. This situation can arise in any dialog where, for reasons of brevity and clarity, conventions must be imposed on phraseology. We present here a technique for detecting phonemic deficiencies in a speaker model, and then correcting that model to partially compensate for the biased training data. This technique relies upon a specially constructed universal background model (UBM) from which speaker models are adapted. This UBM is formed by weighting several dozen phoneme GMMs using EM training. As a result, each Gaussian component of the UBM (and of the resulting speaker models) corresponds to a specific phoneme. Analysis of the speaker model weights reveals whether the training data had the typical phonemic variety found in ordinary speech, and if it did not, the weights are adjusted. Using a specially designed corpus created from the TIMIT utterances, we show that this reweighting technique improves performance over non-reweighted models. Results are also given for the Air Traffic Control Corpus.

Modeling acoustic target response by component

Owsley, L., and J. McLaughlin, "Modeling acoustic target response by component," In Proceedings, MTS/IEEE OCEANS 2010, Seattle, 20-23 September, doi:10.1109/OCEANS.2010.5664273 (MTS/IEEE, 2010).

More Info

20 Sep 2010

The scattered acoustic response of underwater objects due to active interrogation has been studied for decades for use in detection and classification applications. As a means of detection, fielded applications date back nearly a hundred years. However, use of responses for robust automated classification has lagged behind, particularly when the internal structure of the objects is of key importance and when the objects may be partially or fully buried. Analytic solutions for simple geometries have provided much understanding of certain physical mechanisms, but transfer to complex structures of practical importance has proven difficult.

In recent decades, finite element (FE) modeling has provided a method of accurate simulation of many structures previously considered intractable. However, simulation of such complex objects produces equally complex returns, with the result that the models are often simply considered as a "black box" where the physical interpretation of the response components is tenuous at best. Thus the state of the art is still short of a method for development of robust classification systems for complex objects based on the physics of the objects of interest and the varied conditions under which they may be found. This paper introduces an effort to use FE techniques to simulate individual components of a return by "turning off" most aspects of the physics and allowing the researcher to isolate one mechanism at a time. The goal is a true physical understanding of the complete response, a physically justifiable feature set for classification, and a much simpler path to environmental robustness.

Using speed technology to enhance isotope ID and classification

Owsley, L.M.D., J.J. McLaughlin, L.G. Cazzanti, and S.R. Salaymeh, "Using speed technology to enhance isotope ID and classification," Conference Record, Nuclear Science Symposium, Orlando, 24 October-1 November, 629-635, doi:10.1109/NSSMIC.2009.5402002 (IEEE, 2009).

More Info

24 Oct 2009

Scientific advances are often made when researchers identify mathematical or physical commonalities between different fields and are able to apply mature techniques or algorithms developed in one field to another field which shares some of the same challenges. The authors of this paper have identified similarities between the unsolved problems faced in gamma-spectroscopy for automated radioisotope identification and the challenges of the much larger body of research in speech processing. In this paper we describe such commonalities and use them as a motivation for a preliminary investigation of the applicability of speech processing methods to gamma-ray spectra. This approach enables the development of proof-of-concept isotope classifiers, whose performance is presented for both simulated and field-collected gamma-ray spectra.

More Publications

Close

 

Close