
|
Joint CS/ISE Seminar
Monday, November 6 Some Recent Advances in Near-neighbor Statistical LearningDr. Maya GuptaAssistant Professor of Electrical Engineering Adjunct Assistant Professor of Applied Mathematics University of Washington AbstractClassification is the problem of labeling an unknown test sample given a database of learned samples. For example, one might classify a patient as having diabetes, or classify an utterance as the digit "one". Arguably the most intuitive approach to classification is to find a sample in the database that is most similar to the test sample, and give it the same label. This approach is known as nearest-neighbor classification. In this talk, we discuss some recent advances in nearest-neighbor learning, in particular how to weight a set of nearest-neighbors to form a weighted vote about the class label, and how to choose a set of nearest-neighbors from the database. Specifically, it is shown that weights that solve linear interpolation equations minimize the first-order learning error, and when coupled with the principle of maximum entropy this results in significantly improved nearest-neighbor classification performance. New approaches to adaptively determining neighborhoods for local learning that enclose the test point are also discussed. A number of applications are discussed briefly, including finding corrosion in pipelines, classifying audio signals, and color management. Speaker BioMaya Gupta completed her Ph.D. in Electrical Engineering in 2003 at Stanford University as a National Science Foundation Graduate Fellow. Her undergraduate studies led to a BS in Electrical Engineering and a BA in Economics from Rice University in 1997. From 1999-2003 she worked for Ricoh's California Research Center as a color image processing research engineer. In the fall of 2003, she joined the EE faculty of the University of Washington as an Assistant Professor. More information about her research is available at her group's webpage: Information Design Lab. |