Department of Computer Science Faculty Scholarship and Creative Works

Component Selection to Optimize Distance Function Learning in Complex Scientific Data Sets

Aparna Varde, Montclair State UniversityFollow
Stephen Bique, Worcester Polytechnic Institute
Elke Rundensteiner, Worcester Polytechnic Institute
David Brown, Virginia State UniversityFollow
Jianyu Liang, Worcester Polytechnic InstituteFollow
Richard D. Sisson, Worcester Polytechnic InstituteFollow
Ehsan Sheybani, Virginia State UniversityFollow
Brian Sayre, Virginia State University

Document Type

Conference Proceeding

Publication Date

10-6-2008

Abstract

Analyzing complex scientific data, e.g., graphs and images, often requires comparison of features: regions on graphs, visual aspects of images and related metadata, some features being relatively more important. The notion of similarity for comparison is typically distance between data objects which could be expressed as distance between features. We refer to distance based on each feature as a component. Weights of components representing relative importance of features could be learned using distance function learning algorithms. However, it is seldom known which components optimize learning, given criteria such as accuracy, efficiency and simplicity. This is the problem we address. We propose and theoretically compare four component selection approaches: Maximal Path Traversal, Minimal Path Traversal, Maximal Path Traversal with Pruning and Minimal Path Traversal with Pruning. Experimental evaluation is conducted using real data from Materials Science, Nanotechnology and Bioinformatics. A trademarked software tool is developed as a highlight of this work.

DOI

10.1007/978-3-540-85654-2_27

Montclair State University Digital Commons Citation

Varde, Aparna; Bique, Stephen; Rundensteiner, Elke; Brown, David; Liang, Jianyu; Sisson, Richard D.; Sheybani, Ehsan; and Sayre, Brian, "Component Selection to Optimize Distance Function Learning in Complex Scientific Data Sets" (2008). Department of Computer Science Faculty Scholarship and Creative Works. 177.
https://digitalcommons.montclair.edu/compusci-facpubs/177

This document is currently not available here.

COinS

Department of Computer Science Faculty Scholarship and Creative Works

Component Selection to Optimize Distance Function Learning in Complex Scientific Data Sets

Document Type

Publication Date

Abstract

DOI

Montclair State University Digital Commons Citation

Search

Browse

Author Corner

Links

Department of Computer Science Faculty Scholarship and Creative Works

Component Selection to Optimize Distance Function Learning in Complex Scientific Data Sets

Authors

Document Type

Publication Date

Abstract

DOI

Montclair State University Digital Commons Citation

Share

Search

Browse

Author Corner

Links

//<![CDATA[ document.write("<a href='mailto:" + "digitalcommons" + "@" + "mail.montclair.edu" + "'>" + "Contact Us" + "<\/a>") //]]>