Distributed Information Management in the National HPCC Software Exchange

NHSE

Copyright (C symbol) 1995 by the Association for Computing Machinery, Inc. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that new copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted.

To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request Permissions from Publications Dept, ACM Inc., Fax +1 (212) 869-0481, or

permissions@acm.org

Shirley Browne
(Corresponding Author)
University of Tennessee
107 Ayres Hall
Knoxville, TN 37996-1301
Office: 615-974-5886, FAX: 615-974-8296
http://www.cs.utk.edu/~browne/

browne@cs.utk.edu

Jack Dongarra
(Presenting Author)
University of Tennessee and Oak Ridge National Laboratory
http://www.netlib.org/utk/people/JackDongarra.html

dongarra@cs.utk.edu

Geoffrey C. Fox
Syracuse University
http://www.npac.syr.edu/users/gcf/homepage/index.html

gcf@npac.syr.edu

Ken Hawick
Syracuse University
http://www.npac.syr.edu/users/hawick/homepage/index.html

hawick@npac.syr.edu

Ken Kennedy
Rice University
http://www.cs.rice.edu/CS/faculty/ken.html

ken@cs.rice.edu

Rick Stevens
Argonne National Laboratory
http://www.mcs.anl.gov/people/stevens/

stevens@mcs.anl.gov

Robert Olson
Argonne National Laboratory
http://www.mcs.anl.gov/people/olson/

olson@mcs.anl.gov

Tom Rowan
Oak Ridge National Laboratory and University of Tennessee
http://www.epm.ornl.gov/~rowan/

rowan@msr.epm.ornl.gov
Keywords:
information management, information retrieval, HPCC, high performance computing, software repository

Abstract

The National HPCC Software Exchange is a collaborative effort by member institutions of the Center for Research on Parallel Computation to provide network access to HPCC-related software, documents, and data. Challenges for the NHSE include identifying, organizing, filtering, and indexing the rapidly growing wealth of relevant information available on the Web. The large quantity of information necessitates performing these tasks using automatic techniques, many of which make use of parallel and distribution computation, but human intervention is needed for intelligent abstracting, analysis, and critical review tasks. Thus, major goals of NHSE research are to find the right mix of manual and automated techniques, and to leverage the results of manual efforts to the maximum extent possible. This paper describes our current information gathering and processing techniques, as well as our future plans for integrating the manual and automated approaches. The NHSE home page is accessible at http://www.netlib.org/nhse/.