Collaborative Research: SEI: Graph-based Mining of Public Health Data

Information

  • NSF Award
  • 0506635
Owner
  • Award Id
    0506635
  • Award Effective Date
    7/15/2005 - 18 years ago
  • Award Expiration Date
    6/30/2009 - 15 years ago
  • Award Amount
    $ 126,506.00
  • Award Instrument
    Standard Grant

Collaborative Research: SEI: Graph-based Mining of Public Health Data

Automated analysis of public health data represents a critical need, but effective analysis must look beyond individual data points. Much of the data that is collected is structural, consisting not only of entities but also of relationships (e.g., spatial,temporal) between the entities. As a result, a need exists to develop methods for discovering knowledge and learning concepts specifically for this type of structural data. A graph-based data mining technique that can perform pattern discovery, concept learning, and hierarchical clustering on data represented as graphs. This approach, implemented in the Subdue system, has demonstrated success in a numbeof scientific and industrial databases. The proposed effort will investigate the viability ofgraph-based data mining approach as a foundation for representing and ministructural data found in public health databases and related applications.<br/><br/>The effort will contribute 1) an analysis of public health data that explores data<br/>points, relationships between the data points, and integration of data from related<br/>domains to strengthen the results, 2) design of a graph-based mining system that can handle streaming data in an online fashion, 3) development of a new approach to concept learning that processes training examples embedded n a single interconnected graph, and 4) construction of a toolset that can provide early detection and assessment of epidemics and other public health crises.<br/>The project depends on a strong partnership between computer scientists and an expert in public health. A collaboration between the University of Texas at Arlington and the University of North Texas Health Science Center has already received initial support from the two schools The collaboration will be fostered through monthly seminars and research meetings. The results of this project will thus have an impact on the computerscience community and an equal, if not greater, impact on the domain community The code and data will be available for general dissemination over the Internet, and results will be integrated into the classroom and into a book on graph-based data mining.

  • Program Officer
    Sylvia J. Spengler
  • Min Amd Letter Date
    6/21/2005 - 19 years ago
  • Max Amd Letter Date
    6/25/2008 - 16 years ago
  • ARRA Amount

Institutions

  • Name
    University of North Texas Health Science Center at Fort Worth
  • City
    Fort Worth
  • State
    TX
  • Country
    United States
  • Address
    3500 Camp Bowie Blvd.
  • Postal Code
    761072699
  • Phone Number
    8177355073

Investigators

  • First Name
    Karan
  • Last Name
    Singh
  • Email Address
    ksingh@hsc.unt.edu
  • Start Date
    6/21/2005 12:00:00 AM

FOA Information

  • Name
    Information Systems
  • Code
    104000

Program Element

  • Text
    SCIENCE & ENGINEERING INFORMAT
  • Code
    7294

Program Reference

  • Text
    SCIENCE & ENGINEERING INFORMAT
  • Code
    7294
  • Text
    ADVANCED SOFTWARE TECH & ALGOR
  • Code
    9216
  • Text
    HIGH PERFORMANCE COMPUTING & COMM