Collaborative Research: Joint Analysis of Correlated Data

Information

  • NSF Award
  • 1521583
Owner
  • Award Id
    1521583
  • Award Effective Date
    9/15/2015 - 8 years ago
  • Award Expiration Date
    11/30/2016 - 7 years ago
  • Award Amount
    $ 109,920.00
  • Award Instrument
    Standard Grant

Collaborative Research: Joint Analysis of Correlated Data

Across science, engineering, medicine and business we face a deluge of data coming from sensors, from simulations, or from the activities of myriads of individuals on the Internet. Furthermore, the data sets we collect are frequently highly inter-correlated, reflecting information about the same or similar/related entities in the world, or echoing semantically important repetitions/symmetries or hierarchical structures common to both man-made and natural objects. This project will assist scientists and engineers working with correlated data sets in getting the most information and value out of their data. Key to the approach is the idea of joint data analysis, the notion that each piece of data is best understood not in isolation but in the context provided by its peers and partners in a collection of related data sets, using the web of relationships referred to above. The key aim is to complement the social networks of scientists and engineers as they exist today with parallel networks that interlink the data they base their work on, using domain-specific semantic links and aiming at mechanisms that allow algorithmic transport of information between data used by scientists working in the same domain. The resulting system amplifies scientific insights by allowing an observation of one scientist on one piece of data to automatically be transported to other relevant data sets and aggregated and also enables the automated discovery of shared structures or common abstractions that can inform multiple data sets.<br/><br/>In order to accomplish this joint analysis this project interconnects data sets into networks along which information can be transported and aggregated. These data set links are based on efficient matching algorithms using domain-specific features. In the associated setting, these matching or maps are used not to estimate distances or similarities but to build operators that can transport information between different data sets. The research team will exploit a functional analytic framework that allows for encoding of information as functions over the data and leads to linear operators for mapping, enabling the use of many powerful tools from linear algebra and optimization. Using inspiration from homological algebra, this team will join multiple related data sets into networks connected through these operators in a way that allows information transport, correction, and aggregation, with the ultimate goal of using the "wisdom of the collection" to provide as much information as possible for specific data sets to specific scientists.

  • Program Officer
    Leland M. Jameson
  • Min Amd Letter Date
    9/9/2015 - 8 years ago
  • Max Amd Letter Date
    9/9/2015 - 8 years ago
  • ARRA Amount

Institutions

  • Name
    Toyota Technological Institute at Chicago
  • City
    Chicago
  • State
    IL
  • Country
    United States
  • Address
    6045 S. Kenwood Avenue
  • Postal Code
    606372902
  • Phone Number
    7738340409

Investigators

  • First Name
    Qixing
  • Last Name
    Huang
  • Email Address
    huangqx@cs.utexas.edu
  • Start Date
    9/9/2015 12:00:00 AM

Program Element

  • Text
    CDS&E-MSS
  • Code
    8069

Program Reference

  • Text
    CyberInfra Frmwrk 21st (CIF21)
  • Code
    7433
  • Text
    CDS&E
  • Code
    8084
  • Text
    COMPUTATIONAL SCIENCE & ENGING
  • Code
    9263