CIF21 DIBBs: User Driven Architecture for Data Discovery

Information

  • NSF Award
  • 1443070
Owner
  • Award Id
    1443070
  • Award Effective Date
    9/1/2014 - 10 years ago
  • Award Expiration Date
    8/31/2017 - 7 years ago
  • Award Amount
    $ 1,484,940.00
  • Award Instrument
    Standard Grant

CIF21 DIBBs: User Driven Architecture for Data Discovery

The number, size, and availability of scientific datasets have grown enormously over the last few years. As scientific activity becomes more data intensive and collaborative, a key challenge for cross-disciplinary research will be discovery of diverse data sets, managed within distributed repositories and registries. Currently, discovery of information on the Internet is largely performed through automated approaches, characterized by web crawling and associated algorithms, or labor intensive indexing and categorization, such as the National Library of Medicine index for medical literature. There are significant amounts of data housed in repositories where only researchers with expertise in the specific field know and access the data.<br/><br/>This project builds a user driven architecture for data discovery (UDADD), a capability that enhances discovery of scientific datasets by building a global index from diverse communities with minimal input. In the UDADD approach user actions, such as dataset queries or downloads, drive the construction of a global index. These actions are recorded and gathered automatically, through cooperation with repository managers. Two software plugins are provided to help the repositories interact with the UDADD system. The architecture includes ranking techniques based on frequency and recency of use of the datasets. <br/><br/>The pilot architecture will be demonstrated and evaluated using cooperating repositories within the DataNet Federation Consortium. Currently, six science and engineering communities participate in the consortium, including national scale projects in oceanography, social science, cognitive science, hydrology, engineering, and plant biology.

  • Program Officer
    Amy Walton
  • Min Amd Letter Date
    8/18/2014 - 10 years ago
  • Max Amd Letter Date
    8/18/2014 - 10 years ago
  • ARRA Amount

Institutions

  • Name
    Corporation for National Research Initiatives (NRI)
  • City
    Reston
  • State
    VA
  • Country
    United States
  • Address
    1895 Preston White Drive
  • Postal Code
    201915434
  • Phone Number
    7036208990

Investigators

  • First Name
    Allison
  • Last Name
    Powell
  • Email Address
    apowell@cnri.reston.va.us
  • Start Date
    8/18/2014 12:00:00 AM
  • First Name
    Laurence
  • Last Name
    Lannom
  • Email Address
    llannom@cnri.reston.va.us
  • Start Date
    8/18/2014 12:00:00 AM
  • First Name
    Giridhar
  • Last Name
    Manepalli
  • Email Address
    gmanepalli@cnri.reston.va.us
  • Start Date
    8/18/2014 12:00:00 AM

Program Element

  • Text
    INFO INTEGRATION & INFORMATICS
  • Code
    7364
  • Text
    DATANET
  • Code
    7726

Program Reference

  • Text
    CyberInfra Frmwrk 21st (CIF21)
  • Code
    7433
  • Text
    Data Infrstr Bldg Blocks-DIBBs
  • Code
    8048
  • Text
    Big Data Science &Engineering
  • Code
    8083