CIF21 DIBBs: User Driven Architecture for Data Discovery

Information

NSF Award
1443070

Owner

Corporation for National Research Initiatives

Award Id
1443070
Award Effective Date
9/1/2014 - 11 years ago
Award Expiration Date
8/31/2017 - 8 years ago
Award Amount
$ 1,484,940.00
Award Instrument
Standard Grant

Information

CIF21 DIBBs: User Driven Architecture for Data Discovery

The number, size, and availability of scientific datasets have grown enormously over the last few years. As scientific activity becomes more data intensive and collaborative, a key challenge for cross-disciplinary research will be discovery of diverse data sets, managed within distributed repositories and registries. Currently, discovery of information on the Internet is largely performed through automated approaches, characterized by web crawling and associated algorithms, or labor intensive indexing and categorization, such as the National Library of Medicine index for medical literature. There are significant amounts of data housed in repositories where only researchers with expertise in the specific field know and access the data.<br/><br/>This project builds a user driven architecture for data discovery (UDADD), a capability that enhances discovery of scientific datasets by building a global index from diverse communities with minimal input. In the UDADD approach user actions, such as dataset queries or downloads, drive the construction of a global index. These actions are recorded and gathered automatically, through cooperation with repository managers. Two software plugins are provided to help the repositories interact with the UDADD system. The architecture includes ranking techniques based on frequency and recency of use of the datasets. <br/><br/>The pilot architecture will be demonstrated and evaluated using cooperating repositories within the DataNet Federation Consortium. Currently, six science and engineering communities participate in the consortium, including national scale projects in oceanography, social science, cognitive science, hydrology, engineering, and plant biology.

Program Officer
Amy Walton
Min Amd Letter Date
8/18/2014 - 11 years ago
Max Amd Letter Date
8/18/2014 - 11 years ago
ARRA Amount

Institutions

Name
Corporation for National Research Initiatives (NRI)
City
Reston
State
VA
Country
United States
Address
1895 Preston White Drive
Postal Code
201915434
Phone Number
7036208990

Investigators

First Name
Allison
Last Name
Powell
Email Address
apowell@cnri.reston.va.us
Start Date
8/18/2014 12:00:00 AM

First Name
Laurence
Last Name
Lannom
Email Address
llannom@cnri.reston.va.us
Start Date
8/18/2014 12:00:00 AM

First Name
Giridhar
Last Name
Manepalli
Email Address
gmanepalli@cnri.reston.va.us
Start Date
8/18/2014 12:00:00 AM

Program Element

Text
INFO INTEGRATION & INFORMATICS
Code
7364

Text
DATANET
Code
7726

Program Reference

Text
CyberInfra Frmwrk 21st (CIF21)
Code
7433

Text
Data Infrstr Bldg Blocks-DIBBs
Code
8048

Text
Big Data Science &Engineering
Code
8083

CIF21 DIBBs: User Driven Architecture for Data Discovery

Information

Owner

Award Id

Award Effective Date

Award Expiration Date

Award Amount

Award Instrument

CIF21 DIBBs: User Driven Architecture for Data Discovery

Program Officer

Min Amd Letter Date

Max Amd Letter Date

ARRA Amount

Institutions

Name

City

State

Country

Address

Postal Code

Phone Number

Investigators

First Name

Last Name

Email Address

Start Date

First Name

Last Name

Email Address

Start Date

First Name

Last Name

Email Address

Start Date

Program Element

Text

Code

Text

Code

Program Reference

Text

Code

Text

Code

Text

Code