EAGER: Infrastructure for Research Data Registration and Interpretation

Information

NSF Award
1349985

Owner

Corporation for National Research Initiatives

Award Id
1349985
Award Effective Date
9/15/2013 - 11 years ago
Award Expiration Date
8/31/2014 - 10 years ago
Award Amount
$ 99,986.00
Award Instrument
Standard Grant

Information

EAGER: Infrastructure for Research Data Registration and Interpretation

CNRI is proposing to develop a framework and a related set of infrastructural tools that will greatly improve the ability of research organizations to register scientific data sets, either those they hold directly or those which they have funded and which are held elsewhere, and expose them for discovery, analysis, and further processing. The project will build the tools and low-level APIs to be suitable for use by data producers as well as organizations that are expert in metadata and data organization. There is no widely adopted infrastructure currently in place for sharing research data. Individual pieces certainly exist, especially within given domains, but transparent and seamless sharing of scientific data requires a level of standardization and acceptance that simply doesn't exist today. To realize the potential of widely available scientific data, it must be discoverable, reference-able, and understandable, and it must be so without the investment of enormous amounts of time and effort on the part of those who are providing the data or those consuming the data. Research institutions currently expose their data through institution-specific web sites and APIs. The PI propose to build a pair of registries that will enable the use of a common API as well as the ability to federate registries across institutions when it makes sense, without requiring the existing underlying storage and management systems to change. We also propose to design basic metadata schemas to be used in those registries. The first of the two registries is a metadata registry in which data sets can be registered and described. A common API will be built both for the registration process as well as for access to the resulting metadata objects. Each metadata object and, if required, each data set, will be given a unique, persistent identifier. These identifiers will resolve to the metadata objects and data sets respectively and their assignment will be part of the deposit API. We will also enable related objects to be associated with each other through the registry and through identifier resolution, depending on the specific cases in hand. This will be transparent to users of the access API. The second of the two registries is a type registry. The metadata objects and data sets will each be typed and the type registry will provide the information needed to decipher those types. The goal is to be able to answer the question of, given a specific identifier or piece of data, what does it represent and how should I interpret it. This interaction will be made as transparent as possible to the access API. The interaction between these two registries is key to the proposed framework. The proposed deliverables will include an open source release of the metadata registry and the type registry software, the basic metadata schemas applicable for those registries, and a prototype service that demonstrates the infrastructure capability by federating research data from at least two sources.

Program Officer
Robert Chadduck
Min Amd Letter Date
9/4/2013 - 11 years ago
Max Amd Letter Date
9/4/2013 - 11 years ago
ARRA Amount

Institutions

Name
Corporation for National Research Initiatives (NRI)
City
Reston
State
VA
Country
United States
Address
1895 Preston White Drive
Postal Code
201915434
Phone Number
7036208990

Investigators

First Name
Laurence
Last Name
Lannom
Email Address
llannom@cnri.reston.va.us
Start Date
9/4/2013 12:00:00 AM

First Name
Giridhar
Last Name
Manepalli
Email Address
gmanepalli@cnri.reston.va.us
Start Date
9/4/2013 12:00:00 AM

Program Element

Text
INFORMATION TECHNOLOGY RESEARC
Code
1640

Program Reference

Text
EAGER
Code
7916

Text
INFORMATION TECHNOLOGY RESEARC
Code
1640

EAGER: Infrastructure for Research Data Registration and Interpretation

Information

Owner

Award Id

Award Effective Date

Award Expiration Date

Award Amount

Award Instrument

EAGER: Infrastructure for Research Data Registration and Interpretation

Program Officer

Min Amd Letter Date

Max Amd Letter Date

ARRA Amount

Institutions

Name

City

State

Country

Address

Postal Code

Phone Number

Investigators

First Name

Last Name

Email Address

Start Date

First Name

Last Name

Email Address

Start Date

Program Element

Text

Code

Program Reference

Text

Code

Text

Code