EAGER: PBI: Designing Public Use Microdata Business Areas (PUMBAs) for Innovation

Information

  • NSF Award
  • 2430732
Owner
  • Award Id
    2430732
  • Award Effective Date
    10/1/2024 - 10 months ago
  • Award Expiration Date
    9/30/2026 - a year from now
  • Award Amount
    $ 299,999.00
  • Award Instrument
    Standard Grant

EAGER: PBI: Designing Public Use Microdata Business Areas (PUMBAs) for Innovation

Currently, there is unprecedented investment in regional and place-based innovation ecosystems through programs such as the National Science Foundation’s Regional Innovation Engines (hereafter, NSF Engines), the Economic Development Administration’s Tech Hubs, and the U.S. Department of Energy’s Hydrogen Hubs. A key constraint from a program evaluation perspective is the lack of regional and place-based baseline data against which to measure societal and economic impact (e.g., number and share of companies that currently conduct research and development (R&D); number and share of companies that report innovation activity; ownership of firms by sex, ethnicity, race, and veteran status). This project seeks to develop and validate a process for generating new geographic subdivisions well-suited for business microdata such as the Annual Business Survey (ABS). This would be analogous to the Public Use Microdata Areas (PUMAs) developed by the Census Bureau for the American Community Survey (ACS). Release of geographic definitions of these regions and related microdata or summary statistics will allow researchers and policymakers access to these data for place-based innovation.<br/><br/>Geographic clustering is an open-ended problem. In general, finding an optimal solution is infeasible, because it requires investigating all possible cluster sizes and memberships. Several reasonable and efficient approaches exist in the literature and in open-source code; however, different algorithms may lead to different use cases being either strongly or poorly supported. Use of synthetic microdata is also an open-ended problem. Similarly, many general approaches exist but need to be evaluated for appropriateness of use cases. Finally, research on both methods has tended to focus on data for individuals rather than establishments (businesses). Adapting evaluation metrics for utility and disclosure risk to these units of analysis will expand the methodology in this area. This work will use a motivating application based on demographics of business ownership and innovation rates for businesses to further our understanding in each of these three areas: clustering algorithms, synthetic data methods, and evaluation metrology.<br/><br/>This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

  • Program Officer
    Rebecca Shearmanrshearma@nsf.gov7032927403
  • Min Amd Letter Date
    7/9/2024 - a year ago
  • Max Amd Letter Date
    7/9/2024 - a year ago
  • ARRA Amount

Institutions

  • Name
    Research Triangle Institute
  • City
    DURHAM
  • State
    NC
  • Country
    United States
  • Address
    3040 CORNWALLIS RD
  • Postal Code
    277090155
  • Phone Number
    9195416000

Investigators

  • First Name
    Christine
  • Last Name
    Task
  • Email Address
    christine.task@knexusresearch.com
  • Start Date
    7/9/2024 12:00:00 AM
  • First Name
    Jennifer
  • Last Name
    Ozawa
  • Email Address
    jozawa@rti.org
  • Start Date
    7/9/2024 12:00:00 AM
  • First Name
    Matthew
  • Last Name
    Williams
  • Email Address
    mrwilliams@rti.org
  • Start Date
    7/9/2024 12:00:00 AM

Program Element

  • Text
    NSF Engines - Type 2

Program Reference

  • Text
    EAGER
  • Code
    7916