Currently, there is unprecedented investment in regional and place-based innovation ecosystems through programs such as the National Science Foundation’s Regional Innovation Engines (hereafter, NSF Engines), the Economic Development Administration’s Tech Hubs, and the U.S. Department of Energy’s Hydrogen Hubs. A key constraint from a program evaluation perspective is the lack of regional and place-based baseline data against which to measure societal and economic impact (e.g., number and share of companies that currently conduct research and development (R&D); number and share of companies that report innovation activity; ownership of firms by sex, ethnicity, race, and veteran status). This project seeks to develop and validate a process for generating new geographic subdivisions well-suited for business microdata such as the Annual Business Survey (ABS). This would be analogous to the Public Use Microdata Areas (PUMAs) developed by the Census Bureau for the American Community Survey (ACS). Release of geographic definitions of these regions and related microdata or summary statistics will allow researchers and policymakers access to these data for place-based innovation.<br/><br/>Geographic clustering is an open-ended problem. In general, finding an optimal solution is infeasible, because it requires investigating all possible cluster sizes and memberships. Several reasonable and efficient approaches exist in the literature and in open-source code; however, different algorithms may lead to different use cases being either strongly or poorly supported. Use of synthetic microdata is also an open-ended problem. Similarly, many general approaches exist but need to be evaluated for appropriateness of use cases. Finally, research on both methods has tended to focus on data for individuals rather than establishments (businesses). Adapting evaluation metrics for utility and disclosure risk to these units of analysis will expand the methodology in this area. This work will use a motivating application based on demographics of business ownership and innovation rates for businesses to further our understanding in each of these three areas: clustering algorithms, synthetic data methods, and evaluation metrology.<br/><br/>This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.