The broader impact of this I-Corps project is based on the development of software to generate synthetic data for use in the healthcare, consulting, and insurance industries. Synthetic data is artificially generated data that is statistically similar to real-world datasets used by businesses. Synthetic data can be used for analytics and machine learning when access to real data is limited and may have uses in augmenting minority representation in real-world datasets thereby aiding in more equitable outcomes. Overall, the broad applicability of synthetic datasets has the potential to drive innovation in healthcare and other industries by allowing businesses to share synthetic versions of proprietary data with strategic partners, such as data analytics companies, and remain in full compliance with data privacy laws. This ability can lead to an increase in data-driven decision-making in the private sector and effective policy formulation in the public sector. For instance, applications of synthetic medical data may help healthcare researchers and administrators to better model patient activity, including representative data of understudied populations, and ultimately improve human health.<br/> <br/>This I-Corps project utilizes experiential learning coupled with a first-hand investigation of the industry ecosystem to assess the translation potential of the technology. The solution is based on the prior development of a non-deep learning technique to generate synthetic datasets using features of real data. Synthetic data is artificially generated data that is statistically similar to real datasets and can be used for analytics and machine learning when access to real data is limited. This innovative solution allows significantly faster generation of tabular and timeseries synthetic data without the need for training or optimization processes, while internally using linear algebra-based techniques. Although this solution was initially created to generate synthetic timeseries data, it can be modified to generate synthetic tabular data. This solution is completely non-parametric and does not involve the additional steps associated with training and optimization, making it 300x faster than state-of-the-art deep learning generation methods for tabular data. Thus, this approach can generate richly structured datasets using significantly less computing time relative to deep-learning methods.<br/><br/>This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.