III: Small: Strategically Transforming Code Across SQL and User-Defined Function Boundaries to Enable Effective Optimizations

Information

NSF Award
2404373

Owner

CARNEGIE-MELLON UNIVERSITY

Award Id
2404373
Award Effective Date
10/1/2024 - 9 months ago
Award Expiration Date
9/30/2027 - 2 years from now
Award Amount
$ 599,639.00
Award Instrument
Standard Grant

Information

III: Small: Strategically Transforming Code Across SQL and User-Defined Function Boundaries to Enable Effective Optimizations

Modern software applications in all facets of society, including commercial, scientific, and non-profit enterprises, rely on databases to store information. These organizations often want to use their data in ways they cannot easily express with existing database query languages, especially in the context of artificial intelligence and data science applications. This mismatch means such applications wait longer for answers about their data, inhibiting them from reacting to changes as quickly as possible and impeding their goals. This research addresses this problem and develops foundational techniques that automatically removes such inefficiencies without requiring organizations to perform costly rewrites of their application code. It enables organizations to ask more complex questions about their data and extrapolate new knowledge from it, all while using less computing and energy resources than today’s systems.<br/><br/>Many database management systems (DBMSs) extend the query language SQL to support user-defined functions (UDFs) written in procedural programming languages. Despite their software engineering advantages, UDFs are notoriously difficult to optimize within database systems, and DBMSs often resort to executing them iteratively (row-by-row). This project focuses on developing optimization approaches to overcome SQL and UDF boundaries via automatic code transformations that pass critical information between them to enable more effective query planning and compilation. These strategies include methods for programmatically deconstructing UDFs into smaller pieces, manipulating them individually, and reconstructing them into the calling query to optimize performance. This project will address three fundamental research challenges: (1) improving the performance of UDFs without requiring modifications to the application code, (2) optimizing external language UDFs (e.g., Python) that rely on dynamic types and library calls, and (3) generating new optimizations that leverage information about UDFs across the entire lifecycle of a query and multiple invocations. By eliminating performance penalties associated with UDFs, this research will enable organizations to improve the efficiency of applications and support more complex workloads, including leveraging machine learning and data science libraries.<br/><br/>This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

Program Officer
Judith Cushingjcushing@nsf.gov3607016450
Min Amd Letter Date
8/26/2024 - 10 months ago
Max Amd Letter Date
8/26/2024 - 10 months ago
ARRA Amount

Institutions

Name
Carnegie-Mellon University
City
PITTSBURGH
State
PA
Country
United States
Address
5000 FORBES AVE
Postal Code
152133815
Phone Number
4122688746

Investigators

First Name
Todd
Last Name
Mowry
Email Address
tcm@cs.cmu.edu
Start Date
8/26/2024 12:00:00 AM

First Name
Andrew
Last Name
Pavlo
Email Address
pavlo@cs.cmu.edu
Start Date
8/26/2024 12:00:00 AM

Program Element

Text
Info Integration & Informatics
Code
736400

Program Reference

Text
INFO INTEGRATION & INFORMATICS
Code
7364

Text
SMALL PROJECT
Code
7923

III: Small: Strategically Transforming Code Across SQL and User-Defined Function Boundaries to Enable Effective Optimizations

Information

Owner

Award Id

Award Effective Date

Award Expiration Date

Award Amount

Award Instrument

III: Small: Strategically Transforming Code Across SQL and User-Defined Function Boundaries to Enable Effective Optimizations

Program Officer

Min Amd Letter Date

Max Amd Letter Date

ARRA Amount

Institutions

Name

City

State

Country

Address

Postal Code

Phone Number

Investigators

First Name

Last Name

Email Address

Start Date

First Name

Last Name

Email Address

Start Date

Program Element

Text

Code

Program Reference

Text

Code

Text

Code