FMitF : Track I: Aligning Code-Generating Models with Formal Specifications

Information

NSF Award
2422214

Owner

University of California

Award Id
2422214
Award Effective Date
10/1/2024 - 6 months ago
Award Expiration Date
9/30/2028 - 3 years from now
Award Amount
$ 875,000.00
Award Instrument
Standard Grant

Information

FMitF : Track I: Aligning Code-Generating Models with Formal Specifications

Large language models (LLMs) are an artificial intelligence (AI) technology that promises to revolutionize programming by translating a user's informal intent expressed in natural language into computer code. This technology has the potential to democratize programming and allow anyone, regardless of their skills, to generate code from a simple task description. However, LLMs do not offer any guarantees about the quality of the code they generate, or whether the generated code actually does what the user intended. With LLMs becoming popular, it is thus crucial to build formal techniques that can produce code that provably matches the user's intent and convince the user that the code will do what is expected of it. Recent work has proposed grammar-constrained decoding as a way to enforce that the output generated by large language models belongs to the language of a user-provided formal grammar. This project will contribute new grammar decoding techniques that can align LLMs with formal specifications and enable efficient generation of high-quality code. <br/><br/>To this end, this project will integrate program analysis and synthesis techniques from formal methods with structured prediction methods from natural-language processing. Concretely, the project will (1) develop grammar-aligned decoding, a suite of decoding algorithms that more faithfully capture the LLM's underlying distribution than existing grammar-constrained decoding methods; (2) adapt program analysis and synthesis techniques to encode a variety of formal specifications as grammars that grammar-aligned decoding can handle; and (3) develop interactive techniques that help users formalize their intent as specifications.<br/><br/>This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

Program Officer
Sorin Draghicisdraghic@nsf.gov7032922232
Min Amd Letter Date
8/5/2024 - 8 months ago
Max Amd Letter Date
8/5/2024 - 8 months ago
ARRA Amount

Institutions

Name
University of California-San Diego
City
LA JOLLA
State
CA
Country
United States
Address
9500 GILMAN DR
Postal Code
920930021
Phone Number
8585344896

Investigators

First Name
Loris
Last Name
DAntoni
Email Address
ldantoni@ucsd.edu
Start Date
8/5/2024 12:00:00 AM

First Name
Nadia
Last Name
Polikarpova
Email Address
nadia.polikarpova@gmail.com
Start Date
8/5/2024 12:00:00 AM

First Name
Taylor
Last Name
Berg-Kirkpatrick
Email Address
tberg@eng.ucsd.edu
Start Date
8/5/2024 12:00:00 AM

Program Element

Text
FMitF: Formal Methods in the F

Program Reference

Text
FMitF-Formal Methods in the Field

Text
EXP PROG TO STIM COMP RES
Code
9150

FMitF : Track I: Aligning Code-Generating Models with Formal Specifications

Information

Owner

Award Id

Award Effective Date

Award Expiration Date

Award Amount

Award Instrument

FMitF : Track I: Aligning Code-Generating Models with Formal Specifications

Program Officer

Min Amd Letter Date

Max Amd Letter Date

ARRA Amount

Institutions

Name

City

State

Country

Address

Postal Code

Phone Number

Investigators

First Name

Last Name

Email Address

Start Date

First Name

Last Name

Email Address

Start Date

First Name

Last Name

Email Address

Start Date

Program Element

Text

Program Reference

Text

Text

Code