FMitF : Track I: Aligning Code-Generating Models with Formal Specifications

Information

  • NSF Award
  • 2422214
Owner
  • Award Id
    2422214
  • Award Effective Date
    10/1/2024 - 6 months ago
  • Award Expiration Date
    9/30/2028 - 3 years from now
  • Award Amount
    $ 875,000.00
  • Award Instrument
    Standard Grant

FMitF : Track I: Aligning Code-Generating Models with Formal Specifications

Large language models (LLMs) are an artificial intelligence (AI) technology that promises to revolutionize programming by translating a user's informal intent expressed in natural language into computer code. This technology has the potential to democratize programming and allow anyone, regardless of their skills, to generate code from a simple task description. However, LLMs do not offer any guarantees about the quality of the code they generate, or whether the generated code actually does what the user intended. With LLMs becoming popular, it is thus crucial to build formal techniques that can produce code that provably matches the user's intent and convince the user that the code will do what is expected of it. Recent work has proposed grammar-constrained decoding as a way to enforce that the output generated by large language models belongs to the language of a user-provided formal grammar. This project will contribute new grammar decoding techniques that can align LLMs with formal specifications and enable efficient generation of high-quality code. <br/><br/>To this end, this project will integrate program analysis and synthesis techniques from formal methods with structured prediction methods from natural-language processing. Concretely, the project will (1) develop grammar-aligned decoding, a suite of decoding algorithms that more faithfully capture the LLM's underlying distribution than existing grammar-constrained decoding methods; (2) adapt program analysis and synthesis techniques to encode a variety of formal specifications as grammars that grammar-aligned decoding can handle; and (3) develop interactive techniques that help users formalize their intent as specifications.<br/><br/>This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

  • Program Officer
    Sorin Draghicisdraghic@nsf.gov7032922232
  • Min Amd Letter Date
    8/5/2024 - 8 months ago
  • Max Amd Letter Date
    8/5/2024 - 8 months ago
  • ARRA Amount

Institutions

  • Name
    University of California-San Diego
  • City
    LA JOLLA
  • State
    CA
  • Country
    United States
  • Address
    9500 GILMAN DR
  • Postal Code
    920930021
  • Phone Number
    8585344896

Investigators

  • First Name
    Loris
  • Last Name
    DAntoni
  • Email Address
    ldantoni@ucsd.edu
  • Start Date
    8/5/2024 12:00:00 AM
  • First Name
    Nadia
  • Last Name
    Polikarpova
  • Email Address
    nadia.polikarpova@gmail.com
  • Start Date
    8/5/2024 12:00:00 AM
  • First Name
    Taylor
  • Last Name
    Berg-Kirkpatrick
  • Email Address
    tberg@eng.ucsd.edu
  • Start Date
    8/5/2024 12:00:00 AM

Program Element

  • Text
    FMitF: Formal Methods in the F

Program Reference

  • Text
    FMitF-Formal Methods in the Field
  • Text
    EXP PROG TO STIM COMP RES
  • Code
    9150