Collaborative Research: SHF: Small: RUI: Context-aware Models of Source Code Summarization

Information

  • NSF Award
  • 2100050
Owner
  • Award Id
    2100050
  • Award Effective Date
    10/1/2021 - 3 years ago
  • Award Expiration Date
    9/30/2024 - 8 months ago
  • Award Amount
    $ 50,009.00
  • Award Instrument
    Standard Grant

Collaborative Research: SHF: Small: RUI: Context-aware Models of Source Code Summarization

The objective of this research project is to address key barriers towards automatic documentation generation for software source code. Programmers create software by writing instructions in source code. That source code is often very difficult to understand, and programmers often must spend significant time writing and updating natural language descriptions of the code to serve as a guide to other programmers. But programmers tend to avoid this task, leading to difficult-to-understand legacy code, bugs, struggles for novice programmers, and other problems. The process of writing these natural language descriptions is called "source code summarization" and this project aims to automate this process. The long-term goal of the project is that automatic documentation generation will improve productivity for software engineers, and increase the quality of software generally.<br/><br/>The two key barriers that this project targets are: 1) context-aware models of code summarization, and 2) improved optimization and evaluation procedures for those models. The research towards item (1) centers on novel neural network-based algorithms for reading and understanding source code. The "context" of a section of source code includes the surrounding source code, dependencies and dependents, programmer communications, bug reports, architecture documentation, and many other software artifacts. This proposal aims to build new neural models of code that include this context such as attentional graph neural networks and dynamic memory networks. The research towards item (2) centers on improving the metrics used to evaluate models of source code summarization, as well as optimization functions used to train these models. This project includes both design of these metrics and functions, and experiments to evaluate them.<br/><br/>This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

  • Program Officer
    Sol Greenspansgreensp@nsf.gov7032927841
  • Min Amd Letter Date
    9/1/2021 - 3 years ago
  • Max Amd Letter Date
    9/1/2021 - 3 years ago
  • ARRA Amount

Institutions

  • Name
    Eastern Michigan University
  • City
    YPSILANTI
  • State
    MI
  • Country
    United States
  • Address
    Office of Research Development
  • Postal Code
    481972212
  • Phone Number
    7344873090

Investigators

  • First Name
    Siyuan
  • Last Name
    Jiang
  • Email Address
    sjiang1@emich.edu
  • Start Date
    9/1/2021 12:00:00 AM

Program Element

  • Text
    Software & Hardware Foundation
  • Code
    7798

Program Reference

  • Text
    SMALL PROJECT
  • Code
    7923
  • Text
    SOFTWARE ENG & FORMAL METHODS
  • Code
    7944