Collaborative Research: Frameworks: hpcGPT: Enhancing Computing Center User Support with HPC-enriched Generative AI

Information

  • NSF Award
  • 2411299
Owner
  • Award Id
    2411299
  • Award Effective Date
    8/1/2024 - 2 months from now
  • Award Expiration Date
    7/31/2027 - 3 years from now
  • Award Amount
    $ 360,003.00
  • Award Instrument
    Standard Grant

Collaborative Research: Frameworks: hpcGPT: Enhancing Computing Center User Support with HPC-enriched Generative AI

hpcGPT is a question answering service for academic computing centers such as the National Center for Supercomputing Applications, Ohio Supercomputer Center, San Diego Supercomputer Center, and Texas Advanced Computing Center. These Centers provide high-performance computing (HPC) platforms to tens of thousands of users for science and engineering research. In collaboration with Princeton University and Rutgers University, hpcGPT uses generative artificial intelligence (AI) and integrates heterogeneous data sources with different update frequencies to enhance the user support service quality and efficiency, decrease the response time, and improve precision of the support. With hpcGPT, user support teams can leverage the historical knowledge, real-time system status, and external technical expertise to better support the HPC users. With the high-quality and timely answers from hpcGPT, HPC users can resolve many technical issues, thus reducing the workload of the user support teams. This will allow the support teams to focus more on new and novel support issues. hpcGPT will significantly enhance the user support service quality, capacity, and efficiency without increasing the human effort.<br/><br/>hpcGPT combines the fine-tuning and Retrieval Augmented Generation (RAG) techniques to incorporate recent knowledge, past experience, domain expertise, documentations, and real-time system status of versatile computing. By building upon existing and recognized capabilities in large language model fine-tuning and hosting, retrieval augmentation generation, and external data source integration, hpcGPT reduces the complexity and effort required to align information and identify dependencies between questions, answers, and the supporting information. This is particularly beneficial for research groups and computing centers with diverse application requirements and limited staff. hpcGPT extends and translates a suite of Cyberinfrastructure building blocks and technologies such as large language model training and inference service hosting.<br/><br/>This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

  • Program Officer
    Varun Chandolavchandol@nsf.gov7032922656
  • Min Amd Letter Date
    4/2/2024 - 2 months ago
  • Max Amd Letter Date
    4/2/2024 - 2 months ago
  • ARRA Amount

Institutions

  • Name
    Princeton University
  • City
    PRINCETON
  • State
    NJ
  • Country
    United States
  • Address
    1 NASSAU HALL
  • Postal Code
    085442001
  • Phone Number
    6092583090

Investigators

  • First Name
    Chi
  • Last Name
    Jin
  • Email Address
    chij@princeton.edu
  • Start Date
    4/2/2024 12:00:00 AM

Program Element

  • Text
    Software Institutes
  • Code
    8004

Program Reference

  • Text
    CSSI-1: Cyberinfr for Sustained Scientif
  • Text
    INTERDISCIPLINARY PROPOSALS
  • Code
    4444
  • Text
    Software Institutes
  • Code
    8004