Collaborative Research: EAGER: Automating CI Configuration Troubleshooting with Bayesian Group Testing

Information

  • NSF Award
  • 2333325
Owner
  • Award Id
    2333325
  • Award Effective Date
    9/1/2023 - a year ago
  • Award Expiration Date
    8/31/2025 - 9 months from now
  • Award Amount
    $ 75,000.00
  • Award Instrument
    Standard Grant

Collaborative Research: EAGER: Automating CI Configuration Troubleshooting with Bayesian Group Testing

Configuration troubleshooting in large-scale cyberinfrastructure (CI) software systems is a complex and costly task due to numerous configurable parameters. Existing methods like log mining and machine learning analysis face challenges in such environments. To address this, we present BGT4AutoCI (Automating CI Configuration Troubleshooting with Bayesian Group Testing), a groundbreaking solution that leverages Bayesian Group Testing, ensuring accurate results even with minimal prior knowledge and testing errors. Experienced CI operators can expedite the process with approximated prior knowledge. This research aims to revolutionize CI configuration troubleshooting, introducing a highly precise and efficient approach that will optimize the utilization of current and future large-scale CI systems.<br/><br/>The primary focus of this research is to address critical challenges in automated configuration troubleshooting within large-scale CI systems. The proposed three-fold approach encompasses: (1) Formulating Bayesian Group Testing for CI configuration troubleshooting, which employs lattice models to accurately identify risks at the individual configuration parameter level, taking uncertainty into account during troubleshooting. (2) A multinomial paradigm for Bayesian Group Testing, which introduces multinomial responses to simultaneously observe multiple aspects of CI systems, enabling efficient test selection algorithms for jointly testing configuration parameters that impact various aspects of CIs. (3) Automated configuration troubleshooting, which involves the designs of several key components to establish BGT4AutoCI as an automated configuration troubleshooting framework that minimizes the need for human intervention. The outcomes of this project hold the potential to significantly enhance the efficiency and accuracy of CI configuration troubleshooting, benefiting current and future large-scale CI systems.<br/><br/>This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

  • Program Officer
    Varun Chandolavchandol@nsf.gov7032922656
  • Min Amd Letter Date
    8/17/2023 - a year ago
  • Max Amd Letter Date
    8/17/2023 - a year ago
  • ARRA Amount

Institutions

  • Name
    Case Western Reserve University
  • City
    CLEVELAND
  • State
    OH
  • Country
    United States
  • Address
    10900 EUCLID AVE
  • Postal Code
    441061712
  • Phone Number
    2163684510

Investigators

  • First Name
    Vipin
  • Last Name
    Chaudhary
  • Email Address
    vipin@case.edu
  • Start Date
    8/17/2023 12:00:00 AM

Program Element

  • Text
    Software Institutes
  • Code
    8004

Program Reference

  • Text
    EAGER
  • Code
    7916
  • Text
    Software Institutes
  • Code
    8004