CDS&E-MSS: Topological Learning with Multiparameter Persistent Homology

Information

  • NSF Award
  • 2324353
Owner
  • Award Id
    2324353
  • Award Effective Date
    10/1/2023 - 12 months ago
  • Award Expiration Date
    9/30/2026 - 2 years from now
  • Award Amount
    $ 75,369.00
  • Award Instrument
    Continuing Grant

CDS&E-MSS: Topological Learning with Multiparameter Persistent Homology

Experimental data may have an underlying shape whose determination can be crucial for scientific advances. In many cases, the data is sufficiently complex that state-of-the-art methods do not provide an adequate summary of its shape. The subject of this project, topological data analysis, uses ideas from theoretical mathematics to address this challenge. Topological data analysis has been successful in settings where the shape varies as a single parameter changes, but is in need of further research in settings where multiple parameters vary. The goal of this project is to develop mathematical summaries of the shape of data in the multiple parameter setting that may be easily combined with other tools in data science. In summary, this project will use ideas in mathematics, computer science, and statistics to advance theory and develop tools that are of broad use to scientists and engineers. A conference at the University of Florida will be organized in order to advance careers of young STEM researchers. <br/><br/>A main tool of topological data analysis, persistent homology is now well established. The subject of this project concerns a more advanced variant called multiparameter persistent homology. The goal of this project is to develop new tools for easily combining multiparameter persistent homology with statistics and machine learning. Several of the computational approaches to multiparameter persistent homology produce summaries which may be viewed as signed formal sums on a pointed metric space. The investigator has shown that these summaries, called generalized persistence diagrams, have a Wasserstein distance, and may be viewed as elements of the free Banach space on a pointed metric space. This project will produce continuous linear functionals for generalized persistence diagrams and develop a corresponding theory of functional analysis and optimal transport. The investigator’s persistence landscape is a nonlinear functional for persistence diagrams. This project will extend the persistence landscape to generalized persistence diagrams. When a topological signal is detected or a topological classification is constructed, researchers would like to use this to learn more about their data. This project will develop methods for selecting a sub-population responsible for a detected topological signal. It will also use deep neural network software to produce stable visualizations of the parts of the data responsible for a learned classification.<br/><br/>This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

  • Program Officer
    Jodi Meadjmead@nsf.gov7032927212
  • Min Amd Letter Date
    8/2/2023 - a year ago
  • Max Amd Letter Date
    8/2/2023 - a year ago
  • ARRA Amount

Institutions

  • Name
    University of Florida
  • City
    GAINESVILLE
  • State
    FL
  • Country
    United States
  • Address
    1523 UNION RD RM 207
  • Postal Code
    326111941
  • Phone Number
    3523923516

Investigators

  • First Name
    Peter
  • Last Name
    Bubenik
  • Email Address
    peter.bubenik@ufl.edu
  • Start Date
    8/2/2023 12:00:00 AM

Program Element

  • Text
    CDS&E-MSS
  • Code
    8069

Program Reference

  • Text
    Machine Learning Theory
  • Text
    COMPUTATIONAL SCIENCE & ENGING
  • Code
    9263