SaTC: CORE: Small: Robust Speaker and Speech Recognition Under AI-Driven Physical and Digital Attacks

Information

  • NSF Award
  • 2310207
Owner
  • Award Id
    2310207
  • Award Effective Date
    10/1/2023 - 7 months ago
  • Award Expiration Date
    9/30/2026 - 2 years from now
  • Award Amount
    $ 193,587.00
  • Award Instrument
    Continuing Grant

SaTC: CORE: Small: Robust Speaker and Speech Recognition Under AI-Driven Physical and Digital Attacks

Recent advances in artificial intelligence (AI) technologies have driven the widespread popularity and acceptance of commercial voice assistants, transforming the way in which we interact with the digital ecosystem and physical devices. These voice assistants have diverse applications across many critical sectors, ranging from smart homes and healthcare to e-commerce and self-driving vehicles. However, this rapidly evolving technology has introduced new cybersecurity threats that could significantly undermine voice system security. For example, the criminal exploitation of AI technologies has led to the advent of audio deepfakes that can be used to circumvent speaker verification protocols in banking systems. This project addresses AI-driven attacks through a comprehensive exploration of audio attack strategies and the development of a defensive framework to strengthen recognition performance and counteract these attacks. The project's broader significance and importance are underpinned by its deep understanding of AI-driven voice attacks and the methodologies to deploy defenses against them, thereby enhancing the safety of voice-controlled systems across various sectors. The project also provides research training opportunities for underrepresented students, both undergraduate and graduate, as well as K-12 students through well-crafted curriculum design and engaging outreach activities.<br/><br/>The goal of this project is to scrutinize the robustness of automatic speaker and speech recognition (ASSR) against AI-driven digital and physical voice attacks, and further develop a defensive framework for enhancing the resilience of voice signals and voice models. The research activities include: (i) characterizing physically-realizable attacks targeting speaker recognition and measuring the impact of stealthy attacks on speech recognition; (ii) designing robust learning-based defenses by shielding the voice signals from threats posed by voice synthesis techniques, and analyzing high-frequency voice components to reveal attack signals; and (iii) developing a robustness enhancement framework that incorporates vocal styles and facial expression features in a multi-modality speaker and speech representation scheme. The insights and technologies generated through this project significantly advance our understanding of voice assistant security, foster secure next-generation voice applications, and enable voice systems to fully leverage emerging sensing and learning technologies.<br/><br/>This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

  • Program Officer
    Karen Karavanickkaravan@nsf.gov7032922594
  • Min Amd Letter Date
    8/9/2023 - 9 months ago
  • Max Amd Letter Date
    8/9/2023 - 9 months ago
  • ARRA Amount

Institutions

  • Name
    Michigan State University
  • City
    EAST LANSING
  • State
    MI
  • Country
    United States
  • Address
    426 AUDITORIUM RD RM 2
  • Postal Code
    488242600
  • Phone Number
    5173555040

Investigators

  • First Name
    Qiben
  • Last Name
    Yan
  • Email Address
    qyan@msu.edu
  • Start Date
    8/9/2023 12:00:00 AM
  • First Name
    Li
  • Last Name
    Xiao
  • Email Address
    lxiao@cse.msu.edu
  • Start Date
    8/9/2023 12:00:00 AM
  • First Name
    Arun
  • Last Name
    Ross
  • Email Address
    rossarun@cse.msu.edu
  • Start Date
    8/9/2023 12:00:00 AM

Program Element

  • Text
    Secure &Trustworthy Cyberspace
  • Code
    8060

Program Reference

  • Text
    SaTC: Secure and Trustworthy Cyberspace
  • Text
    SMALL PROJECT
  • Code
    7923