SHF: small: On-device Large Language Model Personalization with Algorithm-Hardware Co-design for Healthcare Applications

Information

  • NSF Award
  • 2426639
Owner
  • Award Id
    2426639
  • Award Effective Date
    10/1/2024 - 8 months ago
  • Award Expiration Date
    9/30/2027 - 2 years from now
  • Award Amount
    $ 569,268.00
  • Award Instrument
    Standard Grant

SHF: small: On-device Large Language Model Personalization with Algorithm-Hardware Co-design for Healthcare Applications

Personalized healthcare that considers individual differences in genetics, lifestyle, and medical history is more effective than one-size-fits-all solutions. This project utilizes advanced wearable and portable devices with large language models (LLMs) to enhance personalized healthcare by addressing the patient variability often overlooked by current methods. It focuses on real-time healthcare personalization through fast and accurate data searches in ever-growing personal databases, using novel memory semiconductor devices, advanced circuits, and custom architecture. This enables quick, personalized interactions on devices, potentially saving lives with timely interventions for emergencies like suicide attempts and strokes, thus advancing precision medicine and national health. Additionally, the project will support activities to enhance healthcare education for K-12 students, tech briefings on semiconductor technology for undergraduate students, and mentoring support for underrepresented students. <br/><br/>The project addresses on-device LLM personalization through Retrieval Augmented Generation (RAG) for healthcare applications, aiming to significantly reduce latency and hardware overhead via algorithm-hardware co-design. It will define healthcare scenarios for LLM applications, generate user prompt input datasets for benchmarking, and create personalized healthcare datasets for LLM personalization. Efficient RAG-based personalization will be explored, focusing on unsupervised data selection and optimal embedding dimension/bit-width selection. To mitigate computation-storage data transfer bottlenecks, custom compute-in-memory architectures and data search frameworks using Ferroelectric Field-Effect Transistors will be investigated. This approach aims for a 1000-fold latency reduction and a 100-fold increase in energy efficiency compared to state-of-the-art edge LLM embedded systems, setting new benchmarks for edge computing performance and sustainability. Successful implementation will enhance personalized healthcare interventions and advance AI-assisted personalized healthcare. <br/><br/>This project is jointly funded by the Software and Hardware Foundation (SHF) core research program and the Advancing Informal STEM Learning (AISL) program.<br/><br/>This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

  • Program Officer
    Almadena Chtchelkanovaachtchel@nsf.gov7032927498
  • Min Amd Letter Date
    8/5/2024 - 10 months ago
  • Max Amd Letter Date
    8/5/2024 - 10 months ago
  • ARRA Amount

Institutions

  • Name
    University of Notre Dame
  • City
    NOTRE DAME
  • State
    IN
  • Country
    United States
  • Address
    940 GRACE HALL
  • Postal Code
    465565708
  • Phone Number
    5746317432

Investigators

  • First Name
    Zhi
  • Last Name
    Zheng
  • Email Address
    zzheng3@nd.edu
  • Start Date
    8/5/2024 12:00:00 AM
  • First Name
    Ningyuan
  • Last Name
    Cao
  • Email Address
    ncao@nd.edu
  • Start Date
    8/5/2024 12:00:00 AM
  • First Name
    Yiyu
  • Last Name
    Shi
  • Email Address
    yshi4@nd.edu
  • Start Date
    8/5/2024 12:00:00 AM

Program Element

  • Text
    AISL
  • Code
    725900
  • Text
    Software & Hardware Foundation
  • Code
    779800

Program Reference

  • Text
    Microelectronics and Semiconductors
  • Text
    SMALL PROJECT
  • Code
    7923
  • Text
    COMPUTER ARCHITECTURE
  • Code
    7941
  • Text
    Broaden Particip STEM Resrch
  • Code
    8212