COINSTAC 2.0: decentralized, scalable analysis of loosely coupled data

Information

  • Research Project
  • 10269008
  • ApplicationId
    10269008
  • Core Project Number
    R01DA040487
  • Full Project Number
    5R01DA040487-07
  • Serial Number
    040487
  • FOA Number
    PA-19-056
  • Sub Project Id
  • Project Start Date
    7/1/2015 - 8 years ago
  • Project End Date
    6/30/2025 - a year from now
  • Program Officer Name
    PARIYADATH, VANI
  • Budget Start Date
    7/1/2021 - 2 years ago
  • Budget End Date
    6/30/2022 - a year ago
  • Fiscal Year
    2021
  • Support Year
    07
  • Suffix
  • Award Notice Date
    7/2/2021 - 2 years ago
Organizations

COINSTAC 2.0: decentralized, scalable analysis of loosely coupled data

Project Summary/Abstract The brain imaging community is greatly benefiting from extensive data sharing efforts currently underway. However, there is still a major gap in that much data is still not openly shareable, which we propose to address. In addition, current approaches to data sharing often include significant logistical hurdles both for the investigator sharing the data (e.g. often times multiple data sharing agreements and approvals are required from US and international institutions) as well as for the individual requesting the data (e.g. substantial computational re- sources and time is needed to pool data from large studies with local study data). This needs to change, so that the scientific community can create a venue where data can be collected, managed, widely shared and analyzed while also opening up access to the (many) data sets which are not currently available (see overview on this from our group7). The large amount of existing data requires an approach that can analyze data in a distributed way while (if required) leaving control of the source data with the individual investigator or the data host; this motivates a dynamic, decentralized way of approaching large scale analyses. During the previous funding period, we developed a peer-to-peer system called the Collaborative Informatics and Neuroimaging Suite Toolkit for Anonymous Computation (COINSTAC). Our system provides an independent, open, no-strings-attached tool that performs analysis on datasets distributed across different locations. Thus, the step of actually aggregating data is avoided, while the strength of large-scale analyses can be retained. During this new phase we respond to the need for advanced algorithms such as linear mixed effects models and deep learning, by proposing to develop decentralized models for these approaches and also implement a fully scalable cloud-based framework with enhanced security features. To achieve this, in Aim 1, we will incorporate the necessary functionality to scale up analyses via the ability to work with either local or commercial private cloud environments, together with advanced visualization, quality control, and privacy and security features. This suite of new functions will open the floodgates for the use of COINSTAC by the larger neuroscience community to enable new discovery and analysis of unprecedented amounts of brain imaging data located throughout the world. We will also improve usability, training materials, engage the community in contributing to the open source code base, and ultimately facilitate the use of COINSTAC's tools for additional science and discovery in a broad range of applications. In Aim 2 we will extend the framework to handle powerful algorithms such as linear mixed effects models and deep learning, and to perform meta-learning for leveraging and updating fit models. And finally, in Aim 3, we will test this new functionality through a partnership with the worldwide ENIGMA addiction group, which is currently not able to perform advanced machine learning analyses on data that cannot be centrally located. We will evaluate the impact of 6 main classes of substances of abuse (e.g. methamphetamines, cocaine, cannabis, nicotine, opiates, alcohol and their combinations) using the new developed functionality. 3

IC Name
NATIONAL INSTITUTE ON DRUG ABUSE
  • Activity
    R01
  • Administering IC
    DA
  • Application Type
    5
  • Direct Cost Amount
    437947
  • Indirect Cost Amount
    179964
  • Total Cost
    617911
  • Sub Project Total Cost
  • ARRA Funded
    False
  • CFDA Code
    279
  • Ed Inst. Type
    ORGANIZED RESEARCH UNITS
  • Funding ICs
    NIDA:617911\
  • Funding Mechanism
    Non-SBIR/STTR RPGs
  • Study Section
    BCHI
  • Study Section Name
    Biomedical Computing and Health Informatics Study Section
  • Organization Name
    GEORGIA STATE UNIVERSITY
  • Organization Department
    MISCELLANEOUS
  • Organization DUNS
    837322494
  • Organization City
    ATLANTA
  • Organization State
    GA
  • Organization Country
    UNITED STATES
  • Organization Zip Code
    303023999
  • Organization District
    UNITED STATES