BIGDATA: Causal Inference in Large-Scale Time Series

Information

Research Project
10123124

ApplicationId
10123124
Core Project Number
R01LM011826
Full Project Number
2R01LM011826-08
Serial Number
011826
FOA Number
PAR-18-896
Sub Project Id

Project Start Date
6/1/2013 - 12 years ago
Project End Date
2/28/2025 - 7 months ago
Program Officer Name
YE, JANE
Budget Start Date
6/1/2021 - 4 years ago
Budget End Date
2/28/2022 - 3 years ago
Fiscal Year
2021
Support Year
08
Suffix
Award Notice Date
6/1/2021 - 4 years ago

Organizations

Stevens Institute of Technology, LLC

Information

BIGDATA: Causal Inference in Large-Scale Time Series

Project summary Large datasets generated by hospitals could have a transformative effect on medical knowledge and patient care. Yet currently the volume of data is more likely to overwhelm clinicians and the challenges of the data can overwhelm machine learning algorithms. Intensive care units (ICUs) generate data at a resolution of seconds, for the entirety of a patient's stay. Our long-term goal is to turn these data into actionable knowledge, like risk factors for a disease, early intervention targets, and real-time information to support clinical decisions. This is a broad problem, but particularly important in ICUs, which involve high stakes decisions being made in a complex environment under time pressure. We focus in particular on understanding consciousness in adults, and neurologic status in neonates. While 7% of ICU admissions are due to loss of consciousness, and degree of consciousness is critical to evaluating prognosis, making difficult choices such as when to withdraw care, and providing early interventions to improve quality of life, there are no objective or automated assessments for consciousness (adults) or neurologic status (neonates). We have shown that unresponsive patients with brain activation were twice as likely to regain the ability to follow commands compared to unresponsive patients without such activation, yet these assessments are too time consuming for regular clinical use. However we also showed that physiological data routinely collected in ICUs can be used as a proxy to classify consciousness. It is still not known why it changes and we must be sure that the patterns we find are in fact causal to avoid treating symptoms instead of a disease or launching unsuccessful clinical trials. There have been two key barriers preventing a causal understanding of consciousness. First, variables measured for each ICU patient differ, and can differ within a patient over the course of their admission. This leads to confounding when attempting to infer causal models, and has prevented learning a single model for all patients, which limits generalizability. Second, while the challenges of medical data require new methods, researchers are rarely able to rigorously evaluate and compare them, since real-world data lacks ground truth and often cannot be shared for privacy reasons. To address these challenges, we aim 1) to develop methods that learn generalizable causal models with latent variables (by intelligently sharing and combining information across patients), 2) to develop data driven simulations methods for testing machine learning algorithms while preserving privacy, and 3) to apply these methods to neonatal and neurological ICU data. We aim to create better indicators for consciousness and to uncover causes of both neurological status in ICU and its link to long-term functional outcomes. Our work turns potential weaknesses of medical data (different variables measured across individuals) into a strength, and will enable better use of large-scale observational biomedical data for real-time treatment decisions.

IC Name

NATIONAL LIBRARY OF MEDICINE

Activity
R01
Administering IC
LM
Application Type
2

Direct Cost Amount
238581
Indirect Cost Amount
56425
Total Cost
295006
Sub Project Total Cost

ARRA Funded
False
CFDA Code
879
Ed Inst. Type
BIOMED ENGR/COL ENGR/ENGR STA
Funding ICs
NLM:295006\
Funding Mechanism
Non-SBIR/STTR RPGs
Study Section
ZLM1
Study Section Name
Special Emphasis Panel

Organization Name
STEVENS INSTITUTE OF TECHNOLOGY
Organization Department
BIOSTATISTICS & OTHER MATH SCI
Organization DUNS
064271570
Organization City
HOBOKEN
Organization State
NJ
Organization Country
UNITED STATES
Organization Zip Code
070305906
Organization District
UNITED STATES

BIGDATA: Causal Inference in Large-Scale Time Series

Information

ApplicationId

Core Project Number

Full Project Number

Serial Number

FOA Number

Sub Project Id

Project Start Date

Project End Date

Program Officer Name

Budget Start Date

Budget End Date

Fiscal Year

Support Year

Suffix

Award Notice Date

Organizations

BIGDATA: Causal Inference in Large-Scale Time Series

IC Name

Activity

Administering IC

Application Type

Direct Cost Amount

Indirect Cost Amount

Total Cost

Sub Project Total Cost

ARRA Funded

CFDA Code

Ed Inst. Type

Funding ICs

Funding Mechanism

Study Section

Study Section Name

Organization Name

Organization Department

Organization DUNS

Organization City

Organization State

Organization Country

Organization Zip Code

Organization District