Least Angle Regression

Information

  • Research Project
  • 6933500
  • ApplicationId
    6933500
  • Core Project Number
    R43GM074313
  • Full Project Number
    1R43GM074313-01
  • Serial Number
    74313
  • FOA Number
  • Sub Project Id
  • Project Start Date
    5/15/2005 - 19 years ago
  • Project End Date
    5/14/2006 - 18 years ago
  • Program Officer Name
    HEATH, ANNE K
  • Budget Start Date
    5/15/2005 - 19 years ago
  • Budget End Date
    5/14/2006 - 18 years ago
  • Fiscal Year
    2005
  • Support Year
    1
  • Suffix
  • Award Notice Date
    5/13/2005 - 19 years ago
Organizations

Least Angle Regression

DESCRIPTION (provided by applicant): This SBIR project aims to produce superior methods and software for classification and regression when there are many potential predictor variables to choose from. The methods should (1) produce stable results, where small changes in the data do to produce major changes in the variables selected or in model predictions, (2) produce accurate predictions, (3) facilitate scientific interpretation, by selecting a smaller subset of predictors which provide the best predictions, (4) allow continuous and categorical variables, and (5) support linear regression, logistic regression (predicting a binary outcome), survival analysis, and other types of regression. This project is based on least angle regression, which unifies and provides a fast implementation for a number of modern regression techniques. Least angle regression has great potential, but the state of the art is limited to linear regression with continuous or binary variables, and uses numerically-unstable calculations. The outcome of this project should be software which is more robust and widely applicable. This software would apply broadly, including to medical diagnosis, detecting cancer, feature selection in microarrays, and modeling patient characteristics like blood pressure. Phase I work will demonstrate feasibility by extending least angle work in three key directions-categorical predictors, logistic regression, and a numerically-accurate implementation. Phase II will extend the work to other types of explanatory variables (e.g. polynomial or spline functions, and interactions between variables), and to survival and other additional regression models. This proposed software will enable medical researchers to obtain high prediction accuracy, and obtain stable and interpretable results, in high-dimensional situations.

IC Name
NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES
  • Activity
    R43
  • Administering IC
    GM
  • Application Type
    1
  • Direct Cost Amount
  • Indirect Cost Amount
  • Total Cost
    99685
  • Sub Project Total Cost
  • ARRA Funded
  • CFDA Code
    859
  • Ed Inst. Type
  • Funding ICs
    NIGMS:99685\
  • Funding Mechanism
  • Study Section
    ZRG1
  • Study Section Name
    Special Emphasis Panel
  • Organization Name
    INSIGHTFUL CORPORATION
  • Organization Department
  • Organization DUNS
    150683779
  • Organization City
    SEATTLE
  • Organization State
    WA
  • Organization Country
    UNITED STATES
  • Organization Zip Code
    98109
  • Organization District
    UNITED STATES