Theory and Applications of Random Forests

Information

  • NSF Award
  • 1104830
Owner
  • Award Id
    1104830
  • Award Effective Date
    7/1/2011 - 13 years ago
  • Award Expiration Date
    10/31/2011 - 12 years ago
  • Award Amount
    $ 63,874.00
  • Award Instrument
    Continuing grant

Theory and Applications of Random Forests

This research develops theory for random forests specifically for the purpose of better facilitating its use in practical settings. Theoretical considerations include balancedness, subtrees, node distributions, node splitting, depth of variables, and other novel tree concepts. These concepts are used to improve prediction and variable selection for random forests in both high and low-dimensional problems. <br/><br/>One of the simplest techniques for improving the performance of a statistical method such as a tree is to take its average over multiple instances of the data. This averaging process is often referred to as ensemble learning and has attracted considerable attention as it has been widely observed that combining elementary learners can yield a predictor with superior prediction performance. One of the most successful tree ensemble learners is random forests. Random forests has met with considerable empirical success, yet much is still unknown about it. This research seeks to improve our understanding of random forests and utilize this knowledge to enhance its application in practical settings. This research focuses on cardiovascular disease, the number one cause of death in the developed world, cancer staging and prognostication for cancer patients, and identifying and developing genotype signatures for myelodsyplastic syndromes, a heterogeneous diseases of blood stem cells having no current curative medical therapy.

  • Program Officer
    Gabor J. Szekely
  • Min Amd Letter Date
    5/20/2011 - 13 years ago
  • Max Amd Letter Date
    5/20/2011 - 13 years ago
  • ARRA Amount

Institutions

  • Name
    Cleveland Clinic Foundation
  • City
    Cleveland
  • State
    OH
  • Country
    United States
  • Address
    9500 Euclid Avenue
  • Postal Code
    441950001
  • Phone Number
    2164456440

Investigators

  • First Name
    Hemant
  • Last Name
    Ishwaran
  • Email Address
    hemant.ishwaran@gmail.com
  • Start Date
    5/20/2011 12:00:00 AM

Program Element

  • Text
    STATISTICS
  • Code
    1269