New Methods to Reduce Bias and Mean Square Error of Maximum Likelihood Estimators

Information

  • Research Project
  • 7161282
  • ApplicationId
    7161282
  • Core Project Number
    R43RR023228
  • Full Project Number
    1R43RR023228-01
  • Serial Number
    23228
  • FOA Number
    PA-06-006
  • Sub Project Id
  • Project Start Date
    7/1/2009 - 15 years ago
  • Project End Date
    12/31/2009 - 14 years ago
  • Program Officer Name
    SWAIN, AMY L
  • Budget Start Date
    7/1/2009 - 15 years ago
  • Budget End Date
    12/31/2009 - 14 years ago
  • Fiscal Year
    2009
  • Support Year
    1
  • Suffix
  • Award Notice Date
    6/29/2009 - 15 years ago
Organizations

New Methods to Reduce Bias and Mean Square Error of Maximum Likelihood Estimators

DESCRIPTION (provided by applicant): Logistic regression is the most frequently used model for binary data and has widespread applicability in the health, behavioral, and physical sciences. Over two thousand research papers were published in 1999 in which "logistic regression" was in the title of the paper or among the keywords. Maximum likelihood is the nearly universal method for computing estimates of regression coefficients in logistic regression models. These estimates are reliable for problems with large samples and when the proportion of responses is neither too small nor too large. However, it has been known for several years that maximum likelihood estimates can have high bias and mean square error for small, sparse or unbalanced datasets, with the latter referring to a considerable difference between the number of responses and non-responses. Exact logistic regression is a method invented by D. R. Cox that is often useful in such situations. However, exact logistic regression is computationally intensive and is limited in practice in terms of the size of datasets and the number of covariates that it can handle before running out of memory or taking an inordinate amount of computing time. D. Firth has developed a method for reducing bias and mean square error for logistic regression as well as other generalized regression models that is not as computationally demanding. Studies in the literature have shown that the method often improves on maximum likelihood. Firth's method is not available in any commercial software package today. We propose to incorporate Firth's method into LogXact, Cytel's regression package, as well as into PROC LOGXACT, a module that runs seamlessly as a part of the SAS software system. In addition to incorporating Firth's method for logistic regression we intend to develop it to apply to conditional logistic regression, ordered and unordered polytomous regression, Poisson regression and Negative Binomial regression. Firth's method does not perform well over certain ranges of model parameters in moderate sized samples in logistic regression. There are instances when it is worse than maximum likelihood. We have created a novel method that generalizes Firth's method to overcome this shortcoming. We propose to implement this method into LogXact and PROC LOGXACT. Under certain unusual conditions both maximum likelihood and Firth's method produce poor estimates for logistic regression. We have developed a diagnostic measure that identifies this situation and we will incorporate this method as part of our generalization of Firth's method. We will also investigate a Bayesian estimator and the target estimator suggested by Cabrerra and Fernholz that have promise of performing well in this situation.

IC Name
NATIONAL CENTER FOR RESEARCH RESOURCES
  • Activity
    R43
  • Administering IC
    RR
  • Application Type
    1
  • Direct Cost Amount
  • Indirect Cost Amount
  • Total Cost
    106212
  • Sub Project Total Cost
  • ARRA Funded
    False
  • CFDA Code
    389
  • Ed Inst. Type
  • Funding ICs
    NCRR:106212\
  • Funding Mechanism
    SBIR-STTR
  • Study Section
    BCHI
  • Study Section Name
    Biomedical Computing and Health Informatics Study Section
  • Organization Name
    CYTEL, INC
  • Organization Department
  • Organization DUNS
    183012277
  • Organization City
    CAMBRIDGE
  • Organization State
    MA
  • Organization Country
    UNITED STATES
  • Organization Zip Code
    02139
  • Organization District
    UNITED STATES