New Methods to Reduce Bias and Mean Square Error of Maximum Likelihood Estimators

Information

Research Project
7161282

ApplicationId
7161282
Core Project Number
R43RR023228
Full Project Number
1R43RR023228-01
Serial Number
23228
FOA Number
PA-06-006
Sub Project Id

Project Start Date
7/1/2009 - 15 years ago
Project End Date
12/31/2009 - 14 years ago
Program Officer Name
SWAIN, AMY L
Budget Start Date
7/1/2009 - 15 years ago
Budget End Date
12/31/2009 - 14 years ago
Fiscal Year
2009
Support Year
1
Suffix
Award Notice Date
6/29/2009 - 15 years ago

Organizations

CYTEL INC

Information

New Methods to Reduce Bias and Mean Square Error of Maximum Likelihood Estimators

DESCRIPTION (provided by applicant): Logistic regression is the most frequently used model for binary data and has widespread applicability in the health, behavioral, and physical sciences. Over two thousand research papers were published in 1999 in which "logistic regression" was in the title of the paper or among the keywords. Maximum likelihood is the nearly universal method for computing estimates of regression coefficients in logistic regression models. These estimates are reliable for problems with large samples and when the proportion of responses is neither too small nor too large. However, it has been known for several years that maximum likelihood estimates can have high bias and mean square error for small, sparse or unbalanced datasets, with the latter referring to a considerable difference between the number of responses and non-responses. Exact logistic regression is a method invented by D. R. Cox that is often useful in such situations. However, exact logistic regression is computationally intensive and is limited in practice in terms of the size of datasets and the number of covariates that it can handle before running out of memory or taking an inordinate amount of computing time. D. Firth has developed a method for reducing bias and mean square error for logistic regression as well as other generalized regression models that is not as computationally demanding. Studies in the literature have shown that the method often improves on maximum likelihood. Firth's method is not available in any commercial software package today. We propose to incorporate Firth's method into LogXact, Cytel's regression package, as well as into PROC LOGXACT, a module that runs seamlessly as a part of the SAS software system. In addition to incorporating Firth's method for logistic regression we intend to develop it to apply to conditional logistic regression, ordered and unordered polytomous regression, Poisson regression and Negative Binomial regression. Firth's method does not perform well over certain ranges of model parameters in moderate sized samples in logistic regression. There are instances when it is worse than maximum likelihood. We have created a novel method that generalizes Firth's method to overcome this shortcoming. We propose to implement this method into LogXact and PROC LOGXACT. Under certain unusual conditions both maximum likelihood and Firth's method produce poor estimates for logistic regression. We have developed a diagnostic measure that identifies this situation and we will incorporate this method as part of our generalization of Firth's method. We will also investigate a Bayesian estimator and the target estimator suggested by Cabrerra and Fernholz that have promise of performing well in this situation.

IC Name

NATIONAL CENTER FOR RESEARCH RESOURCES

Activity
R43
Administering IC
RR
Application Type
1

Direct Cost Amount
Indirect Cost Amount
Total Cost
106212
Sub Project Total Cost

ARRA Funded
False
CFDA Code
389
Ed Inst. Type
Funding ICs
NCRR:106212\
Funding Mechanism
SBIR-STTR
Study Section
BCHI
Study Section Name
Biomedical Computing and Health Informatics Study Section

Organization Name
CYTEL, INC
Organization Department
Organization DUNS
183012277
Organization City
CAMBRIDGE
Organization State
MA
Organization Country
UNITED STATES
Organization Zip Code
02139
Organization District
UNITED STATES

New Methods to Reduce Bias and Mean Square Error of Maximum Likelihood Estimators

Information

ApplicationId

Core Project Number

Full Project Number

Serial Number

FOA Number

Sub Project Id

Project Start Date

Project End Date

Program Officer Name

Budget Start Date

Budget End Date

Fiscal Year

Support Year

Suffix

Award Notice Date

Organizations

New Methods to Reduce Bias and Mean Square Error of Maximum Likelihood Estimators

IC Name

Activity

Administering IC

Application Type

Direct Cost Amount

Indirect Cost Amount

Total Cost

Sub Project Total Cost

ARRA Funded

CFDA Code

Ed Inst. Type

Funding ICs

Funding Mechanism

Study Section

Study Section Name

Organization Name

Organization Department

Organization DUNS

Organization City

Organization State

Organization Country

Organization Zip Code

Organization District