|
Sign In to gain access to subscriptions and/or personal tools.
|
The Effect of Errors in Estimating Ability on Goodness-of-Fit Tests for Irt Models
Clement A. Stone
Mary A. Hansen
University of Pittsburgh
Assessing goodness of fit of item response theory models typically involves evaluating differences between observed and expected score response distributions using a chi-square test statistic. When these methods are applied to assessments that are shorter in length, uncertainty with which ability is estimated greatly affects the approximation to the null chi-square distribution. Results from a Monte Carlo study indicated serious departures between null theoretical distributions and empirically derived sampling distributions for the chi-square statistic for tests with 8 and 16 constructed response items. This article also describes a fit statistic that attempts to account for the uncertainty in estimating ability and that could therefore be applied to testing situations in which ability is not precisely estimated. This method employs more information from the same distribution used to obtain Bayesian point estimates of ability and reflects probabilities that examinees have ability equal to a range of values rather than restricting expectations to single values.
Educational and Psychological Measurement, Vol. 60, No. 6,
974-991 (2000)
DOI: 10.1177/00131640021970907

CiteULike Complore Connotea Del.icio.us Digg Reddit Technorati Twitter What's this?
This article has been cited by other articles:

|
 |

|
 |
 
T. Liang and C. S. Wells
A Model Fit Statistic for Generalized Partial Credit Model
Educational and Psychological Measurement,
December 1, 2009;
69(6):
913 - 928.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
F. J. Abad, J. Olea, and V. Ponsoda
The Multiple-Choice Model: Some Solutions for Estimation of Parameters in the Presence of Omitted Responses
Applied Psychological Measurement,
May 1, 2009;
33(3):
200 - 221.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
Bo Zhang and C. A. Stone
Evaluating Item Fit for Multidimensional Item Response Models
Educational and Psychological Measurement,
April 1, 2008;
68(2):
181 - 196.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
C. E. Demars
Type I Error Rates for Parscale's Fit Index
Educational and Psychological Measurement,
February 1, 2005;
65(1):
42 - 50.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
C. A. Stone
IRTFIT-RESAMPLE: A Computer Program for Assessing Goodness of Fit of Item Response Theory Models Based on Posterior Expectations
Applied Psychological Measurement,
March 1, 2004;
28(2):
143 - 144.
[PDF]
|
 |
|

|
 |

|
 |
 
C. E. DeMars
Type I Error Rates for Generalized Graded Unfolding Model Fit Indices
Applied Psychological Measurement,
January 1, 2004;
28(1):
48 - 71.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
C. A. Stone
Empirical Power and Type I Error Rates for an IRT Fit Statistic that Considers the Precision of Ability Estimates
Educational and Psychological Measurement,
August 1, 2003;
63(4):
566 - 583.
[Abstract]
[PDF]
|
 |
|
|
|