Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Click here to sign up for SAGE Journal Email Alerts today!

Sign In to gain access to subscriptions and/or personal tools.
Educational and Psychological Measurement
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Stone, C. A.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

Empirical Power and Type I Error Rates for an IRT Fit Statistic that Considers the Precision of Ability Estimates

Clement A. Stone

University of Pittsburgh, cas+{at}pitt.edu

Model-data-fit of item response theory (IRT) models is generally assessed by comparing observed performance by examinees on individual items with performance that is predicted under the chosen IRT model. However, use of traditional chi-square methods to evaluate goodness-of-fit of IRT models is not appropriate when the underlying trait/ability is estimated imprecisely (e.g., shorter assessments). This article describes a goodness-of-fit statistic that considers directly the uncertainty with which ability is estimated as well as a resampling-based hypothesis testing procedure. A simulation study was conducted to evaluate the empirical power and Type I error rates for the proposed procedure. Results of the study indicated that the procedure should be useful for evaluating goodness-of-fit of IRT models for most testing applications where uncertainty in ability estimation is an issue.

Key Words: goodness-of-fit • IRT • posterior probabilities • performance assessments

Educational and Psychological Measurement, Vol. 63, No. 4, 566-583 (2003)
DOI: 10.1177/0013164402251034


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?


This article has been cited by other articles:


Home page
Applied Psychological MeasurementHome page
F. J. Abad, J. Olea, and V. Ponsoda
The Multiple-Choice Model: Some Solutions for Estimation of Parameters in the Presence of Omitted Responses
Applied Psychological Measurement, May 1, 2009; 33(3): 200 - 221.
[Abstract] [PDF]


Home page
Educational and Psychological MeasurementHome page
Bo Zhang and C. A. Stone
Evaluating Item Fit for Multidimensional Item Response Models
Educational and Psychological Measurement, April 1, 2008; 68(2): 181 - 196.
[Abstract] [PDF]


Home page
Educational and Psychological MeasurementHome page
C. E. Demars
Type I Error Rates for Parscale's Fit Index
Educational and Psychological Measurement, February 1, 2005; 65(1): 42 - 50.
[Abstract] [PDF]


Home page
Applied Psychological MeasurementHome page
C. A. Stone
IRTFIT-RESAMPLE: A Computer Program for Assessing Goodness of Fit of Item Response Theory Models Based on Posterior Expectations
Applied Psychological Measurement, March 1, 2004; 28(2): 143 - 144.
[PDF]


Home page
Applied Psychological MeasurementHome page
C. E. DeMars
Type I Error Rates for Generalized Graded Unfolding Model Fit Indices
Applied Psychological Measurement, January 1, 2004; 28(1): 48 - 71.
[Abstract] [PDF]