Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Click here for more information on Research and Evaluation in Education and Psychology, 3e

CiteULike is a free service for managing and discovering scholarly references - click here to get started.

Sign In to gain access to subscriptions and/or personal tools.
Educational and Psychological Measurement
This Article
Right arrow Abstract Freely available
Right arrow Free Full Text (Free PDF) Free
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Web of Science (1)
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Sheng, Y.
Right arrow Articles by Wikle, C. K.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

Comparing Multiunidimensional and Unidimensional Item Response Theory Models

Yanyan Sheng

Southern Illinois University Carbondale, ysheng{at}siu.edu

Christopher K. Wikle

University of Missouri-Columbia

For tests consisting of multiple subtests, unidimensional item response theory (IRT) models apply when the subtests are known to measure a common underlying ability. However, in many instances, due to the lack of a satisfactory index for assessing the dimensionality assumption, the test structure is not clear. A more general IRT model, the multiunidimensional model, is more flexible and efficient in various test situations. This article compares these two classes of normal ogive two-parameter models and shows that the multiunidimensional model offers a better way to represent test situations not realized in unidimensional models.

Key Words: item response theory • unidimensional model • multiunidimensional model • Markov chain Monte Carlo • Bayesian model choice

References

  • Ackerman, T.A. (1989). Unidimensional IRT calibration of compensatory and noncompensatory items. Applied Psychological Measurement, 13, 113-127.[Medline] [Order article via Infotrieve]
  • Ackerman, T.A. (1993). Insuring the validity of the reported score scale by reporting multiple scores. Paper presented at the North American Meeting of the Psychometric Society, Berkeley, CA.
  • Albert, J.H. (1992). Bayesian estimation of normal ogive item response curves using Gibbs sampling. Journal of Educational Statistics, 17, 251-269.[CrossRef][Web of Science]
  • Albert, J.H., & Chib, S. (1993). Bayesian analysis of binary and polychotomous response data. Journal of the American Statistical Association, 88, 669-679.[CrossRef][Web of Science]
  • Béguin, A.A., & Glas, C.A.W. (2001). MCMC estimation and some model-fit analysis of multidimensional IRT models. Psychometrika, 66, 541-562.[CrossRef]
  • Birnbaum, A. (1968). The logistic test model. In F. Lord & M. Novick (Eds.), Statistical theories of mental test scores (pp. 397-423). Reading, MA: Addison-Wesley.
  • Bock, R.D., & Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46, 443-459.[CrossRef][Web of Science]
  • Brooks, S.P., & Gelman, A. (1998). General methods for monitoring convergence of iterative simulations. Journal of Computational and Graphical Statistics, 7, 434-455.[CrossRef]
  • Carlin, B.P., & Louis, T.A. (2000). Bayes and empirical Bayes methods for data analysis (2nd ed.). London: Chapman & Hall.
  • De Champlain, A.F., & Gessaroli, M.E. (1998). Assessing the dimensionality of item response matrices with small sample sizes and short test lengths. Applied Measurement in Education, 11, 231-253.[CrossRef][Web of Science]
  • Folk, V.G., & Green, B.F. (1989). Adaptive estimation when the unidimensionality assumption of IRT is violated. Applied Psychological Measurement, 13, 373-389.[Abstract]
  • Gelman, A., Carlin, J.B., Stern, H.S., & Rubin, D.B. (2003). Bayesian data analysis. Boca Raton, FL: Chapman & Hall/CRC.
  • Gessaroli, M.E., & De Champlain, A.F. (1996). Using an approximate chi-square statistic to test the number of dimensions underlying the responses to a set of items. Journal of Educational Measurement, 33, 157-179.[CrossRef][Web of Science]
  • Hambleton, R.K., & Han, N. (2004). Assessing the fit of IRT models: Some approaches and graphical displays. Paper presented at the annual meeting of the National Council on Measurement, San Diego, CA.
  • Hambleton, R.K., Swaminathan, H., & Rogers, H.J. (1991). Fundamentals of item response theory. London: Sage.
  • Hattie, J. (1985). Methodology review: Assessing unidimensionality of tests and items. Applied Psychological Measurement, 9, 139-164.[Abstract]
  • Hattie, J., Krakowski, K., Rogers, H.J., & Swaminathan, H. (1996). An assessment of Stout's index of essential unidimensionality. Applied Psychological Measurement, 20, 1-14.[Abstract]
  • Hoijtink, H., & Molenaar, I.W. (1997). A multidimensional item response model: Constrained latent class analysis using the Gibbs sampler and posterior predictive checks. Psychometrika, 62, 171-189.[CrossRef][Web of Science]
  • Lee, H. ( 1995). Markov chain Monte Carlo methods for estimating multidimensional ability in item response analysis. Unpublished doctoral dissertation, University of Missouri, Columbia.
  • Lord, F.M. (1980). Applications of item response theory to practical testing problems. Hillsdale, NJ: Lawrence Erlbaum.
  • McDonald, R.P. (1985). Factor analysis and related methods. Hillside, NJ: Lawrence Erlbaum.
  • McNemar, Q. (1946). Opinion-attitude methodology. Psychological Bulletin, 43, 289-374.[CrossRef][Web of Science]
  • Mislevy, R.J. (1986). Bayes modal estimation in item response models. Psychometrika, 51, 177-195.[CrossRef][Web of Science]
  • Patz, R.J., & Junker, B.W. (1999). A straightforward approach to Markov chain Monte Carlo methods for item response models. Journal of Educational and Behavioral Statistics, 24, 146-178.[Abstract/Free Full Text]
  • Reckase, M.D. (1997). The past and future of multidimensional item response theory. Applied Psychological Measurement, 21, 25-36.[Abstract/Free Full Text]
  • Segall, D.O. (2002). Confirmatory item factor analysis using Markov chain Monte Carlo estimation with applications to online calibration in CAT. Paper presented at the annual meeting of the National Council on Measurement in Education, New Orleans, LA.
  • Sinharay, S. (2005). Assessing fit of unidimensional item response models using a Bayesian approach. Journal of Educational Measurement, 42, 375-394.[CrossRef][Web of Science]
  • Sinharay, S., & Johnson, M.S. (2003). Simulation studies applying posterior predictive model checking for assessing fit of the common item response theory models (No. ETS RR-03-28). Princeton, NJ: Educational Testing Service.
  • Sinharay, S., & Stern, H.S. (2003). Posterior predictive model checking in hierarchical models. Journal of Statistical Planning and Inference, 111, 209-221.[CrossRef]
  • Spiegelhalter, D.J., Best, N., & Carlin, B.P. (1998). Bayesian deviance, the e fective number of parameters, and the comparison of arbitrarily complex models (Research Report 98-009). Minneapolis: Division of Biostatistics, University of Minnesota.
  • Walker, C.M., & Beretvas, S.N. (2000). Using multidimensional versus unidimensional ability estimates to determine student proficiency in mathematics. Paper presented at the annual meeting of the American Educational Research Association, New Orleans, LA.

This version was published on December 1, 2007

Educational and Psychological Measurement, Vol. 67, No. 6, 899-919 (2007)
DOI: 10.1177/0013164406296977


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?


This article has been cited by other articles:


Home page
Applied Psychological MeasurementHome page
J. de la Torre and H. Song
Simultaneous Estimation of Overall and Domain Abilities: A Higher-Order IRT Model Approach
Applied Psychological Measurement, November 1, 2009; 33(8): 620 - 639.
[Abstract] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Free Full Text (Free PDF) Free
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Web of Science (1)
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Sheng, Y.
Right arrow Articles by Wikle, C. K.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?