Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Click here to submit your manuscript to SPPS

Click here for more information on Research and Evaluation in Education and Psychology, 3e

Sign In to gain access to subscriptions and/or personal tools.
Educational and Psychological Measurement
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Roznowski, M.
Right arrow Articles by Reith, J.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

Examining the Measurement Quality of Tests Containing Differentially Functioning Items: Do Biased Items Result in Poor Measurement?

Mary Roznowski

Janet Reith

Ohio State University

This study investigated effects of retaining test items manifesting differential item functioning (DIF) on aspects of the measurement quality and validity of that test’s scores. DIF was evaluated using the Mantel-Haenszel procedure, which allows one to detect items that function differently in two groups of examinees at constant levels of the trait. Multiple composites of DIF-and non-DIF-containing items were created to examine the impact of DIF on the measurement, validity, and predictive relations involving those composites. Criteria used were the American College Testing composite, the Scholastic Aptitude Test (SAT) verbal (SATV), quantitative (SATQ), composite (SATC), and grade point average rank percentile. Results indicate measurement quality of tests is not seriously degraded when items manifesting DIF are retained, even when number of items in the compared composites has been controlled. Implications of results are discussed within the framework of multiple determinants of item responses.

Educational and Psychological Measurement, Vol. 59, No. 2, 248-269 (1999)
DOI: 10.1177/00131649921969839


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?


This article has been cited by other articles:


Home page
Educational and Psychological MeasurementHome page
J. C. Immekus and S. J. Maller
Item Parameter Invariance of the Kaufman Adolescent and Adult Intelligence Test Across Male and Female Samples
Educational and Psychological Measurement, December 1, 2009; 69(6): 994 - 1012.
[Abstract] [PDF]


Home page
AssessmentHome page
R. Sheppard, K. Han, S. M. Colarelli, G. Dai, and D. W. King
Differential Item Functioning by Sex and Race in the Hogan Personality Inventory
Assessment, December 1, 2006; 13(4): 442 - 453.
[Abstract] [PDF]


Home page
Language TestingHome page
T.-I. Pae and G.-P. Park
Examining the relationship between differential item functioning and differential test functioning
Language Testing, October 1, 2006; 23(4): 475 - 496.
[Abstract] [PDF]


Home page
Language TestingHome page
T.-I. Pae
DIF for examinees with different academic backgrounds
Language Testing, January 1, 2004; 21(1): 53 - 73.
[Abstract] [PDF]


Home page
Educational and Psychological MeasurementHome page
B. B. Ellis and A. D. Mead
Assessment of the Measurement Equivalence of a Spanish Translation of the 16PF Questionnaire
Educational and Psychological Measurement, October 1, 2000; 60(5): 787 - 807.
[Abstract] [PDF]


Home page
Language TestingHome page
S. Takala and F. Kaftandjieva
Test fairness: a DIF analysis of an L2 vocabulary test
Language Testing, July 1, 2000; 17(3): 323 - 340.
[Abstract] [PDF]