|
Sign In to gain access to subscriptions and/or personal tools.
|
Examining the Measurement Quality of Tests Containing Differentially Functioning Items: Do Biased Items Result in Poor Measurement?
Mary Roznowski
Janet Reith
Ohio State University
This study investigated effects of retaining test items manifesting differential item functioning (DIF) on aspects of the measurement quality and validity of that tests scores. DIF was evaluated using the Mantel-Haenszel procedure, which allows one to detect items that function differently in two groups of examinees at constant levels of the trait. Multiple composites of DIF-and non-DIF-containing items were created to examine the impact of DIF on the measurement, validity, and predictive relations involving those composites. Criteria used were the American College Testing composite, the Scholastic Aptitude Test (SAT) verbal (SATV), quantitative (SATQ), composite (SATC), and grade point average rank percentile. Results indicate measurement quality of tests is not seriously degraded when items manifesting DIF are retained, even when number of items in the compared composites has been controlled. Implications of results are discussed within the framework of multiple determinants of item responses.
Educational and Psychological Measurement, Vol. 59, No. 2,
248-269 (1999)
DOI: 10.1177/00131649921969839

CiteULike Complore Connotea Del.icio.us Digg Reddit Technorati Twitter What's this?
This article has been cited by other articles:

|
 |

|
 |
 
J. C. Immekus and S. J. Maller
Item Parameter Invariance of the Kaufman Adolescent and Adult Intelligence Test Across Male and Female Samples
Educational and Psychological Measurement,
December 1, 2009;
69(6):
994 - 1012.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
R. Sheppard, K. Han, S. M. Colarelli, G. Dai, and D. W. King
Differential Item Functioning by Sex and Race in the Hogan Personality Inventory
Assessment,
December 1, 2006;
13(4):
442 - 453.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
T.-I. Pae and G.-P. Park
Examining the relationship between differential item functioning and differential test functioning
Language Testing,
October 1, 2006;
23(4):
475 - 496.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
T.-I. Pae
DIF for examinees with different academic backgrounds
Language Testing,
January 1, 2004;
21(1):
53 - 73.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
B. B. Ellis and A. D. Mead
Assessment of the Measurement Equivalence of a Spanish Translation of the 16PF Questionnaire
Educational and Psychological Measurement,
October 1, 2000;
60(5):
787 - 807.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Takala and F. Kaftandjieva
Test fairness: a DIF analysis of an L2 vocabulary test
Language Testing,
July 1, 2000;
17(3):
323 - 340.
[Abstract]
[PDF]
|
 |
|
|
|