|
Sign In to gain access to subscriptions and/or personal tools.
|
Analysis of the Gender Variable in the Eysenck Personality Questionnaire–Revised Scales Using Differential Item Functioning Techniques
Sergio Escorial
Centro de Estudios Superiores Cardenal Cisneros, sergio.escorial{at}uam.es
Maria J. Navas
Universidad Nacional de Educación a Distancia
Studies in the field of personality have systematically found gender differences in two of the three dimensions of the Eysenck model: neuroticism and psychoticism. This study aims to analyze these differences in the Eysenck Personality Questionnaire—Revised (EPQ-R) scales using differential item functioning (DIF) techniques to determine whether these differences are the result of a differential functioning of the items between males and females or if, on the contrary, they may be reflecting true differences in the assessed dimensions. To this end, 794 participants within a wide age range were evaluated using the EPQ-R test. The following detection methods were used in order to examine DIF: standardization, the simultaneous item bias test, logistic regression, Lord's 2 test, and the differential functioning of items and tests framework. According to the results, the gender differences observed do not seem to be the result of any flaw of the measuring instrument used.
Key Words: EPQ-R gender di ferences DIF standardization logistic regression Lord's 2 test DFIT framework SIBTEST
References
- Barrett, P., & Eysenck, S. (1984). The assessment of personality factors across 25 countries. Personality and Individual Di ferences, 5, 615-632.[CrossRef]
- Borsboom, D., Mellenbergh, G., & Van Heerden, J. (2002). Different kinds of DIF: A distinction between absolute and relative forms of measurement invariance and bias. Applied Psychological Measurement, 26, 433-450.[Abstract/Free Full Text]
- Cattell, R., & Scheier, I. (1961). Handbook for the Neuroticism Scale Questionnaire: The NSQ. Champaign, IL: IPAT.
- Collins, W., Raju, N., & Edwards, J. (2000). Assessing differential functioning in a Satisfaction scale. Journal of Applied Psychology, 85, 451-461.[CrossRef][Web of Science][Medline]
[Order article via Infotrieve]
- Colom, R., & Jayme-Zaro, M. (2004). La psicología de las diferencias de sex [Psychology of sex differences]. Madrid: Biblioteca Nueva.
- Costa, P.T., & McCrae, R.R. (1992). Revised NEO Personality Inventory and NEO Five-Factor Inventory (NEO-FFI). Obessa, FL: Psychological Assessment Resources.
- Delgado, C. (1995). Sesgo de género en la medición del neuroticismo [Gender bias in neuroticism measurement]. Ciencias Sociales, 69, 51-66.
- Dorans, N.J., & Holland, P.W. (1993). DIF detection and description: Mantel-Haenszel and standardization. In P. W. Holland & H. Wainer (Eds.), Di ferential item functioning (pp. 35-66). Hillsdale, NJ: Lawrence Erlbaum.
- Ellis, B., & Mead, A. (2000). Assessment of the measurement equivalence of a Spanish translation of the 16PF questionnaire. Educational and Psychological Measurement, 60, 787-807.[Abstract/Free Full Text]
- Eysenck, H.J., & Eysenck, S.B.G. (1975). Manual of the Eysenck Personality Questionnaire. London: Hodder & Stoughton.
- Eysenck, H.J., & Eysenck, S.B.G. (1997). Cuestionario revisado de personalidad de Eysenck (EPQ-R) [Manual of the Eysenck Personality Questionnaire-Revised].Madrid: TEA Ediciones.
- Fan, X., & Thompson, B. (2001). Confidence intervals about score reliability coefficients, please: An EPM guidelines editorial. Educational and Psychological Measurement, 61, 517-531.[Abstract/Free Full Text]
- Feingold, A. (1994). Gender differences in personality: A meta-analysis. Psychological Bulletin, 116, 429-456.[CrossRef][Web of Science][Medline]
[Order article via Infotrieve]
- Francis, L. (1993). The dual nature of the Eysenckian Neuroticism scales: A question of sex differences? Personality and Individual Di ferences, 15, 43-59.[CrossRef]
- Gelin, M., & Zumbo, B. (2003). Differential item functioning results may change depending on how an item is scored: An illustration with the Center for Epidemiologic Studies Depression Scale. Educational and Psychological Measurement, 63, 65-74.[Abstract/Free Full Text]
- Hambleton, R., & Swaminathan, H. (1985). Item response theory: Principles and applications. Boston: Kluwer-Nijhoff.
- Henson, R.K. (2001). Understanding internal consistency reliability estimates: A conceptual primer on coefficient alpha. Measurement and Evaluation in Counseling and Development, 34, 177-189.[Web of Science]
- Jensen, A. (1998). The g factor. London: Praeger.
- Jodoin, M., & Gierl, M. (2001). Evaluating Type I error and power rates using an effect size measure with the logistic regression procedure for DIF detection. Applied Measurement in Education, 14, 329-349.[CrossRef][Web of Science]
- Jorm, A. (1987). Sex differences in neuroticism: A quantitative synthesis of published research. Australian and New Zealand Journal of Psychiatry, 21, 501-506.[Web of Science][Medline]
[Order article via Infotrieve]
- Lange, R., Irwin, H., & Houran, J. (2000). Top-down purification of Tobacyk's revised Paranormal Belief scale. Personality and Individual Di ferences, 29, 131-156.[CrossRef]
- Lord, F.M. (1980). Applications of item response theory to practical testing problems. Hillsdale, NJ: Lawrence Erlbaum.
- Nunnally, J.C., & Bernstein, I.H. (1994). Psychometric theory (3rd ed.). New York: McGraw-Hill.
- Ortet, G., Ibánñez, M., Moro, M., Silva, F., & Boyle, G. (1999). Psychometric appraisal of Eysenck's Psychoticism scale: A cross cultural study. Personality and Individual Di ferences, 27, 1209-1219.[CrossRef]
- Raju, N. (1988). The area between two item characteristic curves. Psychometrika, 53, 495-502.[CrossRef][Web of Science]
- Raju, N. (1990). Determining the significance of estimated signed and unsigned areas between two item response functions. Applied Psychological Measurement, 14, 197-207.[Abstract]
- Raju, N., van der Linden, W., & Fleer, P. (1995). IRT-based internal measures of differential functioning of items and tests. Applied Psychological Measurement, 19, 353-368.[Abstract]
- Reise, S., Smith, L., & Furr, M. (2001). Invariance on the NEO PI-R Neuroticism scale. Multivariate Behavioral Research, 36, 83-110.[CrossRef][Web of Science]
- Shealy, R.T., & Stout, W.F. (1993). An item response theory model for test bias and differential test functioning. In P. Holland & H. Wainer (Eds.), Di ferential item functioning (pp. 197-240). Hillsdale, NJ: Lawrence Erlbaum.
- Smith, L. (2002). On the usefulness of item bias analysis to personality psychology. Personality and Social Psychology Bulletin, 28, 754-763.[Abstract/Free Full Text]
- Stout, W., & Roussos, L. (1999). Dimensionality-based DIF/DBF package [Computer software]. Urbana-Champaign: William Stout Institute for Measurement, University of Illinois.
- Swaminathan, H., & Rogers, H.J. (1990). Detecting differential item functioning using logistic regression procedures. Journal of Educational Measurement, 27, 361-370.[CrossRef][Web of Science]
- Thissen, D. (1991). MULTILOG: Multiple, categorical item analysis and tests scoring using item response theory. Chicago: Scientific Software.
- Waller, N. (1998). LINKDIF: Linking item parameters and calculating IRT measures of differential item functioning of items and tests. Applied Psychological Measurement, 22, 392.[Free Full Text]
- Zumbo, B., & Thomas, D. (1997). A measure of e fect size for a model-based approach for studying DIF. Prince George, BC, Canada: Edgeworth Laboratory for Quantitative Behavioral Science, University of Northern British Columbia.
This version was published on December
1, 2007
Educational and Psychological Measurement, Vol. 67, No. 6,
990-1001 (2007)
DOI: 10.1177/0013164406299108

CiteULike Complore Connotea Del.icio.us Digg Reddit Technorati Twitter What's this?
|
|