Analisis Kualitas Butir Soal Ujian Tengah Semester Menggunakan Pendekatan Teori Respons Butir (Item Response Theory)

Authors

  • Norina Liranti Pellokila Institut Agama Kristen Negeri Kupang, Indonesia
  • Agnes Demarci Nuban Institut Agama Kristen Negeri Kupang, Indonesia
  • Frengki Lado Institut Agama Kristen Negeri Kupang, Indonesia

Keywords:

Kajian Konseptual, Teori Respons Butir, Model Logistik Tiga Parameter, Kualitas Butir Soal, Asesmen Pendidikan., Conceptual Review, Item Response Theory, Three-Parameter Logistic Model, Item Quality, Educational Assessment

Abstract

This article presents a conceptual and theoretical review of the application of Item Response Theory (IRT), specifically the three-parameter logistic model (3PL), as a framework for analyzing the quality of Mid-Semester Examination (MSE) items at the senior high school level. Unlike conventional empirical articles, this study does not rely on directly collected examinee response data; instead, it explores the theoretical foundations, mathematical structure, and practical implications of the 3PL model through a comprehensive synthesis of psychometric literature. The review examines the three core parameters of the 3PL model  discrimination (a), difficulty (b), and pseudo-guessing (c) along with their interpretive criteria, assumption verification procedures, and a contextually grounded IRT implementation framework for educational institutions in Indonesia, particularly in East Nusa Tenggara (NTT). The article also critically compares the IRT framework with Classical Test Theory (CTT) to elucidate the conceptual advantages and practical limitations of each approach. The findings of this review are intended to serve as a conceptual foundation for teachers, evaluators, and education policymakers in adopting IRT systematically to enhance the validity and reliability of assessment instruments at the secondary school level.

Keywords: Conceptual Review, Item Response Theory, Three-Parameter Logistic Model, Item Quality, Educational Assessment.

 

Penelitian ini menyajikan kajian konseptual dan teoretis tentang penerapan Teori Respons Butir (Item Response Theory/IRT), khususnya model logistik tiga parameter (3PL), sebagai kerangka analisis kualitas butir soal Ujian Tengah Semester (UTS) di jenjang sekolah menengah atas. Berbeda dari artikel empiris konvensional, kajian ini tidak bertumpu pada data respons peserta tes yang dikumpulkan secara langsung, melainkan mengeksplorasi fondasi teoretis, struktur matematis, dan implikasi praktis model 3PL melalui sintesis literatur psikometrik yang komprehensif. Kajian ini menelaah tiga parameter inti model 3PL  daya pembeda (a), tingkat kesukaran (b), dan pseudo-guessing (c) beserta kriteria interpretasinya, prosedur verifikasi asumsi, serta kerangka kerja implementasi IRT yang kontekstual untuk satuan pendidikan di Indonesia, khususnya di wilayah Nusa Tenggara Timur (NTT). Artikel ini juga mengkomparasi secara kritis kerangka IRT dengan Teori Tes Klasik (CTT) untuk menjelaskan keunggulan konseptual dan keterbatasan praktis masing-masing pendekatan. Temuan kajian ini diharapkan dapat menjadi landasan konseptual bagi guru, evaluator, dan pengambil kebijakan pendidikan dalam mengadopsi pendekatan IRT secara sistemik guna meningkatkan validitas dan reliabilitas instrumen asesmen di tingkat sekolah menengah.

Keywords: Kajian Konseptual, Teori Respons Butir, Model Logistik Tiga Parameter, Kualitas Butir Soal, Asesmen Pendidikan.

References

Baker, F. B. (2001). The basics of item response theory (2nd ed.). ERIC Clearinghouse on Assessment and Evaluation.

Baker, F. B., & Kim, S. H. (2004). Item response theory: Parameter estimation techniques (2nd ed.). Marcel Dekker.

Birnbaum, A. (1968). Some latent trait models and their use in inferring an examinee's ability. In F. M. Lord & M. R. Novick (Eds.), Statistical theories of mental test scores (pp. 395–479). Addison-Wesley.

De Ayala, R. J. (2009). The theory and practice of item response theory. Guilford Press.

Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists. Lawrence Erlbaum Associates.

Haladyna, T. M., & Rodriguez, M. C. (2013). Developing and validating test items. Routledge.

Hambleton, R. K., & Swaminathan, H. (1985). Item response theory: Principles and applications. Kluwer-Nijhoff.

Hambleton, R. K., Swaminathan, H., & Rogers, H. J. (1991). Fundamentals of item response theory. Sage Publications.

Lord, F. M. (1952). A theory of test scores. Psychometric Monographs, 7, 1–84.

Lord, F. M. (1980). Applications of item response theory to practical testing problems. Lawrence Erlbaum Associates.

Lord, F. M., & Novick, M. R. (1968). Statistical theories of mental test scores. Addison-Wesley.

Orlando, M., & Thissen, D. (2000). Likelihood-based item-fit indices for dichotomous item response theory models. Applied Psychological Measurement, 24(1), 50–64. https://doi.org/10.1177/01466216000241003

Retnawati, H. (2014). Teori respons butir dan penerapannya: Untuk peneliti, praktisi pengukuran dan pengujian, mahasiswa pascasarjana. Parama Publishing.

Stout, W. F. (1987). A nonparametric approach for assessing latent trait unidimensionality. Psychometrika, 52(4), 589–617. https://doi.org/10.1007/BF02294821

Van Der Linden, W. J., & Hambleton, R. K. (Eds.). (1997). Handbook of modern item response theory. Springer.

Yen, W. M. (1984). Effects of local item dependence on the fit and equating performance of the three-parameter logistic model. Applied Psychological Measurement, 8(2), 125–145. https://doi.org/10.1177/014662168400800201

Downloads

Published

2026-06-13