Evaluasi Karakteristik Butir Soal Tes Numerikal Differential Aptitude Test melalui Analisis Psikometrik

Authors

  • Lira Erwinda Universitas Bina Bangsa
  • Mira Marlina Universitas Bina Bangsa
  • Riris Miladul Idaini Universitas Bina Bangsa
  • Destri Astrianingsih Universitas Bina Bangsa
  • Yuda Syahputra Universitas Indraprasta PGRI

Keywords:

Psychometric evaluation, Rasch model, Differential aptitude test, Numerical ability, Item analysis, difdiferential item functioning

Abstract

Studi ini mengevaluasi karakteristik psikometrik subtes numerik dari Tes Bakat Diferensial menggunakan Model Rasch untuk menentukan kualitas dan kelayakan instrumen dalam mengukur kemampuan numerik. Desain evaluasi psikometrik kuantitatif digunakan yang melibatkan siswa sekolah menengah pertama yang dipilih melalui pengambilan sampel acak sederhana. Data dikumpulkan menggunakan tes bakat numerik dikotomis 40 item dan dianalisis menggunakan Winsteps versi 5.6. Temuan menunjukkan bahwa instrumen tersebut menunjukkan kualitas psikometrik keseluruhan yang baik, dengan reliabilitas yang memuaskan, indeks pemisahan yang dapat diterima, dan konsistensi internal yang memadai. Sebagian besar item sesuai dengan model Rasch, menunjukkan bahwa sebagian besar item tes berfungsi dengan tepat dalam mengukur kemampuan numerik. Distribusi kesulitan item berkisar dari sangat mudah hingga sangat sulit, menunjukkan bahwa instrumen tersebut mencakup berbagai tingkat kemampuan. Peta item-orang menunjukkan bahwa tes tersebut umumnya selaras dengan tingkat kemampuan sebagian besar responden, meskipun jumlah item yang sangat sulit terbatas. Analisis Fungsi Item Diferensial mengungkapkan bahwa sebagian besar item adil di seluruh kelompok, dengan hanya sejumlah kecil yang menunjukkan potensi bias. Secara keseluruhan, subtes numerik dari Tes Bakat Diferensial dianggap cukup layak untuk penilaian pendidikan, meskipun beberapa item memerlukan revisi dan evaluasi lebih lanjut.

References

Ajmi, M. Al, Mustakim, S. S., Roslan, S., & Almehrizi, R. (2024). Psychometric characteristics of the numerical ability test for Gulf students. International Journal of Evaluation and Research in Education , 13(4), 2552–2561. https://doi.org/10.11591/ijere.v13i4.28917

Alkhatib, H. S., Brazeau, G., Akour, A., & Almuhaissen, S. A. (2020). Evaluation of the effect of items’ format and type on psychometric properties of sixth year pharmacy students clinical clerkship assessment items. BMC Medical Education, 20(1). https://doi.org/10.1186/s12909-020-02107-3

Andrich, D., & Marais, I. (2019). A Course in Rasch Measurement Theory : Measuring in the Educational, Social and Health Sciences. Springer Texts in Education. https://doi.org/https://doi.org/10.1007/978-981-13-7496-8

Bagaskara, R. S., Iman, H. I., Assa, N. A., & Yudiana, W. (2025). Properti Psikometri Indonesia Desirable Responding Scale. Journal of Psychological Science and Profession, 9(3), 208–221. https://doi.org/10.24198/jpsp.v9i3.68135

Baluku, M. M., Mugabi, E. N., Nansamba, J., Matagi, L., Onderi, P., & Otto, K. (2021). Psychological Capital and Career Outcomes among Final Year University Students: the Mediating Role of Career Engagement and Perceived Employability. International Journal of Applied Positive Psychology, 6(1), 55–80. https://doi.org/10.1007/s41042-020-00040-w

Bennett, G. K., Seashore, H. G., & Wesman, A. G. (1990). Differential Aptitude Tests (5th ed.) (San Antoni). TX: Psychological Corporation.

Bond, T. G., Yan, Z., & Heene, M. (2020). Applying the rasch model: Fundamental measurement in the human sciences. In Applying the Rasch Model: Fundamental Measurement in the Human Sciences. Psychology Press. https://doi.org/10.4324/9780429030499

Bond, T. G., Yan, Z., & Heene, M. (2021). Applying the Rasch Model: Fundamental Measurement in the Human Sciences (4th ed.). Routledge.

Boone, W. J., & Staver, J. R. (2020). Point Measure Correlation. Advances in Rasch Analyses in the Human Sciences, 25–38. https://doi.org/10.1007/978-3-030-43420-5_3

Boone, W. J., Staver, J. R., & Yale, M. S. (2013). Rasch analysis in the human sciences. Springer.

Boone, W. J., Stever, J. R., & Yale, M. S. (2014). Rasch Analysis in the Human Science. Springer.

Cupani, M., & Cortez, F. D. (2016). Análisis psicométricos del Subtest de Razonamiento Numérico utilizando el Modelo de Rasch. Revista de Psicología, 25(2), 1–16. https://doi.org/10.5354/0719-0581.2016.44558

Donoso, O. A., Hernandez, B., & Horin, E. V. (2010). Use of psychological tests within vocational rehabilitation. Journal of Vocational Rehabilitation, 32(3), 191–200. https://doi.org/10.3233/JVR-2010-0509

Fährmann, K., Köhler, C., Hartig, J., & Heine, J. H. (2022). Practical significance of item misfit and its manifestations in constructs assessed in large-scale studies. Large-Scale Assessments in Education, 10(1). https://doi.org/10.1186/s40536-022-00124-w

Goble, D. (1998). Using the Differential Aptitude Tests for Selection and Prediction in Vocational Education and Training. Australian Journal of Career Development, 7(1), 20–23. https://doi.org/10.1177/103841629800700107

Gökçe, S., Berberoğlu, G., Wells, C. S., & Sireci, S. G. (2021). Linguistic Distance and Translation Differential Item Functioning on Trends in International Mathematics and Science Study Mathematics Assessment Items. Journal of Psychoeducational Assessment, 39(6), 728–745. https://doi.org/10.1177/07342829211010537

Goldstein, H., Pulakos, E., Passmore, J., & Semedo, C. (2020). The prevalence of cognitive tests in personnel selection. In The Wiley Blackwell handbook of the psychology of recruitment, selection and employee retention. https://ebrary.net/106495/psychology/prevalence_cognitive_tests_personnel_selection

Gonzalez, O., Georgeson, A. R., & Pelham, W. E. (2023). How Accurate and Consistent Are Score-Based Assessment Decisions? A Procedure Using the Linear Factor Model. Assessment, 30(5), 1640–1650. https://doi.org/10.1177/10731911221113568

Hamid, A. (2025). Hubungan antara Kemampuan Numerik dan Efektivitas Pemecahan Masalah Matematis Mahasiswa. Jurnal Ilmiah Matematika (JIMAT), 6(2), 602–612. https://doi.org/10.63976/jimat.v6i2.1077

Hapsari, A. D., & Hidayat, R. (2024). Assessing the Predictive Power of Aptitude Tests on Academic Achievement of Students in Science and Technology Majors. Gadjah Mada Journal of Psychology, 10(2), 118–130. https://doi.org/10.22146/gamajop.83491

Hernández, A., Hidalgo, M. D., Hambleton, R. K., & Gómez-Benito, J. (2020). International test commission guidelines for test adaptation: A criterion checklist. Psicothema, 32(3), 390–398. https://doi.org/10.7334/psicothema2019.306

Hsu, C. L., Jin, K. Y., & Chiu, M. M. (2020). Cognitive Diagnostic Models for Random Guessing Behaviors. Frontiers in Psychology, 11. https://doi.org/10.3389/fpsyg.2020.570365

Huang, C. D., Church, A. T., & Katigbak, M. S. (1997). Identifying cultural differences in items and traits: Differential item functioning in the NEO Personality Inventory. Journal of Cross-Cultural Psychology, 28(2), 192–218. https://doi.org/10.1177/0022022197282004

Kilgus, S. P., Eklund, K., von der Embse, N. P., Weist, M., Barber, A. J., Kaul, M., & Dodge, S. (2021). Structural Validity and Reliability of Social, Academic, and Emotional Behavior Risk Screener–Student Rating Scale Scores: A Replication Study. Assessment for Effective Intervention, 46(4), 259–269. https://doi.org/10.1177/1534508420909527

Kumar, D., Jaipurkar, R., Shekhar, A., Sikri, G., & Srinivas, V. (2021). Item analysis of multiple choice questions: A quality assurance test for an assessment tool. Medical Journal Armed Forces India, 77, S85–S89. https://doi.org/https://doi.org/10.1016/j.mjafi.2020.11.007

Linacre, J. M. (2022). A User’s Guide to WINSTEPS MINISTEP Rasch-Model Computer Programs. In winsteps.com.

Linacre, J. M. (2024). Winsteps Rasch Measurement Computer Program Version 5.6. Winsteps.com.

Nalbandyan, R., Gilbert, J. B., Franco, V. R., & Domingue, B. W. (2026). Signposts on the Path From Nominal to Ordinal Scales: Moving From a Discrete to a Continuous View. Educational and Psychological, 1(1). https://doi.org/10.1177/00131644261440556

Novieany, E., Satiadarma, M. P., & Idulfilastri, R. M. (2021). Pengujian Validitas Konstruk, Reliabilitas Internal, Dan Analisis Butir (Studi Adaptasi Alat Ukur Skrining Gangguan Bipolar Di Indonesia). Jurnal Muara Ilmu Sosial, Humaniora, Dan Seni, 5(1), 39–46. https://doi.org/10.24912/jmishumsen.v5i1.9500.2021

Palermo, C. (2022). Rater characteristics, response content, and scoring contexts: Decomposing the determinates of scoring accuracy. Frontiers in Psychology, 13. https://doi.org/10.3389/fpsyg.2022.937097

Peixoto, E. M., Zanini, D. S., & de Andrade, J. M. (2021). Cross-cultural adaptation and psychometric properties of the Kessler Distress Scale (K10): an application of the rating scale model. Psicologia: Reflexao e Critica, 34(1). https://doi.org/10.1186/s41155-021-00186-9

Reinhold, F., Hofer, S., Berkowitz, M., Strohmaier, A., Scheuerer, S., Loch, F., Vogel-Heuser, B., & Reiss, K. (2020). The role of spatial, verbal, numerical, and general reasoning abilities in complex word problem solving for young female and male adults. Mathematics Education Research Journal, 32(2), 189–211. https://doi.org/10.1007/s13394-020-00331-0

Robitzsch, A. (2022). Four-Parameter Guessing Model and Related Item Response Models. Mathematical and Computational Applications, 27(6), 95. https://doi.org/10.3390/mca27060095

Safitri, Z., & Baihaqi, M. (2026). Quality Profile of Arabic Final Semester Assessment Items : A Psychometric Analysis. Al-Lisan: Jurnal Bahasa, 11(1), 87–102. https://doi.org/10.30603/al.v11i1.7322

Santoso, A. P. Y., Nanditya, A. D., Rahmawati, A. N., & Al Hasna, A. S. (2022). Efektivitas Penggunaan Tes Dat (Differential Aptitude Test) Pada Pendidikan Di Indonesia. Flourishing Journal, 2(2), 137–145. https://doi.org/10.17977/um070v2i22022p137-145

Setiawati, F. A., Izzaty, R. E., & Hidayat, V. (2018). Evaluasi Karakteristik Psikometrik Tes Bakat Differensial dengan Teori Klasik. Humanitas, 15(1), 46. https://doi.org/10.26555/humanitas.v15i1.7249

Solichin, M. (2017). Mujianto Solichin Universitas Pesantren Tinggi Darul Ulum ( Unipdu ) Jombang Pendahuluan Kegiatan evaluasi dalam dunia pendidikan merupakan komponen integral dalam program pembelajaran di samping rencana pembelajaran ( kurikulum ), tujuan pembelajaran , b. DIRASAT: Jurnal Manajemen & Pendidikan Islam, 2(2), 192–213. https://journal.unipdu.ac.id/index.php/dirasat/article/view/879/637

Sugiyono. (2023). Metode Penelitian Kuatitatif Kualitatif & R&D (Sutopo, Ed.; 5th ed.). A.

Sumintono, B., & Widhiarso, W. (2015). Aplikasi Model Rasch untuk Penelitian I lmu-ilmu sosial (edisi revisi). Trim Komunikata.

Suwartono, C., & Santoso, J. B. (2016). Attitudes Toward Psychological Test Use in Indonesia. ANIMA Indonesian Psychological Journal, 31(4), 160–169. https://doi.org/10.24123/aipj.v31i4.575

Syahputra, Y., Rahmat, C. P., & Erwinda, L. (2025). Instrumentasi Tes dalam Bimbingan dan Konseling. CV Eureka Media Aksara.

Tarigan, M., & Fadillah. (2021). Properti Psikometri Struktur Intelegensi IST Subtes Verbal (Satzergaenzung, Wortauswahl, dan Analogien) berbahasa Indonesia. Jurnal Muar Ilmu Sosial, Humaniora, Dan Seni, 5(1), 63–72.

Tarigan, M., & Fadillah, F. (2022). Properti Psikometrik Intelligenz Struktur Test Subtes Kemampuan Numerik (Rechenaufgaben dan Zahlen Reihen). Intuisi : Jurnal Psikologi Ilmiah, 13(2), 155–170. https://doi.org/10.15294/intuisi.v13i2.31839

Utari, D., & Lestari, R. (2023). Metode Adaptasi Lintas Budaya Instrumen Kidscreen-27 Di Asia: Integrative Review. Jambura Journal of Health Sciences and Research, 5(2), 474–484. https://doi.org/10.35971/jjhsr.v5i2.18195

von Davier, M., & Bezirhan, U. (2023). A Robust Method for Detecting Item Misfit in Large-Scale Assessments. Educational and Psychological Measurement, 83(4), 740–765. https://doi.org/10.1177/00131644221105819

Zumbo, B. D. (2007). Three generations of DIF analyses.

Downloads

Published

2026-05-28

How to Cite

Erwinda, L., Marlina, M., Idaini, R. M., Astrianingsih, D., & Syahputra, Y. (2026). Evaluasi Karakteristik Butir Soal Tes Numerikal Differential Aptitude Test melalui Analisis Psikometrik. Journal of Mathematical Psychometrics and Measurement Science, 1(1), 10–25. Retrieved from https://journal.aapbk.org/index.php/jmpms/article/view/565