Opportunities and Challeges of Using Artificial Intelligence in Assessment

Supianto Supianto


Abstract: The application of Artificial Intelligence (AI) in learning assessment has attracted the attention of many educational experts, researchers and practitioners. This study discusses the opportunities and challenges of using AI in learning assessment. Traditional assessment has weaknesses in terms of misjudgment, inability to measure individual abilities that are not measured in certain forms of assessment, significant cost and time, slow feedback, and inability to be adjusted individually. Several studies have shown that the use of AI in assessments can improve the accuracy, validity and reliability of assessments, reduce human rater bias, enable adaptive assessments, increase time and cost efficiency, provide faster and more timely feedback, and assist in identifying individual needs and improve the quality of learning. However, the use of AI technology can only be a tool, and the final decision must still be made by humans. Therefore, the use of AI in assessment requires special attention in terms of ethics and the development of human capabilities to understand and use AI technology wisely.

Keywords: Artificial Intelligence, Assessment


Abstrak: Penerapan Artificial Intelligence (AI) dalam penilaian pembelajaran telah menarik perhatian banyak ahli pendidikan, peneliti, dan praktisi. Penelitian ini membahas peluang dan tantangan penggunaan AI dalam asesmen pembelajaran. Asesmen tradisional memiliki kelemahan dalam hal kesalahan penilaian, ketidakmampuan mengukur kemampuan individu yang tidak terukur dalam bentuk asesmen tertentu, biaya dan waktu yang signifikan, umpan balik yang lambat, dan ketidakmampuan untuk disesuaikan secara individual. Beberapa penelitian menunjukkan bahwa penggunaan AI dalam asesmen dapat meningkatkan akurasi, validitas, dan reliabilitas asesmen, mengurangi bias penilai manusia, memungkinkan asesmen adaptif, meningkatkan efisiensi waktu dan biaya, memberikan umpan balik yang lebih cepat dan tepat waktu, serta membantu dalam mengidentifikasi kebutuhan individu dan meningkatkan kualitas pembelajaran. Namun, penggunaan teknologi AI hanya dapat menjadi alat bantu, dan keputusan akhir tetap harus dilakukan oleh manusia. Oleh karena itu, penggunaan AI dalam asesmen memerlukan perhatian khusus dalam hal etika dan pengembangan kemampuan manusia dalam memahami dan memanfaatkan teknologi AI dengan bijak.

Kata  Kunci: Kecerdasan Buatan, Asesmen



Artificial Intelligence; Assessment

Full Text:



Adedokun, AO, & Adeyemo, OI (2021). Enhancing Assessment and Evaluation with Artificial Intelligence. International Journal of Emerging Technologies in Learning, 16(4), 134-148.

Aggarwal, A., Singla, S., & Kaur, S. (2019). Machine learning based automatic assessment systems: A review. International Journal of Computer Applications, 181(47), 15-22.

Alshehri, S., Drew, S., Alghamdi, R., Alsolami, R., & Aljohani, N. (2019). The impact of using artificial intelligence in assessments. Education and Information Technologies, 24(2), 1619-1638.

Arikunto, S. (2013). The research procedure is a practice approach (revision VIII). Jakarta: Rineka Cipta.

Beede, P., Julian, J., Langdon, G., McKittrick, G., Khan, B., & Doms, M. (2011). Women in STEM: A gender gap to innovation. US Department of Commerce.

Bennett, RE (2011). Formative assessment: A critical review. Assessment in Education: Principles, Policy & Practice, 18(1), 5-25.

Black, P., & William, D. (1998). Inside the black box: Raising standards through classroom assessment. Phi Delta Kappan, 80(2), 139-148.

Bolukbasi, T., Chang, KW, Zou, JY, Saligrama, V., & Kalai, AT (2016). Man is to computer programmer as woman is to homemaker? Debiasing word embeddings. In Advances in neural information processing systems (pp. 4349-4357).

Bostrom, N., & Yudkowsky, E. (2014). The ethics of artificial intelligence. The Cambridge Handbook of Artificial Intelligence, 316-334.

Brookhart, SM (2013). How to create and use rubrics for formative assessment and grading. ASCD.

Buolamwini, J., & Gebru, T. (2018). Gender shades: Intersectional accuracy disparities in commercial gender classification. Proceedings of the 1st Conference on Fairness, Accountability and Transparency, PMLR 81:77-91.

Chen, G., Gao, Y., Chen, X., & Yang, Y. (2020). Adaptive learning and assessment based on learning styles using deep learning. Journal of Educational Computing Research, 57(6), 1447-1466.

Chen, LC, Chen, YH, & Huang, YM (2019). The effects of web-based formative assessment on self-regulated learning and learning performance in a mathematics course. Computers & Education, 133, 43-55.

Chen, X., Gao, J., & Wang, J. (2020). A Review of Artificial Intelligence Applications in Educational Assessment. IEEE Access, 8, 89916-89929.

Darling-Hammond, L., & Adamson, F. (2010). Beyond basic skills: The role of performance assessment in achieving 21st century standards of learning. Stanford Center for Opportunity Policy in Education.

Davis, RE, Nichols, RL, & Grant, JF (2019). Using artificial intelligence to develop and evaluate a competency-based assessment program in family medicine. Academic Medicine, 94(4), 557-563.

Foltz, PW (2013). Automated essay scoring: applications to educational technology. Handbook of Research on Educational Communications and Technology, 2, 169-181.

Gao, T., et al. (2020). "A review of artificial intelligence applications in educational assessment." Journal of Educational Evaluation for Health Professions, 17: 27.

Han, S. (2018). Exploring the role of artificial intelligence in language assessment. Language Testing, 35(1), 37-55.

Harlen, W. (2005). Teachers' summative practices and assessment for learning—tensions and synergies. The Curriculum Journal, 16(2), 207-223.

Hattie, J. (2009). Visible learning: A synthesis of over 800 meta-analyses relating to achievement. Routledge.

Hedayati, A., & Navimipour, NJ (2019). A systematic review of automated essay scoring systems in the educational domain. Journal of Educational Computing Research, 57(2), 361-386.

Hendry, GD, Harper, BD, & Rahman, FM (2019). Using machine learning to detect cheating in online assessments. Assessment & Evaluation in Higher Education, 44(3), 360-372.

Hong, H., Choi, J., & Park, J. (2020). The effectiveness of artificial intelligence in improving the consistency of evaluation in medical education. BMC Medical Education, 20(1), 1-7.

Hoque, R., Sorwar, G., & Alzoubi, M. (2021). A Comprehensive Review of the Use of Artificial Intelligence in Education: Opportunities and Challenges. Journal of Educational Technology & Society, 24(2), 110-123.

Jiao, H., Liu, Y., & Yan, H. (2020). AI technology-supported English writing assessment: Opportunities and challenges. Educational Assessment, Evaluation and Accountability, 32(2), 215-230.

Kahng, J., & Cho, K. (2019). "The applications of artificial intelligence in educational assessment." Journal of Educational Evaluation for Health Professions, 16: 31.

Kaul, V., & Lal, M. (2018). Assessment of reliability and consistency of grading in online examinations with artificial intelligence. Education and Information Technologies, 23(2), 819-832.

Khaled, AM, Al-Nashwan, H., & Al-Shehari, T. (2021). An overview of artificial intelligence in education: Benefits, challenges, and risks. Journal of Educational Technology & Society, 24(1), 168-180.

Kim, J. (2021). Algorithmic bias in educational assessment: A review of the literature. Journal of Educational Measurement, 58(2), 171-186.

Kovanović, V., Joksimović, S., Gašević, D., & Hatala, M. (2015). Analyzing and predicting learning achievements in online courses with symbolic and subsymbolic methods. Journal of Computer Assisted Learning, 31(3), 268-286.

Kunnath, SR, Gupta, S., & Srivastava, S. (2020). Automated essay scoring using natural language processing techniques: A systematic review. IEEE Access, 8, 200322-200335.

Leask, M., & Yuan, X. (2019). Assessment design using artificial intelligence to detect and prevent plagiarism. Innovations in Education and Teaching International, 56(6), 677-685.

Ma, J., Zhou, M., & Weng, Y. (2020). An AI-based online testing system for reducing academic dishonesty. Education and Information Technologies, 25(5), 4285-4300.

Mancheno-Smoak, L., Conradi, K., & Tarnoff, A. (2021). Machine Learning and Artificial Intelligence in Assessment: Benefits, Limitations, and Future Directions. In Handbook of Research on Assessment Technologies, Methods, and Applications in Higher Education (pp. 20-38). IGI Global.

Martin, F., Wang, C., & Sadaf, A. (2019). Student perception of helpfulness of facilitation strategies that enhance the instructor's presence, connectedness, engagement and learning in online courses. The Internet and Higher Education, 43, 52-65.

Mazouzi, A., Zellagui, M., & Belbachir, AH (2020). Using artificial intelligence to improve the evaluation of free text answers: The case of Moodle quizzes. Education and Information Technologies, 25(5), 3693-3711.

Mishra, P., & Pandey, AK (2021). Artificial Intelligence in Education: Opportunities and Challenges. In Digital Transformation and the Future of Society (pp. 163-175). Springers, Singapore.

Nitko, AJ (2001). Educational assessment of students (2nd ed.). Upper Saddle River, NJ: Merrill.

Norris, SP (2019). Moving Beyond Multiple-Choice Items: What Else Is Possible?. Educational Measurement: Issues and Practice, 38(2), 15-24.

Rauh, C., Heyder, A., & Maier, R. (2018). The potential of adaptive educational technologies: An empirical study of personalized e-learning. Journal of Educational Technology & Society, 21(3), 1-13.

Riduwan. (2015). The scale of measurement of research variables. Alphabet.

Salvia, J., & Ysseldyke, J. (2007). Assessment in special education: A practical approach. Boston, MA: Houghton Mifflin.

Sanchez, A., & Huang, YM (2017). Applying learning analytics and artificial intelligence for adaptive learning. Journal of Educational Technology & Society, 20(3), 142-154.

Schmidhuber, J. (2015). Deep learning in neural networks: An overview. Neural networks, 61, 85-117.

Schneider, EF, Lang, A., Shin, M., & Bradley, SD (2019). Investigating the use of artificial intelligence in standardized medical assessments. Academic Medicine, 94(11S), S74-S81.

Shabani, M., & Borry, P. (2018). Rules for the ethical use of digital data in human research. In Ethical Aspects of Research with Human Subjects (pp. 113-129). Springer, Cham.

Shaw, SD, & Anderson, KM (2019). Artificial intelligence in educational assessment: Practices, opportunities, and challenges. Journal of Educational Technology Development and Exchange (JETDE), 12(1), 1-14.

Shieh, JC, Chen, TC, & Chang, HF (2020). Automatic Essay Scoring and Feedback Generation with Machine Learning Techniques. Journal of Educational Technology & Society, 23(3), 158-169.

Spataro, W., & Ciminello, A. (2021). The Ethics of Artificial Intelligence in Education: Challenges for Human and Social Development. Frontiers in Psychology, 12, 579944.

Stiggins, R. (2005). From formative assessment to assessment for learning: A path to success in standards-based schools. Phi Delta Kappan, 87(4), 324-328.

Stiggins, R. (2007). Assessment through the student's eyes. Educational Leadership, 64(8), 22-26.

Stowe, R., Sammons, M., Sibert, JL, & Vincent, R. (2020). Remote Proctoring: An Examination of Utilizing Artificial Intelligence and Assessment Literacy to Ensure Academic Integrity in Online Assessments. Journal of Educators Online, 17(2), n2.

Tanes, Z., & Martin, S. (2020). Use of artificial intelligence to enhance and evaluate students' critical thinking skills. International Journal of Educational Technology in Higher Education, 17(1), 1-19.

Vongkulluksn, VW, Xie, K., & Bowman, MA (2018). Preparing pre-service teachers to use formative assessment practices. Assessment in Education: Principles, Policy & Practice, 25(2), 127-142.

Walker, AE, Grunwald, A., & Doherty, D. (2019). Evaluating the reliability and validity of an automated scoring system for a problem-based learning activity. Journal of Educational Computing Research, 57(1), 191-214.

Wang, X., et al. (2020). "Artificial intelligence and education assessment: Current status and prospects." Journal of Educational Technology Development and Exchange, 13(1): 81-96.

Wang, Z., et al. (2021). "Artificial intelligence in education: A systematic review." Journal of Educational Computing Research, 59(6): 1426-1458.

Xi, N., Chen, Y., & Liang, JC (2018). Students' perceptions of formative assessment in EFL writing: A longitudinal inquiry. Language Testing, 35(3), 333-354.

Zhang, J., Wang, L., Li, Y., & Liang, J. (2020). Integrating artificial intelligence into English language assessment: Opportunities and challenges. Educational Assessment, Evaluation and Accountability, 32(2), 189-205.

Zhang, Y., Lu, H., Liu, Z., & Zou, X. (2021). The application of AI in personalized language learning and assessment. Computer Assisted Language Learning, 34(4), 334-358.

Zhao, Y., & Zhou, Y. (2021). "The application of artificial intelligence in education." Open Journal of Social Sciences, 9: 208-215.

Zouaq, A., & Eltagory, A. (2021). Intelligent tutoring system-based cheating detection in online exams. Computers in Human Behavior, 114, 106565.

DOI: http://dx.doi.org/10.30734/jpe.v10i2.3199


  • There are currently no refbacks.

Copyright (c) 2023 Jurnal Pendidikan Edutama

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Creative Commons License
JURNAL PENDIDIKAN EDUTAMA(JPE) by http://ejurnal.ikippgribojonegoro.ac.id/index.php/JPE/ is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.


View My Stats