Song, H. and Flach, P. (2021) “Efficient and Robust Model Benchmarks with Item Response Theory and Adaptive Testing”., International Journal of Interactive Multimedia and Artificial Intelligence, 6(5), pp. 110–118. doi: 10.9781/ijimai.2021.02.009.