Song, H. and Flach, P. (2021). Efficient and Robust Model Benchmarks with Item Response Theory and Adaptive Testing. International Journal of Interactive Multimedia and Artificial Intelligence, 6(5), 110–118. https://doi.org/10.9781/ijimai.2021.02.009