Quantifying Metric and Model Agreement in Bias Evaluation of Large Language Models
Asgari, A., Wu, H., Naziri, A., Kolahdouzi, M., & Seyyed-Kalantari, L. (2026). "Quantifying Metric and Model Agreement in Bias Evaluation of Large Language Models." The 64th Annual Meeting of the Association for Computational Linguistics.
