Abstract: The evaluation of question answering systems plays a crucial role in assessing their performance and effectiveness. Existing evaluation metrics often focus on aspects such as recall, ...