HI-TOM: A Benchmark for Evaluating Higher-Order Theory of Mind Reasoning in Large Language Models
He Yinghui, Yufan Wu, Yilin Jia, Rada Mihalcea, Yulong Chen and Naihao Deng. “HI-TOM: A Benchmark for Evaluating Higher-Order Theory of Mind Reasoning in Large Language Models.” ArXiv abs/2310.16755 (2023): n. pag.