MT-RAIG: Novel Benchmark and Evaluation Framework for Retrieval-Augmented Insight Generation over Multiple Tables

Yonsei Unversity - DLI Lab
ACL 2025
^*Indicates Equal Contribution, ^†Indicates Corresponding Author

Abstract

Recent advancements in table-based reasoning have expanded beyond factoid-level QA to address insight-level tasks, where systems should synthesize implicit knowledge in the table to provide explainable analyses. Although effective, existing studies remain confined to scenarios where a single gold table is given alongside the user query, failing to address cases where users seek comprehensive insights from multiple unknown tables. To bridge these gaps, we propose MT-RAIG Bench, design to evaluate systems on Retrieval-Augmented Insight Generation over Mulitple-Tables. Additionally, to tackle the suboptimality of existing automatic evaluation methods in the table domain, we further introduce a fine-grained evaluation framework MT-RAIG Eval, which achieves better alignment with human quality judgments on the generated insights. We conduct extensive experiments and reveal that even frontier LLMs still struggle with complex multi-table reasoning, establishing our MT-RAIG Bench as a challenging testbed for future research.

BibTeX

@misc{seo2025mtraignovelbenchmarkevaluation, title={MT-RAIG: Novel Benchmark and Evaluation Framework for Retrieval-Augmented Insight Generation over Multiple Tables}, author={Kwangwook Seo and Donguk Kwon and Dongha Lee}, year={2025}, eprint={2502.11735}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2502.11735}, }

MT-RAIG: Novel Benchmark and Evaluation Framework for Retrieval-Augmented Insight Generation over Multiple Tables

Abstract

MT-RAIG Bench is the first largescale benchmark for retrieval-augmented insight generation over multiple tables.

MT-RAIG Eval is a novel decomposition-based evaluation framework that enables finer distinctions in assessing the quality of the long-form outputs.

Comparsion of various baseline generators.

Leaderboard

BibTeX