Extracting Summary Knowledge Graphs from Long Documents
Published in Pre-print, 2020
Abstract:
Knowledge graphs capture entities and relations from long documents and can facilitate reasoning in many downstream applications. Extracting compact knowledge graphs containing only salient entities and relations is important but challenging for understanding and summarizing long documents. We introduce a new text-to-graph task of predicting summarized knowledge graphs from long documents. We develop a dataset of 200k document/graph pairs using automatic and human annotations. We also develop strong baselines for this task based on graph learning and text summarization, and provide quantitative and qualitative studies of their effect.
Bibtex:
@misc{wu2020extracting,
title={Extracting Summary Knowledge Graphs from Long Documents},
author={Zeqiu Wu and Rik Koncel-Kedziorski and Mari Ostendorf and Hannaneh Hajishirzi},
year={2020},
eprint={2009.09162},
archivePrefix={arXiv},
primaryClass={cs.CL}
}