ACL2021

WikiSum: Coherent Summarization Dataset for Efficient Human-Evaluation

Nachshon Cohen, Oren Kalinsky, Yftah Ziser, Alessandro Moschitti

Abstract

Recent works have made significant advances on summarization tasks, facilitated by summarization datasets. Several existing datasets have the form of coherent-paragraph summaries. However, these datasets were curated from academic documents written for experts, making the essential step of assessing the summarization output through human-evaluation very demanding. To overcome these limitations, we present a dataset 1 based on article summaries appearing on the WikiHow website, composed of howto articles and coherent-paragraph summaries written in plain language. We compare our dataset attributes to existing ones, including readability and world-knowledge, showing our dataset makes human evaluation significantly more manageable and effective. A human evaluation conducted on PubMed and the proposed dataset reinforces our findings.