ACL2023

Interpretable Automatic Fine-grained Inconsistency Detection in Text Summarization

Hou Pong Chan, Qi Zeng, Heng Ji

被引用 7 次

摘要

Existing factual consistency evaluation approaches for text summarization provide binary predictions and limited insights into the weakness of summarization systems. Therefore, we propose the task of fine-grained inconsistency detection, the goal of which is to predict the fine-grained types of factual errors in a summary. Motivated by how humans inspect factual inconsistency in summaries, we propose an interpretable fine-grained inconsistency detection model, FINEGRAINFACT, which explicitly represents the facts in the documents and summaries with semantic frames extracted by semantic role labeling, and highlights the related semantic frames to predict inconsistency. The highlighted semantic frames help verify predicted error types and correct inconsistent summaries. Experiment results demonstrate that our model outperforms strong baselines and provides evidence to support or refute the summary. 1 Source text Marcy Smith was woken up by her son David to find their house in Glovertown, Newfoundland and Labrador, completely engulfed in flames ... Mrs Smith said if it wasn't for her son, she and her daughter probably wouldn't have survived. David was on FaceTime to his father at the time, so was the only one awake and saw the flames out of the corner of his eye ... Error type Example summary Extrinsic noun phrase error: Errors that add new object(s), subject(s), or prepositional object(s) that cannot be inferred from the source article. David was using FaceTime with Maggie Smith and saw the flames. Intrinsic noun phrase error: Errors that misrepresent object(s), subject(s), or prepositional object(s) from the source article. David was using FaceTime with Marcy Smith and saw the flames.