MemSum-DQA: Adapting An Efficient Long Document Extractive Summarizer for Document Question Answering


METADATA ONLY
Loading...

Date

2023-10-10

Publication Type

Working Paper

ETH Bibliography

yes

Citations

Altmetric
METADATA ONLY

Data

Rights / License

Abstract

We introduce MemSum-DQA, an efficient system for document question answering (DQA) that leverages MemSum, a long document extractive summarizer. By prefixing each text block in the parsed document with the provided question and question type, MemSum-DQA selectively extracts text blocks as answers from documents. On full-document answering tasks, this approach yields a 9% improvement in exact match accuracy over prior state-of-the-art baselines. Notably, MemSum-DQA excels in addressing questions related to child-relationship understanding, underscoring the potential of extractive summarization techniques for DQA tasks.

Permanent link

Publication status

published

Editor

Book title

Journal / series

Volume

Pages / Article No.

2310.06436

Publisher

Cornell University

Event

Edition / version

Methods

Software

Geographic location

Date collected

Date created

Subject

Computation and Language (cs.CL); FOS: Computer and information sciences; Document understanding; Question answering

Organisational unit

03774 - Hahnloser, Richard H.R. / Hahnloser, Richard H.R. check_circle

Notes

Funding

Related publications and datasets