Journal: Journal of Data and Information Quality
Loading...
Abbreviation
Publisher
Association for Computing Machinery
2 results
Search Results
Publications 1 - 2 of 2
- Augmenting data quality through high-precision gender categorizationItem type: Journal Article
Journal of Data and Information QualityMüller, Daniel; Jain, Pratiksha; Te, Yieh-Funk (2019) - The Choice of Textual Knowledge Base in Automated Claim CheckingItem type: Journal Article
Journal of Data and Information QualityStammbach, Dominik; Zhang, Boya; Ash, Elliott (2023)Automated claim checking is the task of determining the veracity of a claim given evidence retrieved from a textual knowledge base of trustworthy facts. While previous work has taken the knowledge base as given and optimized the claim-checking pipeline, we take the opposite approach - taking the pipeline as given, we explore the choice of the knowledge base. Our first insight is that a claim-checking pipeline can be transferred to a new domain of claims with access to a knowledge base from the new domain. Second, we do not find a "universally best"knowledge base - higher domain overlap of a task dataset and a knowledge base tends to produce better label accuracy. Third, combining multiple knowledge bases does not tend to improve performance beyond using the closest-domain knowledge base. Finally, we show that the claim-checking pipeline's confidence score for selecting evidence can be used to assess whether a knowledge base will perform well for a new set of claims, even in the absence of ground-truth labels.
Publications 1 - 2 of 2