TY - JOUR
T1 - Using Knowledge Graphs and Agentic LLMs for Factuality Text Assessment and Improvement
AU - Kwan, Linda
AU - Omran, Pouya G.
AU - Taylor, Kerry
N1 - Publisher Copyright:
© 2024 Copyright for this paper by its authors.
PY - 2024
Y1 - 2024
N2 - This paper addresses the challenge of assessing and enhancing the factual accuracy of texts generated by large language models (LLMs). Existing methods often rely on self-reflection or external knowledge sources, validating statements individually and rigidly, thus missing a holistic view. We propose a novel approach utilizing a comprehensive knowledge graph (KG), such as Wikidata, to assess and improve the factuality of generated texts. Our method dynamically retrieves and integrates relevant facts during the assessment process, providing a more interconnected and accurate evaluation. Integrating KG with LLM capabilities enhances the overall factual integrity, leading to more reliable AI-generated content. Our results demonstrate improvements in factual accuracy, highlighting the effectiveness of our approach.
AB - This paper addresses the challenge of assessing and enhancing the factual accuracy of texts generated by large language models (LLMs). Existing methods often rely on self-reflection or external knowledge sources, validating statements individually and rigidly, thus missing a holistic view. We propose a novel approach utilizing a comprehensive knowledge graph (KG), such as Wikidata, to assess and improve the factuality of generated texts. Our method dynamically retrieves and integrates relevant facts during the assessment process, providing a more interconnected and accurate evaluation. Integrating KG with LLM capabilities enhances the overall factual integrity, leading to more reliable AI-generated content. Our results demonstrate improvements in factual accuracy, highlighting the effectiveness of our approach.
KW - Agentic LLM
KW - Knowledge Graph
KW - Large Language Model
KW - LLM Evaluation
UR - http://www.scopus.com/inward/record.url?scp=85210233104&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85210233104
SN - 1613-0073
VL - 3828
JO - CEUR Workshop Proceedings
JF - CEUR Workshop Proceedings
T2 - ISWC 2024 Posters, Demos and Industry Tracks: From Novel Ideas to Industrial Practice, ISWC-Posters-Demos-Industry 2024
Y2 - 11 November 2024 through 15 November 2024
ER -