Highlighting Case Studies in LLM Literature Review of Interdisciplinary System Science

Lachlan McGinness, Peter Baumgartner*, Esther Onyango, Zelalem Lema

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference Paperpeer-review

2 Citations (Scopus)

Abstract

Large Language Models (LLMs) were used to assist four Commonwealth Scientific and Industrial Research Organisation (CSIRO) researchers to perform systematic literature reviews (SLR). We evaluate the performance of LLMs for SLR tasks in these case studies. In each, we explore the impact of changing parameters on the accuracy of LLM responses. The LLM was tasked with extracting evidence from chosen academic papers to answer specific research questions. We evaluate the models’ performance in faithfully reproducing quotes from the literature and subject experts were asked to assess the model performance in answering the research questions. We developed a semantic text highlighting tool to facilitate expert review of LLM responses. We found that state of the art LLMs were able to reproduce quotes from texts with greater than 95% accuracy and answer research questions with an accuracy of approximately 83%. We use two methods to determine the correctness of LLM responses; expert review and the cosine similarity of transformer embeddings of LLM and expert answers. The correlation between these methods ranged from 0.48 to 0.77, providing evidence that the latter is a valid metric for measuring semantic similarity.

Original languageEnglish
Title of host publicationAI 2024
Subtitle of host publicationAdvances in Artificial Intelligence - 37th Australasian Joint Conference on Artificial Intelligence, AI 2024, Proceedings
EditorsMingming Gong, Yiliao Song, Yun Sing Koh, Wei Xiang, Derui Wang
PublisherSpringer Science+Business Media B.V.
Pages29-43
Number of pages15
ISBN (Print)9789819603473
DOIs
Publication statusPublished - 2025
Event37th Australasian Joint Conference on Artificial Intelligence, AJCAI 2024 - Melbourne, Australia
Duration: 25 Nov 202429 Nov 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15442 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference37th Australasian Joint Conference on Artificial Intelligence, AJCAI 2024
Country/TerritoryAustralia
CityMelbourne
Period25/11/2429/11/24

Fingerprint

Dive into the research topics of 'Highlighting Case Studies in LLM Literature Review of Interdisciplinary System Science'. Together they form a unique fingerprint.

Cite this