An AI System Evaluation Framework for Advancing AI Safety: Terminology, Taxonomy, Lifecycle Mapping

Boming Xia*, Qinghua Lu, Liming Zhu, Zhenchang Xing

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)

Abstract

The advent of advanced AI underscores the urgent need for comprehensive safety evaluations, necessitating collaboration across communities (i.e., AI, software engineering, and governance). However, divergent practices and terminologies across these communities, combined with the complexity of AI systems - of which models are only a part - and environmental affordances (e.g., access to tools), obstruct effective communication and comprehensive evaluation. This paper proposes a framework for AI system evaluation comprising three components: 1) harmonised terminology to facilitate communication across communities involved in AI safety evaluation; 2) a taxonomy identifying essential elements for AI system evaluation; 3) a mapping between AI lifecycle, stakeholders, and requisite evaluations for accountable AI supply chain. This framework catalyses a deeper discourse on AI system evaluation beyond model-centric approaches.

Original languageEnglish
Title of host publicationAIware 2024 - Proceedings of the 1st ACM International Conference on AI-Powered Software, Co-located with
Subtitle of host publicationESEC/FSE 2024
EditorsBram Adams, Thomas Zimmermann, Ipek Ozkaya, Dayi Lin, Jie M. Zhang
PublisherAssociation for Computing Machinery (ACM)
Pages74-78
Number of pages5
ISBN (Electronic)9798400706851
DOIs
Publication statusPublished - 10 Jul 2024
Event1st ACM International Conference on AI-Powered Software, AIware 2024, co-located with the ACM International Conference on the Foundations of Software Engineering, FSE 2024 - Porto de Galinhas, Brazil
Duration: 15 Jul 202416 Jul 2024

Publication series

NameAIware 2024 - Proceedings of the 1st ACM International Conference on AI-Powered Software, Co-located with: ESEC/FSE 2024

Conference

Conference1st ACM International Conference on AI-Powered Software, AIware 2024, co-located with the ACM International Conference on the Foundations of Software Engineering, FSE 2024
Country/TerritoryBrazil
CityPorto de Galinhas
Period15/07/2416/07/24

Fingerprint

Dive into the research topics of 'An AI System Evaluation Framework for Advancing AI Safety: Terminology, Taxonomy, Lifecycle Mapping'. Together they form a unique fingerprint.

Cite this