TNNT: The Named Entity Recognition Toolkit

Sandaru Seneviratne, Sergio J. Rodríguez Méndez, Xuecheng Zhang, Pouya G. Omran, Kerry Taylor, Armin Haller

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

Extraction of categorised named entities from text is a complex task given the availability of a variety of Named Entity Recognition (NER) models and the unstructured information encoded in different source document formats. Processing the documents to extract text, identifying suitable NER models for a task, and obtaining statistical information is important in data analysis to make informed decisions. This paper presents\footnoteThe manuscript follows guidelines to showcase a demonstration that introduces an overview of how the toolkit works: input document set, initial settings, processing, and output set. The input document set is artificial in order to show various toolkit capabilities. TNNT, a toolkit that automates the extraction of categorised named entities from unstructured information encoded in source documents, using diverse state-of-the-art (SOTA) Natural Language Processing (NLP) tools and NER models.TNNT integrates 21 different NER models as part of a Knowledge Graph Construction Pipeline (KGCP) that takes a document set as input and processes it based on the defined settings, applying the selected blocks of NER models to output the results. The toolkit generates all results with an integrated summary of the extracted entities, enabling enhanced data analysis to support the KGCP, and also, to aid further NLP tasks.

Original languageEnglish
Title of host publicationK-CAP 2021 - Proceedings of the 11th Knowledge Capture Conference
PublisherAssociation for Computing Machinery (ACM)
Pages249-252
Number of pages4
ISBN (Electronic)9781450384575
DOIs
Publication statusPublished - 2 Dec 2021
Event11th ACM International Conference on Knowledge Capture, K-CAP 2021 - Virtual, Online, United States
Duration: 2 Dec 20213 Dec 2021

Publication series

NameK-CAP 2021 - Proceedings of the 11th Knowledge Capture Conference

Conference

Conference11th ACM International Conference on Knowledge Capture, K-CAP 2021
Country/TerritoryUnited States
CityVirtual, Online
Period2/12/213/12/21

Fingerprint

Dive into the research topics of 'TNNT: The Named Entity Recognition Toolkit'. Together they form a unique fingerprint.

Cite this