TY - GEN
T1 - TextSimplifier
T2 - 2nd Workshop on Text Simplification, Accessibility and Readability, TSAR 2023
AU - Seneviratne, Sandaru
AU - Daskalaki, Elena
AU - Suominen, Hanna
N1 - Publisher Copyright:
© 2023 TSAR 2023 - 2nd Workshop on Text Simplification, Accessibility and Readability, associated with the 14th International Conference on Recent Advances in Natural Language Processing 2023, RANLP 2023 - Proceedings of the Workshop. All rights reserved.
PY - 2023
Y1 - 2023
N2 - Natural language understanding is fundamental to knowledge acquisition in today's information society. However, natural language is often ambiguous with frequent occurrences of complex terms, acronyms, and abbreviations that require substitution and disambiguation, for example, by “translation” from complex to simpler text for better understanding. These tasks are usually difficult for people with limited reading skills, second language learners, and non-native speakers. Hence, the development of text simplification systems that are capable of simplifying complex text is of paramount importance. Thus, we conducted a user study to identify which components are essential in a text simplification system. Based on our findings, we proposed an improved text simplification framework, covering a broader range of aspects related to lexical simplification - from complexity identification to lexical substitution and disambiguation - while supplementing the simplified outputs with additional information for better understandability. Based on the improved framework, we developed TextSimplifier, a modularised, context-sensitive, end-to-end simplification framework, and engineered its web implementation. This system targets lexical simplification that identifies complex terms and acronyms followed by their simplification through substitution and disambiguation for better understanding of complex language.
AB - Natural language understanding is fundamental to knowledge acquisition in today's information society. However, natural language is often ambiguous with frequent occurrences of complex terms, acronyms, and abbreviations that require substitution and disambiguation, for example, by “translation” from complex to simpler text for better understanding. These tasks are usually difficult for people with limited reading skills, second language learners, and non-native speakers. Hence, the development of text simplification systems that are capable of simplifying complex text is of paramount importance. Thus, we conducted a user study to identify which components are essential in a text simplification system. Based on our findings, we proposed an improved text simplification framework, covering a broader range of aspects related to lexical simplification - from complexity identification to lexical substitution and disambiguation - while supplementing the simplified outputs with additional information for better understandability. Based on the improved framework, we developed TextSimplifier, a modularised, context-sensitive, end-to-end simplification framework, and engineered its web implementation. This system targets lexical simplification that identifies complex terms and acronyms followed by their simplification through substitution and disambiguation for better understanding of complex language.
UR - https://www.scopus.com/pages/publications/85184663736
U2 - 10.26615/978-954-452-086-1_003
DO - 10.26615/978-954-452-086-1_003
M3 - Conference Paper
AN - SCOPUS:85184663736
T3 - TSAR 2023 - 2nd Workshop on Text Simplification, Accessibility and Readability, associated with the 14th International Conference on Recent Advances in Natural Language Processing 2023, RANLP 2023 - Proceedings of the Workshop
SP - 21
EP - 32
BT - TSAR 2023 - 2nd Workshop on Text Simplification, Accessibility and Readability, associated with the 14th International Conference on Recent Advances in Natural Language Processing 2023, RANLP 2023 - Proceedings of the Workshop
A2 - Stajner, Sanja
A2 - Saggion, Horacio
A2 - Shardlow, Matthew
A2 - Alva-Manchego, Fernando
PB - Incoma Ltd
Y2 - 7 September 2023
ER -