A first approach to the automatic detection of zero subjects and impersonal constructions in Portuguese

Luz Rello*, Gabriela Ferraro, Iria Gayo

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

In this paper we present a first approximation to the automatic detection of zero subjects and impersonal constructions in Brazilian Portuguese. To the best of our knowledge, this is the first attempt of approaching such task using machine learning in Portuguese. We compiled a corpus containing more than 5,600 instances annotated with the classes to be identified: explicit subjects, zero subjects or pronouns and impersonal constructions. We applied machine learning using linguistically motivated features to classify the instances. The results are modest but promising and provide guidance for future work.

Original languageEnglish
Pages (from-to)163-170
Number of pages8
JournalProcesamiento del Lenguaje Natural
Volume49
Publication statusPublished - Sept 2012
Externally publishedYes

Fingerprint

Dive into the research topics of 'A first approach to the automatic detection of zero subjects and impersonal constructions in Portuguese'. Together they form a unique fingerprint.

Cite this