Abstract
This paper presents an analysis of the errors of a machine learning method that allow us to propose changes to improve it in future developments. The evaluated system detects Spanish subject ellipsis and yields an accuracy of 85.3%. We extract the erroneously classified instances of our training data (1,001) and classify the errors. We perform an analysis of these instances taking into account the features and the linguistic patterns involved, which motivate the inclusion of new features and rules in the system.
Original language | English |
---|---|
Pages (from-to) | 223-230 |
Number of pages | 8 |
Journal | Procesamiento del Lenguaje Natural |
Volume | 47 |
Publication status | Published - 2011 |
Externally published | Yes |