TY - JOUR
T1 - Towards distributional semantics-based classification of collocations for collocation dictionaries
AU - Wanner, Leo
AU - Ferraro, Gabriela
AU - Moreno, Pol
N1 - Publisher Copyright:
© 2016 Oxford University Press. All rights reserved.
PY - 2017
Y1 - 2017
N2 - Automatic acquisition of raw source material is of great aid for the compilation of dictionaries, and, in particular, of specialized dictionaries such as collocation dictionaries. The extraction of collocations from corpora has been actively worked on since the late eighties. The quality of the state-of-the-art extraction algorithms allows the lexicographers to obtain lists of collocations they can work with. However, mere lists of collocations are not sufficient. In collocation dictionaries, collocations are grouped se-mantically, which also presupposes a semantic classification of collocations. In this article, a distributional semantics-based model is proposed that classifies collocations with respect to broad semantic categories as encountered in dictionaries. In experiments with Spanish verb-noun and noun-adjective collocations from the lexicographic field of emotion nouns, it is shown that the use of features extracted from the context of collocations is decisive for retrieval of draft entries for collocation dictionaries.
AB - Automatic acquisition of raw source material is of great aid for the compilation of dictionaries, and, in particular, of specialized dictionaries such as collocation dictionaries. The extraction of collocations from corpora has been actively worked on since the late eighties. The quality of the state-of-the-art extraction algorithms allows the lexicographers to obtain lists of collocations they can work with. However, mere lists of collocations are not sufficient. In collocation dictionaries, collocations are grouped se-mantically, which also presupposes a semantic classification of collocations. In this article, a distributional semantics-based model is proposed that classifies collocations with respect to broad semantic categories as encountered in dictionaries. In experiments with Spanish verb-noun and noun-adjective collocations from the lexicographic field of emotion nouns, it is shown that the use of features extracted from the context of collocations is decisive for retrieval of draft entries for collocation dictionaries.
UR - http://www.scopus.com/inward/record.url?scp=85021817442&partnerID=8YFLogxK
U2 - 10.1093/ijl/ecw002
DO - 10.1093/ijl/ecw002
M3 - Article
SN - 0950-3846
VL - 30
SP - 167
EP - 186
JO - International Journal of Lexicography
JF - International Journal of Lexicography
IS - 2
ER -