ParGramBank: The ParGram parallel treebank

Sebastian Sulger, Miriam Butt, Tracy Holloway King, Paul Meurer, Tibor Laczkó, György Rákosi, Cheikh Bamba Dione, Helge Dyvik, Victoria Rosén, Koenraad De Smedt, Agnieszka Patejuk, Özlem Çetinoǧlu, I. Wayan Arka, Meladel Mistica

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    18 Citations (Scopus)


    This paper discusses the construction of a parallel treebank currently involving ten languages from six language families. The treebank is based on deep LFG (Lexical-Functional Grammar) grammars that were developed within the framework of the ParGram (Parallel Grammar) effort. The grammars produce output that is maximally parallelized across languages and language families. This output forms the basis of a parallel treebank covering a diverse set of phenomena. The treebank is publicly available via the INESS treebanking environment, which also allows for the alignment of language pairs. We thus present a unique, multilayered parallel treebank that represents more and different types of languages than are available in other treebanks, that represents deep linguistic knowledge and that allows for the alignment of sentences at several levels: dependency structures, constituency structures and POS information.

    Original languageEnglish
    Title of host publicationLong Papers
    PublisherAssociation for Computational Linguistics (ACL)
    Number of pages11
    ISBN (Print)9781937284503
    Publication statusPublished - 2013
    Event51st Annual Meeting of the Association for Computational Linguistics, ACL 2013 - Sofia, Bulgaria
    Duration: 4 Aug 20139 Aug 2013

    Publication series

    NameACL 2013 - 51st Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference


    Conference51st Annual Meeting of the Association for Computational Linguistics, ACL 2013


    Dive into the research topics of 'ParGramBank: The ParGram parallel treebank'. Together they form a unique fingerprint.

    Cite this