Grammars

From DELPH-IN

Revision as of 04:20, 12 February 2013 by EmilyBender (Talk | contribs)
Jump to: navigation, search

DELPH-IN members share a commitment to re-usable, multi-purpose resources and active exchange. Based on contributions from several members and joint development over many years, an open-source repository of software and linguistic resources has been created that has wide usage in education, research, and application building.

At the core of the DELPH-IN repository is agreement among partners on a shared set of linguistic assumptions (grounded in HPSG and Minimal Recursion Semantics) and on a common formalism (i.e. logic) for linguistic description in typed feature structures. The formalism is implemented in several development and processing environments (that can serve differing purposes) and enables the exchange of grammars and lexicons across platforms. Formalism continuity, on the other hand, has allowed DELPH-IN researchers to develop several comprehensive, wide-coverage grammars of diverse languages that can be processed by a variety of software tools.

Linguistic resources that are available as part of the DELPH-IN open-source repository include broad-coverage grammars for English, German, and Japanese, as well as a set of ‘emerging’ grammars for other languages. Here is some background on a few of the most extensive grammars:

LinGO English Resource Grammar (ERG)

Being developed at the Center for the Study of Language and Information (CSLI) at Stanford University since 1993. The ERG was originally developed within the Verbmobil machine translation effort, but over the past few years has been ported to additional domains and significantly extended. The grammar includes a hand-built lexicon of over 35,000 lexemes and allows interfacing to external lexical resources (like COMLEX). The main grammar developer is Dan Flickinger, with contributions by (among others) Emily Bender, Rob Malouf, and Jeff Smith.

JaCY Japanese Grammar

Jacy is a large scale grammar of Japanese, currently mainly being used in the JaEn machine translaiton sytem. The grammar builds on the ChaSen package for word segmentation, morphological analysis, and a treatment of unknown words. It has been developed at multiple sites. It was originally developed at the German National Research Center in AI (DFKI GmbH) and Saarland University (both in Saarbrücken, Germany) then through cooperation with YY Technologies, later NTT Communications Research Laboratories and the National Institutue for Information Technologies, Japan and now Nanyang Technological University in Singapore. Melanie Siegel, Emily Bender and Francis Bond are the main developers.

Spanish Resource Grammar (SRG)

A computational grammar for Spanish currently being developed at Institut Universitari de Lingüística Aplicada of Universitat Pompeu Fabra. MSG development is currently funded by the Juan de la Cierva program (MEC, Spain) within the TEXTERM-II project (BFF2003-2111). Montserrat Marimon and Núria Bel are the main developers of the SRG.
Personal tools