6th International Workshop on

Linguistically Interpreted Corpora (LINC-2005)

A workshop to be held at

The Second International Joint Conference on Natural Language Processing (IJCNLP-05)

Jeju Island, 15 October 2005

Submission Deadline: June 7, 2005

Organized by:

Kyonghee Paik (ATR Spoken Language Translation Research Laboratories),
Francis Bond (NTT Communication Science Laboratories), and
Stephan Oepen (University of Oslo & Stanford University)

Topic and Motivation

Large linguistically interpreted corpora play an increasingly important role for machine learning, evaluation, psycholinguistics as well as theoretical linguistics. Many research groups are engaged in the creation of corpus resources annotated with morphological, syntactic, semantic and discourse information for a variety of languages. We aim to bring together these activities in order to identify and disseminate best practice in the development and utilization of linguistically interpreted corpora.

The aim of the workshop is to exchange and propagate research results with respect to the annotation, conversion and exploitation of corpora taking into account different applications and theoretical investigations in the field of language technology and research. We invite submissions of papers constituting substantial, original, and unpublished work on all aspects of linguistically interpreted corpora, including, but not limited to:

  • creation of practical annotation schemes;
  • efficient annotation techniques;
  • automation of corpus annotation;
  • tools supporting corpus conversions;
  • validation including consistency checking of corpora;
  • browsing corpora and searching for instances of linguistic phenomena;
  • relating actual annotation to contemporary linguistic theory;
  • interpretation of quantitative results; and
  • use of annotated corpora in the automated induction of linguistic knowledge.

As this is the first time the workshop will be held outside Europe, we particularly welcome work on non-European languages and the problems associated with them --- segmentation, spelling variation, different encodings and so forth.

Workshop Programme

9:40 – 9:55 Welcome & Introduction
Kyonghee Paik, Francis Bond, and Stephan Oepen
09:45 – 10:15 The TIGER 700 RMRS Bank: RMRS Construction from Dependencies
Kathrin Spreyer and Anette Frank
10:15 – 10:45 Obtaining Japanese Lexical Units for Semantic Frames from
Berkeley FrameNet using a Bilingual Corpus

Toshiyuki Kanamaru, Masaki Murata, Kow Kuroda, and Hitoshi Isahara
10:45 – 11:00 Coffee Break
11:00 – 11:30 A Study of Applying BTM Model on the Chinese Chunk Bracketing
Jia-Lin Tsai
11:30 – 12:00 Integration of a Lexical Type Database with a Linguistically Interpreted Corpus
Chikara Hashimoto, Francis Bond, Takaaki Tanaka, and Melanie Siegel
12:00 – 13:15 Lunch Break
13:15 – 14:15 Building Dialogue Corpora for Nursing Activity Analysis
Hiromi itoh Ozaku, Akinori Abe, Noriaki Kuwahara,
Futoshi Naya, Kiyoshi Kogure, and Kaoru Sagara
The Syntactically Annotated ICE Corpus and the
Automatic Induction of a Formal Grammar

Alex Chengyu Fang
Syntactic Identification of Attribution in the RST Treebank
Peter Rossen Skadhauge and Daniel Hardt
Linguistically Enriched Corpora for Establishing
Variation in Support Verb Constructions

Begoña Villada Moirón
14:15 – 14:45 Error Annotation for Corpus of Japanese Learner English
Emi Izumi, Kiyotaka Uchimoto, and Hitoshi Isahara
14:45 – 15:15 Some remarks on the Annotation of Quantifying Noun Groups in Treebanks
Kristina Spranger
15:15 – 15:30 Coffee Break
15:30 – 16:00 Annotating Honorifics Denoting Social Ranking of Referents
Shigeko Nariyama, Hiromi Nakaiwa, and Melanie Siegel
16:00 – 17:30 Panel and General Discussion


Paper submission deadline: June 7, 2005
Notification of acceptance: July 18, 2005
Camera ready manuscripts due: August 5, 2005 (instructions for authors)
Workshop date: October 15, 2005


Please refer to the main conference web pages for registration details.


Paper submissions must be anonymous and are limited to at most 8 pages including references, figures etc. Authors are required to follow the guidelines of IJCNLP-05 conference workshop style, by using either the LaTeX style file or the MS Word document template shown in the IJCNLP-05 style file page). Only electronic submissions will be accepted. Please email your submission in PDF (preferred), PostScript, or MS Word to the following address:

Each submission should also specify the author's name, affiliation, postal address, email address and title in the body of the email message. For more information, please make contact with the workshop co-chairs by using the same e-mail address above.

We are currently investigating subsequent publication as a journal special issue.

Programme Committee

  • Anne Abeille, Paris
  • Olga Babko-Malaya, Pittsburgh
  • Francis Bond (co-chair), Keihanna
  • Colin Baker, Berkely
  • Pierrette Bouillon, Geneva
  • Thorsten Brants, Palo Alto
  • John Carroll, Sussex
  • Huang Chu-Ren, Taipei
  • Montserrat Civit Torruella, Barcelona
  • Tomaz Erjavec, Ljubljana
  • Jan Hajic, Prague
  • Chung-hye Han, British Columbia
  • Silvia Hansen, Saarbrücken
  • Erhard Hinrichs, Tübingen
  • Beom-mo Kang, Seoul
  • Sadao Kurohashi, Tokyo
  • Frank Keller, Edinburgh
  • Brigitte Krenn, Vienna
  • Joakim Nivre, Vaxjö
  • Stephan Oepen (co-chair), Oslo and Stanford
  • Kyonghee Paik (co-chair), Keihanna
  • Laurent Romary, Nancy
  • Kiril Simov, Sofia
  • Kiyotaka Uchimoto, Keihanna
  • Hans Uszkoreit, Saarbrücken
  • Atro Voutilainen, Helsinki
  • Nianwen Xue, Pittsburgh

Previous Workshops

last modified: 2005-03-23 (oe@csli.stanford.edu)