An exercise in reuse of resources: Adapting general discourse coreference resolution for detecting lexical chains in patent documentation

Nadjet Bouayad-Agha, Alicia Burga, Gerard Casamayor, Joan Codina, ROGELIO ANTONIO NAZAR, Leo Wanner

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

The Stanford Coreference Resolution System (StCR) is a multi-pass, rule-based system that scored best in the CoNLL 2011 shared task on general discourse coreference resolution. We describe how the StCR has been adapted to the specific domain of patents and give some cues on how it can be adapted to other domains. We present a linguistic analysis of the patent domain and how we were able to adapt the rules to the domain and to expand coreferences with some lexical chains. A comparative evaluation shows an improvement of the coreference resolution system, denoting that (i) StCR is a valuable tool across different text genres; (ii) specialized discourse NLP may significantly benefit from general discourse NLP research.

Original languageEnglish
Title of host publicationProceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014
EditorsNicoletta Calzolari, Khalid Choukri, Sara Goggi, Thierry Declerck, Joseph Mariani, Bente Maegaard, Asuncion Moreno, Jan Odijk, Helene Mazo, Stelios Piperidis, Hrafn Loftsson
PublisherEuropean Language Resources Association (ELRA)
Pages3214-3221
Number of pages8
ISBN (Electronic)9782951740884
StatePublished - 2014
Externally publishedYes
Event9th International Conference on Language Resources and Evaluation, LREC 2014 - Reykjavik, Iceland
Duration: 26 May 201431 May 2014

Publication series

NameProceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014

Conference

Conference9th International Conference on Language Resources and Evaluation, LREC 2014
CountryIceland
CityReykjavik
Period26/05/1431/05/14

Keywords

  • Coreference resolution
  • Domain adaptation
  • Lexical chain
  • Patents
  • Stanford coreference resolution system

Fingerprint Dive into the research topics of 'An exercise in reuse of resources: Adapting general discourse coreference resolution for detecting lexical chains in patent documentation'. Together they form a unique fingerprint.

Cite this