Distributional analysis applied to terminology extraction: First results in the domain of psychiatry in Spanish

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

This paper presents the first results of a new method for terminology extraction based on distributional analysis. The intuition behind the algorithm is that single or multi-word lexical units that refer to specialised concepts will show a characteristic co-occurrence pattern, described as a tendency to appear in the same contexts with other conceptually related terms. E.g. the term fluoxetine will systematically appear in the same sentences with other related terms such as depression, serotonin reuptake inhibitor, obsessive-compulsive disorder and others. Of course, terms will co-occur with general vocabulary units as well, but not with a characteristic pattern as when a conceptual relation holds. Experimental evaluation of this method was conducted in a corpus of psychiatry journals from Spain and Latin America, and concluded that the results are significantly better than other methods.

Original languageEnglish
Pages (from-to)141-170
Number of pages30
JournalTerminology
Volume22
Issue number2
DOIs
StatePublished - 2016
Externally publishedYes

Keywords

  • Co-occurrence
  • Distributional semantics
  • Terminology extraction
  • Text-mining
  • Topic signatures

Fingerprint

Dive into the research topics of 'Distributional analysis applied to terminology extraction: First results in the domain of psychiatry in Spanish'. Together they form a unique fingerprint.

Cite this