TY - JOUR
T1 - Distributional analysis applied to terminology extraction
T2 - First results in the domain of psychiatry in Spanish
AU - Nazar, Rogelio
N1 - Publisher Copyright:
© 2016 John Benjamins Publishing Company.
PY - 2016
Y1 - 2016
N2 - This paper presents the first results of a new method for terminology extraction based on distributional analysis. The intuition behind the algorithm is that single or multi-word lexical units that refer to specialised concepts will show a characteristic co-occurrence pattern, described as a tendency to appear in the same contexts with other conceptually related terms. E.g. the term fluoxetine will systematically appear in the same sentences with other related terms such as depression, serotonin reuptake inhibitor, obsessive-compulsive disorder and others. Of course, terms will co-occur with general vocabulary units as well, but not with a characteristic pattern as when a conceptual relation holds. Experimental evaluation of this method was conducted in a corpus of psychiatry journals from Spain and Latin America, and concluded that the results are significantly better than other methods.
AB - This paper presents the first results of a new method for terminology extraction based on distributional analysis. The intuition behind the algorithm is that single or multi-word lexical units that refer to specialised concepts will show a characteristic co-occurrence pattern, described as a tendency to appear in the same contexts with other conceptually related terms. E.g. the term fluoxetine will systematically appear in the same sentences with other related terms such as depression, serotonin reuptake inhibitor, obsessive-compulsive disorder and others. Of course, terms will co-occur with general vocabulary units as well, but not with a characteristic pattern as when a conceptual relation holds. Experimental evaluation of this method was conducted in a corpus of psychiatry journals from Spain and Latin America, and concluded that the results are significantly better than other methods.
KW - Co-occurrence
KW - Distributional semantics
KW - Terminology extraction
KW - Text-mining
KW - Topic signatures
UR - http://www.scopus.com/inward/record.url?scp=85012157972&partnerID=8YFLogxK
U2 - 10.1075/term.22.2.01naz
DO - 10.1075/term.22.2.01naz
M3 - Article
AN - SCOPUS:85012157972
VL - 22
SP - 141
EP - 170
JO - Terminology
JF - Terminology
SN - 0929-9971
IS - 2
ER -