Automatic hypernymy discovery continues to present challenges for natural language processing. Polysemous nouns are linked to more than one hypernym and can therefore cause structural damage on a lexical taxonomy. For instance, the Spanish noun tarántula ('tarantula') is a hyponym of araña ('spider'), but this is also a polysemous noun, as it means 'chandelier' as well. It is thus necessary to determine the next hypernym in the chain, that is animal ('animal') or artefacto ('artifact'). In this paper we explore methods to solve this problem using a similarity measure that uses verb-noun co-occurrence as a predictor variable. Best results (84 % success) are obtained with a simple method that only measures co-occurrence, irrespective of any syntactic information.
|Translated title of the contribution||Tarantula -> spider -> animal: Second level hypernymy discovery based on distributional similarity methods|
|Number of pages||8|
|Journal||Procesamiento de Lenguaje Natural|
|State||Published - Mar 2020|