Combining resources: Taxonomy extraction from multiple dictionaries

Rogelio Nazar, Maarten Janssen

Resultado de la investigación: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

5 Citas (Scopus)

Resumen

The idea that dictionaries are a good source for (computational) information has been around for a long while, and the extraction of taxonomic information from them is something that has been attempted several times. However, such information extraction was typically based on the systematic analysis of the text of a single dictionary. In this paper, we demonstrate how it is possible to extract taxonomic information without any analysis of the specific text, by comparing the same lexical entry in a number of different dictionaries. Counting word frequencies in the dictionary entry for the same word in different dictionaries leads to a surprisingly good recovery of taxonomic information, without the need for any syntactic analysis of the entries in question nor any kind of language-specific treatment. As a case in point, we will show in this paper an experiment extracting hyperonymy relations from several Spanish dictionaries, measuring the effect that the different number of dictionaries have on the results.

Idioma originalInglés
Título de la publicación alojadaProceedings of the 7th International Conference on Language Resources and Evaluation, LREC 2010
EditoresDaniel Tapias, Irene Russo, Olivier Hamon, Stelios Piperidis, Nicoletta Calzolari, Khalid Choukri, Joseph Mariani, Helene Mazo, Bente Maegaard, Jan Odijk, Mike Rosner
EditorialEuropean Language Resources Association (ELRA)
Páginas1055-1061
Número de páginas7
ISBN (versión digital)2951740867, 9782951740860
EstadoPublicada - 2010
Publicado de forma externa
Evento7th International Conference on Language Resources and Evaluation, LREC 2010 - Valletta, Malta
Duración: 17 may. 201023 may. 2010

Serie de la publicación

NombreProceedings of the 7th International Conference on Language Resources and Evaluation, LREC 2010

Conferencia

Conferencia7th International Conference on Language Resources and Evaluation, LREC 2010
País/TerritorioMalta
CiudadValletta
Período17/05/1023/05/10

Huella

Profundice en los temas de investigación de 'Combining resources: Taxonomy extraction from multiple dictionaries'. En conjunto forman una huella única.

Citar esto