Inducción automática de una taxonomía multilingüe de marcadores discursivos: primeros resultados en castellano, inglés, francés, alemán y catalán

Translated title of the contribution: Automatic induction of a multilingual taxonomy of discourse markers: first results in Spanish, English, French, German and Catalan

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

This paper presents a methodological proposal por the automatic induction of a multilingual taxonomy of discourse markers which, in the case of English, correspond to units such as however, therefore, by the way, etc. First, a method is proposed to separate such units from the rest of the vocabulary using a measure of information, followed by a method to group them using a parallel corpus. Finally, this categorization is used as the basis for the extraction and classification of new units. Apart from the method, the first results are described, which consist of a database that currently surpasses 2600 units.

Translated title of the contributionAutomatic induction of a multilingual taxonomy of discourse markers: first results in Spanish, English, French, German and Catalan
Original languageSpanish
Pages (from-to)127-138
Number of pages12
JournalProcesamiento de Lenguaje Natural
Volume67
DOIs
StatePublished - Sep 2021
Externally publishedYes

Fingerprint

Dive into the research topics of 'Automatic induction of a multilingual taxonomy of discourse markers: first results in Spanish, English, French, German and Catalan'. Together they form a unique fingerprint.

Cite this