Towards advanced collocation error correction in Spanish learner corpora

Gabriela Ferraro, Rogelio Nazar, Margarita Alonso Ramos, Leo Wanner

Producción científica: Contribución a una revistaArtículorevisión exhaustiva

12 Citas (Scopus)


Collocations in the sense of idiosyncratic binary lexical co-occurrences are one of the biggest challenges for any language learner. Even advanced learners make collocation mistakes in that they literally translate collocation elements from their native tongue, create new words as collocation elements, choose a wrong subcategorization for one of the elements, etc. Therefore, automatic collocation error detection and correction is increasingly in demand. However, while state-of-the-art models predict, with a reasonable accuracy, whether a given co-occurrence is a valid collocation or not, only few of them manage to suggest appropriate corrections with an acceptable hit rate. Most often, a ranked list of correction options is offered from which the learner has then to choose. This is clearly unsatisfactory. Our proposal focuses on this critical part of the problem in the context of the acquisition of Spanish as second language. For collocation error detection, we use a frequency-based technique. To improve on collocation error correction, we discuss three different metrics with respect to their capability to select the most appropriate correction of miscollocations found in our learner corpus.

Idioma originalInglés
Páginas (desde-hasta)45-64
Número de páginas20
PublicaciónLanguage Resources and Evaluation
EstadoPublicada - mar. 2014
Publicado de forma externa


Profundice en los temas de investigación de 'Towards advanced collocation error correction in Spanish learner corpora'. En conjunto forman una huella única.

Citar esto