Data mining technique for fast retrieval of similar waveforms in Fusion massive databases

J. Vega, A. Pereira, A. Portas, S. Dormido-Canto, G. Farias, R. Dormido, J. Sánchez, N. Duro, M. Santos, E. Sánchez, G. Pajares

Resultado de la investigación: Contribución a una revistaArtículorevisión exhaustiva

15 Citas (Scopus)

Resumen

Fusion measurement systems generate similar waveforms for reproducible behavior. A major difficulty related to data analysis is the identification, in a rapid and automated way, of a set of discharges with comparable behaviour, i.e. discharges with "similar" waveforms. Here we introduce a new technique for rapid searching and retrieval of "similar" signals. The approach consists of building a classification system that avoids traversing the whole database looking for similarities. The classification system diminishes the problem dimensionality (by means of waveform feature extraction) and reduces the searching space to just the most probable "similar" waveforms (clustering techniques). In the searching procedure, the input waveform is classified in any of the existing clusters. Then, a similarity measure is computed between the input signal and all cluster elements in order to identify the most similar waveforms. The inner product of normalized vectors is used as the similarity measure as it allows the searching process to be independent of signal gain and polarity. This development has been applied recently to TJ-II stellarator databases and has been integrated into its remote participation system.

Idioma originalInglés
Páginas (desde-hasta)132-139
Número de páginas8
PublicaciónFusion Engineering and Design
Volumen83
N.º1
DOI
EstadoPublicada - ene 2008
Publicado de forma externa

Huella

Profundice en los temas de investigación de 'Data mining technique for fast retrieval of similar waveforms in Fusion massive databases'. En conjunto forman una huella única.

Citar esto