TY - JOUR
T1 - Structural pattern recognition methods based on string comparison for fusion databases
AU - Dormido-Canto, S.
AU - Farias, G.
AU - Dormido, R.
AU - Vega, J.
AU - Sánchez, J.
AU - Duro, N.
AU - Vargas, H.
AU - Rattá, G.
AU - Pereira, A.
AU - Portas, A.
PY - 2008/4
Y1 - 2008/4
N2 - Databases for fusion experiments are designed to store several million waveforms. Temporal evolution signals show the same patterns under the same plasma conditions and, therefore, pattern recognition techniques allow the identification of similar plasma behaviours. This article is focused on the comparison of structural pattern recognition methods. A pattern can be composed of simpler sub-patterns, where the most elementary sub-patterns are known as primitives. Selection of primitives is an essential issue in structural pattern recognition methods, because they determine what types of structural components can be constructed. However, it should be noted that there is not a general solution to extract structural features (primitives) from data. So, four different ways to compute the primitives of plasma waveforms are compared: (1) constant length primitives, (2) adaptive length primitives, (3) concavity method and (4) concavity method for noisy signals. Each method defines a code alphabet and, in this way, the pattern recognition problem is carried out via string comparisons. Results of the four methods with the TJ-II stellarator databases will be discussed.
AB - Databases for fusion experiments are designed to store several million waveforms. Temporal evolution signals show the same patterns under the same plasma conditions and, therefore, pattern recognition techniques allow the identification of similar plasma behaviours. This article is focused on the comparison of structural pattern recognition methods. A pattern can be composed of simpler sub-patterns, where the most elementary sub-patterns are known as primitives. Selection of primitives is an essential issue in structural pattern recognition methods, because they determine what types of structural components can be constructed. However, it should be noted that there is not a general solution to extract structural features (primitives) from data. So, four different ways to compute the primitives of plasma waveforms are compared: (1) constant length primitives, (2) adaptive length primitives, (3) concavity method and (4) concavity method for noisy signals. Each method defines a code alphabet and, in this way, the pattern recognition problem is carried out via string comparisons. Results of the four methods with the TJ-II stellarator databases will be discussed.
KW - Data mining
KW - Information retrieval
KW - Nuclear fusion
KW - Structural pattern recognition
UR - http://www.scopus.com/inward/record.url?scp=42249086824&partnerID=8YFLogxK
U2 - 10.1016/j.fusengdes.2007.11.009
DO - 10.1016/j.fusengdes.2007.11.009
M3 - Article
AN - SCOPUS:42249086824
SN - 0920-3796
VL - 83
SP - 421
EP - 424
JO - Fusion Engineering and Design
JF - Fusion Engineering and Design
IS - 2-3
ER -