TY - JOUR
T1 - Fine-grained parallelization of fitness functions in bioinformatics optimization problems
T2 - Gene selection for cancer classification and biclustering of gene expression data
AU - Gomez-Pulido, Juan A.
AU - Cerrada-Barrios, Jose L.
AU - Trinidad-Amado, Sebastian
AU - Lanza-Gutierrez, Jose M.
AU - Fernandez-Diaz, Ramon A.
AU - Crawford, Broderick
AU - Soto, Ricardo
N1 - Publisher Copyright:
© 2016 The Author(s).
PY - 2016/8/31
Y1 - 2016/8/31
N2 - Background: Metaheuristics are widely used to solve large combinatorial optimization problems in bioinformatics because of the huge set of possible solutions. Two representative problems are gene selection for cancer classification and biclustering of gene expression data. In most cases, these metaheuristics, as well as other non-linear techniques, apply a fitness function to each possible solution with a size-limited population, and that step involves higher latencies than other parts of the algorithms, which is the reason why the execution time of the applications will mainly depend on the execution time of the fitness function. In addition, it is usual to find floating-point arithmetic formulations for the fitness functions. This way, a careful parallelization of these functions using the reconfigurable hardware technology will accelerate the computation, specially if they are applied in parallel to several solutions of the population. Results: A fine-grained parallelization of two floating-point fitness functions of different complexities and features involved in biclustering of gene expression data and gene selection for cancer classification allowed for obtaining higher speedups and power-reduced computation with regard to usual microprocessors. Conclusions: The results show better performances using reconfigurable hardware technology instead of usual microprocessors, in computing time and power consumption terms, not only because of the parallelization of the arithmetic operations, but also thanks to the concurrent fitness evaluation for several individuals of the population in the metaheuristic. This is a good basis for building accelerated and low-energy solutions for intensive computing scenarios.
AB - Background: Metaheuristics are widely used to solve large combinatorial optimization problems in bioinformatics because of the huge set of possible solutions. Two representative problems are gene selection for cancer classification and biclustering of gene expression data. In most cases, these metaheuristics, as well as other non-linear techniques, apply a fitness function to each possible solution with a size-limited population, and that step involves higher latencies than other parts of the algorithms, which is the reason why the execution time of the applications will mainly depend on the execution time of the fitness function. In addition, it is usual to find floating-point arithmetic formulations for the fitness functions. This way, a careful parallelization of these functions using the reconfigurable hardware technology will accelerate the computation, specially if they are applied in parallel to several solutions of the population. Results: A fine-grained parallelization of two floating-point fitness functions of different complexities and features involved in biclustering of gene expression data and gene selection for cancer classification allowed for obtaining higher speedups and power-reduced computation with regard to usual microprocessors. Conclusions: The results show better performances using reconfigurable hardware technology instead of usual microprocessors, in computing time and power consumption terms, not only because of the parallelization of the arithmetic operations, but also thanks to the concurrent fitness evaluation for several individuals of the population in the metaheuristic. This is a good basis for building accelerated and low-energy solutions for intensive computing scenarios.
KW - Biclustering
KW - Cancer classification
KW - FPGA
KW - Fitness function
KW - Floating-point arithmetic
KW - Metaheuristics
KW - Parallelism
UR - http://www.scopus.com/inward/record.url?scp=84984671970&partnerID=8YFLogxK
U2 - 10.1186/s12859-016-1200-9
DO - 10.1186/s12859-016-1200-9
M3 - Article
C2 - 27581798
AN - SCOPUS:84984671970
SN - 1471-2105
VL - 17
JO - BMC Bioinformatics
JF - BMC Bioinformatics
IS - 1
M1 - 330
ER -