Reinforcement Learning Based Whale Optimizer

Marcelo Becerra-Rozas, José Lemus-Romani, Broderick Crawford, Ricardo Soto, Felipe Cisternas-Caneo, Andrés Trujillo Embry, Máximo Arnao Molina, Diego Tapia, Mauricio Castillo, Sanjay Misra, José Miguel Rubio

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations


This work proposes a Reinforcement Learning based optimizer integrating SARSA and Whale Optimization Algorithm. SARSA determines the binarization operator required during the metaheuristic process. The hybrid instance is applied to solve benchmarks of the Set Covering Problem and it is compared with a Q-learning version, showing good results in terms of fitness, specifically, SARSA beats its Q-Learning version in 44 out of 45 instances evaluated. It is worth mentioning that the only instance where it does not win is a tie. Finally, thanks to graphs presented in our results analysis we can observe that not only does it obtain good results, it also obtains a correct exploration and exploitation balance as presented in the referenced literature.

Original languageEnglish
Title of host publicationComputational Science and Its Applications – ICCSA 2021 - 21st International Conference, Proceedings
EditorsOsvaldo Gervasi, Beniamino Murgante, Sanjay Misra, Chiara Garau, Ivan Blečić, David Taniar, Bernady O. Apduhan, Ana Maria Rocha, Eufemia Tarantino, Carmelo Maria Torre
PublisherSpringer Science and Business Media Deutschland GmbH
Number of pages15
ISBN (Print)9783030870126
StatePublished - 2021
Event21st International Conference on Computational Science and Its Applications, ICCSA 2021 - Virtual, Online
Duration: 13 Sep 202116 Sep 2021

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12957 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349


Conference21st International Conference on Computational Science and Its Applications, ICCSA 2021
CityVirtual, Online


  • Combinatorial optimization
  • Metaheuristic
  • Q-Learning
  • Swarm intelligence
  • Whale optimization algorithm


Dive into the research topics of 'Reinforcement Learning Based Whale Optimizer'. Together they form a unique fingerprint.

Cite this