A Comparison of Learnheuristics Using Different Reward Functions to Solve the Set Covering Problem

Broderick Crawford, Ricardo Soto, Felipe Cisternas-Caneo, Diego Tapia, Hanns de la Fuente-Mella, Wenceslao Palma, José Lemus-Romani, Mauricio Castillo, Marcelo Becerra-Rozas

Resultado de la investigación: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

Resumen

The high computational capacity that we have thanks to the new technologies allows us to communicate two great worlds such as optimization methods and machine learning. The concept behind the hybridization of both worlds is called Learnheuristics which allows to improve optimization methods through machine learning techniques where the input data for learning is the data produced by the optimization methods during the search process. Among the most outstanding machine learning techniques is Q-Learning whose learning process is based on rewarding or punishing the agents according to the consequences of their actions and this reward or punishment is carried out by means of a reward function. This work seeks to compare different Learnheuristics instances composed by Sine Cosine Algorithm and Q-Learning whose different lies in the reward function applied. Preliminary results indicate that there is an influence on the quality of the solutions based on the reward function applied.

Idioma originalInglés
Título de la publicación alojadaOptimization and Learning - 4th International Conference, OLA 2021, Proceedings
EditoresBernabé Dorronsoro, Patricia Ruiz, Lionel Amodeo, Mario Pavone
EditorialSpringer Science and Business Media Deutschland GmbH
Páginas74-85
Número de páginas12
ISBN (versión impresa)9783030856717
DOI
EstadoPublicada - 2021
Publicado de forma externa
Evento4th International Conference on Optimization and Learning, OLA 2021 - Virtual, Online
Duración: 21 jun. 202123 jun. 2021

Serie de la publicación

NombreCommunications in Computer and Information Science
Volumen1443
ISSN (versión impresa)1865-0929
ISSN (versión digital)1865-0937

Conferencia

Conferencia4th International Conference on Optimization and Learning, OLA 2021
CiudadVirtual, Online
Período21/06/2123/06/21

Huella

Profundice en los temas de investigación de 'A Comparison of Learnheuristics Using Different Reward Functions to Solve the Set Covering Problem'. En conjunto forman una huella única.

Citar esto