TY - GEN
T1 - A Comparison of Learnheuristics Using Different Reward Functions to Solve the Set Covering Problem
AU - Crawford, Broderick
AU - Soto, Ricardo
AU - Cisternas-Caneo, Felipe
AU - Tapia, Diego
AU - de la Fuente-Mella, Hanns
AU - Palma, Wenceslao
AU - Lemus-Romani, José
AU - Castillo, Mauricio
AU - Becerra-Rozas, Marcelo
N1 - Publisher Copyright:
© 2021, Springer Nature Switzerland AG.
PY - 2021
Y1 - 2021
N2 - The high computational capacity that we have thanks to the new technologies allows us to communicate two great worlds such as optimization methods and machine learning. The concept behind the hybridization of both worlds is called Learnheuristics which allows to improve optimization methods through machine learning techniques where the input data for learning is the data produced by the optimization methods during the search process. Among the most outstanding machine learning techniques is Q-Learning whose learning process is based on rewarding or punishing the agents according to the consequences of their actions and this reward or punishment is carried out by means of a reward function. This work seeks to compare different Learnheuristics instances composed by Sine Cosine Algorithm and Q-Learning whose different lies in the reward function applied. Preliminary results indicate that there is an influence on the quality of the solutions based on the reward function applied.
AB - The high computational capacity that we have thanks to the new technologies allows us to communicate two great worlds such as optimization methods and machine learning. The concept behind the hybridization of both worlds is called Learnheuristics which allows to improve optimization methods through machine learning techniques where the input data for learning is the data produced by the optimization methods during the search process. Among the most outstanding machine learning techniques is Q-Learning whose learning process is based on rewarding or punishing the agents according to the consequences of their actions and this reward or punishment is carried out by means of a reward function. This work seeks to compare different Learnheuristics instances composed by Sine Cosine Algorithm and Q-Learning whose different lies in the reward function applied. Preliminary results indicate that there is an influence on the quality of the solutions based on the reward function applied.
KW - Learnheuristic
KW - Q-Learning
KW - Reinforcement learning
KW - Reward function
KW - Sine cosine algorithm
UR - http://www.scopus.com/inward/record.url?scp=85115138840&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-85672-4_6
DO - 10.1007/978-3-030-85672-4_6
M3 - Conference contribution
AN - SCOPUS:85115138840
SN - 9783030856717
T3 - Communications in Computer and Information Science
SP - 74
EP - 85
BT - Optimization and Learning - 4th International Conference, OLA 2021, Proceedings
A2 - Dorronsoro, Bernabé
A2 - Ruiz, Patricia
A2 - Amodeo, Lionel
A2 - Pavone, Mario
PB - Springer Science and Business Media Deutschland GmbH
T2 - 4th International Conference on Optimization and Learning, OLA 2021
Y2 - 21 June 2021 through 23 June 2021
ER -