TY - GEN
T1 - Classification tools for carotenoid content estimation in manihot esculenta via metabolomics and machine learning
AU - Moresco, Rodolfo
AU - Afonso, Telma
AU - Uarrota, Virgílio G.
AU - Navarro, Bruno Bachiega
AU - Nunes, Eduardo da C.
AU - Rocha, Miguel
AU - Maraschin, Marcelo
N1 - Publisher Copyright:
© Springer International Publishing AG 2017.
PY - 2017
Y1 - 2017
N2 - Cassava genotypes (Manihot esculenta Crantz) with high pro-vitamin A activity have been identified as a strategy to reduce the prevalence of deficiency of this vitamin. The color variability of cassava roots, which can vary from white to red, is related to the presence of several carotenoid pigments. The present study has shown how CIELAB color measurement on cassava roots tissue can be used as a non-destructive and very fast technique to quantify the levels of carotenoids in cassava root samples, avoiding the use of more expensive analytical techniques for compound quantification, such as UV-visible spectrophotometry and the HPLC. For this, we used machine learning techniques, associating the colorimetric data (CIELAB) with the data obtained by UV-vis and HPLC, to obtain models of prediction of carotenoids for this type of biomass. Best values of R2 (above 90%) were observed for the predictive variable TCC determined by UV-vis spectrophotometry. When we tested the machine learning models using the CIELAB values as inputs, for the total carotenoids contents quantified by HPLC, the Partial Least Squares (PLS), Support Vector Machines, and Elastic Net models presented the best values of R2 (above 40%) and Root-Mean-Square Error (RMSE). For the carotenoid quantification by UV-vis spectrophotometry, R2 (around 60%) and RMSE values (around 6.5) are more satisfactory. Ridge regression and Elastic Network showed the best results. It can be concluded that the use colorimetric technique (CIELAB) associated with UV-vis/HPLC and statistical techniques of prognostic analysis through machine learning can predict the content of total carotenoids in these samples, with good precision and accuracy.
AB - Cassava genotypes (Manihot esculenta Crantz) with high pro-vitamin A activity have been identified as a strategy to reduce the prevalence of deficiency of this vitamin. The color variability of cassava roots, which can vary from white to red, is related to the presence of several carotenoid pigments. The present study has shown how CIELAB color measurement on cassava roots tissue can be used as a non-destructive and very fast technique to quantify the levels of carotenoids in cassava root samples, avoiding the use of more expensive analytical techniques for compound quantification, such as UV-visible spectrophotometry and the HPLC. For this, we used machine learning techniques, associating the colorimetric data (CIELAB) with the data obtained by UV-vis and HPLC, to obtain models of prediction of carotenoids for this type of biomass. Best values of R2 (above 90%) were observed for the predictive variable TCC determined by UV-vis spectrophotometry. When we tested the machine learning models using the CIELAB values as inputs, for the total carotenoids contents quantified by HPLC, the Partial Least Squares (PLS), Support Vector Machines, and Elastic Net models presented the best values of R2 (above 40%) and Root-Mean-Square Error (RMSE). For the carotenoid quantification by UV-vis spectrophotometry, R2 (around 60%) and RMSE values (around 6.5) are more satisfactory. Ridge regression and Elastic Network showed the best results. It can be concluded that the use colorimetric technique (CIELAB) associated with UV-vis/HPLC and statistical techniques of prognostic analysis through machine learning can predict the content of total carotenoids in these samples, with good precision and accuracy.
KW - Carotenoids
KW - Cassava genotypes
KW - Chemometrics
KW - Descriptive models
KW - HPLC
KW - Machine learning
KW - UV-vis
UR - http://www.scopus.com/inward/record.url?scp=85025160764&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-60816-7_34
DO - 10.1007/978-3-319-60816-7_34
M3 - Conference contribution
AN - SCOPUS:85025160764
SN - 9783319608150
T3 - Advances in Intelligent Systems and Computing
SP - 280
EP - 288
BT - 11th International Conference on Practical Applications of Computational Biology and Bioinformatics, 2017
A2 - Rocha, Miguel
A2 - De Paz, Juan F.
A2 - Pinto, Tiago
A2 - Fdez-Riverola, Florentino
A2 - Mohamad, Mohd Saberi
PB - Springer Verlag
T2 - 11th International Conference on Practical Applications of Computational Biology and Bioinformatics, PACBB 2017
Y2 - 21 June 2017 through 23 June 2017
ER -