EBSCO Logo
Connecting you to content on EBSCOhost
Title

Smart Coffee: Machine Learning Techniques for Estimating Arabica Coffee Yield.

Authors

Freitas, Cleverson Henrique de; Coelho, Rubens Duarte; Costa, Jéfferson de Oliveira; Sentelhas, Paulo Cesar

Abstract

Coffee is a global commodity, with Brazil being a major producer, particularly in the Minas Gerais state. This study applied machine learning to predict the Arabica coffee yield in the region, analyzing two groups of cultivars (G1 and G2) using data from 1993 to 2020. The Factor Analysis of Mixed Data (FAMD) was employed to explore the relationships between climatic factors, management practices, and the coffee yield. Four machine learning models, such as Multiple Linear Regression (MLR), Random Forest (RF), XGBoost (XGB), and Support Vector Machines (SVM) were calibrated and evaluated for yield prediction. The FAMD revealed complex interactions among variables, requiring four principal components to explain approximately 64.6% of the total variance. Management practices, such as the planting density and pruning, had a stronger influence on G1 cultivars, while G2 cultivars were more sensitive to climatic conditions, particularly the air temperature. Among the machine learning models, RF and XGB performed best in the yield estimation, whereas MLR and SVM were less effective, particularly for values above 60 bags ha−1 (1 bag = 60 kg). These findings underscore the variability in the yield across cultivars and demonstrate the potential of machine learning to guide tailored management strategies for different coffee cultivars.

Subjects

SUPPORT vector machines; COMPUTERS; ESTIMATION theory; AGRICULTURE; COFFEE

Publication

AgriEngineering, 2024, Vol 6, Issue 4, p4925

ISSN

2624-7402

Publication type

Academic Journal

DOI

10.3390/agriengineering6040281

EBSCO Connect | Privacy policy | Terms of use | Copyright | Manage my cookies
Journals | Subjects | Sitemap
© 2025 EBSCO Industries, Inc. All rights reserved