Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/31755
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSam, S-
dc.contributor.authorDabreo, SM-
dc.date.accessioned2025-08-18T09:03:12Z-
dc.date.available2025-08-18T09:03:12Z-
dc.date.issued2025-05-27-
dc.identifierORCiD: Steven Sam https://orcid.org/0000-0002-4353-6118-
dc.identifier.citationSam, S. and Dabreo, S.M. (2025) 'Crop recommendation with machine learning: leveraging environmental and economic factors for optimal crop selection', arXiv preprint arXiv:2505.21201, pp. 1 - x. doi: 10.48550/arXiv.2505.21201.en_US
dc.identifier.urihttps://bura.brunel.ac.uk/handle/2438/31755-
dc.descriptionData availability – The data that support the findings of this study are openly available from three sources. The data: Kaggle (https://www.kaggle.com/datasets/vihith12/crop-yield recommendationdataset) for environmental parameters and the India Directorate of Economics and Statistics (https://eands.dacnet.nic.in/Cost_of_Cultivation.htm) and Farmer’s Portal for economic parameters of cost and price (https://farmer.gov.in/mspstatements.aspx).en_US
dc.description.abstractAgriculture constitutes a primary source of food production, economic growth and employment in India, but the sector is confronted with low farm productivity and yields aggravated by increased pressure on natural resources and adverse climate change variability. Efforts involving green revolution, land irrigations, improved seeds and organic farming have yielded suboptimal outcomes. The adoption of computational tools like crop recommendation systems offers a new way to provide insights and help farmers tackle low productivity. However, most agricultural recommendation systems in India focus narrowly on environmental factors and regions, limiting accurate predictions of high-yield, profitable crops. This study uses environmental and economic factors with 19 crops across 15 states to develop and evaluate Random Forest and SVM models using 10-fold Cross Validation, Time-series Split, and Lag Variables. The 10-fold cross validation showed high accuracy (RF: 99.96%, SVM: 94.71%) but raised overfitting concerns. Introducing temporal order, better reflecting real-world conditions, reduced performance (RF: 78.55%, SVM: 71.18%) in the Time-series this http URL further increase the model accuracy while maintaining the temporal order, the Lag Variables approach was employed, which resulted in improved performance (RF: 83.62%, SVM: 74.38%) compared to the 10-fold cross validation approach. Overall, the models in the Time-series Split and Lag Variable Approaches offer practical insights by handling temporal dependencies and enhancing its adaptability to changing agricultural conditions over time. Consequently, the study shows the Random Forest model developed based on the Lag Variables as the most preferred algorithm for optimal crop recommendation in the Indian context.en_US
dc.description.sponsorship...en_US
dc.format.extent1 - 22-
dc.format.mediumElectronic-
dc.language.isoen_USen_US
dc.publisherCornell Universityen_US
dc.relation.urihttps://www.kaggle.com/datasets/vihith12/crop-yield recommendationdataset-
dc.relation.urihttps://eands.dacnet.nic.in/Cost_of_Cultivation.htm-
dc.rightsCreative Commons Attribution 4.0 International-
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/-
dc.subjectcrop recommendation modelen_US
dc.subjectrandom foresten_US
dc.subjectsupport vector machinesen_US
dc.subjectIndian agricultureen_US
dc.subjectexploratory data analysisen_US
dc.titleCrop recommendation with machine learning: leveraging environmental and economic factors for optimal crop selectionen_US
dc.typeArticleen_US
dc.identifier.doihttps://doi.org/10.48550/arXiv.2505.21201-
dc.relation.isPartOfarXiv preprint arXiv:2505.21201-
pubs.publication-statusPublished-
dc.identifier.eissn2331-8422-
dc.rights.licensehttps://creativecommons.org/licenses/by/4.0/legalcode.en-
dc.rights.holderThe Author(s)-
Appears in Collections:Dept of Computer Science Research Papers

Files in This Item:
File Description SizeFormat 
Preprint.pdfCopyright © 2025 The Author(s). This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).1.62 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons