A wide range of statistical methods are used for the various analyses done in agriculture sector to make today’s agriculture smarter. Technology comprising models has proved to be helpful in getting more production in agriculture and it also can easily handle some of unexpected problems such as floods, draughts and demands during food shortages. Table 2.2 shows the various processes carried out in agriculture sector through analytics and machine learning techniques.

AuthorsWork regarding agricultureDescription of techniques
Poudel and ShawRelationship between climate variability and crop yieldRegression model
Rohitashw Kumar and Harendar Raj GowthamImpact on climate change and crop productivitySummary of data
Ibn Musah et al. (2018)Climate change and variability in temperature in GhanaContinuous distribution
Kumar et al. (2018)Efficient crop yield estimation of sugarcaneSVM, KNN, least square SVM
Liakos et al. (2019)Analysis of soil, water, livestock managementANN, SVM
Priya et al.Prediction of yield of cropRandom forest
RenukaAnalysis of sugarcane crop data of Karnataka stateKNN, SVM, decision trees
Zingade et al. (2017)Android application for suggesting the best profitable crops for the given weather conditionMultiple linear regression
Chourasiya et al. (2019)Seed classification for crop productionANN, SVM, MLR
Veenadhari et al.Prediction to control climatic parameters for Madhya PradeshC4.5 algorithm
AbihRainfall variability on EthiopiaRegression, correlation
Kumar et al. (2019)Comparison of weather forecasting using ML algorithmsDecision trees, KNN
Kagita et al.Optimum agriculture land allocation of Krishna deltaFuzzy membership functions, GA
Palanivel and Surianarayanan (2019)Comparison of crop yield with ML algorithmsLinear regression, ANN
Kyei-​Mensah et al. (2019)Analysis of rainfall distribution and crop production for GhanaPearson correlation
Ndamani and Watanabe (2013)Inter and intra-​annual rainfall variability for crop production in northern GhanaCoefficient of variation, correlation analysis
Kyada et al.Sensitivity analysis of rainfall forecasting of GujaratANN, adaptive neuro-​fuzzy inference system
Sumi et al.Machine learning methods for forecasting daily and monthly rainfall of Fukuoka in JapanPrincipal component analysis (PCA), artificial neural network (ANN), multivariate adaptive regression splines (MARS),
Peprah (2014)Correlation among temperature and rainfall of Asunafo forest, Ghana.Correlation analysis
Bewket (2009)Food grain production and rainfall variability for Amhara region in Ethiopia.Spearman correlation analysis, coefficient of variation.
Shinde and Khadke (2017)Maharastra’s rainfall on crop productionStandard deviation, Pearson correlation
Varsha and PaiRainfall prediction of IndiaFuzzy C-​means clustering, FRBCS
Parmar et al. (2017)Comparison of machine learning algorithms on rainfall predictionANN, SOM, CFBP, BPNN, SVM
Zaman (2018)Machine learning model on rainfall for BangladeshRegression, decision trees, random forest, KNN
Yousif et al. (2018)Implications of rainfall variability for agricultural production in Eastern SudanCoefficient of variation, simple linear regression, correlation
Bala Sai Tarun et al. (2019)Prediction of rainfall with machine learning algorithmsSVM, CART, GA
Refonaa et al.Rainfall forecast of ChennaiLinear regression
Arvind et al. (2017)Analysis of rainfall data for Trichy, TamilnaduStandard deviation, CV, probability distribution
Lobell et al.Effects of climate change on crop yieldRegression model
Wenjiao et al.Analysis of crop yields based on climatic contributionsTime series model, cross-​section model, panel model
TailorCrop production of sugarcane in Olpad region of GujaratRegression, ANOVA
Khatri (2013)Water irrigation systemFuzzy logic and artificial intelligence
Tzimopoulos et al.(2018)Relationship between rainfall and altitude of different meteorological stations in KeralaFuzzy linear regression
Archontoulis et al.Applications and regression models in agricultureNonlinear regression model
Menaka and YuvarajCrop yield prediction modelsAdaptive Neuro-​Fuzzy Inference System, MLR

Regression is a model that demonstrates the liaison between the dependent and independent variables. Most of the researchers used regression model in their studies in agriculture, while many authors concentrated on data of rainfall and crop production of various regions all around the globe. Regression is one of the supervised machine learning technique which is very flexible and provides high-​quality results (Palanivel and Surianarayanan 2019). The rainfall data of different areas of Eastern Sudan have been analysed and it provided reliable R2 values for each station. In Bangladesh, a study of various other algorithms such as naive Bayes, decision trees and regression with 77% of accuracy of results (Zaman 2018) was conducted.

Correlation is another technique of machine learning which is used to evaluate the strapping of two or more variables. In the study of rainfall variability and crop production conducted in Ghana (Ndamani and Watanabe 2013) correlation technique was used to calculate the effect of rainfall (independent variable) on crop yield (dependent variable). In the study of rainfall variability in Ethiopia (Abih 2011), correlation technique was used to state the association between the spring and summer rainfall. Peprah (2014), in the study of Asunafo forest of Ghana, shown the relationship between the climatic conditions of various crops like rice, maize, cassava, cococyam, plantain and yam and rainfall with the help of correlation.

Coefficient of variation is another technique which is used to determine of dispersion of data points in the data sequence around the mean. Ndamani and Watanabe (2013) used coefficient of variation for determining the relationship between the annual and seasonal rainfall of Ghana. Annual rainfall shows coefficient of variation of 0.18, which is taken as moderate rainfall based on correlation and mean. In the study conducted in Ethiopia, Bewket (2009) also used coefficient of variation for comparing the annual, seasonal and daily rainfall levels. Spearman correlation was applied on rainfall data to get its significance. Least square regression was used to best fit the parameter of rainfall.

Continuous distribution is another statistical method which can handle continuous data. When there is a wide range of values, continuous distribution is applied. Some functions such as Weibull, Gumbel and Frechet are the part of extreme value distribution. These functions are used to maximize the maxima value of random variable. GEVD is also a function of continuous distribution which was used in a study done by (Ibn Musah et al. 2018) to get the maximum likelihood values for temperature and rainfall variables.

Probability distribution depicts various possible effects of an incident. It is a mathematical based statistical method. It is divided into continuous probability distribution and discrete probability distribution. This method was used in the study of Tiruchirappalli rainfall data, for evaluating the distribution of rainfall over the region. Mean and standard deviation values are calculated and based on that chi-​square values are calculated with various distribution functions. Chi-​square values are used to rank the best fit of the distribution (Arvind et al. 2017).

Multiple linear regression (MLR) is modified form of linear regression and it is an effective machine learning technique. Linear regression can handle one independent variable to a dependent variable. But in MLR, more than one independent variables can be assigned to a response variable. Seed classification is done based on soil dataset. Parameters of soil were assigned as independent variables and seed type as a decisive factor in the work of Chourasiya et al. (2019). An android application has been developed by Zingade (2017) with the machine learning technique of MLR. In this application crop prediction as done according to the environmental conditions.

SVM, artificial neural network (ANN), decision trees, random forest and KNN are some supervised machine learning algorithms. Comparative analysis was made with the algorithms such as SVM, ANN, GA and CART on rainfall data. In comparison with other algorithms, ANN provided better performance with 86% accuracy (Bala Sai Tarun et al. 2019). Other algorithms such as SOM, SVM, BPNN, CFBPN and ANN are discussed and reviewed for their benefits in agricultural scenario. The author reviewed various kinds of works regarding agriculture (Parmar et al. 2017).


Comments

Leave a Reply

Your email address will not be published. Required fields are marked *