The Digital Agricultural Revolution. Группа авторов

Чтение книги онлайн.

Читать онлайн книгу The Digital Agricultural Revolution - Группа авторов страница 14

The Digital Agricultural Revolution - Группа авторов

Скачать книгу

The training data is a set of data that represent the data that the ML will consume to answer the problem it was created to tackle. In certain circumstances, the training data have been labeled—that is, it has been “tagged” with features and classification labels that the model will need to recognize. The model will have to extract such features and group them based on their similarity if the data is unlabeled. To improve the generalization capability of the model, the data set can be divided into three sets according to their standard deviation: training sets, validation sets, and testing sets. The validation set is used to verify the network’s performance during the training phase, which in turn is useful to determine the best network setup and related parameters. Furthermore, a validation error is useful to avoid overfitting by determining the ideal point to stop the learning process.

       1.3.1.4 Model Development

      The ultimate goal of this stage is to create, train, and test the ML model. The learning process is continued until it provides an appropriate degree of accuracy on the training data. A set of statistical processing processes is referred to as an algorithm. The type of algorithm used is determined by the kind (labeled or unlabeled) and quantity of data in the training data set, as well as the problem to be solved. Different ML algorithms are used concerning labeled data. The ML algorithm adjusts weights and biases to give accurate results.

       i. Support Vector Machine

      Support vector machine finds out an optimum decision boundary to divide the linear data into different classes. It is also useful to classify nonlinear data by employing the concept of kernels to transform the input data into higher dimension data. The nonlinear data will be categorized into different classes in the new higher-dimensional space by finding out an optimum decision surface.

       ii. Regression Algorithm

       iii. Decision Tree

       iv. K-means Clustering

Schematic illustration of cotton leaf disease using DT algorithm.

       v. Association Algorithm

      Association algorithms look for patterns and links in data, as well as frequently occurring “if-then” correlations known as association rules. These restrictions are comparable to data mining rules.

       1.3.1.5 Improving the Model With New Data

      The final stage is to apply the model to new data and, in the best-case scenario, see how accurate and effective it becomes over time. The source of the new data will be determined by the problem to be solved.

      1.3.2 Artificial Neural Network

      ANNs resembles the human brain based on the principle that:

       Information is processed by basic units known as neurons.

       Signals are transmitted from one neuron to the next via connecting links.

       Each connecting link has a weight associated with it, which amplifies the signal transmitted in a conventional neural network.

       To determine its output signal, each neuron’s net input passes through the activation function.

       1.3.2.1 ANN in Agriculture

      The major advantage of neural networks is their ability to predict and anticipate via parallel thinking. Artificial Neural Network can be taught instead of being extensively programmed. Artificial Neural Network was employed by Gliever and Slaughter [30] to distinguish weeds from crops. Maier and Dandy [31] used ANNs to forecast water resources factors. Song and He [32] combined expert systems and

Скачать книгу