Machine Learning for Healthcare Applications. Группа авторов
Чтение книги онлайн.
Читать онлайн книгу Machine Learning for Healthcare Applications - Группа авторов страница 26
The study [20] is an experiment on ECG signals of 26 participants where approximate entropy method is implemented for examining the concentration. Approximation entropy window was taken less for intra-patient comparing to inter-patient and for filtering the noisy signals S-Golay method was implemented.
They have innovatively preprocessed the ECG signal using S–Golay filter technique [21]. With both quadratic degrees of smoothing and differentiation filter methods combinedly has processed ECG signals having sampling rate 500 Hz with seventeen points length.
A very unique “double-class motor imaginary Brain Computer Interface” was implemented with Recurrent Quantum Neural Network model for filtering EEG signals [22].
In the paper [23] using the S-Golay filter, the artifacts due to blinking of eyes are found out and it is eliminated adapting a noise removal method.
3.3 Methodology
3.3.1 Bagging Decision Tree Classifier
Among the many Machine Learning algorithms, this method forms a group of algorithms where several instances are created of black-box estimators on variable subsets from the base training set after which we aggregate their solo predictions to form a resultant prediction. This process is used as a path to minimize the variance of the foundation estimator i.e. a decision tree by including randomization within its creation process and building an ensemble from it. In multiple scenarios, this method consists a simple path to improve with regard to a single model, avoids making it a necessity to acclimatize to a foundation algorithm. It works best with fully developed decision trees as it reduces overfitting in comparison to boosting methods which generally work best in shallow decision trees. This classifier comes in many flavors but majorly differ from each other by the path that they draw variable subsets of the training set. In our case samples were extracted with replacement called as Bagging.
3.3.2 Gaussian Naïve Bayes Classifier
This classifier is based on probability which is combined within a Machine Learning model. Hence, it is based on “Bayes Theorem” which states that, we can derive the probability of an event1 given that a retrospective event2 has happened. Here, event2 is the witness and event1 is the hypothesis. The assumption here is that the features are non-dependent which means that the existence of one feature does not affect the other which is why it’s called Naïve. When predictions allocate a continuous value without being discrete, we can ascertain that those values are derived from gaussian distribution. Following is the general formula for Bayes theorem (3.1).
Since our case has a different set or input, our formula for this implementation changes to Equation (3.2).
3.3.3 Kernel Support Vector Machine (Sigmoid)
The separable data with non-linear attributes cannot be tackled by a simple Support Vector Machine algorithm due to which we use a modified version of it called Kernel Support Vector Machine. Essentially in K-SVM it presents the data from a non-linear lower dimension to a linear higher dimension form as such that the attributes belonging to variable classes are assigned to different dimensions. We use a simple Python-SciKit Learn Library to implement and use K-SVM.
For training purposes, we use the SVC class of the library. The difference is in the values for the Kernel parameters of SVC class. In simple SVM’s we use “Linear” for Kernel parameters but in K-SVM we use Gaussian, Sigmoid, Polynomial, etc. wherein we have used Sigmoid.
The only limitation observed in our case is that though this method achieves the highest accuracy but not up to the mark. Hence more advanced models like Deep Learning may be applied in near future for more concrete results.
3.3.4 Random Decision Forest Classifier
It is a variant of supervised machine learning algorithm founded on the schematic of ensembled learning. Ensemble learning is an algorithm where you join multiple or single algorithm into multiple types of algorithms of multiple or same variant to create a complex and advanced prediction model. It also combines many algorithms of same variant as decision trees, forest trees, etc. so the name “Random Forest”. It is used for regression and classification tasks.
The way it works is it picks a part of the dataset and builds a decision tree on these records, and after selection of number of trees you want this process is repeated. Each tree represents the prediction in that category for which the new record belongs. The only limitation here is that there forte lies in their complexity and for that we need substantial computing resources when huge number of decision trees can be brought together which in turn will better train themselves.
3.4 System Setup & Design
We have used an Emotiv EPOC+ biosensor device for capturing Neuro-Signals in the following manner. Figure 3.1 represents the channels on the brain from signals collected and the equipment used for collection. The signals are collected from 14 electrodes positioned at “AF3, AF4, F3, F4, F7, F8, FC5, FC6, O1, O2, P7, P8 T7 and T8” according to International 10–20 system viewed in the figure below. There are reference electrodes positioned above ears at CMS and DRL. By default, the device has a sampling frequency of 2,048 Hz which we have down-sampled to 128 Hz per channel. The data acquired is transmitted using Bluetooth connectivity to a system. Before every sample collection sensor felt pads are rubbed with saline, connected via the Bluetooth USB and charged after with a USB cable as shown in the figure below.
The device is placed on the participants and then showed a particular set of common usage items for the purpose of our experiment, during which all the EEG activity is recorded and later on they are asked to label their choice of purchase amongst each set of products i.e. 1 among 3 items of from each set of products. The process diagram can be seen below in Figure 3.2.
After the data collection, the signals are preprocessed, and some features are extracted using wavelet transformation method and then the classification models were run on the resultant as mentioned before. A part of the data was is preprocessed and decomposed to test the training model. The labeling was done majorly into Like/Dislike.
Figure 3.1 Brain map structure and Equipment used.