- Open Access
- Authors : Anila M , Dr. G. Pradeepini
- Paper ID : IJERTV9IS060241
- Volume & Issue : Volume 09, Issue 06 (June 2020)
- Published (First Online): 15-06-2020
- ISSN (Online) : 2278-0181
- Publisher Name : IJERT
- License: This work is licensed under a Creative Commons Attribution 4.0 International License
A Review on Parkinson’s Disease Diagnosis using Machine Learning Techniques
Department of CSE,
Koneru Lakshmaiah Education Foundation Vaddeswaram,India
Dr. G. Pradeepini
Department of CSE
Koneru Lakshmaiah Education Foundation Vaddeswaram,India
Abstract Parkinson disease is a neurodegenerative disorder that affects nervous system and the root cause of it is falling rates of dopamine levels in the forebrain. It is a chronic degenerative disease with progressive illness, which means it develops new symptoms over time. This happens with progressive neuronal loss in the substantia nigra of brain. People with PD cannot do their works as a normal human. Though clinical assessments considered ample amount of data that include various features, sometimes it is hard to decide whether a person is suffering from PD or not based on the type of data, feature selection methods help to solve this issue. Various methods are developed, proposed, and analyzed to detect the Parkinson disease, given the required data. This paper is a survey of predicting Parkinson disease using machine learning algorithms, various new technologies applied, and their accuracies achieved.
Keywords PD (Parkinson Disease), dopamine, SVM (Support Vector Machine), KNN (K Nearest Neighbor), ANN (Artificial Neural Network).
Parkinson disease mainly effects central nervous system and is observed to be affected on many people globally. Most of the people suffering with PD are observed to be physically and emotionally draining. They even feel depressed, trouble concentrating on things, painful spasms etc. PD has a large spectrum of clinical features ranging from motor to nonmotor symptoms. Some of the motor symptoms are hypophonic speech, rigidity, resting tremor. Non-motor symptoms are as hallucinations, depression, constipation, sleeping disorders, cognitive impairment, and impulse control disorders. Non motor symptoms show sickness than motor symptoms [1,3]. Most of the cases, physicians find it difficult to envisage whether a given patient is already affected by the disease or is expected to develop the Parkinson's disease. To conquer this, development of some computing model must be done that evaluates and summarizes the data of a given patient and predicts with adequate accuracy where he/she will have development of PD. Most of the PD patients are observed with symptoms called voice impairment which is known as dysphonia. There are several measures related to dysphonia, out of which voice related problem can be used to assess the patients at various stages.
This paper is a survey of prediction of PD using Machine learning and Deep learning techniques that generated good models and potency of those algorithms in terms of accuracies achieved, also about different methodologies applied.
Importance of Voice data:
Speech or voice data is assumed to be 90% helpful to diagnose a person for identifying presence of disease. In general, Person with PD suffer from speech problems, which can be categorized into two: hypophonia and dysarthria. Hypophonia indicates very soft and weak voice from a person and dysarthria indicate slow speech or voice , that can hardly be understood at one time and this causes because of damage to central nervous system. So, most of the clinicians who treat PD patients observe dysarthria and try to rehabilitate with specific treatments to improvise vocal intensity.
Survey carried out for the diagnosis of PD with different algorithms and approaches:
Several strategies are recorded for early detection of PD based on the different ML techniques. But accuracy in detection and classifying within the time is very important or else, it causes development of more symptoms. There are different kinds of data, brain MRI images, Voice data, posture images, senor captured data, handwritten data, using which we can predict whether person is having PD or not. Out of all those , speech or voice data helps in identifying PD accurately.
Eduardo Tolosa et al proposed a twofold fully automatic approach with 3D images has shown promising results in their experimentation .
Max A. Little et al presented a new dysphonia measure, pitch period entropy (PPE) and used a kernel support vector machine and has achieved classification accuracy of 91%.
RAINER SCHOÂ¨ NWEILER et al identified a different approach which used voice analysis with ANN and got good results but observed that cost-effectiveness remains to be a challenge.
Marius Ene et al suggested NN based approach with three types of internal methods and discriminated persons having PD with healthy persons.
DAVID GIL A, MAGNUS JOHNSON B found that with a smaller number of neurons at hidden layer both training set and test sets performed poorly. With higher number of neurons, the training set performed well with high risk of
over fitting. The ideal solution for this layer was found to be 13 neurons.
Ipsita Bhattacharya et al identified the ROC curve variation and identified that values of TP and FP rates show changes while increase in the CV folds.
Freddie Ã…strÃ¶m et al proposed unique approach of parallel neural networks and then outcome of each neural network is assessed by using a rule-based system for the decision. During the training process, data that is not yet learned of each neural network is collected and applied in the training set of the later neural network. This helped to increase prediction accuracy.
Athanasios Tsanas et al developed a novel algorithm based on speech signals but its questionable as most of the features are not considered here, only 10 features are used.
Hui-Ling Chen et al proposed FKNN centered system using a 10-fold cross validation method.
Mohammad S Islam et al has compared various ML techniques based on their performance accuracies in determining whether person is having PD or not and mentioned that new classifier may be built to get better accuracies.
Bo Penga et al suggested Computer Aided Analysis with image data and used BrainLab software for processing the images and calculate thickness of the cortex, volume of gray matter, and surface area of the cortex on each region of interest (ROI). Use of Multilevel ROI-based features improved the classification performance.
Derya Avci and Akif Dogantekin proposed another approach using Genetic Algorithm-Wavelet Kernel-Extreme Learning and achieved good accuracy results.
R Prashanth identified that multimodal features can be used to predict PD in earlier stage.
Satyabrata Aich proposed a unique approach by using Genetic algorithm and PCA as feature selection methods and applied seven ML algorithms for classification, that saved time and productivity while doing pattern classification with two categories such as PD and not PD.
Leandro A. Passos compared ResNet-50 , Optimum-Path Forest (OPF) classier with Support Vector Machines (SVM) and Bayes and achieved 96% of identication rate.
Deepak Gupta followed a different approach cuttlefish algorithm and used for feature selection,
different fitness functions approximations are used to improve cuttlefish algorithm and is termed as Optimized cuttlefish algorithm (OCFA).Decision tree and K-Nearest Neighbor classifiers are applied and achieved 94% of accuracy in detecting PD effected patients .
Salama A. Mostafa proposed
Multiple Feature Evaluation Approach (MFEA) of a multi-agent system (ii) Implementation of five clasification schemas which are Decision Tree, Random Forests, Neural Network, NaÃ¯ve Bayes and Support Vector Machine on the Parkinsons diagnosis before and after applying their approach, and (iii)Author approach witnessed the following average rate of accuracies : Decision Tree achieved accuracy of 10.51%, NaÃ¯ve Bayes shown 15.22%, Neural Network is found with 9.19%, Random Forests and SVM performed with 12.75% and 9.13% respectively.
Ali H. Al-Fatlawi et al
Deep belief network, Restricted Boltzmann Machines ,Back propagation algorithm
Marius Ene et al
Probabilistic neural network (PNN)
Accuracies ranging between 79% and 81%
David Gil A, Magnus Johnson B
ANNs and SVMs
Chien-Wen Cho et al
Principal component analysis with linear discriminant analysis.
Max A. Little et al
classication performance of 91.4%
Resul Das et al
Neural Networks, DMneural, Decision Tree and Regression
Score of 92.9% is achieved
C. Okan Sakar & Olcay Kursun
Zachary C.Lipton et al
Long Short-Term Memory (LSTM-RNN) with forget gate, MLP
Several accuracies are compared.
Ipsita Bhattacharya et al
Used LibSVM for classifying along with random split of the dataset, and determine accuracy for the different kernel functions
Improved average accuracy achieved.
Freddie Ã…strÃ¶m et al
Used a different neural network to minimize the probability of outcome with error
Total nine parallel neural networks are arranged and achieved development of 8.4%
for the prediction of PD compared to single network
Athanasios Tsanas et al
Speech signal processing algorithms, RF,SVM
Indrajit Mandal et al
Multinomial logistic regression, rotation forest together with SVM and PCA, ANN, boosting methods
100% accuracy achieved with sparse multinomial logistic regression and linear logistic regression, observed sensitivity:0.983 and specificity: 0.996
Hui-Ling Chen et al
96.07% obtained by the FKNN dependent system using a 10- fold CV method
Tarigoppula V.S Sriram et al
Random Forest shown better accuracy
Mohammad S Islam et al,2014
SVM, Random Tree and Feedforward Back- propagation built Artificial Neural Network.
90% recognition accuracy
Oana Geman et al
90% accuracy achieved
Bo Penga et al
t-test, SVM, and Minimum Redundancy and Maximum Relevance.
Speech impairment data
Proposed method used multilevel ROI-based features and is observed better classification accuracy..
Othman Ibrahim , Mehrbakhsh Nilashi, & Ali Ahani
PCA is used for feature selection, EM, ANFIS and Support Vector Regression (SVR).
Hui-Ling Chen et al
Extreme learning machine and kernel ELM
10- fold cross validation through 10 runs achieved 96.47% accuracy
Derya Avci and Akif Dogantekin et al
Genetic Algorithm, wavelet kernel and Extreme Learning Machines(ELM).
Thomas J. Hirschauer
EPNN (Enhanced Probabilistic Neural Network
LÃgia Sousa et al
DNN, KNN,PCA (for optimizing feature set)
93.4% for the binary classication,84.7% for multiclass classication.
Leandro A. Passos
ResNet-50 , Optimum-Path Forest (OPF) classier
HandPD dataset, speech
96% of identication rate using speech samples.
Optimized cuttlefish algorithm ,Decision tree, KNN
Speech data and Handwritten data are used to
evaluate the proposed model.
Along with advanced machine learning methods, Neuroimaging modalities also used
speech, ,MRI, EEG
(Various implementations are discussed)
Hariharan et al
Gaussian mixture with PCA and LDA. SVM classifier
Zhang et al
Stacked autoencoders, KNN
In the range of 94-98%
Oung et al
Classifiers used are KNN, PNN, ELM classifiers.
Motion and Speech
KNN:93.26% PNN: 95.22%
Hlavnicka et al
Zero-crossing rate, variance of autocorrelation function.
Salama A. Mostafa
Decision Tree, Random Forests, NaÃ¯ve Bayes, Support Vector Machine and Neural Network.
Avg rate of improved accuracies achieved are: Decision Tree: 10.51%,
Random Forests: 12.75%
NaÃ¯ve Bayes:15.22%, Support Vector Machine: 9.13%, Neural Network: 9.19%
Rainer schonweiler et al
Artificial neural networks, Regression tree
Varius combinations of methods applied and achieves improved accuracies.
Table 1: Summary of the survey of various methodologies and their performances
It is important to note that , out of all ML techniques, ANN and SVM classifier are used most of the proposed algorithms to aid faster and accurate the prediction.
As per the survey, we observed that most of the models used voice/speech data for efficient diagnosis of the disease and because it is preferred by most of the therapists to consider voice data as relevant feature.
ARCHITECTURE OF ANN
The following figure represents architecture of Artificial Neural network with an input layer, hidden layer(s) and an output layer. Number of hidden layers for each network varies from one another.
Machine Learning techniques has got prominent role as they are applied in variety of domains especially in the healthcare. Unlike traditional methods, the models generated by applying ML techniques show dynamic outputs as data is fed into it. One shall make note that significant and narrow research is needed to obtain knowledge in diagnosing the disease. Various machine learning algorithms and techniques are being proposed rapidly, out of which some are observed to be promising with the results and few demonstrated their usage in different fields. Advantage with the ML generated models is that when more data is used, the precision values gets increased and the much accuracy in predictions can be gained.
Every circle in the above network represents a neuron at which the inputs and corresponding weights are processed layer by layer.
An input layer accepts large volumes of data as input to build the neural network. The data can be in the form of text, image, audio, etc. In general, the input layer contains features of the dataset, each node of input layer in the above architecture represents one feature
Every hidden layer receives the input feature along with their weights, where weights of every feature indicates their contribution towards the decision or prediction. Hidden layer processes the data at each node by performing complex computations and helps in feature extraction. Nodes at first hidden layer receives product of input feature with its weights value and is passed as input
to next hidden layer and so on. Choosing number of hidden layers and number of nodes for every hidden layer varies with the problem as well as dataset.
At output layer Processing of nodes are determined by the functions called as activation functions like tanh, sigmoid, ReLU. Depending on the kind of dataset and criteria , one can decide suitable activation function. Output layer receives the output generated by last hidden layer as input and generates an output in the desired form.
This paper is an effort to present broad review about Parkinson disease diagnosis system that have applied various machine learning techniques. The summary of results obtained by different researchers is made available in literature survey table , almost all the authors/researchers made great efforts to predict the Parkinson disease with novel approaches. It can be identified that maximum of all ML techniques used by various authors worked better but developing a very faster classifier using novel architecture of neural network combined with specific approach may work better. To achieve this, we try to implement artificial neural network with different number of hidden layers and number of nodes in future and compare all the accuracies.
Claas Ahlrichs ae al, Parkinson's Disease Motor Symptoms In Machine Learning: A Review, Health Informatics- An International Journal (HIIJ) Vol.2,No.4,November 2013.
ERIKA ROVINI et al, Comparative Motor Pre-clinical Assessment in Parkinsons Disease Using Supervised Machine Learning Approaches, Annals of Biomedical Engineering (2018) https://doi.org/10.1007/s10439-018-2104-9.
Pratibha Surathi et al Research in Parkinson's disease in India: A review, Ann Indian Acad Neurol. 2016 Jan-Mar; 19(1): 9 20.doi: 10.4103/0972-2327.167713
Eduardo Tolosa et al, Decision Support System for the Diagnosis of Parkinsons Disease, SCIA 2005, LNCS 3540, pp. 740749, 2005.
Rainer SchoÂ¨ Nweiler et al, Novel Approach to Acoustical Voice Analysis Using Artificial Neural Networks ,JARO 01: 270282 (2000) DOI: 10.1007/s101620010020.
Gert Cauwenberghs et al, Incremental and Decremental Support Vector Machine Learning, Proceedings of NIPS 2000,Pages 384-394.
Marius Ene et al, Neural network-based approach to discriminate healthy people from those with Parkinsons disease, Annals of the University of Craiova, Math. Comp. Sci. Ser. Volume 35, 2008, Pages 112116 ISSN: 1223-6934
DAVID GIL A, MAGNUS JOHNSON B, Diagnosing Parkinson by using Artificial Neural Networks and Support Vector Machines, Global Journal of Computer Science and Technology, 2009.
Chien-Wen Cho et al, A vision-based analysis system for gait recognition in patients with Parkinsons disease, Expert Systems with Applications 36 (2009) 70337039
Max A. Little et al, Suitability of Dysphonia Measurements for Telemonitoring of Parkinsons Disease, IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, VOL. 56, NO. 4, APRIL 2009.
Resul Das et al, A comparison of multiple classification methods for diagnosis of Parkinson disease, IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 13, NO. 2, MARCH 2002.
C. Okan Sakar & Olcay Kursun, Telediagnosis of Parkinsons Disease Using Measurements of Dysphonia, J Med Syst (2010) 34:591599 DOI 10.1007/s10916-009-9272-y.
Ipsita Bhattacharya et al, SVM Classification to Distinguish Parkinson Disease Patients, Conference: Proceedings of the 1st Amrita ACM-W Celebration on Women in Computing in India, September 16-17, 2010, Tamilnadu, India.
Freddie Ã…strÃ¶m et al, A parallel neural network approach to prediction of Parkinsons Disease, 2011 Elsevier, doi:10.1016/j.eswa.2011.04.028
Athanasios Tsanas et al, Novel Speech Signal Processing Algorithms for High-Accuracy Classification of Parkinsons Disease, IEEE Transactions On Biomedical Engineering, Vol. 59, No. 5, May 2012.
Indrajit Mandal et al, New machine-learning algorithms for prediction of Parkinson's disease, International Journal of Systems Science, DOI:10.1080/00207721.2012.724114.
Hui-Ling Chen et al, An efficient diagnosis system for detection of Parkinsons disease using fuzzy k-nearest neighbor approach, Expert Systems with Applications 40 (2013) 263271.
Mohammad S Islam et al, Performance Comparison of Heterogeneous Classifiers for Detection of Parkinsons Disease Using Voice Disorder (Dysphonia), 3rd International Conference On Informatics, Electronics & Vision 2014.
Bo Penga et al, Computer Aided Analysis of Cognitive Disorder in Patients with Parkinsonism using Machine Learning Method with Multilevel ROI-based Features, J Med Syst (2015) 39:179 DOI 10.1007/s10916-015-0353-9.
Mehrbakhsh Nilashi, Othman Ibrahim & Ali Ahani, Accuracy Improvement for Predicting Parkinsons Disease Progression, Scientific Reports,2016 | 6:34181 | DOI: 10.1038/srep34181.
Hui-Ling Chen et al, An efficient hybrid kernel extreme learning machine approach for early diagnosis of Parkinsons disease, Neuro Computing (Elsevier) Volume 184, 5 April 2016, Pages 131-144.
Derya Avci and Akif Dogantekin et al, An Expert Diagnosis System for Parkinson Disease Based on Genetic Algorithm-Wavelet Kernel- Extreme Learning Machine, Hindawi Publishing Corporation Parkinsons Disease Volume 2016, Article ID 5264743, 9 pages http://dx.doi.org/10.1155/2016/526474.
R Prashanth, High-Accuracy Detection of Erly Parkinson's Disease through Multimodal Features and Machine Learning, ,Volume 90, June 2016, Pages 13-21
Freddie Astrom, A parallel neural network approach to prediction of Parkinsons Disease, Expert Systems with Applications, Volume 38,
Issue 10, 15 September 2011, Pages 12470-12474
Satyabrata Aich, A Supervised Machine Learning Approach using Different Feature Selection Techniques on Voice Datasets for Prediction of Parkinsons Disease, ICACT Transactions on Advanced Communications Technology (TACT) Vol. 7, Issue 3, May 2018.
O. Faust, Y. Hagiwara, J. H. Tan, S. L. Oh and U. R. Acharya, "Deep learning for healthcare applications based on physiological signals: a review," Computer Methods and Programs in Biomedicine, vol. 161, pp. 1-13, 2018.
M. Hariharan, K. Polat and R. Sindhu, "A new hybrid intelligent system for accurate detection of Parkinson's disease," Computer Methods and Programs in Medicine, vol. 113, pp. 904-913, 2014
Q. W. Oung, M. H, S. N. Basah, H. Lee and V. Vijean, "Empirical wavelet transform based features for classification of Parkinson's disease severity," Journal of Medical Systems, vol. 42, p. 29 , 2017
Y. Zeinalia and B. Story, "Competitive probabilistic neural network," Integrated Computer-Aided Engineering, vol. 24, no. 2, pp. 105-118, 2017
T. Hirschauer, H. Adeli and T. Buford, "Computer-aided diagnosis of Parkinson's disease using an enhanced probabilistic neural network," Journal of Medical Systems, vol. 39, no. 179, p. (12 pages), 2015.
M. Ahmadlou and H. Adeli, "Enhanced probabilistic neural network with local decision circles: a robust classifier," Integrated Computer- Aided Engineering, vol. 17, no. 3, pp. 197-210, 2010.
M. Abrahams, "Diagnostic markers in the early detection of Parkinson's disease," 2012.
F. L. Pagan, "Improving outcomes through early diagnosis of Parkinson's disease," American Journal of Managed Care, vol. 18, no. 7, pp. 176-182, 2012.
S. A. Mostafa, A. Mustapha, M. A. Mohammed et al., Examining multiple feature evaluation and classication methods for improving the diagnosis of Parkinsons disease, Cognitive Systems Research , 2018.
National Parkinson Foundation (a), "What is Parkinsons?" [Online]. Available: http://www.parkinson.org/understanding-parkinsons/what- is-parkinsons. [Accessed 08 03 2017].
Deepak Gupta, Optimized cuttlefish algorithm for diagnosis of Parkinsons disease, Cognitive Systems Research, Volume 52, December 2018, Pages 36- 48https://doi.org/10.1016/j.cogsys.2018.06.006
Leandro A. Passos, Parkinson Disease Identication using Residual Networks and Optimum-Path Forest, SACI 2018, IEEE 12th International Symposium on Applied Computational Intelligence and Informatics, May 17-19, TimiÃºoara, Romania