A Performance Comparison of Various Machine Learning Methods on Diagnosing MCI and AD using Multimodal Neuroimaging

DOI : 10.17577/IJERTCONV4IS14026

Download Full-Text PDF Cite this Publication

Text Only Version

A Performance Comparison of Various Machine Learning Methods on Diagnosing MCI and AD using Multimodal Neuroimaging

R. Ramya1, Mr. S. P. Sivagnana Subramanian2, and Adni3

2Assistant Professor,

Sri Venkateswara College of Engineering, Sriperumbudur, (TN)

Abstract: Neuroimaging has played an important role in non- invasive diagnosis and differentiation of neurodegen-erative disorders, such as Alzheimers disease and Mild Cognitive Impairment. To identify different stages of Alzheimers disease and efficient analysis system has been developed for magnetic resonance Imaging (MRI) and positron emission tomography (PET) Neuroimages. Various features have been extracted from the neuroimaging data to characterize the disorders using machine learning methods. We proposed a Support vector machine (SVM) and Naive Bayes (NB) classifiers are used to characterize Normal, MCI and AD patients. Finally, the performances of the different classifiers are compared and results of the classifiers clearly indicate that gives good performance for Normal, MCI, and AD detection.

Index Terms Image Registration, Feature Extraction, Support vector machine, Naïve Bayes, performance evaluation.

  1. INTRODUCTION

    Neuroimaging data are a rich source of information on brain anatomy and physiology. Neuroimaging has been a fundamental component of the neurological disorder diagnosis, and also plays an important role in therapy assessment and disease progression monitoring.

    Alzheimer's disease (AD) is a neurodegenerative disorder. Early diagnosis of this disease is very important to improve patients life quality and slow down the disease progression.AD is considered to be one of the most common causes of dementia among elderly persons. Around 6080% of age-related dementia is caused due to AD. Alzheimers disease (AD) is a progressive brain disorder that slowly leads to memory loss, confusion, disorientation and inability to communicate.AD is a degenerative brain disorder that is characterized by progressive dementia that is characterized by the degeneration of specific nerve cells.

    Mild Cognitive Impairment (MCI) is a represents has early-Stage of AD.MCI is an Intermediate state between normal and AD. Memory problems are typically one of the first symptoms of cognitive impairment related to AD. Peoples are affected with more memory problems than normal for their age, have a condition called mild Cognitive impairment (MCI).Symptoms of MCI do not interfere with their everyday lives.

    Another way of detecting AD and MCI is by using Neuroimaging or brain imaging techniques.A Normal, MCI and AD patient classifies using Neuroimaging and machine learning algorithms are a promising area of research. Neuroimaging datasets are collected from Alzheimers disease Neuroimaging Initiative (ADNI) for 300 patients. In Multimodal Neuroimages such as MRI and PET are subjected to pre-processing initially where the image Registration technique is used to geometrically align two Neuroimages. Classification of Normal, MCI, and AD patients are difficult because of high dimensional features.

    The Gray level co-occurrence matrix(GLCM) features are the widely used for dimensionality reduction in multimodal Neuroimages classification. GLCM feature extraction technique is applied to pre-processed images and features of images are the input of the classification system. Finally, images are classified into the responsive classes by the suitable techniques such as Support vector machine (SVM) and Naive Bayes (NB) classifier and performance are evaluated.

    The process of proposed system includes image acquisition, pre-processing, feature extraction, classification and performance evaluation.

  2. EXISTING ALGORITHM

    Carlos Cabral, Pedro M.Morgado, Durval Campos Costa and Margarida Silveira constructed an Elastic net (EN) and Two-One-Sided-Test (TOST) model by supervised learning to classify DICOM Neuroimages. The idea behind EN and TOST model it classify two groups of class, highly correlated data and it is capable of using a minimum number of prediction classes but it cannot take a large number of data sets which is the major drawback of this method.

  3. MATERIALS AND METHODS

A.SYSTEM ARCHITECTURE OVERVIEW

The design of this system architecture is shown in Fig. 1. We first obtained the Neuroimaging data of 300 patients with Alzheimers disease Neuroimaging Initiative (ADNI).

In pre-processing the image Registration technique is used for Multimodal Neuroimages that is PET and MRI images. The Pre-processed image will be applied to feature extraction techniques to reduce dimensionality problems for the large set of data. We used the extracted features were applied as an input to different classifiers to characterize Normal, MCI, and AD patients, and compare its performance.

IMAGE ACQUISITION

PRE-PROCESSING

FEATURE EXTRACTION

CLASSIFIERS

NORMAL

MCI

AD

PERFORMANCE

EVALUATION

Fig.1. System architecture

  1. IMAGE ACQUISITION

    Alzheimers disease Neuroimaging Initiative (ADNI) was designed to detect Alzheimers disease at earlier stages. ADNI studies include Neuroimaging or brain imaging techniques, such as positron emission tomography (PET) and Magnetic Resonance Imaging (MRI). PET is a non- invasive medical imaging modality that provides 3D maps modeling the glucose consumption rate of the brain and MRI scanners use magnetic fields and radio waves to form images of the Body.The Neuroimaging data of 300 patients are obtained from ADNI,100 normal subjects, 100 subjects with MCI and 100 subjects with AD and were consequently selected in this system.

  2. STEPS OF SYSTEM ARCHITECTURE

    1. Obtain a DICOM Neuroimages such as MRI and PET

      1. In Pre-Processing stage is applied for multimodal Neuroimages for geometrically to align two DICOM images using Image Registration.

      2. Pre-processed images are applied as an input to extract GLCM features.

      3. Finally extracted features are applied as an input to efficient Machine learning methods are applied and performance is evaluated.

  3. PRE-PROCESSING

Image registration was performed for Multimodal brain images , that is MRI and PET images. In clinical diagnosis using medical images, integration of useful data obtained from separate images is often desired. Image registration is the process of overlaying two or more images of the same scene taken at different times, from different sensors .The images need to be geometrically aligns two images properly, typically on the image is consider as a target image and other is consider as a source image. It is a process of that source image is transformed to match the target image.

E.FEATURE EXTRACTION

Feature extraction technique mainly used for dimensionality reduction for a large dataset and it describes the dataset with sufficient accuracy. It starts from an initial set of measured data and builds derived values intended to be informative and non-redundant, facilitating the subsequent learning and generalization steps, and in some cases leading to better human interpretations. When the input dataset is too large to be processed and it will be transformed into a reduced representation features set. The extracted features are expected to contain more relevant information from the input datasets.

EXTRACTION OF GLCM FEATURES

Gray Level Co-Occurrence Matrix (GLCM) is also called as Gray level Dependency Matrix and it is a popular method of extracting features fom pre-processed Neuroimages. It is defined as a two dimensional histogram of gray levels for a pair of pixels, which are separated by a fixed spatial relationship. The dimension of a GLCM is determined by the maximum gray value of the pixel. Number of gray levels is an important factor in GLCM computation. The extracted features are contrast, correlation, Energy, homogeneity, mean and standard deviation are computed using MATLAB.

CONTRAST

Contrast measures an amount of local changes in an image. It measures of intensity contrast between a pixel and its neighborhood in an image.

from ADNI site.

Contrast = 1 2 .

(i, j) (1)

2. About 300 DICOM Neuroimages are taken and it equally divided the images into three categories as

CORRELATION

=0

=

Normal, MCI, and AD patients.

Correlation measures how the connection between pixels is to its neighborhood. It is the measure of gray tone linear dependencies in the image.

( )

MCI and AD patients. If the number of features is applied and it is much greater than the number of data points, then

Correlation =

ENERGY

=1 =1 . (,)

(2)

the SVM method gives very poor performances. SVM gives a less overfitting in a data points and it is a computationally expensive.

The energy is also known as Uniformity or Angular second moment (ASM). It provides the sum of squared elements in GLCM. Energy is 1 for a constant image.

=1 =1

Energy = 2 (i, j) (3)

HOMOGENEITY

Homogeneity measures the correspondence of pixels.

B.NAIVE BAYES CLASSIFIER

Naive Bayes classifier is fully based on applying Bayes theorem with an assumption of independence between each and every pair of features. Naïve Bayesian classifiers assume that there are no dependencies amongst attributes. This assumption is called class conditional independence. It is made to simplify the computations involved and, hence is called "naive". This classifier is also called idiot Bayes, simple Bayes, or independent Bayes. NB is very easy to build and it is mainly used for very large datasets.

Homogeneity =

(,)

(4)

Extracting features of GLCM is applied to NB classifier to

Mean

=1

=1 1+||

characterize of normal, MCI and AD patients. Naive Bayes classifiers provide a greater amount of information about

The mean takes the average level of intensity of the image.

the Alzheimers disease stages because of the independence assumption.

=0

Mean = 1 . ()

STANDARD DEVIATION

(5)

V.EXPERIMENTAL AND RESULTS

The complete ADNI data set consists of 300 samples.

Standard deviation is the measurement of the dispersivity of the gray scale from the Mean.

The three machine learning methods are trained and tested such as Normal, MCI, and AD based on the different datasets which are collected from ADNI. Here nearly 300

Standard deviation = 1 (

). ( )

(6)

DICOM Neuroimages are taken and 6 GLCM features are

=0

extracted for 50% total samples for training and the

IV.CLASSIFICATION BASED ON EFFICIENT MACHINE LEARNING METHODS

AD and MCI detection using Neuroimaging and machine learning algorithms .Machine learning algorithms are widely used in Neuroimaging to classify AD, MCI and normal persons. AD and MCI classification is difficult because of high dimensional features and hence feature extraction techniques have been increasingly used for dimensionality reduction in Neuroimaging classification. The extracted features are given as an input to the machine learning methods such as Support vector machine (SVM) and Naive Bayes (NB) classifier to characterize Normal, MCI and AD patients.

remaining for testing and for having a comparison between different classifiers. By using six extracted features for finding efficient classification which improves the class prediction accuracy from different classifiers. So that, the computation time will be reduced and improve classification stages such as Normal, MCI, and AD. The Machine Learning approach provides efficient classification and gives good accuracy on the GLCM features set for DICOM Neuroimages analysis.

ACCURACY

Accuracy is able to measure the true amount of samples are how good and efficient. It is the proportion of correctly diagnosed disease stage from the total number of disease stage.

    1. UPPORT VECTOR MACHINE CLASSIFIER

      Support vector machine is a machine learning method that analyzes data points used for classification and it is formally separating by a hyperplane. SVM conceptually

      Accuracy = (+)

      (+++)

      SENSITIVITY

      (7)

      implements the idea that vectors are nonlinearly mapped to a very high dimension feature space. In this feature space, a linear separation surface is created to separate the training data by minimizing the margin between the vectors of the

      Sensitivity measures the proportion of positives disease that is correctly identified. Sensitivity measures the ability to identify disease stage.

      two classes.The training ends with the definition of a decision surface that divides the space into two subspaces. Each subspace corresponds to one class of the training data. Once the training is completed, the test data are mapped to

      Sensitivity =

      (+)

      SPECIFICITY

      (8)

      the feature space.. After extracting GLCM features, it applied as an input to SVM classifiers and it classifies different stages of Alzheimers disease such as normal,

      Specificity measures the proportion of without disease

      can be correctly identified. Specificity measures the ability to identify normal stage.

      Specificity =

      (+)

      (9)

      most important stage. But it is not a simple way for finest identification of early detection of diseases such as MCI and AD. Generally, the Multimodality DICOM

      The effectiveness of the Machine Learning methods has been calculated using the following measures:

      True Positive (TP) – people which have disease are correctly identified

      False Positive (FP) – people have without disease are incorrectly identified

      True Negative (TN) – people have without disease are correctly identified

      False Negative (FN) – people which have disease are incorrectly identified

      Classifiers

      Accuracy (%)

      Sensitivity(%

      )

      Specificity(%

      )

      NB

      82

      81

      80

      SVM

      69

      64

      68

      Table.1. Parametric Results for Classification

      Table.1. shows the comparison of three different classifiers. The Accuracy of SVM 55% and an accuracy of NB 80% . NB gives better performance because it identifies three stages correctly than SVM method. To improve more accuracy by applying large dataset and increase more features set.

      Fig.2.Performance comparison of different classifiers

      Fig.2. shows the extracted six features are given to different classifiers. Support vector machine (SVM) and Naive Bayes (NB) classifiers are performed for identification mechanism of Normal, MCI, and AD.

      VI.CONCLUSION AND FUTURE WORK

      This system was developed to diagnose of disease analysis by using DICOM Neuroimages classification is a

      Neuroimages is a more valuable and most reliable method used for detection of disease in different stages. DICOM is useful for brain scans and most effective technique to detect the dissimilarity in Neuroimages. In this paper different classifier approach is proposed for DICOM image classification and performance is evaluated. The approach consists of Pre-Processing, feature extraction, classification and performance Evaluation. The classification consists of Spport Vector Machine (SVM) and Naive Baye(NB) technique and it is fully based on parameters extracted by GLCM a feature extraction technique. Classifiers mainly used to the characterization of the Normal, MCI and AD patients. In SVM it optimally separates two groups and less overfitting datasets problems are overcome by NB algorithm is its capability. The NB gives results better and quick as compared to SVM classifiers it is beneficial where large datasets have to compute rapidly.

      FUTURE WORK

      In future, we plan to use hybrid classifier and to increase the data sets thereby investigating the performance.

      REFERENCES

      1. Carlos Cabral , Pedro M.Morgado , Durval Campos Costa , Margarida Silveira,Predicting conversion from MCI to AD with FDG-PET brain images at different prodromal stages , Computers in Biology and Medicine,elseveir ,Vol. 58,pp. 101 109,2015.

      2. Pedro M.Morgado , Margarida Silveira ,Minimal neighborhood redundancy maximal relevance: Application to the diagnosis of Alzheimer's disease , Neurocomputing ,elseveir,Vol. 155,pp. 295308,2014.

      3. Sidong Liua, Weidong Caia, Lingfeng Wena, David Dagan Fenga, Sonia Pujolb,Ron Kikinisb, Michael J. Fulhama, Stefan Eberla,Multi-Channel neurodegenerative pattern analysis and its application in Alzheimers disease characterization , Computerized Medical Imaging and Graphics ,elseveir,Vol. 38,pp. 436444,2014.

      4. Ben Rabeh Amira, Benzarti faouzi, Amiri Hamid, Mouna Bouaziz

        ,Computer-assisted diagnosis of Alzheimer's disease, International Image Processing Applications And Systems Conference, 2013.

      5. Biswajit Pathak, Debajyoti Barooah, Texture Analysis Based On The Gray-Level Co-Occurrence Matrix Considering Possible Orientations, International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering, Vol. 2, Issue 9, September 2013.

      6. Ahmad Ashari, Iman Paryudi and A Min Tjoa, Performance Comparison between Naïve Bayes, Decision Tree and k-Nearest Neighbor in Searching Alternative Design in an Energy Simulation Tool, International Journal of Advanced Computer Science and Applications, Vol. 4, No. 11, pp. 33-39,2013.

      7. Eduardo Bicacro, Margarida Silveira, Jorge S. Marques and Durval C. Costa, 3d Brain Image-Based Diagnosis Of Alzheimers Disease:Bringing Medical Vision Into Feature Selection, IEEE,pp. 134-138,2012.

      8. Qian Y, Gao X, Loomes M, Comley R, Barn B, Hui R, Tian z,Content-based retrieval of 3D medical images, International Conference on eHealth, Telemedicine, and Social Medicine (eTELEMED),Vol. 99,pp 7-12,2011.

      9. Pradipta Maji and Sankar K. Pal, FuzzyRough Sets for Information Measures andSelection of

        Relevant Genes From Microarray Data, IEEE, Vol. 40, No. 3, P.741-752, 2010.

      10. Serkawt Khola , Feature Weighting and Selection A Novel Genetic Evolutionary Approach, World Academy of Science, Engineering and Technology 73, P.1007-1012, 2011. Ping Yao, Fuzzy Rough

        Set and Information Entropy Based Feature Selection for Credit Scoring, IEEE , P.247-251, 2009.

      11. El-sayed a. El-dahshan, Abdel-badeeh and Tamer H. Younis, A Hybrid Technique for Automatic MRI brain Images Classification, Studia Univ. Babes_Bolyai, Informatica, Vol. LIV, No.1, P.55-66,2009

      12. Cai W, Kim J, Feng D,Content-based medical image retrieval, Biomedical information technology, Elsevier,Vol. 66, pp 83 113,2008.

      13. Muller H, Michoux N, Bandon D, Geissbuhler A, A review of content-based image retrieval systems in medical applications clinical benefits and future directions, IJRRC,pp 123,2004.

      14. T. Rohlfing, D. B. Russakoff, R. Brandt, R. Menzel, and C. R. Maurer, Performance-Based Multi-Classifier Decision Fusion for ATLAS-Based Segmentation of Biomedical Images, IEEE International Symposium, 404- 407 Vol. 1,2004.

      15. Free borough P.A., Fox N.C., MR image texture analysis applied to the diagnosis and tracking of Alzheimers disease , IEEE Transactions on Medical Imaging, Vol. 17, No.3, P. 475-479, 1998.

Leave a Reply