An Automated SVM Based Skin Lesion Segmentation and Classification System for Tumor Diagnosis

DOI : 10.17577/IJERTCONV5IS13027

Download Full-Text PDF Cite this Publication

Text Only Version

An Automated SVM Based Skin Lesion Segmentation and Classification System for Tumor Diagnosis

N. Rajesh Kumar* Assistant Professor, Dept. of ECE

SACS MAVMM Engineering College Madurai, Tamilnadu, India

M. Vignesh Babu Assistant Professor, Dept. of ECE

SACS MAVMM Engineering College Madurai, Tamilnadu, India

AbstractMedical image processing plays an important role in computer aided automatic cancer detection and diagnosis process. This paper proposes an efficient screening methodology for skin cancer segmentation using Support Vector Machine (SVM) classifier. The proposed method consists of the following stages as preprocessing, feature extraction, feature training and classification. Median filter is used as preprocessing stage to smooth the regions in skin image. Further, texture features are extracted from the preprocessed image and then these features are trained and classified using SVM classifier. The performance of the proposed skin cancer segmentation methodology is compared with ground truth images to analyze its performance in terms of sensitivity, specificity and accuracy.

Keywordsskin cancer; median filter; feature extraction; classifier; texture features

  1. INTRODUCTION

    Skin cancer is the second most common cancer type in women and it is the third most cancer type in men, as per Global Cancer Statistics Report (2015) [1].Skin cancer is more common in people with light coloured skin who have spent a lot of time in the sunlight. Skin cancer can occur anywhere on our body, but it is most common in places that have been exposed to more sunlight, such as our face, neck, hands, and arms.

    Skin cancer can look many different ways. The most common sign of skin cancer is a change on the skin, such as a growth or a sore that won't heal. Sometime there may be a small lump. This lump can be smooth, shiny and waxy looking, or it can be red or reddish brown. Skin cancer may also appear as a flat red spot that is rough or scaly. Not all changes in the skin are cancer, but we should see a doctor if we notice changes in our skin.

    Melanoma is a disease of the skin in which cancer (malignant) cells are found in the cells that colour the skin (melanocytes). Melanoma usually occurs in adults, but it may occasionally be found in children and adolescents. The skin protects our body against heat, light, infection, and injury. It is made up of two main layers: the epidermis (the top layer) and dermis (the inner layer). Melanocytes are found in the epidermis and they contain melanin, which gives the skin its colour. Melanoma is sometimes called cutaneous melanoma or malignant melanoma.

  2. PROPOSED METHODOLOGY

    The proposed skin cancer detection system uses SVM classifier to detect the cancer lesions. The various stages in this system include preprocessing, feature extraction, segmentation by k-means and classification of lesions. The flow of proposed method is depicted in Fig. 1.

    Fig. 1. Overall flow of proposed skin cancer detection system.

    A. Preprocessing

    In the preprocessing stage, the skin images are enhanced sing the median filters. The median filters preserve the edges of the image. Images commonly involve removing low- frequency background noise, normalizing the intensity of the individual particles images, removing reflections, and masking portions of images. Image preprocessing is the technique of enhancing data images prior to computational processing. The procedure done before processing by correcting image from different errors is preprocessing. This has to be done before image enhancement .Image processing in electrical engineering and computer science, image processing is any form of signal processing for which the input is an image, such as photographs or frames of video; the output of image processing can be either an image or a set of characteristics or parameters related to the image.

  3. FEATURE EXTRACTION

    Features are able to differentiate the normal brain image from abnormal brain image. In this paper, the features as GLCM, Gabor and DWT are extracted from the approximate sub band of the NSCT transform.

    1. GLCM features

      The gray level co-occurrence matrix (GLCM) is a method of extracting second order statistical texture features depending on the number of gray levels used. The GLCM matrix is constructed in any one of the orientations as 00,450,900 and 1800. In this paper, the GLCM matrix is constructed in the orientation of 450. GLCM features are the spatial features which are extracted from the co-occurrence matrix. The co-occurrence matrix is constructed in 45° from the enhanced skin image. The features as Contrast, Dissimilarity, Homogeneity, Entropy, Mean, Variance, Correlation and Angular Second Moment are extracted to increase the classification accuracy which is used to classify the given brain image into either normal or abnormal, as stated below.

      ,=1

      = ( )2 (1)

    2. DWT features

      A discrete wavelet transform (DWT) is based on wavelets. DWT analyzes the image on different resolution scales and splits the image into various frequency components, i.e. multi-resolution image. This permits to view the spatial and frequency attributes of the image simultaneously. Wavelets are small waves and are mathematical functions that represent scaled and shifted copies of a finite-length waveform called the mother wavelet.

      ,

      () = 1 () (8)

      where, is the scaling parameter and is the shifting parameter.

      Firstly, wavelet transform is applied to each row and secondly to each column of the resulting image of the first operation. The resulting image is decomposed into four sub- bands: LL, HL, LH, and HH sub-bands. (L=Low, H=High). The LL-sub-band contains an approximation of the original image while the other sub-bands contain the missing details. The LL-sub-band output from any stage can be decomposed further. When using a wavelet transform to describe an image, an average of the coefficients-in this case, pixels-is taken. Then the detail coefficients are calculated .Another average is taken, and more detail coefficients are calculated. This process continues until the image is completely described or the level of detail necessary to represent the image is achieved. As more detail coefficients are described, the image becomes clearer and less blocky. Once the wavelet transform is complete, a picture can be displayed at any resolution by recursively adding and subtracting the detail coefficients from a lower-resolution version. This technique is used by iterated systems.

    3. Moment Invariant Features

      Moment invariants have been frequently used as features for image processing, remote sensing, shape recognition and classification. Moments can provide characteristics of an

      =

      (2)

      object that uniquely represent its shape. Invariant shape

      =

      Mean: =

      ,=1 1+||

      ,=1

      ( ln ) (3)

      ()

      recognition is performed by classification in the multidimensional moment invariant feature space. Several

      techniques have been developed that derive invariant features from moments for object recognition and representation.

      =

      ,=1

      ,=1

      ()

      (4)

      These techniques are distinguished by their moment definition, such as the type of data exploited and the method for deriving invariant values from the image moments. It was

      ,=1

      Variance: =

      =

      ,=1

      ( )2

      ( )2 (5)

      ()()

      Hu (Hu, 1962), that first set out the mathematical foundation for two-dimensional moment invariants and demonstrated their applications to shape recognition. They were first applied to aircraft shapes and were shown to be quick and

      =

      ,=1

      (6)

      reliable (Dudani, Breeding and McGhee, 1977).

      ,=1

      =

      (7)

      These moment invariant values are invariant with respect to translation, scale and rotation of the shape. Hu defines

      Where, i, j – Co-ordinates in the co-occurrence matrix,

      Pd – Co-occurrence matrix value at the coordinates i,

      j and

      N – Dimension of the Co-occurrence matrix.

      seven of these shape descriptor values computed from central moments through order three that are independent to object translation, scale and orientation. Translation invariance is achieved by computing moments that are normalized with respect to the centre of gravity so that the centre of mass of

      the distribution is at the origin (central moments). Size invariant moments are derived from algebraic invariants but these can be shown to be the result of a simple size normalization. From the second and third order values of the normalized central moments a set of seven invariant moments can be computed which are independent of rotation

  4. K-MEANS SEGMENTATION

    The tumor regions are segmented from the abnormal skin image using K-means method, which is an unsupervised algorithm that characterizes the input information into various classes in view of their inherent separation from one another. A cluster is a gathering of objects which are similar between them and are not at all like the objects having a place with different groups. The points are clustered around centroids.

    K-means clustering is an algorithm which gathers objects

  5. SVM CLASSIFICATION

    It is otherwise called as binary classification algorithm. The binary SVM algorithm has been classified as linear and non-linear SVM. The training set for the case of normal linear SVM classifier is represented as,

    = {1, 2, 3, . . }

    where, represents the extracted features and i represents the number of features extracted.

    It should satisfy the following conditions as,

    =1

    {1 ||2 + } (12)

    2

    Where, W is the optimal hyperplane parameter, represents slack variable which has the value greater than 0 and represents the penalty value,

    in view of attributes into k number of gatherings where k is a

    { 1

    } (13)

    positive integer. The gathering (clustering) is finished by minimizing the Euclidean separation between the information and the comparing cluster centroid. For a given image, the cluster means m is computed by,

    =1 2 =1

    Where, and represents the kernel function of the SVM classifier, and 0

    = .()=

    ; = 1, (9)

    The SVM classification is mapped into,

    { 1 . exp( 2)} and 0

    The steps involved in k-means segmentation are described as follows:

      1. Choose the number of clusters k with initial cluster centroid ; i =1,2, …, k.

      2. Split the input data points into k clusters by assigning each data point to the closest cluster centroid using the Euclidean distance ,

        = (10)

        Where, X={x1, x2, .xn} is the input dataset.

      3. Calculate a cluster assignment matrix U to define the data points partition with the binary membership value of the jth data point to the ith cluster, such that

        = ||, where is {0, 1} for all i, j.

        =1 2 =1

        1 (14)

        Where, is the smoothness parameter.

  6. RESULTS AND DISCUSSION

In this paper, 50 skin tumor affected images (Normal case: 10, abnormal case: 20) from the open access dataset is used for evaluation. These skin images are categorized into training and testing set. The training set contains 10 normal images and 20 abnormal images. The testing dataset contains 10 normal images and 20 abnormal images. MATLAB R2013 is used in this paper for simulation. The proposed system classifies 9 normal images as normal and 19 abnormal images as abnormal. Hence, this proposed skin tumor detection

=1

= 1for all j and 0 <

< for all i.

system achieves classification accuracy rate of 90% and 95%

=1

    1. Re-calculate the centroid using the membership values using,

      for both normal and abnormal skin images, respectively. The average classification accuracy rate of the proposed system is

      about 92.5%.

      = =1 for all i. (11)

      =1

    2. If there is no change in cluster centroid or assignment matrix from the previous iteration, stop the process or go to step 2.

      1. (b)

( c )

(d)

Fig. 2. (a) Original skin image, (b) Moment invariant output, (c) Gray level features, (d) SVM classified image.

VI. CONCLUSION

A computer aided fully automatic detection system for the detection of skin cancer has been proposed in this paper. The method is used to classify the preprocessed skin lesion images into normal and abnormal images. And it also defines what type of cancer it is. This method is proposed to reduce the manual segmentation time and increases the accuracy than that segmented by a physician.

REFERENCES

  1. American Cancer Society. Cancer facts & Fig.s 2013. Available: http://www.cancer.org/acs/groups/content/@epidemiologysurveilance/d ocuments/document/acsp-036845.pdf.

  2. Q. Abbas, M. E. Celebi, and I. Fondon, Skin tumor area extraction using an improved dynamic programming approach, Skin Res. Technol., vol. 18, no. 2, pp: 133-142, 2012.

  3. Skin Cancer Foundation, Skin Cancer Facts, http://www.skincancer.org/Skin-Cancer-Facts/, Accessed: July 4, 2010.

  4. A. Jemal, R. Siegel, J. Xu, and E. Ward, Cancer statistics, 2010, CA. Cancer J. Clin., vol. 60, no. 5, pp. 288-296, 2010.

  5. M. Silveira, J. Nascimento, J. Marques, A. Marcal, T. Mendonca, S. Yamauchi, J. Maeda, and J. Rozeira, Comparison of Segmentation Methods for Melanoma Diagnosis in Dermoscopy Images, IEEE Journal of Selected Topics in Signal Processing, vol. 3, pp. 35-45, 2009.

  6. M. Yuksel and M. Borlu, Accurate segmentation of dermoscopic images by image thresholding based on type-2 fuzzy logic, IEEE Transactions on Fuzzy Systems, vol. 17, pp. 976-981, 2009.

  7. M. Silveira, J. Nascimento, and J. Marques, A. Marcal, T. Mendonca,

    S. Yamauchi, J. Maeda, and J. Rozeira, Comparison of segmentation methods for melanoma diagnosis in dermoscopy images, IEEE Journal of Selected Topics in Signal Processing , vol. 3, pp. 35-45, 2009.

  8. H. Zhou, X. Li, G. Schaefer, M. E. Celebi, and P. Miller, Mean shift based gradient vector flow for image segmentation, Comput. Vis. Image Und., vol. 117, no. 9, pp. 10041016, 2013.

  9. Q. Abbas, M. E. Celebi, and I. F. Garcia, A novel perceptually- oriented approach for skin tumor segmentation, Int. J. Innov. Comput. I., vol. 8, no. 3, pp. 18371848, 2012.

  10. M. Mete and N. M. Sirakov, Lesion detection in dermoscopy images with novel density-based and active contour approaches, BMC Bioinformatics, vol. 11, no. Suppl. 6, pp. S23, 2010.

  11. H. Zhou, G. Schaefer, M. E. Celebi, F. Lin, and T. Liu, Gradient vector flow with mean shift for skin lesion segmentation, Comput. Med. Imag. Grap., vol. 35, no. 2, pp. 121127, 2011.

Leave a Reply