Sandalwood Tree Identification using Probabilistic Neural Network as a Classifiers

DOI : 10.17577/IJERTCONV3IS05003

Download Full-Text PDF Cite this Publication

Text Only Version

Sandalwood Tree Identification using Probabilistic Neural Network as a Classifiers

1Nakul Dev, 2 Vivek C,3 Benaka Santhosh. S

1Dept.of ECE,CIT,Ponnampet,Karnataka

2 Dept.of ECE,CIT,Ponnampet,Karnataka,

3Assistant Professor, Dept. of ECE,CIT, Ponnampet, Karnataka,

AbstractSandalwood (Santalum Album L) is a valuable tree associated with Indian culture. It is the second most expensive wood in the world. Because of which it is at its quick need to be reserved for the future. In order to achieve this, continuous senses of sandalwood trees is required. As an approach, the identification of sandalwood trees using the Probabilistic Neural Network (PNN) as a classifier is implemented here. Several methods to identify plants have been proposed by several researchers. Commonly, the methods did not capture color information because color was not recognized as an important aspect to the identification. The shape, vein, color, and texture features were used here to do the classification of leaf then tree. The result shows that the method of classification gives accuracy of 90%.

KeywordsColor features, Sandalwood, PNN, Texture features

  1. INTRODUCTION

    Santalum album or sandalwood is a small tropical tree and is the most commonly known source of Sandalwood. This species has historically been cultivated, processed and traded since ancient times. Certain cultures place great significance on its fragrant and medicinal qualities. The high value of the species has caused its past exploitation, to the point where the wild population is vulnerable to extinction. So it is at its quick need to be saved. As an approach in this research, the real time sensus of sandalwood trees time to time has been proposed.

    Most of the plant identification systems have used the concept of aspect ratio, leaf dent, leaf vein and invariant m . ,

    been recognized as an alternative in real time classification problems.

    In this system, we tried to capture the shape, vein, and texture of the leaf of Sandalwood. In implementation, we used Fourier descriptions of PFT, three kinds of geometric features, color moments vein features and texture features based on lacunarity. Then, those features were inputted into the identification system that uses a PNN classifier. Testing was done by using Flavia data set. The result shows that performance of identification system have been improved.

  2. FEATURE EXTRACTION

    The features of Sandalwood tree leaves are extracted from shape, color, vein and texture. All the features are utilized in the identification system.

    1. Shape

      If we consider the shape of Santalum album for identification purpose, two features of shape we are going to utilize, they are geometric features and Fourier descriptions of PFT. Geometric features that commonly used in leaf recognition are Slimness and Roundness. Slimness is also called aspect ratio and is defined as,

      1

      Slimness = 2 (1)

      Here 1 is the width of the leaf and 2 is the length of the leaf and is as shown in figure 1

      oment to identify plants However all those plant

      identification systems did not incorporate color information. Color, shape and texture are the common features involved in several applications. But researchers used part of those features only.

      Texture is the powerful regional description that helps in retrieval process. Texture, on its own does not have the capability of finding similar images but it can be used to classify textured images from non-textured ones and then be combined with another visual attribute like color to make the retrieval more effective.

      Neural networks are attracting the researchers in area of pattern recognition because of its power to learn from training data sets. So it is used as classifier on most of plant identification systems. For example, back propagation was used in adaptive route. Selection policy in mobile adhoc networks. PNN is another neural network that has been used in several applications. PNN has proven to be more time efficient than conventional back-propagation based networks and has

      L2 L1

      Fig1. Leaf of S.ALBUM

      Roundness or compactness is another geometric feature we are going to consider here.

      Where R is the maximum radius from the centre of the shape. The origin of polar space becomes as centre of space to get translation invariant. The centroid (XC, Yc) calculated by

      2

      2

      Roundness = 4

      (2)

      using formula

      Here, A is the area of the leaf image

      = 1 1 ,

      = 1 1

      (4)

      P is the perimeter of the leaf contour

      =0

      =0

      Polar Fourier Transform (PFT) is very useful to capture shape of a leaf. The descriptions extracted from PFT are invariant under the actions of translation, scaling and rotation as illustrated in figure 2.

      In this case (r, ) is calculated by using

      r = ( )2 + ( )2 , = arctan

      (5)

      Rotation invariance is estimated by ignoring the phase information in the co-efficient are retained meanwhile, to get the scale invariance the first magnitude value is normalized by the area of the circle and all the magnitude values are normalized by the magnitude of the first coefficient. So the Fourier descriptions.

      (a)

      (b)

      (c)

      (d)

      (a)

      (b)

      (c)

      (d)

      FDs= {(0,0) , (0,1) , . . , (0,) , . , (,)}

      22

      (0,0)

      (0,0)

      (0,0)

      (6)

      Where m is the maximum number of the radial frequencies and n is the maximum number of angular frequencies.

      Figure 4 shows the S.Album leave and Table 1 lists the Fourier description of leaf using m=4 & n=6.

      Fig. 2 Translation, scaling, and rotation invariants (a) leaf, (b) change of size,

      (c) Change of position, (d) change of orientation

      Polar Fourier Transform that is used in this research is defined as

    2. Color Features

    Feature

    S.Album leaf

    1

    0.5590

    2

    0.0024

    3

    0.1800

    4

    0.0190

    5

    0.0770

    6

    0.4150

    Feature

    S.Album leaf

    1

    0.5590

    2

    0.0024

    3

    0.1800

    4

    0.0190

    5

    0.0770

    6

    0.4150

    Table I

    PF2(, ) = (, ) [2 ( + 2 )]

    Color moments of the leaf represent color features to

    Where,

    (3)

    characterize a color image .Features can involved are mean (µ), standard deviation (), skewness () and kurtosis ().

    1. Mean The simple mathematical average of a set of two or more numbers. The mean for a given set of numbers can be

      0r<R tan = i(2/T) (0i<T) ;0 <R, 0 <T;

      • R is the radial frequency resolution

      • T is the angular frequency resolution Computation of PFT is described as follows,

        For example there is an image I = {f(x, y); 0x<M, 0y<N/p>

        In the first step, image is converted from Cartesian space to polar space. = {(, ); 0 < , 0 < < 2}

        computed in more than one way, including the arithmetic mean method, which uses the sum of the numbers in the series, and the geometric mean method

    2. Standard Deviation A quantity expressing by how much the members of a group differ from the mean value for the group.

    3. Skewness It is a measure of symmetry, or more precisely, the lack of symmetry. A distribution, or data set, is symmetric if it looks the same to the left and right of the center point.

    4. KurtosisIt is a measure of whether the data are peaked or flat relative to a normal distribution.

    For RGB color space the three features are extracted from each plane R, G and B.The formulas to capture those moments are

    C Vein Features

    Vein features can be extracted by using the morphological features. That operation is performed on the gray scale image with flat disk shaped structuring element. Three features of vein are calculated as follows

    V1=A1/A, V2=A2/A, V3=A3/A (11)

    µ = 1

    1 1

    (7)

    = 1

    =

    =

    (

    µ)2

    (8)

    In this case, V1, V2 and V3 represents features of the vein, A1, A2 and A3 represents total pixels of the vein, and A denotes the total pixels on the part of the leaf.

    =1

    =1

    D Texture Features

    In this part, we are going to use fractional measure called

    (µ)

    Lucunarity. The word lacunarityrefers to a gap or pool as

    3

    3

    3

    3

    = =1 =1

    (9)

    derived from the word for lake, but in morphological analysis it has been variously defined as gappiness, visual

    (µ)

    texture, inhomogeneity, translational and rotational invariance

    4

    4

    4

    4

    = =1 =1

    (10)

    etc.

    Lucunarity will help to distinguish between two fractals

    M and N are the dimensions of image .Pij is value of color on i th coloumn and j th row. The S.Album leaf has the mean, standard deviation, Skewness and kurtosis as given in Table II.

    with the same fractal dimension. Definitions of lacunarity are shown as follows

    1

    Feature

    S.Album

    R

    µ=72

    =22.4763

    =5.2656e-005

    =-2.9997

    G

    µ=161

    =15.1716

    7.8869e-005

    =-2.9997

    B

    µ=31

    =21.8473

    4.2772e-005

    =-2.9998

    Feature

    S.Album

    R

    µ=72

    =22.4763

    =5.2656e-005

    =-2.9997

    G

    µ=161

    =15.1716

    7.8869e-005

    =-2.9997

    B

    µ=31

    =21.8473

    4.2772e-005

    =-2.9998

    L = MN

    Mm=1 N 1 Pmn2

    1 (12)

    (

    (

    s 1

    MN

    M

    k=1

    N

    l=1

    2

    n=

    n=

    Pkl)

    L = 1 M

    N | Pmn

    1|

    (13)

    a MN

    m=1

    n=1

    1 M

    MN k=1

    N

    l=1

    Pkl

    L = ( 1 M

    N ( Pmn

    1

    p p

    1) )

    p MN

    m=1

    n=1

    1 M

    MN k=1

    N

    l=1

    Pkl

    (14)

    Originally these formulae are applied to gray scale images, where Pmn is a gray value at co-ordinate (min). However here only the last formulae or used as texture features. With P having the values 2, 4 and 6. Pmn represents values of color R, G, B of RGB image and intensity in grey scale image. Therefore twelve features are used to represent texture features.

    E Data Normalization

    Data normalization is a useful step often adopted prior to the use of classifier as a precaution when the feature values vary in different dynamic ranges. In absence of normalization, features with large values have a stronger influence on the cost function in designing the classifier. By normalizing the data, values of all features will be in pre determined ranges. Normalization can be done by using following formulae.

    Table II

    Xi= (X1Xmin)

    (XmaxXmin)

    (15)

    In this case x^I represents the new value of the features, x1 represents original value of the feature, Xmin is the smallest value of original feature, and Xmax is the largest value of original feature.

  3. PROPOSED SYSTEM

    The proposed algorithm for S. Album leaf classification followed by tree classification is shown in fig5. In the first step image of leaf is acquired. That leaf image is inputted in to the system for classification. The features contained on the leaf are extracted by Feature Extraction. Then, the features are use by a PNN. The result is an index that represents a plant. Then, plant information getter translates the index into the name of the tree. Before the start of classification, the PNN has been trained once.

    Segmentation is used to separate leaf from its background. It can be accomplished by using adaptive threshold .Firstly, an intensity histogram of image is built with 20 bits.Secondly,two major peaks in the histogram that represent the leaf and its background respectively are obtained. Third, find a bit with smallest value that lies between the two major peaks. Then the median of the bits is used as a threshold to separate leaf and its background.

    Image Segmentation

    Image Segmentation

    Image Acquisition

    Figure. below shows the process to get area of the leaf. As a first step, the image of the leaf is converted to gray scale image. Then conversion from gray scale to binary form is done by using adaptive threshold. After that several morphological operations are performed to remove holes in the leaf caused by previous thresholding. The leaf is obtained by using operation AND between RGB image and binary image.

    Convert RGB to Convert gray scale gray scale image to binary image

    AND

    Feature Extraction

    Shape Feature

    Texture Feature

    Shape Feature

    Texture Feature

    Color Feature

    Color Feature

    Database of

    S.album

    Leaf Features

    Database of

    S.album

    Leaf Features

    PNN as a classifier

    PNN as a classifier

    Tree Information Getter

    Tree Information Getter

    Information about the leaf

    Information about the leaf

    The other important part of the identification system is PNN as a classifier.PNN is actually a kind of Radius Basis Function (RBF).Basically PNN classifier adopts Bayer classification rule and density estimation based on Gaussian functions, Figure shows the architecture.

    The input layer accepts an input vector. The pattern layer processed the input vector by using weight vector came from training dataset. This layer compute the distances from the input layer to the training input. As a result, a vector that indicate how close the input is to a training input. Then in the summation layer, a vector contains probabilities is found by summing up the contribution for each class. This vector of probabilities is to the output layer. The last layer in PNN structure produces a classification decision in which a class with maximum proabilities will be assigned by 1 and other classes will be assigned by 0.

    Mathematically, the probabilities found by summing up the contribution for each class is calculated by using parzen method as follows

    Based on the results, we can see that all kinds of features have important contributions except the kurtoris.

    P (x| ) = 1

    ( ()2)

    (16)

    (2)2

    =1

    22

    Where P(x|) represents the conditional probabilities X to class , x is input vector, is training data set, d is the number of input vector, is the number of samples for class j, is the smoothing factor that its value is input heuristically.

    Based on fact that x has class j, if P(x| ) > P(x|) , ij

    then P(x|) can be calculated as follows,

    Features

    Perfomance

    PFT

    74.6875%

    PFT + 3 geometric features

    77.5000%

    PFT + 3 geometric features + mean of colors

    82.5000%

    PFT + 3 geometric features + mean of colors + standard deviation of colors

    88.1250%

    PFT + 3 geometric features + mean of colors + standard deviation of colors + skewness of colors

    88.7500%

    PFT + 3 geometric features + mean of colors + standard deviation of colors + skewness of colors

    +kurtosis of color

    87.8125%

    Features

    Perfomance

    PFT

    74.6875%

    PFT + 3 geometric features

    77.5000%

    PFT + 3 geometric features + mean of colors

    82.5000%

    PFT + 3 geometric features + mean of colors + standard deviation of colors

    88.1250%

    PFT + 3 geometric features + mean of colors + standard deviation of colors + skewness of colors

    88.7500%

    PFT + 3 geometric features + mean of colors + standard deviation of colors + skewness of colors

    +kurtosis of color

    87.8125%

    P(x| ) = 1

    exp [ ()2]

    (17)

    =1

    22

    V. CONCLUSIONS

    An algorithm for sandalwood tree identification has been

  4. RESULTS

The image of the purticular region having the sandalwood trees have been captured using high definition camera. From the acquired image, each individual tree, followed by some 10 samples of leaves in that tree are extracted using feature extraction of image segmentation. Each leaf is compared in all the characteristics provided in last section. With the database of S.Album leaf provided. Depending on this the tree type is detected. In this case,PNN classifier is adjusted by using smoothing factor as equal 0.05

In order to get the perfomance of the system, the following formula is used

developed. This method in corporates shape, vein, color and texture features, and uses PNN as a classifier. Fourier descriptors, slimness ratio, roundness ratio and dispersion are used to represent shape features. Color moments that consists of mean, standard deviation, and skewness are used to represent color. Twelve textures. The result gives up to 90% of accuracy. In the future work, we are planning to implement the system using some more advanced classifiers.

REFERENCES

  1. Q. Wu, C. Zhou, & C. Wang, Feature Extraction and Automatic Recognition of Plant Leaf Using Artificial Neural Network, Avances en Ciencias de la Computacion, pp. 5-12, 2006.

  2. S. G. Wu, F. S. Bao, E. Y Xu, Y-X. Wang, Y-F. Chang, & Q-L. Xiang,

    Perfomance =

    (18)

    A Leaf Recognition Algorithm for Plant Classification Using Probabilistic Neural Network, IEEE 7th Interantional Symposium on Signal Processing and Information Technology, Cairo, 2007.

  3. K. Singh, I. Gupta, & S. Gupta, SVM-BDT PNN and Fourier Moment

    where is relevant number of images and is the total number of query.

    Table below shows the results. As given in the table, combination of shape, color (without kurtoris), vein, and texture features gives the best result with accuracy of 90%.

    Technique for Classification of Leaf Shape. International Journal of Signal Processing, Image Processing and Pattern Recognition, vol. 3 (4), pp. 67-78, 2010.

  4. J-X. Du, D-S. Huang, X-F. Wang, & X. Gu, Computer-aided Plant Species Identification (CAPSI) Based on Leaf Shape Matching Technique, Transactions of the Institute of Measurement and Control , vol. 28 (3), pp. 275-284, 2006.

  5. P. Hiremath, & J. Pujari, Content based Image Retrieval based on Color, Texture and Shape Features Using Image and Its Complement, International Journal of Computer Science and Security , vol. 1 (4), pp. 44-50, 2011.

  6. B. Jyothi, Y. M. Latha, & V. Reddy, Medical Image Retrieval using Multiple Features, Advances in Computational Sciences and Technology , vol. 3 (3), pp. 387-396, 2010.

  7. S. Theodoridis, & K. Koutroumbas, An Introduction Pattern Recognition, Burlington: Academic Press, 2009.

  8. A. Kulkarni, Artificial Neural Networks for Image Understanding,

    New York: Van Nostrand Reinhold, 1994.

  9. M. Mercimek, K. Gulez, & T. V. Mumcu, Real Object Recognition Using Moment Invariants, Sadhana , vol. 30 (6), pp. 765-775, 2005

  10. Z. Zulkifli, Plant Leaf Identification Using Moment Invariants & General Regression Neural Network, Master Thesis, Universiti Teknologi Malaysia, 2009.

  11. A. Kadir, L.E. Nugroho, A. Susanto, & P.I. Santosa, A Comparative Experiment of Several Shape Methods in Recognizing Plants, International Journal of Computer Science & Information Technology (IJSIT) , vol. 3 (3), pp. 256-263.

  12. D. Zhang, Image Retrieval Based on Shape, Unpublished Dissertation, Monash University, 2002.

  13. I. Kunttu, L. Lepisto, & A. Visa, Image Correlogram in Image Database Indexing and Retrieval, Proceedings of 4th European Workshop on Image Analysis for Multimedia Interaction Services, pp. 88-91, London, 2003.

  14. Q-K. Man, C-H. Zheng, X-F. Wang, & F-Y.Lin,Recognition of Plant Leaves Using Support Vector, International Conference on Intelligent Computing, pp. 192-199, Shanghai, 2008.

  15. R. S. Choras, Image Feature Extraction Techniques and Their Application for CBIR and Biometrics Systems, International Journal of Biology and Biomedical Engineering , vol. 1 (1), pp. 6-16, 2007.

  16. T. Acharya, & A.K. Ray, Image Processingg Principles and Applications, New Jersey: John Wiley & Sons, Inc, 2005.

  17. Y.A.O. Min, Y.I. Win-Sheng,S. Bin, &D.A.I. Hong-Hua, An Image Retrieval System Based on Fractal Dimension, Journal Zheijang University Science, vol. 4(4), p. 421-425, 2003.

Leave a Reply