Accuracy Assessment of Supervised and Unsupervised Classification using NOAA Data in Andhra Pradesh Region

— The objective of this study is to differentiate NOAA satellite data using NDVI thresholds. Normalized Different Vegetation Index image (NDVI), initially derived from visible and near infrared bands of NOAA satellite. The different areas like vegetation, non-vegetation and water bodies are keenly observed and the thresholds for classifying them are formulated carefully with the help of ground truth information of the study area. The separation of images into different land covers is performed using density slicing. The classification process is completed by color mapping and class labelling. Confusion matrix is used to determine the accuracy of classified image by calculating overall classification accuracy and Kappa coefficient. NDVI based classification is one of best method to classify the NOAA satellite data with a high accuracy.


Ⅰ. INTRODUCTION
Remote sensing data have a wide range of applications, among them land cover mapping has its own significance. The physical condition of the ground surface can be detected using land covers. In the study of land cover dynamics, remote sensing is majorly done by satellites. From last few decades, due to the advancement in technology in remote sensing, there is enhancement in obtaining a large geographical data. The accurate and timely information about location and spatial configuration as well as growth rates of land covers is provided by sensors. Image classification is process used for land cover mapping from remote sensing data. The major breakthrough in satellite remote sensing is better spatial and spectral resolution and it's been possible by employing advanced sensors. For mapping and monitoring forest/vegetation, both visual and digital analysis techniques have been used. Interpreting various land use classes such as forest, vegetation is done by adapting proper methodology. The analysis of remotely sensed imagery through interpretation is distinguished by following three factors. They are 1. Panoramic overview of remotely sensed imagery. 2. They fall in the region of visible and infrared region of the electro-magnetic spectrum and 3. portraying the Earth's surface at different scales and resolutions. The necessary spectral and spatial features of the various objects can be obtained through multispectral remote sensing. Technique used for classification of objects is spectral analysis of the radiant energy reflected or emitted by the target. In this paper, the Normalized Difference Vegetation Index (NDVI) values and classifying different land cover types over the selected study area is done by multispectral images. The difference between the sensor spectral radiance of the red band (band4) and the near-infrared band (band5) of satellite image, gives NVDI. Generally, NVDI values are positive for soil and vegetation and theoretically the values of the NDVI vary between -1.0 and +1.0. This paper is divided into 5 sections, section II describes the detailed of Study Area and Data, Methodology used is given in section III. Section IV deals with the obtained experimental results for the proposing approach. Section V contains concluding remarks.
Ⅱ. DESCRIPTION OF STUDY AREA AND DATA Andhra Pradesh region displays a lack of continuity of stratigraphic significance which indicates a period of remarkable tranquility. This is an example of Eparchaean Unconformity [Source: Tirupati Urban Development Authority]. The monsoon bring a moderate rainfall and in summer, temperatures shoot up ranging from 35 to 41 degrees Celsius and in winter the city experience low temperatures ranging 16 to 20 degrees Celsius. The summer season starts from March and may last up to June, followed by rainy season from July and ends in December. The city then experiences winter till the end of February. In this paper, rainy season is considered and the study area experiences maximum rainfall from October to November, which falls under northeast monsoon season. For the study, images issued by to the Centre of Excellence on "Atmospheric Remote Sensing and Advanced Signal Processing", were collected and used. The images taken are multispectral remote sensing images of Andhra Pradesh region of three different seasons which are listed above .The satellite used here is NOAA, which images the entire region (1) Where NIR is the near infrared band value of a pixel and RED is the red band value of the same pixel.
Maximum Likelihood Classification is used for image classification and is carried out for each dataset, training sets were collected from FCC imagery and supervised imagery. Comparison of statistical NDVI values like minimum, maximum, mean and standard deviation is done for various land cover types of different seasons. The same category of land use/cover will have same or similar curves of NDVI and NDVI time series provide seasonal changes. The land-cover change detection and phenological parameter derivations are supported by time-series data of normalized difference vegetation index (NDVI).
The classification of the image based on NDVI values is shown in the Figure1 and the statistics of NDVI are estimated as: The statistics show images with low NDVI value are usually water bodies (ranging from -0.0175 to -0.328), Built up (ranging from -0.019 to 0.060) and bare soil (ranging from -0.001 to 0.166). Throughout the three seasons, the NDVI value of the dense vegetation is ranging from 0.500 to 0.575, that almost remind consistent, which indicates forest region in Eastern Ghats. On the other hand, for three different seasons, the vegetation in and around the city varies from 0.244 to 0.44.
In this paper, the classification of the study area on the basis of NDVI by using multi season and multispectral satellite data. The dense and thick forest areas are been observed in the Andhra Pradesh region, which is part of eastern Ghat division in geographical division of Indian mountain ranges. A plenty of waterbodies are situated at eastern part of Andhra Pradesh. Vegetation land is extended over east and west regions. Central part is mostly found with built up area and the settlements. The result obtained from study state that there is no large significant NDVI changes are found in dense vegetation and its dynamics, throughout the three seasons. Same methodology can be implemented for retrieving of nonspatial parameters which can affect the vegetation through climatic changes over the study area. The Land Surface Emissivity (LSE) and Land Surface Temperature (LST) can be estimated from NDVI and the correlation between them can be calculated. Ⅲ .

MATERIALS AND METHODS
The flowchart of the proposed work to get classified NDVI image was shown in Figure 1. This technique is capable only to process NOAA data. In this study, Channel 1 and Channel 2 are used to calculate NDVI. Step1: Before analysis, the images were radiometrically and geometrically corrected. During geometric correction, control points are detected on the topographic maps and the satellite images with RMS errors that are less than two pixels. After that, the images of 2016 and 2018 were registered. A subset image was created from each NOAA image for subsequent analysis.
Step2: Normalized Difference Vegetation Index (NDVI) was used to identify different land cover types of the study area by using the equation (1).
NDVI was calculated for the year 2016 and 2018 of NOAA data. The results were also analyzed, which shows significant changes in land cover over a period of time in the study area. NDVI ranges between -1.0 to +1.0. The cloud, water, and snow reflect in the visible channel than the near-infrared channel, they take negative NDVI values. Rock, bare soil and man-made structures have an NDVI values are around zero. Vegetation, on the other hand, has strong reflectance in nearinfrared there by providing NDVI values close to +1. Step3: In the present work, NOAA data of both dates were classified independently based on the NDVI values range from -1 to +1. Based on this NDVI values, the two dates NDVI images were classified into four classes by using NDVI threshold ranges.

A. CLASSIFICATION
The image enhancement is initial process done by using decorrelation stretch to enhance the image for more effective visualization, before the classification. The flow chart in figure 1 represents the classification and accuracy assessment process. The experiment further proceeded by calculating the NDVI thresholds. The different combination bands of 5-4-3 (NIR, Red, and Green) are constructed to red, green and blue (RGB color) are been employed here. The RGB image is a standard color for/of infrared (CIR) image.
The experiment is carried out further by locating the vegetation by calculating the threshold of NDVI image. Based on the values of NDVI threshold are shown in The training data classified mainly into three major classes: Water, Non-vegetation and Vegetation area. The results have been kindly observed and analyzed. The NVDI is set as modified gray level image by the process called density slicing. The next consecutive experimental step involves the color mapping and labeling of the satellite image for three classes. The process of classification based on NDVI threshold is thus concluded and the study further involves comparison of the classified image with the ground truth.

B. ACCURACY ASSESSMENT(CONFUSION MATRIX) :
The classified image obtained by experiment is now compared with the ground truth data for analysis of information about accuracy. Both user and producer accuracy were measured to calculate overall classification accuracy. when individual class accuracy divided by the sum of correctly classified pixels, the Producer's accuracy. Both the misclassfied pixels and errors in them where classified into another class and are recorded. On the other hand the user's accuracy is a calculated when individual class acquired from the classified pixels in same group. The confusion matrix is formulated to calculated the over all accuracy which is obtained for user's accuracy and producer's accuracy. The details of overall accuracy is shown below which is acquired by dividing.
Overall Accuracy = Total no.of correct classified Total no.of pixels * 100 (3) The measurement to measure the training pixels with the ground truth data, is by using technique of Kappa coefficient.
The Kappa values are in range of +1.0 to -1.0, if the value is the positive value, the it shows high accuracy. A value of zero in Kappa coefficient indicates no correlation in the classification Where: = total number of pixels p = total number of classes ∑ = total number of elements in confusion matrix ∑ 0 = sum of row i ∑ 0 = sum of column i The results of classification based on the accuracy assessment were obtained and recorded in the next section.

Ⅳ. RESULTS AND DISCUSSION
Accuracy of any classification process is to be quantitatively determined and thus the results to be obtained from satellite data. NDVI image was compared with the ground truth information with the pixel that already been categorized. NDVI ratio is used for evaluation of the supervised NDVI threshold for Classification of an area (349 pixels 329 pixels) of image. The grayscale image of NDVI is shown in Figure 2.
The color mapping and labelling of three classes is done for the classification process. This is showed and observed in  Table 2. is formulated and assessment of the related confusion matrices are done, along with the classification results (overall accuracy and kappa coefficient). From the results, the total pixels are correctly categorized as very high rather than misclassified pixels for every category class. For the water class which are into a non -vegetation class, the misclassified pixels are 65 only the misclassified pixels of other classes are also formulated and concluded. The vegetation class are in the other two classes; water and nonvegetation, 446 and 927, respectively. The misclassified pixels for non-vegetation class are also into the other two classes namely water and vegetation are 2880 and 778, respectively. From every row and every column of the confuse on matrix table, one can obtain the user's accuracy and producer's accuracy. The classified pixels of every row and column are divided with the sum of total pixels for every row and column. The result is highly accurate and the overall accuracy is obtained 92.55%. The value of kappa coefficient is positive and in the range of 0.906. The observed results are with high accuracy for obtained kappa coefficient. The classification process here analyzed by considering the confusion matrix must be quantitively done to obtain their accuracy of classification for satellite data taken from NOAA. As already mentioned earlier in the supervised classification, the NDVI images are categorized into certain classes based on their NDVI value. The user will have flexibility of choosing the training sample for classification based on one's knowledge.