Embedded Product Label Reading System using SIFT and Line Detection Algorithm

V. Ramya; M. Kokila Sundari; C. Gayathri Devi

doi:10.17577/IJERTCONV3IS16034

TITCON - 2015 (Volume 3 - Issue 16)

Embedded Product Label Reading System using SIFT and Line Detection Algorithm

DOI : 10.17577/IJERTCONV3IS16034

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 131
Total Downloads : 19
Authors : V. Ramya, M. Kokila Sundari, C. Gayathri Devi
Paper ID : IJERTCONV3IS16034
Volume & Issue : TITCON – 2015 (Volume 3 – Issue 16)
Published (First Online): 30-07-2018
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

Embedded Product Label Reading System using SIFT and Line Detection Algorithm

V. Ramya

Department of Computer Science and Engineering

Annamalai University, Annamalainagar, India

M. Kokila Sundari

Department of Computer Science and Engineering,

Annamalai University, Annamalainagar, India

Gayathri Devi

Department of Computer Science and Engineering,

Annamalai University, Annamalainagar, India

Abstract- In order to help the physically challenged visually impaired people to read text labels on product packaging, a product label reading system is proposed. To increase the clear visibility of the text with low time complexity, Scale Invariant Feature Transform (SIFT) and Line detection algorithm is applied. The proposed work is carried out at different levels, such as, preprocessing, segmentation, feature extraction, text extraction, and text recognition, Audio output. The input image is preprocessed to improve the quality of that picture. Median filter is applied to enhance the image, borders of the image are observed using the Sobel edge detection method. Segmentation is performed to segment the images. To extract the particular text region of the image scale invariant feature transform is performed and text extraction is employed to draw out parts by one another one. The extracted product label text is recognized by optical character recognition. The audio output of the recognized product label is provided forthe benefit of the visually impaired.

Key words- SIFT Line detection, Median filter, Sobel edge detection, and optical character recognition.
1. INRODUCTION
  
  The visually impaired people worldwide are around 314 million, were 45 million are blind [1]. In todays developing word reading is really important, printed text is everywhere in day to day life, such as getting a product package, to find a manufacture date and expiry date of the product, receipts, bank statement and so on. There are several systems designed to help the visually impaired people, but to handle product labeling is found difficult. The existing system to find the product is by using bar code reader, where the user identifies the product [2] from the large database using speech. The drawback is that the visually impaired people find it hard to position the barcode [3].
2. PROPOSED SYSTEM
  
  The overview of the proposed system is designed in Figure
  1. The image of the product is captured using a camera, in proposed work the laptop camera is used. The input picture is captured lively, it is then preprocessed to improve quality of the image and by using a median filter image is
    
    enhanced. The Sobel edge detection method is applied to detect the edges. The segmentation is performed to segment the image. To extract the particular text region of the image scale invariant feature transform is performed and text extraction is employed to draw out parts by one another one. Optical character recognition is used to binarize and recognize the text regions that are localized by the text characters.
    
    Input image
    
    Input image
    
    Preprocessi ng
    
    Preprocessi ng
    
    Segmentat ion
    
    Segmentat ion
    
    USB
    
    Camera
    
    Text Recogniti on
    
    Text Extraction
    
    Text localizatio
    
    Text Recogniti on
    
    Text Extraction
    
    Text localizatio
    
    Audio output
    
    Audio output
    
    Fig 1. Block diagram of the proposed system
3. SYSTEM DESIGN
  The given product is recognized by using SIFT and Line detection algorithm. The recognized product is embedded unit which generated by P89C51, APR9600 and speaker. In micro controller is a program with the 10 product name and corresponding voice is stored in APR9600.A micro controller receive the signal from the pc through the RS232.Based on that received signal audio output played through Speaker under the control of microcontroller.
  
  Fig 3 Image of APR9600
  
  P89c51
  
  Microcontroller
  
  P89c51
  
  Microcontroller
  
  Power Supply
  
  3) Speaker
  
  APR9600
  
  APR9600
  
  Speakers are one of the common devices. The purpose of speakers is to produce audio output that can be heard by the listener. Speakers are transducers that convert electromagnetic waves into sound waves. The speakers receive audio input from a device such as an audio receiver.
  
  Speaker
  
  Speaker
  
  PC
  
  PC
  
  Fig 2. Block diagram of vision based Assistive System
4. HARDWARE DESCRIPTION
  1. Component description
    - P89C51 Micro Controller
      
      APR9600
    - Speaker
    1. P89C51 Micro Controller
  A Single-Chip 8-Bit Microcontroller manufactured in advanced CMOS process. On-chip Flash Program Memory with In-System Programming (ISP) and In-Application Programming (IAP) capability. The device also has four 8- bit I/O ports, three 16-bit timer/event counters. The added features of the P89C51 make it a powerful microcontroller for applications that require pulse width modulation, high- speed I/O and up/down counting capabilities such as motor control.
  
  2) APR9600
  
  APR device is a single chip voice recording. The playback capability for 40 to 60 seconds. The device is support both random and sequential access of multiple messages. The sample rates are user selectable, allowing designer to customize their design for unique quality. The device is ideal for use in portable voice recorders, many other consumer and industrial application.
  
  Fig 4 .Speaker
5. RESULT AND DISCUSSION

Step 1:

This is the first stage of the proposed work and it is used for identifying the input image. This phase consists of three modules, first one is a camera, it is employed to check that the camera will be acting right or not, and second stage is to capture the product image, and third stage is used to browse the image that is already stored.

Step 2:

In step 2, the original image will be input to the system for preprocessing,

Step 3:

The original picture is converted into a grayscale picture for preprocessing, because the color image requires a lot of computational time.

Step 4:

The text is filtered from the edge mask image

Step 5:

The region based segmentation is used for segmenting the image. This method detects region directly because it is used to extract the image region from the location of starting point and also segmented the image pixel by pixel

Step 6:

This step is used to detect the text.

Step 7:

After text localization the text is displayed in the message text box.

Step 7:

After text recognition the text codes are send to micro controller through RS232. corresponding voice is stored in APR9600.A micro controller receive the signal from the pc through the RS232.Based on that received signal audio output played through Speaker under the control of microcontroller.

CONCLUSION

The proposed system to read printed text on hand held objects for assisting visually impaired people. The input image is taken lively and it is preprocessed to improve the quality of the image and to enhance the image using median filter, and then detect the edges using the Sobel edge detection method. At the next stage, segmentation is performed to the segment the image and then scale invariant feature transform algorithm is used to localize text in the camera based image, and then text extraction is used to extract the character one by one. The optical character recognition is used to perform word recognition on the localized text regions and transform into audio output to blind users in speech.

REFERENCES

[1]. C. Yi and Y. Tian, Assistive text reading from complex background for blind persons, in Proc. Int. workshop Camera-Based Document Anal. Recognit., 2011, vol. LNCS-7139, pp. 1528 .

[2]. X. Chen, J. Yang, J. Zhang, and A. Waibel, Automatic detection and recognition of signs from natural scenes, IEEE Trans. Image Process.,vol. 13, no. 1, pp. 8799, Jan. 2004..

[3]. S. Shoval, J. Borenstein, and Y. Koren, Auditory guidance with the Nav-belt: A computerized travel for the blind, IEEE Trans. Syst., Man, Cybern. C. Appl. Rev., vol. 28, no. 3, pp. 459467, Aug. 1998.

[4]. J. Zhang and R. Kasturi, Extraction of Text Objects in Video Documents: Recent Progress, In IAPR Workshop on Document Analysis Systems, 2008.

[5]. N. Nikolaou and N. Papamarkos, Color Reduction for Complex Document Images, International journal of Imagining system and Technology, Vol.19, pp.14-26, 2009.

[6]. N. Otsu, A threshold selection method from gray-level histograms,In IEEE Tran.s on system, man and cybernetics, pp. 62-66, 1979.

[7]. L. Ma, C. Wang, B. Xiao, Text detection in natural images based on multi-scale edge detection and classification, In the Int. Congress on Image and Signal Processing (CISP), 2010.

[8]. B. Epshtein, E. Ofek and Y. Wexler, Detecting text in natural scenes with stroke width transform, In CVPR, pp. 2963-2970, 2010.

[9]. S. Kumar, R. Gupta, N. Khanna, S. Chaudhury, and S. D. Joshi, Text Extraction and Document Image Segmentation Using Matched Wavelets and MRF Model, IEEE Trans on Image Processing, Vol. 16, No. 8, pp. 2117-2128, 2007.

[10]. C. Stauffer and W. E. L. Grimson, Adaptive background mixture models for real-time tracking, presented at the IEEE Comput. Soc. Conf. Comput.Vision Pattern Recognit, Fort Collins, CO, USA, 1999

[11]. X. Chen and A. L. Yuille, Detecting and reading text in natural scenes, In CVPR, Vol. 2, pp. II-366 II-373, 2004.

[12]. X. Chen, J. Yang, J. Zhang and A. Waibel, Automatic detection and recognition of signs from natural scenes, In IEEE Transactions on image processing, Vol. 13, No. 1, pp. 87-99, 2004.

Embedded Product Label Reading System using SIFT and Line Detection Algorithm

Leave a Reply