Drushti- A Smart Reader for Visually Impaired People

Shobha Sharma; Dhanush.R.Bhat; Neerendra.R.Hegde; Yajnesh A.P; Mythri. D

doi:10.17577/IJERTCONV6IS15008

ICRTT - 2018 (Volume 06 - Issue 15)

Drushti- A Smart Reader for Visually Impaired People

DOI : 10.17577/IJERTCONV6IS15008

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 248
Total Downloads : 0
Authors : Shobha Sharma, Dhanush.R.Bhat, Neerendra.R.Hegde, Yajnesh A.P, Mythri. D
Paper ID : IJERTCONV6IS15008
Volume & Issue : ICRTT – 2018 (Volume 06 – Issue 15)
Published (First Online): 05-01-2019
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

Drushti- A Smart Reader for Visually Impaired People

Shobha Sharma

1

, Dhanush.R.Bhat

2

, Neerendra.R.Hegde

3

, Yajnesh A.P

4 5

, Mythri. D

Assistant Professor

1

, UG Student

2,3,4,5

Department of Information Science and Engineering SDM Institute of Technology, Ujire, Karnataka, India

Abstract:- According to the World Health organization (WHO), 285 million people are estimated to be visually impaired worldwide, among which 90% live in developing countries and forty five million are blind individuals worldwide. Though there are many existing solutions to the problem of assisting individuals who are blind to read .In particular, there is a need for a portable text reader that is affordable and readily available to the blind community. This project proposes a smart reader for visually challenged people using Raspberry Pi. This paper addresses the integration of a complete Text Read-out system designed for the visually challenged. A camera will be used to take input, speaker and LCD to give output. The system consists of a webcam interfaced with Raspberry Pi which accepts a page of printed text. The OCR (Optical Character Recognition) package installed in Raspberry Pi scans it into a digital document. Once it is scanned, the text is read out by a text to speech conversion unit (TTS engine) installed in Raspberry Pi. The output is fed to an audio amplifier before it is read out. The image to text conversion and text to speech conversion is done by the OCR software installed in Raspberry Pi. The system finds its interesting applications in libraries, auditoriums, offices where instructions and notices are to be read and also assists in filling of application forms.

Keywords:- Raspberry Pi, OCR(Optical Character Recognition), TTS(Text to Speech) Engine, Web Camera.

INTRODUCTION

An Embedded System is a combination of computer hardware and software, perhaps additional mechanical parts, designed to perform a specific function. An embedded system is a microcontroller-based, software driven, reliable, real-time control system, autonomous, human / network interactive, operating on diverse physical variables in diverse environments sold into a competitive and cost conscious market.

We present a smart device that assists the visually impaired which effectively and efficiently reads paper- printed text. The proposed project uses the methodology of a camera based assistive device that can be used by people to read Text document. The framework is on implementing image capturing technique in an embedded system based on Raspberry Pi board. The design is motivated as it is small-scale and mobile, which enables a more manageable operation with minimal setup. In this project we have proposed a text read out system for the visually challenged. The

proposed integrated system has a camera as an input device to feed the printed text document for its conversion into a gray scale image and the scanned document is processed by a software module known as OCR (optical character recognition engine). As part of the software development, the Open CV (Open source Computer Vision) libraries is utilized to do image capture of text for character recognition. Most of the access technology tools built for people with blindness and limited vision are built on two basic building blocks of OCR software and Text-to-Speech (TTS) engines. Optical character recognition (OCR) is the translation of captured images of printed text into machine encoded text. It is defined as the process of converting scanned images of machine printed into a computer processable format. The final recognized text document is fed to the output device depending on the choice of the user. The output device can be a headset connected to the Raspberry Pi board or a speaker which can spell out the text document aloud.

Figure 1. Prevalence of Blindness as per estimate of 2017

According to Figure 1 , In our planet of 7.4 billion humans, 40% are visually impaired out of which 6% people are completely blind, i.e. have no vision at all, and 35% have mild or severe visual impairment (WHO, 2017). It has been predicted that by the year 2020, these numbers will rise to 75 million blind and 200 million people with visual impairment. There have been numerous efforts in this area to help visually impaired to read without difficulties. By this project, we would be able to detect the text effectively and efficiently which would work towards the benefit of these people.
LITERATURE SURVEY

According to Bindu Philip and R. D. Sudhaker Samuel

paper on Human Machine Interface-A Smart OCR for the visually challenged. The integration of a complete Malayalam Text Read-out system was designed for the visually challenged. The system accepts a page of printed Malayalam text with English numerals, scans it into a digital document which is then subjected to skew correction, segmentation, before feature extraction to perform classification. Once classified, the text in Malayalam is read out by a text to speech conversion unit.

A paper by V. Ajantha Devi, Dr. S Santhosh Baboo on Embedded Optical Character Recognition on Tamil Text Image using Raspberry Pi. Optical Character recognition is used to digitize and reproduce texts that have been produced with non-computerized system. Digitizing texts also helps reduce storage space.

A paper by J.N. Balaramkrishna , J.Geetha on The Smart Reader from Image using OCR and Open CV with Raspberry Pi 3 This kind of system helps visually impaired people to interact with computers effectively through vocal interface. Text-to-Speech is a device that scans and reads English alphabets and numbers that are in the image using OCR technique and changing it to voices.

A paper by Asha G. Hagargund, Sharsha Vanria Thota, Mitadru Bera, Eram Fatima Shaik on Image to speech conversion for visually Impaired The device that proposed aims to help people with visual impairment. A device that converts an image text to speech. The basic framework is an embedded system that captures an image, extracts only the region of interest (i.e. region of the image that contains text) and converts that text to speech. It is implemented using a Raspberry Pi and a Raspberry Pi camera Two tools are used convert the new image (which contains only the text) to speech. They are OCR (Optical Character Recognition) software and TTS (Text-to-Speech) engines. The audio output is heard through the raspberry pis audio jack using speakers or earphones.
BLOCK DIAGRAM

Figure 2. Block Diagram of Proposed System

The Figure 2 illustrates the block diagram of our proposed system. The framework for the proposed system is the Raspberry Pi board. The Raspberry Pi 3 B+ is a single board computer which has 4 USB ports, an Ethernet port for internet connection, 40 GPIO pins for input/output, CSI camera interface, HDMI port, DSI display interface, SOC (system on a chip), LAN controller, SD card slot and an audio jack. The power supply is given to the 5V micro USB connector of Raspberry Pi through the Switched Mode Power Supply (SMPS). The SMPS converts the 230V AC supply to 5V DC. Web Camera is connected to the USB port of Raspberry Pi. Raspberry Pi has an OS named RASPBIAN which process the conversions. The audio output is taken from the audio jack of the Raspberry Pi. The converted speech output is amplified using an audio amplifier. A Power Supply Unit is a device that supplies electrical energy to the output loads.

A Power Supply is also given to the CD Display for Display Purpose. The Capacitors, Resistors and Voltage Regulators are all embedded on a PCB Board and is then connected to a Power Source for functioning of the LCD Display. The PCB Board is connected to the Raspberry Pi through GPIO Pins in order to bring about collaboration between Raspberry Pi and Printed Circuit Board.

The Document to be read is placed on a base and the camera is focused to capture the image. The captured image is processed by the OCR software installed in Raspberry Pi. The captured image is converted to text by the software. The text is converted into speech by the TTS engine. The final output is given to the audio amplifier from which it is connected to the speaker. Speaker can also be replaced by a headphone .
SOFTWARE REQUIREMENTS
- Raspbian OS It is an Operating System to get the Raspberry Pi started.
- IDLE- Python Integrated Development and Learning Environment.
- OpenCV- Open source Computer Vision libraries is utilized to do image capture of text
- OCR- Optical character recognition is the translation of captured images of printed text into machine-encoded text.
- TTS Engine – A Text-To-Speech (TTS) synthesizer is a computer-based system that should be able to read any text aloud, whether it was directly introduced in the computer by an operator or scanned image.
METHODOLOGY

Figure 3. The Methodology Stages

According to Figure 3 our Proposed Project has been divided into the Following Stages:

Input Image
Image Pre-Processing
Image To Text Converter
Text to Audio Converter
Audio Output

Figure 4. Project Setup

The Figure 4 illustrates the Project Setup of our Device Drushti on the basic external connections done according to the Block Diagram.
ADVANTAGES
DISADVANTAGES
CONCLUSION

We have implemented an image to speech conversion technique using Raspberry Pi. Our algorithm successfully processes the image and reads it out clearly. This is an economical as well as efficient device for the visually impaired people. We have applied our algorithm on many images which has succeeded. The device is compact and helpful to the society.
FUTURE SCOPE

In the future we can use more Robust and Efficient algorithms to read the image and separate the text from the images. The Captured Image was blur and there is a need to de-blur the Image in less time so that we can separate the data efficiently and convert them into speech. By considering all these aspects our proposed project is going to work towards the benefit of the society and would benefit the visually impaired as well as Blind.
REFERENCES

[1] Bindu Philip and r. d. Sudhaker Samuel 2009 Human machine interface a smart ocr for the visually challenged International journal of recent trends in engineering, vol no.3,November .

[2]. K Nirmala Kumari, Meghana Reddy J [2016]. Image Text to Speech Conversion Using OCR Technique inRaspberry Pi. International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering Vol. 5, Issue 5, May 2016.

[3] V. Ajantha devi, dr. Santhosh baboo Embedded optical character recognition on tamil text image using raspberry pi international journal of computer science trends and technology (ijcst) volume 2 issue 4, jul-aug 2014

[4] Jaiprakash verma, khushali desai Image to sound conversion International journal of advance research.

[5] R. Mithe, S. Indalkar and N. Divekar. Optical Character Recognition" International Journal of Recent Technology and Engineering (IJRTE), ISSN: 2277- 3878,Volume-2, Issue-1, March 2013.

[6] Character Detection and Recognition System for Visually Impaired People by Akhilesh A. Panchal, Shrugal Varde, M.S. Panse .

Drushti- A Smart Reader for Visually Impaired People

Leave a Reply