Implementation of Driver Vigilance System using Deep Learning and Advance Computer Vision

Harshvardhan Patil; Aishwarya C Kuratti; Disha Bhanushali; Sandhya Belgaonkar; Praveen.Y.Chitti

doi:10.17577/IJERTCONV8IS11030

IETE - 2020 (Volume 8 - Issue 11)

Implementation of Driver Vigilance System using Deep Learning and Advance Computer Vision

DOI : 10.17577/IJERTCONV8IS11030

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 463
Authors : Harshvardhan Patil, Aishwarya C Kuratti, Disha Bhanushali, Sandhya Belgaonkar, Praveen.Y.Chitti
Paper ID : IJERTCONV8IS11030
Volume & Issue : IETE – 2020 (Volume 8 – Issue 11)
Published (First Online): 04-08-2020
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

Implementation of Driver Vigilance System using Deep Learning and Advance Computer Vision

Harshvardhan Patil Department of Computer Science and Engineering,

Jain college of Engineering, Belagavi, India.

Sandhya Belgaonkar

Aishwarya C Kuratti Department of Computer Science and Engineering,

Jain college of Engineering, Belagavi, India.

Disha Bhanushali

Department of Computer Science and Engineering,

Jain college of Engineering, Belagavi, India.

Prof. Praveen.Y.Chitti

Department of Computer Science and Engineering, Jain college of Engineering,

Belagavi, India.

Department of Computer Science and Engineering, Jain college of Engineering, Belagavi, India.

AbstractDistracted driving is an established cause of motor vehicle crashes for all ages. With the rapidly growing elderly population and more adults embracing technology, distracted driving is also increasing in prevalence within that populationparticularly cell phone usage behind the wheel. This research explores the behaviors and attitudes of senior drivers regarding cellphone use while driving as well as the prevalence of the mode of cell phone use behind the car such as, talking, texting, emailing, browsing the internet and navigating. It also explores possible characteristics that would predict the frequency of distracted driving. Distracted driving is an established cause of motor vehicle crashes, for all ages. Nearly 60% of crashes involving younger drivers are linked to distraction (AAAFTS, 2015). This research brief provides evidence from a recent survey that as more older adults embrace technology, distracted drivingin particular, using cell phones behind the wheelis prevalent among them as well. According to a recent survey conducted by AAA Foundation for Traffic Safety and the University of California San Diego, the majority of drivers aged 65 and oldernearly 60%have used their cell phone in some capacity (i.e., texting, making calls, and answering calls) while driving. More than a quarter of these older drivers have engaged in distracting behaviors while driving with a minor in the car. Among those, 32% have talked on the phoneeither with hands-free or hand-held deviceswith younger children (under age 11) in the car, while 42% have done so when accompanied by older children (12- to 17 year-olds). While distracted driving encompasses a wide range of risky behaviors including but not limited to eating, talking with passengers, reaching for belongings, etc., this survey focused solely on cell phone use while operating a vehicle. The findings suggest the need for interventions to reduce distracted driving behaviors among older adults, especially given the rapidly growing older adult population, with their age-associated physiologic changes, such as slower reflexes, reduced contrast sensitivity, and other driving-impairing conditions.

INTRODUCTION

In recent years, the increasing number of vehicles on roads leads to an increase in traffic accidents. In 2015, the National Highway Traffic Safety Administration, part of

U.S. Department of Transportation, reported that 35,092 people died in traffic accidents on the U.S. roads, a 7.2%

increase in fatalities from 2014. Distracted driving was responsible for 391,000 injuries and 3477 fatalities in 2015. It is found that distracted driving was related to one-tenth of fatal crashes. Distracted driving fatalities have increased more rapidly than those caused by drunk driving, speeding and failing to wear a seatbelt. A driver is considered to be distracted when there is an activity that attracts his/her attention away from the task of driving.

There are three types of driving distractions
- Manual distraction: The driver takes his/her hands off the wheel, e.g. drinking, eating etc.
- Visual distraction: The driver looks away from the road,
  
  e.g. reading, watching the phone etc.
- Cognitive distraction: The driver's mind is not fully focused on the driving task, e.g. talking, thinking etc.
  
  It is important to note that although driving distractions are categorized into three different types they do not always occur separately. For example, in the event of talking on the phone, two types of distractions occur at the same time: manual distraction and cognitive distraction. There are many sources that can lead to distraction. However, the most possible distractions usually come from inside the vehicle. Major motor companies such as Toyota, Nissan, Ford, and Mercedes-Benz have been introducing advanced infotainment, control panels, and display systems. Adjusting those in-vehicle devices while driving could cause a considerable distraction that may lead to traffic accidents. Another source that can influence the driving performance is phone use. Conversation on phones while driving consumes a significant amount of brain power. When doing both, the human brain activity dedicated to driving can be reduced by 37%. Text messaging while driving can cause even more distraction because it keeps not only the driver's thought but also his/her hands and eyes out of the driving task for an average time of 4.6s. A recent study pointed out that ~78% of drivers use cell phones behind the wheel which significantly increases the possibility of traffic accidents on the Indian roads. To reduce vehicle accidents and improve transportation safety, a system that can classify distracted driving is highly desirable and has attracted much research interest in recent
  
  years. This study is motivated by developing such a distraction detection system that has the potential to be implemented in real vehicles. Therefore, the goal of this work is to develop an assisted driving system that can detect distracted driving behaviors and alert the driver to focus on the driving task. The main contributions of this study are: (i) proposing a real time distraction detection system which is developed using deep learning. (ii) implementing four types of convolutional neural networks (CNNs) for the detection system in order to determine the most suitable architecture for distraction detection; (iii) collecting our own distracted driving image dataset and (iv) developing a voice-alert system which reminds the driver to focus on the driving task when he/she gets distracted. The proposed work focuses on driver distraction activities detection via images using different kinds of machine learning techniques. The input of our model is videos of driver taken in the car. We first preprocess these videos to get input vectors, then use different classifiers (linear SVM, softmax, naive bayes, decision tree, and 2-layer neural network) to output a predicted type of distraction activity that drivers are conducting.
  
  Figure 1.1: Software setup of the embedded computing system for distraction detection and alert
LITERATURE SURVEY

Literature survey or a literature study includes the current knowledge including substantive findings as well as theoretical and methodological contributions to a particular topic. In [1] the authors Mahbub Hussain, Jordan J. Bird and Diego R. Faria states that Image classification is one of the core problems in Computer Vision field with a large variety of practical applications. This paper proposes the study and investigation of a CNN architecture model(i.e. Inception-v3) to establish whether it would work best in terms of accuracy and efficiency with new image datasets via Transfer Learning. The retrained model is evaluated and the results are compared to some state-of-the-art approaches. Deep Learning has emerged as a new area in machine earning and is applied to a number of signal and image applications. Author used Deep Learning Algorithm namely Convolutional neural networks(CNN) in image classification [2]. Texted-based retrieval system is used to retrieve video or images from database but this is not efficient approach so to address this problems associated with Traditional system Content Based Image Retrieval and Content Based Video Retrieval were introduced. They proposed the methodology for CBIR based on image classification using Support Vector Machine [3] classifier

is introduces and CBIR used C4.5 classifier. To detect an object in an image or a video the system needs to have few components in order to complete the task of detecting the object. The various techniques that are used to detect an object, localize an object, categorize an object, extract features, appearance information and many more[4]. In
[5] the authors Zhong-Qiu Zhao, Peng Zheng, Shou-tao Xu and Xindong Wu states that due to object detections close relationship with video analysis and image understanding, it has attracted much research attention in recent years. This paper provides a review on deep learning bases object detection frameworks. The Supervised Machine Learning is the search for algorithms that reason from externally supplied instances to produce general hypothesis, which then make predictions about future instances. In [6] describes various Supervised Machine Learning classification techniques, compares various supervised learning algorithms as well as determines the most efficient classification algorithm based on dataset, the number of instances and variables.

The Convolutional Neural Networks for human action recognition in videos have proposed different solutions for incorporating the appearance and motion information. In
[7] a new ConvNet architecture for spatiotemporal fusion of video snippets, and evaluate its performance on standard benchmarks where this architecture achieves state-of-the- art results. In [8] the authors Fernando Moya Rueda, Rene Grzeszick, Gernot A. Fink , Sascha Feldhorst and Michael ten Hompel state that methods of HAR have been developed for classifying human movements. HAR uses as inputs signals from videos or from multichannel time- series. This paper focuses on HAR from multichannel time- series. Capturing, evaluating and analyzing signal series for recognizing human actions are critical for many applications. The video retrieval can be used for multiuser systems for video search and browsing which are useful in web applications. This paper takes the information needs and retrieval data already present in the archive, and that retrieval performance can be significantly improved when content-based image retrieval (CBIR) algorithm[9] are applied to search. With the development of multimedia data types and available bandwidth there is huge demand of video retrieval systems, as users shift from text based retrieval systems to content based retrieval systems. In [10] the authors Byeong-Ho KANG, states that Image Processing is any form of signal processing for which the input is an image, such as photographs or frames or videos .This paper presents Image and Video processing elements and current technologies related to that.
PROBLEM IDENTIFICATION According to the motor vehicle safety division, one in

five car accidents is caused by a distracted driver. The World Health Organization(WHO) reported 1.25 million deaths yearly due to road traffic accidents worldwide and the number is continuously increasing, Nearly fifth of these accidents are caused by distracted drivers. Therefore, our project aims to alarm the driver whenever he/she gets distracted. It mainly focuses on the driver when he/she is

texting on phone, talking on phone, drinking and operating radio.
OBJECTIVES The main objective of the proposed work is
- To reduce the number of accidents causing due to distracted drivers.
- To provide an alert system when the driver is involved in other activities apart from driving.
- To provide a system for the safety measures for drivers.
METHODOLOGY
The final step is we test our model with a real world data and find the accuracy rate and the results are been recorded for different actions and people.
PROPOSED WORK PLAN

Figure 7.1:Architecture of proposed system

Figure 6.2:Architecture of proposed system

Training phase:

In this phase video will be converted into frames and then the victim images will be extracted through key frame extraction. Then the images will be resized and converted into the smallest pixel and the CNN filters will be applied. From these images the key features will be extracted. These features of image will be sent for classification.

Testing phase:

In this phase images will be extracted through key frame extraction. Then the images will be resized and converted into the smallest pixel and the CNN filters will be applied. From these images the key features will be extracted. These features of image will be tested with the training data and the desired part will be retrieved.

REFERENCES

Mahbub Hussain,Jordan J. Bird and Diego R. Faria A Study on CNN Transfer Learning for Image Classification, School of Engineering and Applied Science Aston University,UK, june 2018.
Deepika Jaswal, Sowmya. V, K.P.Soman Image Classification Using Convolutional Neural Networks,International Journal of Advancements in Research and Technology, Volume 3, June-2014.
Milan R. Shetake, Sanjay. B. Waikar, Content Based Image and Video Retrieval,International Journal of Advances in Electronics and Computer Science, Volume-2, Sept-2015.
Karthik Umesh Sharma and Nileshsingh V. Thakur A review and an approach for object detection in images, International Journal of Computational Vision and Robotics, Volume 7, Nov-2017.
Zhong-Qiu Zhao, Peng Zheng, Shou-tao Xu and Xindong Wu Object Detection with Deep Learning, IEEE Transactions on neural networks and Learning Systems, 16 Apr 2019. [6] Osisanwo F.Y. ,Akinsola J.E.T. Awodele O, Hinmikaiye J.O Supervised Machine Learning Algorithms: Classification and Comparison,International Journal of Computer Trends and Technology, Volume 48, June 2017.
Christoph Feichtenhofer, Axel Pinz and Andrew Zisserman, Convolutional Two-Stream Network Fusion for Video Action Recognition, 26 Sept 2016.
Fernando Moya Rueda, Rene Grzeszick, Gernot A. Fink , Sascha Feldhorst and Michael ten Hompel, Convolutional Neural Networks for Human Activity Recognition Using BodyWorn Sensors ,25 May 2018.
Vrushali A. Wankhede, Prakash S. Mohod, A Review on Content-Based Image Retrieval from Videos using Self Learning Object Dictionary, International Journal of Science and Research, 2012.
Byeong-Ho KANG, A Review on Image and Video Processing, International Journal of Multimedia And Ubiquitos Engineering,Volume 2, April 2017.

Implementation of Driver Vigilance System using Deep Learning and Advance Computer Vision

Leave a Reply