Differential Privacy based Preserving Data on Cloud Environment

Prachee Atmapoojya; Utkarsh Saini; Rohit Patida; Rishabh Gupta; Ashutosh Kumar Singh

doi:10.5281/zenodo.18618609

Volume 10, Issue 05 (May 2021)

Differential Privacy based Preserving Data on Cloud Environment

DOI : 10.5281/zenodo.18618609

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 355
Authors : Prachee Atmapoojya , Utkarsh Saini , Rohit Patida , Rishabh Gupta , Ashutosh Kumar Singh
Paper ID : IJERTV10IS050356
Volume & Issue : Volume 10, Issue 05 (May 2021)
Published (First Online): 02-06-2021
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

Differential Privacy based Preserving Data on Cloud Environment

Prachee Atmapoojya

Department of Computer Application Nit Kurukshetra Haryana, India

Rohit Patidar

Department of Computer Application Nit Kurukshetra Haryana, India

Utkarsh Saini

Department of Computer Application Nit Kurukshetra Haryana, India

Rishabh Gupta

Department of Computer Application Nit Kurukshetra, Haryana

Prof. Ashutosh Kumar Singh Department Of Computer Application Nit Kurukshetra, Haryana

Abstract:- Cloud Computing offers several profits, including scalability, accessibility, and many services. But with its wide acceptance everywhere in the world, new risks and penetrability have appeared too. Storing the information on the cloud removes ones worries about space considerations, buying new storage gadgets, or managing their data, rather they're ready to operate their data any time from anywhere on the condition that they need internet access. But the rising security problem holds out against the organizations from connecting with cloud computing completely. Hence, security risks have come out because of the main drawbacks of cloud computing. The paper will include descriptions of approaches to information security and strategies used globally to ensure optimal data protection by reducing threats and risks. It is common that the information is always different from data providers for machine learning. Therefore, how to perform machine learning over cloud data from multiple users becomes a new challenge. Traditional differential privacy techniques and encryption methods are not practical for this environment. On the one hand, the information from different users is encrypted with other public keys or noises, making the computation difficult.

Keywords – Cloud computing, machine learning, data security, privacy techniques, differential privacy, privacy preserving.

INTRODUCTION

Cloud computing has been envisioned as the next generation paradigm in computation [1]. The advancement of technology has taken computing to a whole new level, and one of the latest developments in this field is the introduction of cloud computing [2]. It is used for sharing ideas, power processing, storing, connectivity, and virtualization. The Internet cloud offers a vast pool of tools that help to provide on-demand applications, storage media, and sharing media. In the cloud domain, security concerns are the biggest problem and thus the most significant obstacle to the aggravation of IT- based businesses that offer on-demand services to customers.

In the network process, the application phase, authentication phase, storage of information and virtualization phase, these security challenges can be visualized. These challenges are still an obstacle within the complete success path of cloud computing. One reason is that consumers and plenty of organizations keep their information on cloud databases. Hence, the main focus is that the users information should be safe, and the critical information shouldn't drift and tamper when traveling from one place to another across the network. Thus, it is essential that Integrity, Confidentiality, and Availability of user information ought to be ensured. Another reason is that the hacking of data as cloud computing makes it possible for companies to sharpen their Development and achievement. In addition, it also hosts more users with less effort to provide access to shared resources. But security concerns or risks remain a stumbling block in the effective direction of cloud computing. There are a variety of explanations. The first explanation is that consumers and several organizations store their data on cloud storage, so the primary emphasis is that the data must be safe and that when moving across the network from one location to another, the data is not lost and tampered with. It is therefore critical that data security, availability and integrity should be maintained. Organization: The literature review is discussed in Section II. Section III describes the background study containing service models in the cloud, deployment models, including cloud security threats. Machine learning techniques for cloud data security described in Section IV. Section V and VI described further works and the conclusion.

LITERATURE REVIEW

Several resources are consulted to know the fundamentals of cloud computing and storing data security on the cloud. This section provides a review of the literature to lay a foundation for discussing various data security aspects. Srinivas et al. [3] give superb insight into the essential concepts of cloud computing. Several key concepts are explored during this paper by providing samples of applications that can be developed using cloud computing and how they will help the developing world enjoy this emerging technology shared access or multi-tenancy is additionally considered together

with the major risks to data in cloud computing Since multiple users are using equivalent shared computing resources like CPU, Storage, and memory, etc.

Table 1. Comparison of techniques for cloud security

Refer ences	Proposed model/ work	Used Techniques	Pros	Cons
Li et a..[5]	privacy- preserving Naive Bayes learning scheme	Naive Bayes learning,	– differential privacy is utilized to preserve the privacy of every owner.	Forge & manipulati on of data possible.
Ma et al.[6]	privacy- preserving deep learning model	Deep learning	Reduces storage overhead	Low classificati on accuracy, high computati on cost
R.Yo netani , et al. [7.]	Doubly Permuted homomorphic Encryption	SVM, Semi Supervised learning technique	Reduced high computatio nal cost	Supports one operation at a time
Li et al.[8]	Outsourced privacy- preserving classification services over encrypted data	SVM, Naive Bayes, Logistic regression, least squares	Delegation of remote server	Frequently involved interaction of users.
Li et al..[9]	privacy- preserving machine learning under multiple keys	K-NN, SVM, Random forest, Naive Bayes	Protect the privacy of dataset	High computati onal cost
Li et al[10]	Privacy- preserving outsourced classification scheme	Naive Bayes, Homomorphi c Encryption	Used Fully homomorph ic Encryption proxy technique	Storage server no longer applicable in cloud environme nt
Gao et al.[11 ]	constructing a privacy- preserving NB classifier	Naive Bayes, privacy – preserving, double blinding technique	Reduce Communica tion & computatio n overheads	Disable to protect privacy
Hesa mifar d et al..[12 ]	new solutions for running neural network algorithms over encrypted data	Crypto deep learning, homomorphic encryption, neural network	Secures private data well	Non- practical keys are used

It is a threat to not only one user but various users. There's always a risk of personal data accidentally leaing to other users [4]. Multi-tenancy exploits can be exceptionally risky because one fault within the system can allow another user or hacker to access all other data. These sorts of issues are often

taken care of by wisely authenticating the users before they will have access to the info Several authentication techniques are in use to avoid multi-tenancy issues in cloud computing. A data protection scheme is proposed by Li et al. [5], which enables a trainer to train a Naive Bayes classifier over the dataset provided jointly by different data owners. -differential privacy is utilized in this scheme to preserve the confidentiality of every owner. In this approach, collusion is allowed, and adversaries can forge and manipulate the data. To solve the problem of training the model over the encrypted data under multiple keys, a privacy-preserving deep learning model (PDLM) is proposed by Ma et al. [6].

Amritpal et al. [16] suggest an improved steganography technique based on Least Significant Bit (LSB) for photos to provide better data protection. In non-adjacent and distinct pixel regions, it presents an embedding algorithm to hide ciphered messages in confines and smooth image domains. The sides are detected using an enhanced edge detection filter inside the cover image. This steganography technique means that the message bits concealed in the image are less likely to be suspected. Through usual steganography detection methods, it becomes difficult to estimate the true message length. The Proposed approach shows better leads to Peak Signal-to-Noise Ratio (PSNR) value and efficiency compared to other existing techniques.

BACKGROUND STUDY Cloud Computing (CC):

Because of the technology called virtualization, cloud computing is possible. As a platform for providing services over the internet, cloud computing has recently evolved. Without management by the consumer, cloud computing is best for data storage and computing resources. Cloud Computing is subject to confidentiality because service providers can access data at any time. It can alter or delete information. There are several security obstacles to cloud computing that hinder the intense adoption of the computing model.

Service Models of Cloud Computing

Software as a Service (SaaS): On cloud servers, SaaS applications are hosted, and users can access them over the internet. Salesforce, MailChimp, and Slack are all examples of SaaS applications. Security issues with SaaS are certification, acceptance, data privacy, availability, and network safety.

Platform as a Service (PaaS): PaaS dealers offer everything necessary for creating an application having development tools, infrastructure, and operating systems over the internet. Examples include Heroku, Microsoft Azure. The main security concerns are host insecurity, privacy-aware authentication, and fault tolerance [17].

Infrastructure as a Service (IaaS): It provides the basic structure through the Virtual Machine, but nowadays, Virtual Machine is not that much popular. Data deletion and disputes may be resolved by determining the time for data deletion by both the client and the cloud provider. Google Compute Engine and OpenStack are among the IaaS providers.

Figure 1. Service Models of Cloud Computing

Deployment Models in Cloud

PUBLIC CLOUD: According to the name, it is available for the general public. This cloud deployment is the first choice for businesses with low privacy concerns. In this third party, run your cloud infrastructure. It is hassle-free infrastructure management, and you can increase your cloud capacity according to your company's needs. Data security and privacy issues raise the concern.

PRIVATE CLOUD: From a technological perspective, there is no noticeable difference between public and private clouds since their architecture is the same. The private cloud provides several chances for customizing the infrastructure to the companys requirements. Only authorized people can access resources from the private cloud. It is costly, and that is the major disadvantage of it.

COMMUNITY CLOUD: It is a private cloud, only the difference in the number of users. Several companies can share the same cloud platform; because of this, sharing cost is reduced but more than the public deployment model.

HYBRID CLOUD: The best attribute of the other three models is the hybrid. In addition to providing security and managing strategically valuable properties, the hybrid cloud does so in a cost and resource-efficient manner. It is priced fairly [18].

Figure 2. Deployment Models of Cloud

Cloud Security Threats
- Access Management: The primary threat of store in the cloud is not a feature of the system but rather the result of the methods used by companies.
- Data Breaches and Data Leaks.
- Data Loss: Data loss is the most considerable adversity of cloud systems. Not used daily through backups is the most significant threat of the growth of ransomware; it encrypts data and demands payment for returning the data.
- Insecure APIs: The primary tools that allow interconnection with cloud storage systems are application user interfaces (APIs). Security concerns remain with APIs [19].
- Misconfigured Cloud Storage: It is a prevalent threat in cloud systems.
  
  Machine Learning (ML) and Cloud Security: When connected to the cloud, ML applications can be extended. The convergence of ML in the cloud is known as the "intelligent cloud." With Cloud Machine Learning, the ability of both cloud and ML algorithms can greatly increase while the cloud is first used for computing, networking, and storage. For instance, ML is currently a time-consuming task, but with the cloud computing instance, ML tasks can be activated to a great ex-task. As a result, even common statistical tools like R, Octave, and Python have also been transformed into the cloud. Machine Learning is the conceptual analysis of computation and observable models that are used by computer systems to execute a particular effort without using express headings, casual upon models, and acceptance. It is a computerized subset of reasoning. ML in the cloud is so common that any cloud can use ML in the near future.
ML TECHNIQUES FOR THE CLOUD SECURITY
FUTURE WORK Directions that required more researches in future:
- Before including new progressions, a compatible study of overhead should be conducted.
- ML datasets are a series of AI datasets spanning various fields, for which security data sets related to themes, such as spam, phishing, and so on, exist.
- The CSIC HTTP dataset contains a sufficient number of directly produced Web requests and will be used to evaluate Web attack security frameworks.
- Deep neural system disclosure is also an open-source deep neural system endeavor that attempts to discern legally accessible malicious URLs, document methods, and registry keys.
- Datasets within the sample scores are also located within the acquaintance or model's registry. Documents are on it.
- Although the analysis of ML with mob sourcing has been significantly promoted in recent years.
CONCLUSION

The increased use of cloud computing for data storage is undoubtedly accelerating cloud data storage methods. If data stored in the cloud is not adequately secured, it will be at risk. Security risks and assaults are the most complicated problems of cloud computing. Several machine learning algorithms are presented here, such as ANNs, K-NN, NaÃ¯ve Bayes, SVM to secure cloud data. It proves that the method adopted is more precise and efficient. And also reducing user time to encrypt/decrypt various types of data (basic, confidential, and highly confidential). We have also addressed some research avenues that will need further investigations in the future.

REFERENCES

Farhan, S.; Haider, S. Cloud Computing security risks. In the Internet Technology Proceedings and ICITST (Secured Transactions), Abu Dhabi, UAE, December 11-14, 2011; pp. 214-219
Sun, Y., Zhang, J., Xiong, Y., & G. Zhu. (Year 2014). Data Protection and Cloud Computing Privacy. 1-9, Researchgate,.
J. Srinivas, K. Reddy, and A. Qyser, Cloud Computing Basics, Build. Infrastruct. Cloud Security., vol. 1, no. September 2011, pp. 3 22, 2014.
L. Rodero-Merino, L. M. Vaquero, E. Caro, A. Muresan, and F. Desprez, Building safe PaaS clouds: A survey on security in multi- tenant software platforms, Comput. Secur., vol. 31, no. 1, pp. 96 108, 2012.
T. Li, J. Li, Z. Liu, P. Li, and C. Jia, Differentially private naive Bayes learning over multiple data sources, Inf. Sci., vol. 444, pp. 89 104, 2018
X.Ma, J.Ma, H. Li, Q. Jiang, and S. Gao, PDLM: Privacy-preserving deep learning model on cloud with multiple keys, IEEE Trans. Serv. Comput., to be published, doi: 10.1109/TSC.2018.2868750
R. Yonetani, V. Naresh Boddeti, K. M. Kitani, and Y. Sato, Privacy Preserving visual learning using doubly permuted homomorphic encryption, in Proc. IEEE Int. Conf. Comput. Vis., 2017, pp. 2040 2050.
T. Li, Z. Huang, P. Li, Z. Liu, and C. Jia, Outsourced privacy- preserving classification service over encrypted data, J. Netw. Comput. Appl., vol. 106, pp. 100110, 2018.
P. Li, T. Li, H. Ye, J. Li, X. Chen, and Y. Xiang, Privacy-preserving machine learning with multiple data providers, Future Gener. Comput. Syst., vol. 87, pp. 341350, 2018.
Hourani, H.; Cloud Computing: Legal and Security Issues: Abdallah,
1. In International Courts Computer Science and Information Technology (CSIT) Meeting, Helsinki, Finland, June 1314, 2018; Pp. 13 to 16.
Selamat, N.; Ali, F. Using a machine learning algorithm to compare malware detection techniques. Indones. J. Electr. Eng., Eng. Computing. Sci. 2019, 16th, 435th. [The CrossRef].
Shukla, S.; Maheshwari, H. Cloud Computing Security Discerning Risks. Oh, J. Comput. Theor. Theor. Nanosci. Nanosci. 16, 255-261 2019. [The CrossRef]
Yuhong, L.; Yan, S.; Jungwoo, R.; Syed, R.; Athanasios, V. A study of threats to security and privacy in Cloud Computing: Future Paths and Solutions. J. Comput. Sci. Sci. Eng., Eng. 2015, 9, 119-133.
Als olami, E. Security threats and legal issues related to Cloud based solutions. Int. J. Comput. Sci. Netw. Secur. 2018, 18, 156163.
Shamshirband, S.; Fathi, M.; Montieri, A.; Chronopoulos, A.T.; Palumbo, F.; PescapÃ¨, A. Computational, A. in Mobile Cloud Computing Environments, Intelligence Intrusion Detection Techniques: Analysis, Taxonomy, Issues of open science. Uh, J. Inf. Uh, Secur. Appl. Appl. 152, 2019.
Kaur, K., and Zandu, v (2016). A stable model of data classification in the machine learning approach to cloud computing. Computer Science International Journal of Advanced Research, 13-22.
Butt, Umer A., Muhammad Mehmood, Syed B. shah, Rashid Amin,
1. W. Shaukat, Syed M. Raza, Doug Y. Suh, and MD. j. Piran. 2020. A review of Machine Learning Algorithms for Cloud Computing Security. Electronics 1397, no. 19 july 2020 (August): 25. 10.3390/electronics9091379.
Zou, Caifeng, Huifang Deng, and Qunye Qui. 2013. Design and Implementation of Hybrid Cloud Computing Architecture Based on Cloud Bus. Researchgate 6, no. December 2013 (December): 6. 10.1109/MSN.2013.72.
Kazim, Muhammad, and Shao Y. Zhu. 2015. A Survey on top security threats in cloud computing. International journal of Advanced Computer Science and Applications. 6 (3): 5. 10.14569/IJACSA.2015.060316.
Zardari, M.A.; Jung, L.T.; Zakaria, N. K-NN classifier for data confidentiality in cloud computing. In Proceedings of the International Conference on Computer and Information Sciences (ICCOINS), Kuala Lumpur, Malaysia, 35 June 2014; pp. 16.
Zamzam, M.; Tallal, E.; Mohamed, A. Resource Management using Machine Learning in Mobile Edge Computing: A Survey. In Proceedings of the Ninth International Conference on Intelligent Computing and Information Systems (ICICIS), Cairo, Egypt, 810 December 2019; pp. 112117.
Hou, S.; Xin, H. Use of machine learning in detecting network security of edge computing systems. In Proceedings of the 4th International Conference on Big Data Analytics (ICBDA), Suzhou, China, 1315 March 2019; pp. 252256.
Wani, A.; Rana, Q.; Saxena, U.; Pandey, N. Analysis and Detection of DDoS Attacks on Cloud Computing Environment using Machine Learning Techniques. In Proceedings of the Amity International Conference on Artificial Intelligence (AICAI), Dubai, UAE, 46 February 2019; pp. 87087
Calderon, R. The Benefits of Artificial Intelligence in Cybersecurity. Available online: https:// digitalcommons.lasalle.edu/ecf- capstones/36 (accessed on 19 July 2020).
Elzamly, A.; Hussin, B.; Basari, A.S. Classification of Critical Cloud Computing Security Issues for Banking Organizations: A Cloud Delphi Study. Int. J. Grid Distrib. Comput. 2016, 9, 137158. [CrossRef].
Hanna, M.S.; Bader, A.A.; Ibrahim, E.E.; Adel A.A. Application of Intelligent Data Mining Approach in Securing Cloud Computing. Int. J. Adv. Comput. Sci. Appl. 2016, 7, 151159.

Differential Privacy based Preserving Data on Cloud Environment

Leave a Reply