Visual Role Mining Using Adviser and Extraction Algorithm in Business Environment

Merin Jose; Merlin Cyriac; Mereen Thomas

doi:10.17577/IJERTV3IS042393

Volume 03, Issue 04 (April 2014)

Visual Role Mining Using Adviser and Extraction Algorithm in Business Environment

DOI : 10.17577/IJERTV3IS042393

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 170
Total Downloads : 170
Authors : Merin Jose, Merlin Cyriac, Mereen Thomas
Paper ID : IJERTV3IS042393
Volume & Issue : Volume 03, Issue 04 (April 2014)
Published (First Online): 06-05-2014
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

Visual Role Mining Using Adviser and Extraction Algorithm in Business Environment

Merin Jose

PG Scholar Department of Computer Science &

Engineering

KCG College of Technology Chennai, India

Merlin Cyriac

PG Scholar Department of Computer Science &

Engineering

KCG College of Technology Chennai, India

Mereen Thomas

PG Scholar Department of Computer Science &

Engineering Anna University

Guindy Campus,India

AbstractThis paper offers a new visualization approach to Role based Access Control mentioned to as Visual Role Mining. The main idea is to reduce the query generation time in the databases. So that it will provide a text file retrieval model instead of query in databases. First it formally converts the data in the databases to some text files. By the help of ADVISER algorithm and BicOverlapper tool, the data will be converted to some picture formats. Later on the entire data will be extracted by the help of an Extraction algorithm. The algorithm also helps to convert the data into appropriate visual representations which supports role engineering process.

KeywordsRole Based Access Control, Visual Role Mining, Visual Representation.

INTRODUCTION

Data mining is a process that uses a variety of data analysis tools to discover patterns and relationships in data that may be used to make different analysis. The first and simplest analytical step in data mining is to describe the data summarize its statistical attributes, visually review it using pictures, shapes, charts and graphs, and look for meaningful links among attributes. Data mining is increasingly popular because of some research aspects too. It involves databases, data management, data preprocessing etc.

Graphing and visualization tools are a vital aid in data preparation and their importance to effective data analysis cannot be over emphasized. Visualization works because it exploits the broader information bandwidth of graphics. It allows people to go more inside to the concepts rather than the outlook. It is easier to find out the mistakes in graphical view than other textual formats. It is similar to a smart art which gives more information than a simple description.

1.1 Role Based Access Control

The goal of role engineering, by Edward Coyne, is to define a set of roles that is essential, correct and efficient. In particular, role engineering requires defining roles and assigning permissions to them. Role engineering is essential before all the benefits of RBAC can be realized. Meanwhile, role engineering, considered as one of the major challenges

RBAC implementation is a time-consuming and costly process. Due to this, organizations are often reluctant to move to RBAC. Therefore, the increasing popularity of RBAC calls for efficient solutions for role engineering as results in tremendous research efforts in this area.

There are three basic approaches towards role engineering: top-down,bottom-up and hybrid. Under the top-down approach, roles are defined by proper analysis and decomposition of business processes into smaller units in a functionally independent ways. These functionalities are then associated with permissions on information systems. In other way, this approach begins with defining a particular job function and then creating a role for this job function by associating needed permissions.

Access control mechanisms are crucial design elements that aim at mediating requests to data and services. Among all models proposed in the literature, Role-Based Access Control (RBAC) has become the norm for managing permissions within commercial applications. The high-level formalism and the simplicity of its design made it an attractive and pragmatic choice for implementing access control. Under RBAC, a role is a set of permissions, while users posses the permissions to perform system functions only when they are assigned to specific roles. Because of the intuitiveness of RBAC, security policies can be easily defined by business users that do not usually have all the needed IT knowledge.

Role mining is the process of analyzing user-to-resource mapping data to determine or modify user permissions for role-based access control (RBAC) in an enterprise. In a business scenario, roles are defined according to jobspeculiarity, authority and responsibility. The ultimate intent of role mining is to achieve optimal security administration based on the role each individual plays within the organization. Role mining is commonly using in different environments. Other than security it holds simplicity too.
RELATED WORK

The role engineering problemthrough a top-down perspective was illustrated by Coyne et al. [3]. Kuhlmann et al
1. was first trying to apply existing data mining techniques to elicit roles from accessed data. He introduced the term role mining.After that, different algorithms explicitly designed for role engineering purposes were proposed. Molloy et al. [10] presented a comprehensive study to compare them and a brief survey on the subject. Colantonio et al. [1] recently addressed the problem of analyzing the role mining complexity by also proposing a way to reduce it. In general, this approach can be considered a complement for all the existing role engineering methodologies and tools. Indeed, it allows agood, executable, and visceral way to evaluate and select roles generated by other methodologies.
  
  There are different role mining techniques are available for clustering, classification, extraction, mining. Among them some filtering mechanisms are more common. They are scan count, divide skip, merge skip.
  
  Another related work is proposed by Geerts F. et al.[6], where a branch-and-bound algorithm for mining large tiles (that is, regions of database consisting purely of ones) is introduced. It shares with the interest on finding large tiles only indeed, here the focus on the problem of visually representing tiles. A similar problem is partially addressed in 2008. M. Frank et al.[5] show a possible way to build a matrix representation of user-permission relationships. However, this generation is limited to the special case of non overlapping roles, far from being general and optimal according to definition. Moreover, it is not applicable to generic role mining approaches.
  
  As for visual representation of mined data, a small number of visualizers have been proposed in different literature, and most of them are not explicitly designed for a particular data. The BicOverlapper tool integrates on a set of well-known visualization techniques that represent different data information on different levels. However, typical representations for each data such as repeating rows and columns of the analyzed matrix are confusing or not suitable for role mining. Jin R et al. [7] propose a visualization algorithm that extends existing graph sorting algorithms to offer a good matrix visualization of previously defined hyper graphs which can be mapped to the role concept in the RBAC terminology. Leung and Carmichael [9] developed a visualizer for frequent item sets based on multiline calls polyline. However, frequent item sets are not the only relevant patterns for role engineering. This approach greatly differs from the actual implementation:
  1. Adopt a different visualization cost metric that is more suitable for role engineering incompatible with the core of their theory.
  2. Show how to obtain a matrix representation without resorting to any existing mining algorithm.
Roles are treated as sets of permssions: Each row in the list is a role. Equivalently, a user role is characterized by the set of permissions that he owns. Vaidya J et al.[13] proposed a

model, these sets are determined by the row i in x., this model is equivalent to one of the instances of this model class if no underlying probability distribution is considered. In the algorithm proposed by Vaidya J et al [12], all existing users are initially considered as candidate roles. Thus, each candidate role consists of all permissions that are assigned to a particular user. Afterwards, candidate roles are picked in a greedy manner to determine the final set of roles. A similar procedure is proposed by Santamaria G et al.[11]). But there, roles and permissions are represented as sets of users. The initial roles are constructed from existing permissions. Cherichetti F et al [4] introduces, the roles are also represented as set of permissions .Candidate roles are generated and then merged, split, or placed in a role hierarchy, as determined by a small set of given rules. Namely, an initial role is the set of users that are assigned to a given permission. Again, the initial roles are iteratively merged, split, or placed in a role hierarchy according to the cardinality of intersections of the roles.

ROLE VISUALIZATION PROBLEM

Recently, there has been an increasing interest in using automated role engineering techniques. Despite much work dedicated to the design of role mining algorithms, existing methodologies deal with three main practical issues: meaning of roles, noise associated with the data, and interconnections among roles.

To address the issues, a new approach, referred to as visual role mining. User-permission patterns (i.e., RBAC roles) among each individual are managed as visual patterns. The principle behind this approach is that visual representations of roles can actually amplify cognition, leading to optimal analysis results. Visualization of the user-permission assignments is performed in such a way to remove the noise, allowing role engineers to focus on relevant patterns, purchasing their cognition capabilities. Further, connections among roles are shown as different patterns, hence providing a visual manner to discover and utilize these relations.

Role Visualization

Given a set of already discovered roles of interest, the task is to identify the best graphical representation for them. In particular, the representation for user-permission assignments that allows for both an intuitive role validation and a visual identification of the relationships among roles. The proposed method shows that roles are easier to recognize than describe via a binary matrix representation. The proposed method can answer questions that statistical or mining approaches cannot easily provide. It will provide an easy way to analyze data within the text files rather than databases. It will also represent data in some visual manner like charts or in other easily understandable form.
Binary Matrix Representation

A normal representation for this information is the binary matrix, where rows and columns correspond to users and permissions, and each cell is on when a certain user has a certain permission granted.

The table 1 shows the input data which contains the users and the corresponding permissions associated with it. There will be different users associated with different permissions. According to the permissions, the users will be classified into different groups. Now the roles can be retrieved according to these groups. Table2 represents the candidate roles retrieved.

{

<u0,p1>, u0,p3>,<u0,p8>,<u0,p9>,<u1,p1>,

<u1,p2>, <u1,p3>,<u1,p4>,<u1,p6>,<u1,p8>,

<u1,p9>,<u2,p1>,<u2,p2>,<u2,p4>,<u2,p6>,<u2,p9>,<u3,p1

>,<u3,p3>,<u3,p8>,<u3,p9>,<u4,p1>,<u4,p3>,<u4,p8>,<u4,p 9>,<u5,p1>,<u5,p2>,<u5,p4>,<u5,p5>,<u5,p6>,<u5,p9>,

<u6,p1>,<u6,p2>,<u6,p4>,<u6,p6>,

<u6,p9>,<u7,p0>,<u7,p1>, <u7,p7>,

<u8,p0>, <u8,p1>, <u8,p7>,<u9,p1>}

User-Permission Assignments

Table1 Input Data

Table 2 Candidate Roles

Role	Permissions	Users
r1	{p1}	{u0, u1, u2, u3, u4, u5, u6, u7, u8, u9}
r2	{p2, p4, p6, p9}	{u1, u2, u5, u6, u9}
r3	{p3, p8, p9}	{u0, u1, u3, u4}
r4	{p0, p7}	{u7, u8}
r5	{p5}	{u5}

PROPOSED SYSTEM

By leveraging on the observations made in the previous section, it describes a viable, fast heuristic algorithm called ADVISER (Access Data Visualizer). For a given a set of roles, this algorithm is able to provide a compact representation of them. In particular, it rearranges rows and columns of the user-permission matrix to minimize the fragmentation of each roles associated to it.

ADVISER, the more fragments in the visualization of a role, and thenthe role visualization cost will get increased. Reordering users but not permissions only affects the number of gaps between columns, and so do Permissions (i.e., Rows and columns are sorted independently).

According to the expectation, the visualization cost decreases as the number of samples increases. Finally, extensive applications over real and public data confirm that this approach is efficient, reliable both in terms of computational time and result quality of the product.
Extraction algorithm is used to extract data from the text files.

init(picture assigned to role,type Of Mining,values)

{

File file = new File(path); if(file.exists()) {

FileInputStream fis = new FileInputStream(path); byte buffer[] = new byte[fis.available()]; fis.read(buffer);

String a = newString(buffer); String b = a;

fis.close();
RESULT AND ANALYSIS

In order to represent the visual role mining, a bank application has been taken as a model. The bank manager can login to the system and he can perform certain updation. As described the options available to the manger are view employee details,add/remove users,setting permissions,employee registration, work assignment etc The manager is responsible for creating a cash manager. The Employee will be generated in that particular id. The manager can register and assign works to that corresponding employee. After creating each employee we have to register them into the application database, which will be added to the database. It includes employee id, password, address, mail id, and role.

Every user has the provision to change their password after their login. Every cashier will have an online desk, which includes all details of every transaction. It includes the attendance register of each employee. It adds the no of days that particular employee took leave.

There will be certain works associated with the users for getting into the system. The main step is the registration part. The user should register to start an account. All the basic information should be provided. The accountant will be allowing each customer to start an account. Hereafter the customer can use the facilities offered by the bank.

The data which is available in the application database has been converted in to text files as employee data and customer data. These files are considered as the input for the BicOverlapper tool. This data will be converted to different patterns as shown in the 6.. The color difference indicates the different roles associated with each employee. The type of loan makes the color difference in the case of customer data. The analysis part becomes easier when the size of the file increases, comparing with a database.

Figure 6.1 Picture Format

The Extract algorithm will be extracting the data from the text files. It is capable of representing the data into some pictorial representations like bar charts.

Figure 6.2 Extraction process

VI .CONCLUSION

This paper is mainly addressing the visual role mining problem. That is, visualizing user-permission assignments in a graphical form that makes it possible to simplify the role engineering process. The proposed representation of data allows role designers to gain insight, draw conclusions, and ultimately design meaningful roles in business applications. The paper offered a formal description of the visual role mining problem. Then it demonstrated a banking environment which includes all the transactions. The people included are employees and the customers associated with the bank. Moreover, it proposed a novel algorithm called ADVISER in conjunction with a bicOverlapper tool to generate a visual representation. The bicOverlapper tool produces approximate patterns that can be used in conjunction with ADVISER to obtain high-quality visualization results. Finally, extensive applications over public data confirm that this approach is efficient, both in terms of computational time and result quality. It also described an efficient algorithm referred to as Extract algorithm. The paper introduced role engineering as a process which can greatly benefit from the visual approach proposed in earlier years. Role engineering is definitely an active research topic with a high interest from both academy and industry, as witnessed by the rich literature. Our contributions, other than being useful for role engineering, can have interesting applications in other fields as well. For instance the query generation time is one of the disadvantages associated with databases. The paper proposes a novel solution for this problem by creating a text file instead of databases. All the algorithms will be dealing with the text files only. In particular, homogeneous sub matrices indicate subsets of rows co expressed under the same conditions columns. In this case, each transaction corresponds to a row and each item corresponds to column of the matrix. As for future work, our solutions can be extended in several directions. Approximated representations of data are just some examples of possible directions to investigate. Besides partitioning, as suggested in this paper, alternative representations might be taken into account to provide a compact representation of the information.

REFERENCES

Colantonio A, Di Pietro R, Ocello A, and Verde N. V, Taming Role Mining Complexity in RBAC, Computers Security, vol. 29, pp. 548- 564, 2010.
Colantonio A, Di Pietro R, Ocello A, and Verde N.V, Visual Role Mining: A Picture Is Worth a Thousand Roles, IEEE Transactions On Knowledge And Data Engineering, VOL. 24, NO. 6, pp. 1120- 1133, 2012.
Coyne E.J, Role-Engineering, Proc. ACM Workshop Role-Based Access Control (RBAC 95), pp. 15-16, 1995.
F. Chierichetti, R. Kumar, S. Pandey, and S. Vassilvitskii, Finding the Jaccard Median, Proc. 21st Ann. ACM-SIAM Symp. Discrete Algorithms (SODA 10), pp. 293-311, 2010.
Frank M, Basin D, and Buhmann J.M, A Class of Probabilistic Models for Role Engineering, Proc. 15th ACM Conf. Computer and Comm. Security (CCS 08), pp. 299-310, 2008.
Geerts F, Goethals B, and MielikaÂ¨inen T, Tiling Databases, Proc. Seventh Intl Conf. Discovery Science (DS 04), pp. 278-289, 2004.
Jin R, Xiang Y, Fuhry D, and Dragan F.F, Overlapping Matrix Pattern Visualization: A Hypergraph Approach, Proc. IEEE Intl Conf. Data Mining (ICDM 08), pp. 313-322, 2008.
Kuhlmann M, Shohat D, and Schimpf G, Role MiningRevealing Business Roles for Security Administration Using Data Mining Technology, Proc. Eighth ACM Symp. Access Control Models and Technologies (SACMAT 03), pp. 179-186, 2003.
Leung C.K.-S. and Carmichael C.L., FpViz: A Visualizer for Frequent Pattern Mining, Proc. ACM SIGKDD Workshop Visual Analytics and Knowledge Discovery (VAKD 09), pp. 30-39, 2009.
Molloy I, Li N, Li T, Mao Z, Wang Q, and Lobo J, Evaluating Role Mining Algorithms, Proc. 14th ACM Symp. Access Control Models and Technologies (SACMAT 09), pp. 95-104, 2009.
R. Santamaria, R. Theron, and L. Quintales, BicOverlapper: A Tool for Bicluster Visualization, Bioinformatics, vol. 24, no. 9, pp. 1212- 1213, 2008.
Vaidya J, Atluri V, and Guo Q, The Role Mining Problem: Finding a Minimal Descriptive Set of Roles, Proc. 12th ACM Symp. Access Control Models and Technologies (SACMAT 07), pp. 175-184, 2007.
Vaidya J, Atluri V, and Warner J, RoleMiner: Mining Roles Using Subset Enumeration, Proc. 13th ACM Conf. Computer and Comm. Security (CCS 06), pp. 144-153, 2006.

Visual Role Mining Using Adviser and Extraction Algorithm in Business Environment

Leave a Reply