A Novel Evaluation of Query Processing and Optimization in DBMS

Mohd Muntjir

doi:10.17577/IJERTV3IS111390

Volume 03, Issue 11 (November 2014)

A Novel Evaluation of Query Processing and Optimization in DBMS

DOI : 10.17577/IJERTV3IS111390

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 163
Total Downloads : 135
Authors : Mohd Muntjir
Paper ID : IJERTV3IS111390
Volume & Issue : Volume 03, Issue 11 (November 2014)
Published (First Online): 03-12-2014
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

A Novel Evaluation of Query Processing and Optimization in DBMS

Mohd Muntjir

College of Computers and Information Technology Tail University, Taif,

Saudi Arabia

Abstract- Query Processing is the systematic method of accessing the require information from a database system in an expected and reliable trend. Database systems must be agile to respond to requests for information from the user i.e. process queries. In huge database systems that may be running on unreliable and elusive domain it is no easy to outcome to dynamic database query plans based on information available exclusively at compile time. Obtaining and finding the database results in a prompt manner deals with the method of Query Optimization. Adequate processing of queries is a major requirement in various interactive environments that associates huge amounts of data. Dynamic query processing in environments such as the multimedia search, Web, and distributed systems has shown a main impact on performance and optimization. This paper will suggest and propose the main concepts of query processing and query optimization in the relational database systems. It is also describing and differentiating query-processing method in relational database systems.

Keywords: Query Processing, Query Optimization, and Database

INTRODUCTION

The basic part of any Database Management Systems is query processing and optimization. The outcomes of queries must be accessible in the timeframe required by the complying user [2]. Query processing techniques based on various design dimensions can be defined as [1]:
1. Query model:
  
  Processing techniques are defined according to the query model they consider. Few techniques recognize a selection query model, where outcomes are attached basically to base tuples. Alternative techniques speculate a join query model, where final outcomes are calculated over join results. A third section considers an aggregate query model, where we are responsive in ranking groups of tuples.
2. Implementation level:
  
  These processing techniques are defined according to their level of association with database systems. E.g., some techniques are designed in an application layer on top of the database systems, although others are implemented as query operators.
3. Data access methods:
  
  Processing method is classified according to the data access technique they consider to be accessible in the fundamentals data sources. For instance, some techniques define the availability of random access, although others are controlled to only classified access.
4. Ranking function:
  
  Processing techniques are classified based on the limitations they establish on the latent ranking (scoring) functions. Best suggested techniques expect monotone scoring functions.
5. Data and query uncertainty:
Processing techniques are defined based on the ambiguity elaborated in their data and query models. Many techniques establish exact results, whilst others order for proximate answers, or manage indefinite data.
QUERY PROCESSING

Query processing specifies to the range of activities integrated in extracting data from a database system. The activities comprise translation of queries in high-level database languages into expressions that can be used at the physical level of the file systems, and a variation of query-optimizing conversion, and real interpretation of queries. Furthermore, a database query is the vehicle for instructing a DBMS to update or fetch specific data to/from the physically stored intermediate. The real updating and fetching of data is established through different low- level procedures [10]. Instance of such operations for a relational DBMS can be relational algebra operations such as select, project, join, Cartesian product. [11]. As long as the DBMS is created and designed to process these low -level operations purposely, it can be quite the burden to a user to create requests to the DBMS in these designs.

There are three phases [12] that a query passes through during the DBMS processing of that query: 1. parsing and translation
1. Optimization3. Evaluation
  
  Furthermore, the first step in processing a query referred to a Database Management System is to convert the query into a form accessible by the query-processing engines. High- level query languages such as SQL defined a query as a sequence or string, of characters.
  
  Actual sequences of characters represent different types of tokens such as literal strings operators, keywords, operands, etc. Similar to all languages, there are rules (syntax and grammar) that control how the tokens can be integrated into (i. valid statements.
  
  The major job of the parser is to extract the tokens from the raw string of characters and translate them into the equivalent internal data elements and structures (i.e. query graph, query tree). The last task of the parser is to authenticate the validity and syntax of the real query strings. In second phase, the query processor implements rules to the internal data structures of the query to transform these structures into similar, but more adequate demonstrations. The standard can be based upon
  
  mathematical models of the relational algebra expression and tree, upon cost calculates of various algorithms used to operations or upon the semantics within the query and the relations it integrates. Electing the proper rules to implement, when to apply them and how they are implemented is the function of the query optimization engine.
MEASURES OF QUERY COST

Cost of query is basically measured as total overdue time for answering query in a database. Furthermore, expanse of query evaluation can be assumed in terms of a number of various assets, and CPU time to execute a query, along with disk accesses, and, the expanse of communication and broadcasting, in a distributed or parallel database system. The response time for a query-evaluation plan, assuming no other action is going on the computer systems, would scheduled for all these costs, and could be used as a better scope of the cost of the methods. In some database systems, although, disk approaches are usually the most extensive expanse, since disk accesses are low associated to in-memory operations. Further, the speed of CPU has been reestablishing much faster than have disk speed. Henceforth, it is more similar that the time spent in disk activity will continue to control and maintain the total time to execute queries in database systems. Decisively, manipulating and estimating the CPU time is basically conventional compared to calculating the disk-access expanses? Mostly the people consider that the disk-access cost feasible estimation of the cost of a query-evaluation strategy in databases.

Fig. 1. Query processing

The last step in processing a query is the evaluation phase. The best evaluation plan candidate developed by the optimization engine is selection and then execution. Note that there can stand various methods of executing a query. Beyond processing a query in easy consecutive methods, many of a querys individual operations can be oppressed in parallel either as autonomous processes or threads or as interdependent pipelines of processes. Unconcerned of the method selected, the permanent results should be same.

Consider for example in Fig.2 :

Fig. 2. A query-evaluation plan
QUERY ALGORITHMS
A dense index is a file with pairs of keys and pointers for every record in the data file. Each key in this file is related with a particular pointer to a record in the sorted data files.

index with a balanced-tree structure. Creating a search key value in a B+-tree is proportional to the height of the tree maximum number of seeks required is log(height). Although, this, on average is more than a single -level, dense index that requires only one seek. The B+-tree structure has a unique advantage in that it does not need rearrangement; it is self-developing because the tree is kept arranged during insertions and deletions.
CHOICE OF EVALUATION PLANS

In a DBMS, the query optimization engine originates a set of candidate evaluation plans. Although, soe will, in heuristic theory, creates faster, more sufficient executions. On the contrary, by previous historical summary, be more efficient than the theoretical model; this can very well be the case for queries dependent on the semantic nature of the data to be managed. Whereas, still others can be more efficient due to

outside agencies such as competing applications, network congestion, on the same CPU, etc.
CONCLUSIONS

In a DBMS, One of the major functional needs of a database system is its ability to process queries in convenient manner. It is basically true for huge, mission critical applications such as aeronautical applications, banking systems and weather forecasting, which can possess millions and even trillions of records. The basic need for faster and faster, immediate results never conclude. Hence, a big deal of research and resources is spent on creating and generating smarter, highly efficient query optimization engine for query optimization. Among them, some of the basic techniques of query processing and optimization have been presented and redefine in this paper.

REFERENCES

D. Calvanese, G. DeGiacomo, M. Lenzerini and M. Y. Vardi. Reasoning on Regular Path Queries. ACM SIGMOD Record, Vol. 32, No. 4, December 2003.
Henk Ernst Blok, DjoerdHiemstra and Sunil Choenni, Franciska de Jong, Henk M. Blanken and Peter M.G. Apers. Predicting the cost- quality trade-off for information retrieval queries: Facilitating database design and query optimization. Proceedings of the tenth international conference on Information and knowledge management, Pages 207 – 214.
Andrew Eisenberg and Jim Melton. Advancements in SQL/XML. ACM SIGMOD Record, Vol. 33, No. 3, September 2004..
AndrewEisenberg and Jim Melton. An Early Look at XQuery API for JavaTM (XQJ). ACM SIGMOD Record, Vol. 33, No. 2
RamezElmasri and Shamkant B. Navathe. Fundamentals of Database Systems, second edition. Addison-Wesley Publishing Company.
DonaldKossmann and Konrad Stocker. Iterative Dynamic Programming: A new Class ofQuery Optimization Algorithms. ACM Transactions on Database Systems, Vol. 25, No. 1, March 2000, Pages 43- 82.
Chiang Lee, Chi – Sheng Shih and Yaw – Huei Chen. A Graph-theoritic model for optimizing queries involving methods. The VLDB Journal The International Journal on Very Large Data Bases, Vol. 9,Issue 4, Pages327 – 343.
Hsiao-Fei Liu, Ya – Hui Chang and Kun-Mao Chao. An Optimal Algorithm for Querying Tree Structures and its Applications in Bioinformatics. ACM SIGMOD Record Vol. 33, No. 2, June 2004.
Reza Sadri, Carlo Zaniolo, Amir Zarkesh and JafarAdibi. Expressing and Optimizing Transactions on Database Systems, Vol. 29, Issue 2, Pages 282 – 318.
Reza Sadri, Carlo Zaniolo, Amir Zarkesh and JafarAdibi. Optimization of Sequence Queries in Database Systems. In Proceedings of the twentieth ACM SIGMOD -SIGACT-SIGART symposium on Principles of database systems, May 2001, Pages 71 -81.
Thomas Schwentick. XPath Query Containment. ACM SIGMOD Record, Vol. 33, No. 1, March 2004.
AviSilbershatz, Hank Korth and S. Sudarshan.Database Systems Concepts,7thEditions.McGraw Hill.
Dimitri Theodoratos and WugangXu. Constructing Search Spaces for Materialized View Selection. Proceedings of the 7th ACM international workshop on Data warehousing and OLAP, Pages 112 – 121.
Jingren Zhou and Kenneth A. Ross. Buffering Database Operations for Enhanced Instruction Cache Performance. Proceedings of the 2004 ACM SIGMOD international conference on Management of data, June 2004, Pages 191 202.

A Novel Evaluation of Query Processing and Optimization in DBMS

Leave a Reply