Parallel Databases: Architecture, Advantages, and Implementation

WHAT IS PARALLEL DATABASE?

Parallel Database improve processing and input/output speeds by using multiple CPU and disks in parallel.

A Parallel Database system seeks to improve performance through parallelization of various operations, such as loading data, building indexes and evaluating queries. In Parallel processing, many operations are performed simultaneously, as opposed to serial processing, in which the computational steps are performed sequentially.

Parallel Databases Architecture, Advantages, and Implementation
Parallel Databases Architecture, Advantages, and Implementation

Organizations of every size benefit from databases because they improve the management of information. The database has a server, a specialized program that oversees all user requests.  Organization use parallel database approach for a large user base and millions of records to process. They are fast, flexible and reliable.

Architecture For Parallel Databases

There are three main architectures for building parallel DBMS

1. Shared Memory.

2. Shared Disk System.

3. Shared Nothing System

1. Shared Memory System:

This is where multiple processors are attached to an interconnected network and access a common region of memory.

Advantages of Share Memory System

  1. It is closer to conventional machine and easy to program.
  2. Overhead is low.
  3. OS Services are leveraged to utilized the additional CPU

Disadvantages of: Share Memory System

  1. It leads to bottleneck problem.
  2. Expensive to build.
  3. It is less sensitive to partitioning

2. Shared Disk System:

This is where each processor has its main memory and direct access to all disks through an interconnected network.

Advantages of Shared Disk System:

The same with shared memory

Disadvantages of Shared Disk System:

  1. More interference
  2. Increases N/ W band width.
  3. Shared disk is less sensitive to partitioning.

3. Shared Nothing:

This is where each processor has local main memory and disk space, but no two processors can access the same storage area and all communication between processor is through a network connection. It has its own mass storage as well as main memory.

Advantages of: Shared Nothing

  1. It provides linear scale up and linear speed up.
  2. Shared nothing benefit from “ good” partitioning.
  3. Cheap to build.

Shared Nothing Disadvantages:

  1. It is hard to program.
  2. Addition of new nodes requires reorganization.

PARALLEL QUERY EVALUATION

A relational query execution plan is a graph/ tree of relational algebra operators ( based on this operators can execute in parallel) and the operators in a graph can be executed in parallel. If an operator consumes the output of a second operator, we have pipelined parallelism.

Data partitioning: In this case large database are partitioned horizontally across several disk, this enables us to exploit the I/O bandwidth of the disk by reading and writing them in parallel. This can be done in the following ways:

  1. Round Robin Partitioning:If there are n processors, the 1th tuple is assigned to processor i mod n round-robin partitioning. Round-robin partitioning is suitable for efficiently evaluating queries that access the entire relation. If only a subset of the tuples is required, hash partitioning and range partitioning are better than round-robin partitioning.
  2. Hash partitioning: A hash function is applied to (selected fields of) a tuple to determine its processor.Hash partitioning has the additional virtue that it keeps data evenly distributed even if the data grows and shrinks over time.
  3. Range Partitioning: Tuples are sorted and ranges are chosen for the sort key values so that each range contains roughly the same number of tuples, tuples in range, I re assigned to processor i. Range Partitioning can lead to data skew.

The Advantages of Parallel Databases

A parallel database runs on many computers at the same time.

  1. High Performances
  2. Speed
  3. Reliability
  4. Capacity

The  Disadvantages of Parallel database

1. Implementation is highly expensive.

2. Handling Parallel database simultaneously is difficult and complex.

3. A lot of resources are needed to support and maintain the database.

EVALUATION:

  1. Define Parallel query evaluation.
  2. State three methods data can be partitioned.
  3. What are the advantages and disadvantages of parallel database.

READING ASSIGNMENT

Understanding Data Processing for senior secondary schoolsby Dinehin Victoria pages 269 – 271

WEEKEND ASSIGNMENT:

  1. ………..system seeks to improve performance through parallelization of various operations. (a) parallel database (b) distributed database (c) relational database (d) flat database.
  2. The architecture where multiple processors are attached to an interconnected network and access a common region of memory is called ………. (a) shared memory (b) shared disk system      (c) share nothing(d) all of the above
  3. In ……….partitioning, tuples are sorted and ranges are chosen for the sort key values.(a) round robin      (b) hash       (c) range        (d) table
  4. ………..Partitioning is suitable for efficiently evaluating queries that access the entire relation. (a) range  (b) round robin   (c) hash   (d)  query
  5. The following are examples of parallel database except.  (a) implementation is highly expensive  (b) speed   (c) reliability   (d)  capacity

Questions

  1. Define parallel database.
  2. Enumerate the three architectures for database.
  3. State three methods data can be partitioned.
  4. What are the advantages and disadvantages of parallel database.

See also:

Crash Recovery in Databases: Definition, Phases, and ARIES Explained

IMG 20161027 WA0019
I.T Entrepreneur | Digital Marketing Specialist | Website Developer, | Computer Lecturer | Content Creator at Webbpedia Learning | +2347084530359 | admin@webbpedia.com

Samuel Okeke is a highly experienced and skilled Website Developer, Computer Lecturer, IT Instructor, Digital Marketing Expert, Computer Engineer, and Author with over a decade of experience in the educational, digital marketing and IT sectors. He has proven track records of developing and sustaining successful educational projects, including Acadlly, Audio School, and Certifications Exam Prep. He possess a strong passion for education and a commitment to making a positive impact on people and society.

Related Posts

Leave a Reply

Your email address will not be published. Required fields are marked *