Question 1

is a collection of multiple, logically interrelated databases spread across
physically interconnected locations (nodes), appearing as a single, unified system to the user.

Accepted Answer

distributed database

Question 2

architecture consists of multiple interconnected nodes, where each node has its own local database and DBMS, but all nodes work together as a single system.

Accepted Answer

Distributed Database Management System  (DDBMS)

Question 3

Basic Structure of DDBMS Architecture

Accepted Answer

Client → Network → Multiple Database Nodes

Question 4

Components of DDBMS

Accepted Answer

Query Processor
Transaction Manager
Data Manager
Communication Manager

Question 5

Handles user queries and decides where (which node) the data should be retrieved from

Accepted Answer

Query Processor

Question 6

Manages transactions across multiple nodes and ensures consistency during execution

Accepted Answer

Transaction Manager

Question 7

Responsible for storing, retrieving, and updating data within each node

Accepted Answer

Data Manager

Question 8

Handles communication between nodes to exchange data and coordinate operations.

Accepted Answer

Communication Manager

Question 9

Importance of Distributed Database Systems

Accepted Answer

Scalability
High Availability
Fault Tolerance

Question 10

Ability to add more servers (nodes) to handle growth

Accepted Answer

Scalability

Question 11

System remains accessible even if some nodes fail
• Users can still access data anytime

Accepted Answer

High Availability

Question 12

System can continue operating despite failures

Accepted Answer

Fault Tolerance

Question 13

Types of Distributed Database System:

Accepted Answer

Homogeneous Database
Heterogeneous Database
Client-Server Distributed Database System
Peer-to-Peer Distributed Database System
Cloud-Based Distributed Database System

Question 14

All sites use the same DBMS, data model, and structure, making
communication and data sharing easier. The data may be distributed, but the system is uniform across all locations.

Accepted Answer

Homogeneous Database

Question 15

Different sites use different DBMSs, schemas, or data models, which makes integration and query processing more complex. Special middleware or translators are needed for communication between systems.

Accepted Answer

Heterogeneous Database

Question 16

The server manages and stores the database, while clients send requests and receive results over a network. It provides centralized control with
distributed access.

Accepted Answer

Client-Server Distributed Database System

Question 17

All nodes have equal roles, and each can store data and process requests without a central server. This structure improves fault tolerance and decentralization.

Accepted Answer

Peer-to-Peer Distributed Database System

Question 18

These databases are hosted on cloud platforms and distributed across multiple regions for scalability and availability. They are often provided as managed services, reducing infrastructure management.

Accepted Answer

Cloud-Based Distributed Database System

Question 19

A relation is divided into smaller parts called fragments, and each fragment is stored at different sites where it is needed. The fragments must be designed so that the original relation
can be reconstructed without losing any data.

Accepted Answer

Data Fragmentation

Question 20

Divides a table into groups of rows (tuples). Each fragment contains a subset of records. Commonly based on: Location, Conditions (e.g., age > 18).

Accepted Answer

Horizontal Fragmentation

Question 21

Divides a table into groups of columns (attributes). Each fragment contains selected attributes.

Accepted Answer

Vertical Fragmentation

Question 22

ensure that data is available across multiple locations while
remaining correct and synchronized.

Accepted Answer

Replication and Data Consistency

Question 23

is the process of creating and maintaining copies of data across multiple nodes in a distributed system to improve availability, fault tolerance, and performance.

Accepted Answer

Replication

Question 24

stores a copy of all data on every node, ensuring high availability and reliability but increasing storage and update overhead..

Accepted Answer

Full Replication:

Question 25

stores only selected data on certain nodes, reducing storage costs while requiring careful planning to maintain data availability

Accepted Answer

Partial Replication:

Question 26

ensures that all copies of data in a distributed database have the same value at a given time. It is difficult to maintain due to network delays, system failures, and simultaneous updates.

Accepted Answer

Data Consistency

Question 27

ensures that all replicas always reflect the latest updated data before any read operation. This guarantees accurate results but may reduce system performance due to synchronization delays.

Accepted Answer

Strong Consistency

Question 28

allows temporary differences between replicas but ensures they become consistent over time. It improves performance and availability but may return outdated data during synchronization.

Accepted Answer

Eventual Consistency

Question 29

is a transaction that involves multiple databases or systems, ensuring that all
operations either succeed or fail together. It is essential for maintaining data integrity across different
locations.

Accepted Answer

Distributed Transaction

Question 30

Need for Distributed Transactions

Accepted Answer

Atomicity, Consistency, Isolation & Durability (ACID).

Question 31

2PC is a protocol that ensures all participating systems agree before committing a transaction.
It consists of a prepare phase and a commit/rollback phase to guarantee atomic execution.

Accepted Answer

1. Two-Phase Commit (2PC)

Question 32

Two-Phase Commit (2PC)

Accepted Answer

➢ Step 1: Start Transaction ST
➢ Step 2: Execute Operations EO
➢ Step 3: Prepare Phase (Voting) PP
➢ Step 4: Commit Decision CD
➢ Step 5: Final Execution FE

Question 33

Three-Phase Commit (3PC)

Accepted Answer

3PC improves 2PC by adding a pre-commit phase to reduce the chances of system blocking. It allows better fault tolerance by separating decision-making into more steps.

DATABASE - 5

Distributed Database Management Systems

"Know" box contains:
Time elapsed:
Retries: