DB Chapter 11 Concepts - Transactions

0.0(0)

Studied by 0 people

0.0(0)

Call with Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/35

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No study sessions yet.

36 Terms

New cards

transaction

a unit of program execution that accesses and possibly updates various data items

New cards

3 qualities of transactions

a transaction must see a consistent (stable) database before its execution → wait for finished transaction before starting/accepting another
during transaction execution, the database may be inconsistent
when the transaction is committed, the database must be consistent

New cards

For transactions, what are the two main issues to deal with?

failures of various kinds, such as hardware failures and system crashes (behind-the-scene workers deal w/this)
concurrent execution of multiple transactions

New cards

ACID Properties

to preserve integrity of data, the db system must ensure → atomicity, consistency, isolation, and durability

New cards

Atomicity

either all operations of the transaction are properly reflected in the db, or non are

i.e., all of it or none of it → especially after failure!

New cards

consistency

execution of a transaction in isolation preserves the consistency of the db

New cards

isolation

Although multiple transactions may execute concurrently, each transaction must be unaware of other concurrently executing transactions. Intermediate transaction results must be hidden form other concurrently executed transactions

New cards

What is another way to think of Isolation?

for every pair of transactions T_i and T_j, it appears to T_i that either T_j finished execution before T_i started, or T_j started execution after T_i finished

New cards

durability

after a transaction completes successfully, the changes it has made to the db persist, even if there are system failures

New cards

Example of a transaction and the ACID properties:

given the following fund transfer transaction → transaction to transfer $50 from account A to account B
1. read(A)
2. A := A - 50
3. write(A)
4. read(B)
5. B := B + 50
6. write(B)
how does it fulfill the atomicity requirement?

if the transaction fails after step 3 and before step 6, the system should ensure that its updates are not reflected in the db, else an inconsistency will result

New cards

Example of a transaction and the ACID properties:

given the following fund transfer transaction → transaction to transfer $50 from account A to account B
1. read(A)
2. A := A - 50
3. write(A)
4. read(B)
5. B := B + 50
6. write(B)
how does it fulfill the consistency requirement?

the sum of A and B is unchanged by the execution of the transaction

New cards

Example of a transaction and the ACID properties:

given the following fund transfer transaction → transaction to transfer $50 from account A to account B
1. read(A)
2. A := A - 50
3. write(A)
4. read(B)
5. B := B + 50
6. write(B)
how does it fulfill the isolation requirement?

if between steps 3 and 6, another transaction is allowed to access the partially updated db, it will see an inconsistent db (the sum A + B will be less than it should be) → can be ensured trivially by running transactions serially (one after the other)

however, important to note that executing multiple transactions concurrently has significant benefits

New cards

Example of a transaction and the ACID properties:

given the following fund transfer transaction → transaction to transfer $50 from account A to account B
1. read(A)
2. A := A - 50
3. write(A)
4. read(B)
5. B := B + 50
6. write(B)
how does it fulfill the durability requirement?

once the user has been notified that the transaction has completed (i.e., the transfer of the $50 has taken place), the updates to the db by the transaction must persist despite failures

New cards

What are the different transaction states (TS)?

active, partially committed, failed, aborted, committed

New cards

active TS

the initial state; the transaction stays in this state while it is executing

New cards

partially committed TS

after the final statement has been executed

New cards

failed TS

after the discovery that normal execution can no longer proceed (i.e., flight full, bad math, etc.)

New cards

aborted TS

After the transaction has been rolled back and the db restored to its state prior to the start of the transaction. Two options after it has been aborted:

restart the transaction - only if no internal logical error
kill the transaction

New cards

committed TS

after successful completion

New cards

serial execution

transactions are run one after the other → leads to CPU sitting idle

New cards

concurrent execution

transactions can run at the same time → faster and no idle BUT need to make sure there are no conflict

conflicts occurs from working on the same item or data

New cards

Advantages of running multiple transactions concurrently

increased processor and disk utilization, leading to better transaction throughput
- throughput = one transaction can be using the CPU while another is reading from or writing to the disk
reduced average response time for transactions
- short transactions need not wait behind long ones

New cards

Concurrency control schemes

mechanisms to achieve isolation

i.e., to control the interaction among the concurrent transactions in order to prevent them from destroying the consistency of the db

New cards

Schedules

sequences that indicate the chronological order in which instructions of concurrent transactions are executed

New cards

Properties of schedules

a schedule for a set of transactions must consist of all instructions of those transactions
must preserve the order in which the instruction appear in each individual transactions

New cards

What is one advantage of serial schedule over concurrent schedule?

serial schedule ensure that no problem/inconsistencies occur

New cards

tracing

how we can determine if a concurrent schedule is good or not BUT this method is inefficient

New cards

serializability

basic assumption - each transaction preserves db consistency
serial execution of a set of transactions preserves db consistency
a concurrent schedule is serializable if it is equivalent to any serial schedule

*NOTE: serializable = good b/c no conflicts!

New cards

What do we remove when checking serializability?

we ignore operations other than read or write instructions

our simplified schedules should consist of only read and write instructions?

New cards

Conflict serializability

Instructions I_i and I_j of transaction T_i and T_j respectively, conflict if and only if there exists some item Q accessed by both I_i and I_j, and at least one of these instructions wrote Q

I_i = read(Q), I_j = read(Q) → they don’t conflict
I_i = read(Q), I_j = write(Q) → they conflict
I_i = write(Q), I_j = read(Q) → they conflict
I_i = write(Q), I_j = write(Q) → they conflict

a conflict between I_i and I_j forces a (logical) temporal order between them → NOT CONFLICT SERIALIZABLE = BAD SCHEDULE

on the other hand, if not conflict, results would remain the same even if they had been interchanged in the schedule → conflict serializable = good schedule

New cards

Conflict equivalent

If a schedule S can be transformed into a schedule S’ by a series of swaps of non-conflicting instructions

schedule S is conflict serializable if it is conflict equivalent to a serial schedule S’

New cards

precedence graph

a direct graph where the vertices are the transactions (names)

draw an arc from T_i to T_j if the two transactions conflict and T_i access the data item on which the conflict arose earlier
label the arc by the item that was accessed

New cards

Testing for serializability w/a precedence graph

a schedule is conflict serializable if and only if its precedence graph is acyclic
- i.e., if the graph has a cycle, the schedule is BAD and can’t be converted into a serial schedule (not conflict serializable)
- Cycle creates a conflict!!!

New cards

serializability order?

can be obtained by a topological sorting of the precedence graph (if it is acyclic)

this is a linear order consistent w/the partial order of the graph

*Bottom line: use topological sort to get serial schedule

New cards

Concurrency Control vs Serializability Tests

tests for serializability help understand why a concurrency control protocol is correct
testing a schedule for serializability after it has been executed is a little too late
Goal - to develop concurrency control protocols that will assure (before) serializability.
- They will generally not examine the precedence graph as it is being created; instead, a protocol will impose a discipline that avoids non-serializable schedules

New cards

Locking

For concurrency control → Each item has a lock; if item is unlocked transaction can use it. During use, item is locked & another transaction has to wait until the item is unlocked (i.e., previous transaction is finished)