Lossless join decomposition explained

r

into relations

r1,r2

such that a natural join of the two smaller relations yields back the original relation. This is central in removing redundancy safely from databases while preserving the original data.[1] Lossless join can also be called non-additive.[2]

Definition

A relation

r

on schema

R

decomposes losslessly onto schemas

R1

and

R2

if
\pi
R1

(r)\bowtie

\pi
R2

(r)=r

, that is

r

is the natural join of its projections onto the smaller schemas.A pair

(R1,R2)

is a lossless-join decomposition of

R

or said to have a lossless join with respect to a set of functional dependencies

F

if any relation

r(R)

that satisfies

F

decomposes losslessly onto

R1

and

R2

.[3]

Decompositions into more than two schemas can be defined in the same way.

Criteria

A decomposition

R=R1\cupR2

has a lossless join with respect to

F

if and only if the closure of

R1\capR2

includes

R1\setminusR2

or

R2\setminusR1

. In other words, one of the following must hold:[4]

(R1\capR2)\to(R1\setminusR2)\inF+

(R1\capR2)\to(R2\setminusR1)\inF+

Criteria for multiple sub-schemas

Multiple sub-schemas

R1,R2,...,Rn

have a lossless join if there is some way in which we can repeatedly perform lossless joins until all the schemas have been joined into a single schema. Once we have a new sub-schema made from a lossless join, we are not allowed to use any of its isolated sub-schema to join with any of the other schemas. For example, if we can do a lossless join on a pair of schemas

Ri,Rj

to form a new schema

Ri,j

, we use this new schema (rather than

Ri

or

Rj

) to form a lossless join with another schema

Rk

(which may already be joined (e.g.,

Rk,l

)).

Example

R=\{A,B,C,D\}

be the relation schema, with attributes,, and .

F=\{ABC\}

be the set of functional dependencies.

R1=\{A,B,C\}

and

R2=\{A,D\}

is lossless under because

R1\capR2=A

and we have a functional dependency

ABC

. In other words, we have proven that

(R1\capR2R1\setminusR2)\inF+

.[5] [6]

Notes and References

  1. Pohler . K . Lossless-Join Decomposition: applications in quantitative computing metrics . International Journal of Applied Computer Science . 2015 . 21 . 4 . 190–212.
  2. Book: Elmasri . Ramez . Fundamentals of database systems . 2016 . Pearson . 978-0133970777 . Seventh . Hoboken, NJ . 461.
  3. Book: Maier . David . The theory of relational databases . 1983 . Computer Science Press . 0-914894-42-0 . 101 . 16 August 2024.
  4. Book: Ullman . Jeffrey D. . Principles of Database and Knowledge-base Systems . 1988 . Computer Science Press . 0-88175188-X . 397 . 1 . 16 August 2024.
  5. Web site: Lossless-Join Decomposition. Cs.sfu.ca. 2016-02-07.
  6. Web site: www.data-e-education.com - Lossless Join Decomposition . 2014-02-12 . https://web.archive.org/web/20140221081148/http://www.data-e-education.com/E121_Lossless_Join_Decomposition.html . 2014-02-21 . dead.