In temporal databases, valid-time is the time period when an event happened or something was true in the real world, or more formally when a fact was valid in the modeled reality.
The valid-time period is an interval based on event times, which are referred to as event datetime in data vault.[1] Other names are application-time period[2] or real-world timeline. supports valid time through so-called application time-period tables.[3] [4] [5] [6] In a database table, valid-time is often represented by two extra table-columns, such as start_validtime
and end_validtime
. The time interval is closed at its lower bound (denoted by [
) and open at its upper bound (denoted by )
).
In integration layers (for example a data warehouse), the valid time is controlled by the source system which delivers data to the data warehouse.[7] For many reasons, the valid timeline is different from the transaction timeline (which is when data arrives in the warehouse), and it is important that the data warehouse is capable of unambiguously reporting what actually happened in the past by combining these two timelines. In bitemporal data models, valid time and transaction time can be represented two-dimensionally in a Cartesian coordinate system. When data are delivered from the integration layer and is to be represented in a presentation layer (often in a dimensional model or wide table) it is often desirable to have the data on only one timeline.
The term valid time was coined by Richard T. Snodgrass and his doctoral student (1986).[8]
As of December 2011, ISO/IEC 9075, Database Language Part 2: SQL/Foundation included clauses in table definitions to define "application-time period tables" (that is, valid-time tables).
Date | What happened in the real world | Database action | What the database shows | |
---|---|---|---|---|
John is born | Nothing | There is no person called John Doe | ||
John's father officially reports John's birth | Inserted:Person(John Doe, Smallville) | John Doe lives in Smallville | ||
After graduation, John moves to Bigtown, but forgets to register his new address | Nothing | John Doe lives in Smallville | ||
Nothing | Nothing | John Doe lives in Smallville | ||
John registers his new address | Updated:Person(John Doe, Bigtown) | John Doe lives in Bigtown | ||
John dies | Deleted:Person(John Doe) | There is no person called John Doe |
Valid time is the time for which a fact is true in the real world. In the example above, the Person table gets two extra fields, valid_from
and valid_to
, specifying when a person's address was valid in the real world. On 1975-04-04, John's father proudly registered his son's birth. An official will then insert a new entry to the database stating that John lives in Smallville from April, 3rd. Notice that although the data was inserted on the 4th, the database states that the information is valid since the 3rd. The official does not yet know if or when John will ever move to another place so in the database the valid_to
is filled with infinity (∞) or a very late date (like for example 2300-01-01). Resulting in this entry in the database:
Person(John Doe, Smallville,, ∞) |
On 1994-12-27 John reports his new address in Bigtown where he has been living since 1994-08-26. The Bigtown official does not change the address of the current entry of John Doe in the database. He adds a new one:
Person (John Doe, Big Town,, ∞) |
The original entry Person (John Doe, Smallville, 1975-04-03, ∞) is then updated (not removed!). Since it is now known that John stopped living in Smallville on 1994-08-26, the valid_to
entry can be filled in. The database now contains two entries for John Doe
Person(John Doe, Smallville,,) |
Person(John Doe, Bigtown,, ∞) |
When John dies the database is once more updated. The current entry will be updated stating the date of death as the last valid_to
for Bigtown, as John does not live in Bigtown any longer. No new entry is being added. The database now looks like this:
Person(John Doe, Smallville, |