Data degradation explained

Data degradation is the gradual corruption of computer data due to an accumulation of non-critical failures in a data storage device. It is also referred to as data decay, data rot or bit rot.[1] This results in a decline in data quality over time, even when the data is not being utilized.

Primary storages

Data degradation in dynamic random-access memory (DRAM) can occur when the electric charge of a bit in DRAM disperses, possibly altering program code or stored data. DRAM may be altered by cosmic rays[2] or other high-energy particles. Such data degradation is known as a soft error.[3] ECC memory can be used to mitigate this type of data degradation.

Secondary storages

Data degradation results from the gradual decay of storage media over the course of years or longer. Causes vary by medium:

Solid-state media
  • EPROMs, flash memory and other solid-state drive store data using electrical charges, which can slowly leak away due to imperfect insulation. Modern flash controller chips account for this leak by trying several lower threshold voltages (until ECC passes), prolonging the age of data. Multi-level cells with much lower distance between voltage levels cannot be considered stable without this functionality.[4]
  • The chip itself is not affected by this, so reprogramming it approximately once per decade prevents decay. An undamaged copy of the master data is required for the reprogramming. A checksum can be used to assure that the on-chip data is not yet damaged and ready for reprogramming.
    Magnetic media
  • Magnetic media, such as hard disk drives, floppy disks and magnetic tapes, may experience data decay as bits lose their magnetic orientation. Higher temperature speeds up the rate of magnetic loss. As with solid-state media, re-writing is useful as long as the medium itself is not damaged (see below). Modern hard drives use Giant magnetoresistance and have a higher magnetic lifespan on the order of decades. They also automatically correct any errors detected by ECC through rewriting. The reliance on a factory servo track can complicate data recovery if it becomes unrecoverable, however.
  • Floppy disks and tapes are poorly protected against ambient air. In warm/humid conditions, they are prone to the physical decomposition of the storage medium.[5] [6]
    Optical media
  • Optical media such as CD-R, DVD-R and BD-R, may experience data decay from the breakdown of the storage medium. This can be mitigated by storing discs in a dark, cool, low humidity location. "Archival quality" discs are available with an extended lifetime, but are still not permanent. However, data integrity scanning that measures the rates of various types of errors is able to predict data decay on optical media well ahead of uncorrectable data loss occurring.[7]
  • Both the disc dye and the disc backing layer are potentially susceptible to breakdown. Early cyanine-based dyes used in CD-R were notorious for their lack of UV stability. Early CDs also suffered from CD bronzing, and is related to a combination of bad lacquer material and failure of the aluminum reflection layer.[8] Later discs use more stable dyes or forgo them for an inorganic mixture. The aluminum layer is also commonly swapped out for gold or silver alloy.
    Paper media
  • Paper media, such as punched cards and punched tape, may literally rot. Mylar punched tape is another approach that does not rely on electromagnetic stability. Degradation of books and printing paper is primarily driven by acid hydrolysis of glycosidic bonds within the cellulose molecule as well as by oxidation;[9] degradation of paper is accelerated by high relative humidity, high temperature, as well as by exposure to acids, oxygen, light, and various pollutants, including various volatile organic compounds and nitrogen dioxide.[10]

    Example

    Below are several digital images illustrating data degradation, all consisting of 326,272 bits. The original photo is displayed first. In the next image, a single bit was changed from 0 to 1. In the next two images, two and three bits were flipped. On Linux systems, the binary difference between files can be revealed using command (e.g.).

    Causes

    This deterioration can be caused by a variety of factors that impact the reliability and integrity of digital information, including physical factors, software errors, security breaches, human error, obsolete technology, and unauthorized access incidents.[11] [12] [13] [14]

    Hardware failures

    Most disk, disk controller and higher-level systems are subject to a slight chance of unrecoverable failure. With ever-growing disk capacities, file sizes, and increases in the amount of data stored on a disk, the likelihood of the occurrence of data decay and other forms of uncorrected and undetected data corruption increases.[15]

    Low-level disk controllers typically employ error correction codes (ECC) to correct erroneous data.[16]

    Higher-level software systems may be employed to mitigate the risk of such underlying failures by increasing redundancy and implementing integrity checking, error correction codes and self-repairing algorithms.[17] The ZFS file system was designed to address many of these data corruption issues.[18] The Btrfs file system also includes data protection and recovery mechanisms,[19] as does ReFS.[20]

    See also

    Notes and References

    1. Web site: What is Bit Rot?. Techopedia Dictionary. Margaret. Rouse. 25 March 2020. 10 April 2024.
    2. Web site: The Invisible Neutron Threat National Security Science Magazine. Los Alamos National Laboratory. 2020-03-13.
    3. O'Gorman . T. J. . Ross . J. M. . Taber . A. H. . Ziegler . J. F. . Muhlfeld . H. P. . Montrose . C. J. . Curtis . H. W. . Walsh . J. L. . Field testing for cosmic ray soft errors in semiconductor memories. IBM Journal of Research and Development . January 1996 . 40 . 1 . 41–50 . 10.1147/rd.401.0041 .
    4. Li . Qianhui . Wang . Qi . Yang . Liu . Yu . Xiaolei . Jiang . Yiyang . He . Jing . Huo . Zongliang . Optimal read voltages decision scheme eliminating read retry operations for 3D NAND flash memories . Microelectronics Reliability . April 2022 . 131 . 114509 . 10.1016/j.microrel.2022.114509.
    5. Web site: Riss. Dan. July 1993. Conserve O Gram (number 19/8) Preservation Of Magnetic Media. nps.gov. National Park Service / Department of the Interior (US). 2. Harpers Ferry, West Virginia. The longevity of magnetic media is most seriously affected by processes that attack the binder resin. Moisture from the air is absorbed by the binder and reacts with the resin. The result is a gummy residue that can deposit on tape heads and cause tape layers to stick together. Reaction with moisture also can result in breaks in the long molecular chains of the binder. This weakens the physical properties of the binder and can result in a lack of adhesion to the backing. These reactions are greatly accelerated by the presence of acids. Typical sources would be the usual pollutant gases in the air, such as sulphur dioxide (SO2) and nitrous oxides (NOx), which react with moist air to form acids. Though acid inhibitors are usually built into the binder layer, over time they can lose their effectiveness..
    6. Web site: Preserving magnetic media. 3 November 2020. National Archives of Australia. High temperature and humidity and fluctuations may cause the magnetic and base layers in a reel of tape to separate, or cause adjacent loops to block together. High temperatures may also weaken the magnetic signal, and ultimately de-magnetise the magnetic layer..
    7. Web site: QPxTool glossary . qpxtool.sourceforge.io . QPxTool . 22 July 2020 . 2008-08-01 . QPx-Glossary.
    8. Web site: Bronzed CD alert!. IASA Information Bulletin no. 22. July 1997. 3 August 2007. https://web.archive.org/web/20110722224026/http://www.iasa-web.org/content/information-bulletin-no-22-july-1997. 22 July 2011. dead.
    9. Małachowska. Edyta. Pawcenis . Dominika . Dańczak . Jacek. Paczkowska. Joanna. Przybysz. Kamila. Paper Ageing: The Effect of Paper Chemical Composition on Hydrolysis and Oxidation . Polymers. 26 March 2021. 13 . 7. 1029. 10.3390/polym13071029. 33810293. 8036582. free .
    10. Menart . Eva. De Bruin. Gerrit. Strlič. Matija. Dose–response functions for historic paper . Polymer Degradation and Stability. 9 September 2011. 96. 12. 2029–2039 . 10.1016/j.polymdegradstab.2011.09.002. 5 June 2023.
    11. Web site: What is data decay?. Li . Sheng Lance. 22 July 2015. Tech in Asia. 10 April 2024.
    12. Web site: Definition of data degradation. PC Magazine. 10 April 2024.
    13. Web site: Data Decay: What are the Causes?. FormStory. Mike. Hakob. 10 April 2024.
    14. Web site: Forskare: Billiga cd-skivor håller bara i två år. Aftonbladet. 16 March 2006. Robert. Triches. 10 April 2024.
    15. Gray . Jim. van Ingen. Catharine. Empirical Measurements of Disk Failure Rates and Error Rates . Microsoft Research Technical Report MSR-TR-2005-166. December 2005 . 4 March 2013.
    16. Web site: ECC and Spare Blocks help to keep Kingston SSD data protected from errors. Kingston Technology Company. 30 March 2021.
    17. Web site: Salter. Jim . Bitrot and atomic COWs: Inside "next-gen" filesystems. 15 January 2014. . 15 January 2014. https://web.archive.org/web/20150306225935/http://arstechnica.com/information-technology/2014/01/bitrot-and-atomic-cows-inside-next-gen-filesystems/. 6 March 2015. dead .
    18. Web site: Bonwick. Jeff . ZFS: The Last Word in File Systems. Storage Networking Industry Association (SNIA). 4 March 2013. dead. https://web.archive.org/web/20130921055345/http://www.snia.org/sites/default/files2/sdc_archives/2009_presentations/monday/JeffBonwickzfs-Basic_and_Advanced.pdf. 21 September 2013.
    19. Web site: . btrfs Wiki: Features. The btrfs Project . 19 September 2013.
    20. Web site: Wlodarz. Derrick. Windows Storage Spaces and ReFS: is it time to ditch RAID for good?. 15 January 2014. Betanews. 2014-02-09.