Dell EMC Data Domain explained

Dell EMC Data Domain
Discontinued:2019 (Renamed)
Successor:Dell EMC PowerProtect DataDomain (Renamed)
Processor:x86
Releasedate: (as Data Domain DD series) (as EMC Data Domain)
Developer:Dell EMC (2016 - Present) EMC Corporation (2009-2016) Data Domain (2004 - 2009)
Type:Data-management Storage server
Website:delltechnologies.com/.../data-protection/data-domain-series/data-domain-dd6300-data-backup-appliance

Dell EMC Data Domain was Dell EMC’s data deduplication storage system. Development began with the founding of Data Domain, and continued since that company’s acquisition by EMC Corporation (and EMC’s later merger with Dell to form Dell EMC).

History

The technology started in a separate company, which was then acquired and re-branded twice.

Data Domain Corporation

Data Domain Corporation
Type:Subsidiary of EMC Corporation
Successor:Dell EMC Data Domain
Foundation:2001
Location:Santa Clara, California
Founder:Kai Li, Brian Biles
Key People:Kai Li, Brian Biles
Industry:deduplication
Fate:Acquired by EMC Corporation in 2009
Products:Data Duplication
Owner:Dell EMC
Homepage:www.emc.com/datadomain

The Data Domain Corporation was founded by Kai Li, Ben Zhu, and Brian Biles in 2001 as a company specializing in target-based data deduplication products for disk-based backup.[1] [2] [3] Hugo Patterson joined as chief architect 3 months after initial funding. The company started operations in a series of venture capital offices around Palo Alto, California, pre-funding at U.S. Venture Partners, where Zhu was an entrepreneur in residence (EIR), then at New Enterprise Associates (NEA), where Li was an EIR, and post-funding at Greylock Partners.NEA and Greylock provided Series A funding in 2002.[4]

The first product revenue was realized in the beginning of 2004.[5]

Funding, IPO and Acquisition

NEA and Greylock led the company’s $9.3 million Series A funding round in 2002. Sutter Hill Ventures led its $17 million Series B funding round in 2003, joined again by NEA and Greylock. Through 2005, the three companies invested a total of $40 million in Data Domain.[6]

The company had their initial public offering on June 27, 2007, with a total market capitalization of $776.5 million, above its forecast range despite years of losses.[7] This put the stock price at $15 per share, above the forecasted range of $11.50 to $13.50. The company’s market capitalization was $776.5 million at the time of the IPO.[7] It was listed on Nasdaq with symbol DDUP.

EMC Data Domain

In May 2009, NetApp announced it would acquire Data Domain for about $1.5 billion.[5] In June 2009, EMC Corporation announced their intention to acquire Data Domain Corp for $2.4 billion, outbidding the previous offer. In July, the two companies agreed to the acquisition.[8] [9] [10] Post-acquisition, Data Domain would operate as a brand and line of products under EMC, known as EMC Data Domain.[11]

Former CEO Frank Slootman published a book about his experiences in 2011.[12]

Since acquiring Data Domain, EMC integrated the Data Domain platform with its Data Protection Suite software and expanded software enhancements. According to a 2013 analysis sponsored by EMC, Data Domain reduced loss of user productivity from backup, restore, and retrieval operations.

Dell EMC Data Domain

In 2016, EMC merged with Dell to become Dell EMC, which continued the Data Domain brand until 2019.[13] [14] During this period, the brand was named Dell EMC Data Domain.[15] On September 24, 2019, Dell EMC announced via blog post that Data Domain products will be branded as PowerProtect DD products going forward.[16]

Technologies

The goal of the Data Domain technology was to eliminate logistical concerns of using backup or archival tape libraries, by implementing a suitable disk-based substitute for backup tapes. It did this by inventing a fast implementation of lossless data compression, optimized for streaming workloads, which compares incoming large data segments against all others in its store. This provided significant speed advantages compared to tape.[17] Originally categorized as "capacity optimization" by industry analysts, it became more widely known as inline "data deduplication."[18] Also, unlike most non-archival computer storage products, Data Domain went to technical lengths to ensure data longevity (vs. system longevity).[19]

Unlike most of Data Domain's early competition, it was first packaged as a file-system appliance; this made it more predictable than a software product and simpler to manage than a virtual tape library system.[20] This product package included the storage hardware itself, as well as a specialized proprietary OS and file system.[21]

Alongside the standalone appliances, Data Domain also created a method to unify multiple of their appliances into a larger data storage system called a DDX Array. A DDX Array is a singular rack-mounted storage system, consisting of multiple individual Data Domain storage appliances acting as "controllers". This system's data storage capacity could be further expanded by connecting to and controlling "integrated or third party external storage". DDX Arrays provided increased throughput (scaling with the number of appliances used as controllers) into a single storage source, and greater overall storage capacity, when compared to an individual Data Domain appliance.[22]

Products and Services

The first Data Domain system, the DD200 in 2004, had a 1.25 TB addressable capacity and was able to accept data at a rate of 40 MB/sec. Because its implementation put most of the system stress on CPU/RAM, rather than disk I/O, it was able to improve at the rate of Intel technology.

In May 2008, Data Domain Corporation announced the DD690, which used quad-core CPUs and could accept data at a rate of 166 MB/sec. This singular rack-mounted appliance could be combined with other DD690s to form a "DDX Array".[23]

By 2018, Dell EMC would produce the DD9800, which had an addressable capacity of up to 50 PB (depending on configuration), and could accept data at a rate of 8611 MB/sec.

External links

Notes and References

  1. http://www.datadomain.com/company/ "Data Domain, an EMC company." Data Domain.
  2. Web site: The Entrepreneur Questionnaire: Brian Biles, Co-Founder of Data Domain . greylockvc.com . March 11, 2011 . August 11, 2021 . https://web.archive.org/web/20110826084618/http://greylockvc.com/2011/03/11/the-entrepreneur-questionnaire-brian-biles-co-founder-of-data-domain/ . August 26, 2011 . dead.
  3. Web site: Brian D. Biles . Bloomberg . Executive data . August 11, 2021 .
  4. News: Data Domain Founder, Kai Li, on EMC Acquisition and the Future of Data Storage. Xconomy. Xconomy, Inc.. July 9, 2009.
  5. News: NetApp to acquire Data Domain for $1.5 billion. Computerworld. IDG . May 20, 2009 . Stephen Lawson . August 11, 2021 .
  6. News: Breaking Down The VC Investment Returns Of Data Domain. Wall Street Journal. 2009-07-10.
  7. News: Data Domain IPO prices above forecast range . Press release . Reuters . June 27, 2007 . August 11, 2021 .
  8. News: EMC Tops NetApp's Bid for Data Domain . Dealbook . The New York Times Company. June 1, 2009 . August 11, 2021 . https://web.archive.org/web/20110813113048/https://dealbook.nytimes.com/2009/06/01/emc-tops-netapps-bid-for-data-domain/ . August 13, 2011 .
  9. News: Data Domain boosts de-duplication performance. InfoStor. ITBusinessEdge. 2008-06-01.
  10. News: Dell EMC Debuts Software-Only Version Of Data Domain. CRN. The Channel Co.. 2016-10-19.
  11. Web site: The ROI of Consolidating Backup and Archive Data . Randy Perry. Ashish Nadkarni . July 2013. August 11, 2021 . dead . https://web.archive.org/web/20171107014919/https://www.emc.com/collateral/analyst-report/idc-roi-consolidating-backup-archive-data-ar.pdf . November 7, 2017.
  12. Book: Tape Sucks: Inside Data Domain, a Silicon Valley Growth Story . Frank Slootman . 9780615484068 . Together Editing . 2011 .
  13. News: Dell soups up low-end Data Domain deduper: Refreshes SMB-sized deduping backup-to-disk box . Chris Mellor . February 5, 2018 . The Register . August 11, 2021 .
  14. News: Dell stamps on the gas for backup devices with speed and cloud boost . Chris Mellor . February 6, 2019 . The Register . August 11, 2021 .
  15. Web site: New Dell EMC Data Domain DD3300: Big Opportunities to Address Commercial and ROBO Needs . dell.com . Spring . Cindy . February 13, 2018 . January 22, 2023 .
  16. Web site: Introducing Dell EMC PowerProtect DD Series Appliances, the Next Generation of Data Domain, Setting a New Bar for Data Protection in a Modern Digital Economy . dell.com . Phalen . Beth . September 24, 2019 . January 21, 2023 . https://web.archive.org/web/20220405101117/https://www.dell.com/en-us/blog/introducing-dell-emc-powerprotect-dd-series-appliances/ . April 5, 2022 . live .
  17. News: EMC Data Domain De-duplication 2011. Wikibon. 2011-01-25.
  18. News: EMC pushes Data Domain for backup and archiving . SearchDataBackup . TechTarget . 2013-04-16 . https://web.archive.org/web/20171125221236/http://searchdatabackup.techtarget.com/news/2240181803/EMC-pushes-Data-Domain-for-backup-and-archiving . 2017-11-25.
  19. Web site: Archiving & Compliance . datadomain.com . Data Domain . January 22, 2023 . dead . https://web.archive.org/web/20090221103456/http://www.datadomain.com/solutions/archiving.html . February 21, 2009.
  20. News: EMC Data Domain De-duplication 2011. Wikibon. 2011-01-25.
  21. Avoiding the Disk Bottleneck in the Data Domain Deduplication File System . Zhu . Benjamin . Li . Kai . Patterson . Hugo . 2008-02-29 . 6th USENIX Conference on File and Storage Technologies, FAST 2008 . https://www.usenix.org/legacy/event/fast08/ . https://web.archive.org/web/20220531172649/https://www.usenix.org/legacy/event/fast08/tech/full_papers/zhu/zhu.pdf . 2022-05-31 . live . San Jose, California, USA . 14 . 2023-01-22 . en.
  22. Web site: Data Domain DDX Array Series . datadomain.com . Data Domain . January 22, 2023 . dead . https://web.archive.org/web/20090220153752/http://www.datadomain.com/products/arrays.html . February 20, 2009.
  23. Web site: Data Domain Boosts De-Duplication Performance . Komiega . Kevin . June 1, 2008 . infostor.com . https://web.archive.org/web/20220705062926/https://www.infostor.com/index/articles/display/331820/articles/infostor/volume-12/issue-6/news-analysis-trends/data-domain-boosts-de-duplication-performance.html . July 5, 2022 . January 22, 2023.