Clustrix Explained

Clustrix Inc
Type:Private
Industry:Computer database
Parent:MariaDB Corporation AB
Founder:Paul Mikesell, Sergei Tsarev, Eric Hoffman
Founded: in San Francisco, California, U.S.
Location City:San Francisco, CA
Location Country:United States
Products:Clustrix Database Server
Num Employees:40–50

Clustrix, Inc. is a San Francisco-based private company founded in 2006 that developed a database management system marketed as NewSQL.[1] [2]

History

Clustrix was founded in November 2006, and is sometimes called Sprout-Clustrix as it formed with the help of Y Combinator.[3] Founders include Paul Mikesell (formerly of EMC Isilon) and Sergei Tsarev.Some of its technology tested at customers since 2008.[4]

Initially called Sierra during the development phase, at its official announcement in 2010, the product was launched with the product name Clustered Database System (CDS).[5] [6] The company received $10 million in funding from Sequoia Capital, U.S. Venture Partners (USVP), and ATA Ventures in December 2010.[7] Robin Purohit became chief executive in October 2011, and another round of $6.75 million was raised in July 2012.[8] [9] Another round of funding from the original backers of $16.5 million was announced in May 2013,[10] and a round of $10 million in new funding in August 2013 was led by HighBAR Ventures.[7] Purohit was replaced by Mike Azevedo in 2014.[11] A round of over $23 million in debt financing was disclosed in February 2016.[12] On September 20, 2018 it was announced that Clustrix was acquired by MariaDB Corporation.[13]

Technology

Clustrix supports workloads that involve scaling transactions and real-time analytics. The system is a drop-in replacement for MySQL, and is designed to overcome MySQL scalability issues with a minimum of disruption.[14] It also has built in fault-tolerance features for high availability within a cluster. It has parallel backup and parallel replication among clusters for disaster recovery.Clustrix is a scale-out SQL database management system and part of what are often called the NewSQL database systems (modern relational database management systems), closely following the NoSQL movement.[15]

The product was marketed as a hardware "appliance" using InfiniBand through about 2014.[16] [17] Clustrix's database was made available as downloadable software and from the Amazon Web Services Marketplace by 2013.[18] [19]

The primary competitors like Microsoft SQL Server and MySQL supported online transaction processing and online analytical processing but were not distributed. Clustrix provides a distributed relational, ACID database that scales transactions[20] and support real-time analytics. Other distributed relational databases are columnar (they don't support primary transaction workload) and focus on offline analytics and this includes EMC Greenplum, HP Vertica, Infobright, and Amazon Redshift. Notable players in the primary SQL database space are in-memory. This includes VoltDB and MemSQL, which excel at low-latency transactions, but do not target real-time analytics. NoSQL competitors, like MongoDB are good at handling unstructured data and read heavy workloads, but do not compete in the space for write heavy workloads (no transactions, coarse grained (DB-level) locking, and no SQL features (like joins), so the NewSQL and NoSQL databases are complementary.

Query evaluation

The Clustrix database operates on a distributed cluster of shared-nothing nodes using a query to data approach.[21] Here nodes typically own a subset of the data. SQL queries are split into query fragments and sent to the nodes that own the data. This enables Clustrix to scale horizontally (scale out) as additional nodes are added.

Data distribution

The Clustrix database automatically splits and distributes data evenly across nodes with each slice having copies on other nodes.[22] Uniform data distribution is maintained as nodes are added, removed or if data is inserted unevenly. This automatic data distribution approach removes the need to shard and enables Clustrix to maintain database availability in the face of node loss.[23]

Performance

In a performance test completed by Percona in 2011, a three-node cluster saw about a 73% increase in speed over a similarly equipped single MySQL server running tests with 1024 simultaneous threads.[24] [25] Additional nodes added to the Clustrix cluster provided roughly linear increases in speed.[26]

Project cancellation

MariaDB announced in October of 2023 that Xpand (formerly known as Clustrix) had been discontinued.[27]

External links

Notes and References

  1. Web site: What we talk about when we talk about NewSQL . 2011-12-16 . 2012-09-05 . https://web.archive.org/web/20120905141151/http://blogs.the451group.com/information_management/2011/04/06/what-we-talk-about-when-we-talk-about-newsql/ . dead .
  2. Web site: The NewSQL Movement . 2011-12-16 . https://web.archive.org/web/20120201174235/http://www.readwriteweb.com/cloud/2011/04/the-newsql-movement.php . 1 February 2012 . dead .
  3. Web site: Form D: Notice of Sale of Securities . United States Securities and Exchange Commission . July 5, 2007 . September 5, 2016 . https://web.archive.org/web/20160408003951/https://www.sec.gov/Archives/edgar/vprr/07/9999999997-07-032188 . April 8, 2016 . dead .
  4. Web site: The Clustrix story . DBMS2 Blog . May 12, 2010 . September 5, 2016 .
  5. Web site: Y Combinator's Clustrix rolls out databases that scale . Venture Beat . May 3, 2010 . Camille Riketts . September 5, 2016 .
  6. News: Clustrix Builds the Webscale Holy Grail: A Database That Scales . Stacey Higginbotham . Gigaom . May 3, 2010 . September 5, 2016 .
  7. News: Clustrix bags $10M more in funding to keep scaling out its SQL database . Barb Darrow . Gigaom . August 19, 2013 . September 5, 2016 .
  8. Web site: Clustrix Lands Former Hewlett-Packard VP Robin Purohit As Its New CEO . Robin Wauters . Tech Crunch . October 18, 2011 . September 5, 2016 .
  9. Web site: Big Data Startup Clustrix Raises $6.75 Million From Sequoia And Others To Build Scalable Databases . Ryan Lawler . Tech Crunch . July 5, 2012 . September 5, 2016 .
  10. News: Clustrix nets $16.5M to push its database outside the box . Barb Darrow . Gigaom . May 6, 2013 . September 5, 2016 .
  11. Web site: Clustrix Names New CEO Mike Azevedo and Executive Chairman Bruce Armstrong . Wall Street Journal . September 9, 2014 . September 5, 2016 .
  12. Web site: Form D: Notice Exempt Offering of Securities . United States Securities and Exchange Commission . February 12, 2016 . September 5, 2016 .
  13. Web site: MariaDB Acquires Clustrix Adding Distributed Database Technology . February 20, 2018 . September 20, 2018 .
  14. News: Clustrix Lifts the Curtain on Early Database Customers . Derrick Harris . Gigaom via The New York Times . January 17, 2011 . September 5, 2016 .
  15. http://highscalability.com/blog/2012/9/24/google-spanners-most-surprising-revelation-nosql-is-out-and.html / Google Spanner's most surprising revelation NoSQL is Out and NewSQL is in
  16. Web site: Clustrix Database Appliance . May 5, 2010 . James Hamilton . September 5, 2016 .
  17. Web site: Clustrix Database Appliance . Company Documentation . https://web.archive.org/web/20140202180754/http://docs.clustrix.com/display/CLXDOC/Clustrix%2BDatabase%2BAppliance . February 2, 2014 . September 5, 2016 . dead .
  18. Web site: Your Database Is Probably Terrible . January 19, 2013 . Jon Evans . Tech Crunch . September 5, 2016 .
  19. Web site: Clustrix Announces General Availability of ClustrixDB as a Software Release . Database Trends and Applications . October 31, 2013 . September 5, 2016 .
  20. Web site: 10 Companies & Technologies to Watch in 2013 | Inside Analysis . 2013-02-21 . https://web.archive.org/web/20130310052306/http://www.insideanalysis.com/2013/01/companies-technologies-to-watch-in-2013/ . 2013-03-10 . dead .
  21. Web site: Archived copy . 2013-02-21 . https://web.archive.org/web/20130929220824/http://www.clustrix.com/Portals/146389/docs/clustrix_a_new_approach.pdf . 2013-09-29 . dead .
  22. http://cs.brown.edu/courses/cs227/slides/checkpointing/clustrix.pdf{{dead link|date=August 2017 |bot=InternetArchiveBot |fix-attempted=yes }}
  23. http://cattell.net/datastores/Datastores.pdf
  24. Web site: Clustrix tpcc-mysql Benchmark . October 20, 2011 . https://web.archive.org/web/20120212102758/http://www.clustrix.com/Default.aspx?app=LeadgenDownload&shortpath=docs%2FClustrix_TPCC_Percona.pdf . February 12, 2012 . Vadim Tkachenko and Rodrigo Gadea . Percona . September 5, 2016 .
  25. Web site: Opening Keynote: Characterizing Performance . Paul Mikesell and Aaron Passey . Percona Live London . October 25, 2011 . September 5, 2016 .
  26. http://cloudcomputingexpo2010west.sys-con.com/node/2159758 Clustrix Delivers Software-Only Kit to Demo Shard-less MySQL Scaling
  27. Web site: Clark . Lindsay . 2023-10-13 . MariaDB ditches products and staff in restructure, bags $26.5M loan to cushion fall . 2024-06-19 . theregister.com.