Paxata Explained

Paxata
Type:Private[1]
Industry:Enterprise analytics software
Location City:Redwood City, CA[2]
Area Served:Worldwide
Products:The Paxata suite of self-service data preparation software

Paxata is a privately owned software company headquartered in Redwood City, California. It develops self-service data preparation software that gets data ready for data analytics software. Paxata's software is intended for business analysts, as opposed to technical staff. It is used to combine data from different sources, then check it for data quality issues, such as duplicates and outliers. Algorithms and machine learning automate certain aspects of data preparation and users work with the software through a user-interface similar to Excel spreadsheets.

The company was founded in January 2012 and operated in stealth mode until October 2013. It received more than $10 million in venture funding before being acquired by DataRobot.[3] [4]

History

Paxata was founded in January 2012. It initially raised $2 million in venture capital.[5] The company came out of stealth mode in October 2013. Simultaneously with its public release, Paxata announced an $8 million funding round led by Accel Partners.[6] [7] Adoption of the software grew quickly.[6] [8] In March 2014, In-Q-Tel acquired an interest in the startup.[9] It raised an additional $18 million in funding in September 2015.[10] It also began working with Cisco to jointly develop the Cisco Data Preparation suite of software and services.[11]

Software

Paxata refers to its suite of cloud-based data quality, integration, enrichment and governance products as "Adaptive Data Preparation."[7] [12] [13] The software is intended for business analysts, who need to combine data from a variety of sources, then check the data for duplicates, empty fields, outliers, trends and integrity issues before conducting analysis or visualization in a third-party software tool.[13] [14] It uses algorithms and machine-learning to automate certain aspects of data preparation.[13] For example, it may automatically detect records belonging to the same person or address, even if the information is formatted differently in each record in different data sets.[15]

The software has a spreadsheet-based user interface.[13] [16] Patterns and anomalies in the data are color-coded in the spreadsheet. Then users are provided with instructions on how to resolve data quality issues or to supplement the data with contextual information.[17] Data sets and related quality issues can also be addressed in a collaborative environment through the "Paxata Share" feature.[16] It runs on Apache Spark.[10] [18]

According to analyst firm Ovum, the software is made possible through advances in predictive analytics, machine learning and the NoSQL data caching methodology.[13] The software uses semantic algorithms to understand the meaning of a data table's columns and pattern recognition algorithms to find potential duplicates in a data-set.[13] [5] It also uses indexing, text pattern recognition and other technologies traditionally found in social media and search software.[19] One of the software's users is dairy producer Danone, which uses the software so that business staff can create their own reports on merchandising, supply chain and product data, without the IT department.[20]

Reception

In its 2014 report "Cool Vendors in Data Integration and Data Quality", Gartner praised Paxata for developing a "business-user-friendly" data quality product that does not use code.[17] Ventana Research said its spreadsheet-based user interface "should resonate well with business analysts," who are resistant to move away from familiar Excel-like programs.[16] Gartner also said Paxata was recognized in the report due to its automated, algorithm-based features and how it tracks any changes made to the data.[17]

Ventana Research said Paxata was in a "noisy marketplace".[16] According to Gartner, while Paxata is an early entrant into the market, many startups and large corporations are making investments in developing similar competing products.[17] According to Gigaom and IT Business Edge, one way Paxata differs is that it automatically merges multiple data-sets into a single table, so it can be easily imported into a visualization or analysis tool.[5] [21]

Gartner said Paxata will have a difficult time finding a compelling pricing model, when many data discovery tools that it supplements provide some similar features.[17] In contrast, Ventana said Paxata's pricing was "a pretty small amount" compared to the amount of time users can save.[16]

Notes and References

  1. Web site: Paxata: Company Profile. Bloomberg L.P.. September 28, 2014.
  2. Web site: Contact us. Paxata. September 28, 2014. https://web.archive.org/web/20141027100417/http://www.paxata.com/contact-us. October 27, 2014. dead.
  3. Web site: DataRobot Acquires Paxata to Extend AI Platform. Vizard. Michael. 2019-12-19. RTInsights. en-US. 2020-03-18.
  4. Web site: DataRobot is acquiring Paxata to add data prep to machine learning platform. TechCrunch. 12 December 2019 . en-US. 2020-03-18.
  5. News: Gigaom. With $10M from Accel, Paxata wants to make data prep a breeze. Derrick. Harris. October 28, 2013. June 19, 2014.
  6. News: Paxata grabs $8M to help data scientists skip the dirty work. VentureBeat. October 28, 2013. Eric . Blattberg. June 19, 2014.
  7. News: October 28, 2013. Paxata Debuts Data Quality Tools at Strata. Alex. Woodie. Datanami. June 19, 2014.
  8. News: Paxata: streamlining data analytics. Alan. McStravick. February 12, 2014. SiliconAngle. The SiliconAngle Network. June 19, 2014.
  9. News: March 7, 2014. In-Q-Tel Invests in Data-Prep Platform Paxata. InformationWeek. UBM Tech. June 19, 2014. Patience. Wait.
  10. Web site: Harris . Derrick . This startup raised $18 million to make data analysis less of a chore . Fortune . September 9, 2015 . October 13, 2015.
  11. Web site: Cisco Makes Move Into Data Preparation Space . eWeek.com . September 30, 2015 . October 13, 2015.
  12. News: Startup Paxata automates the dirty work of big data. Conner. Forrest. March 4, 2014. TechRepublic. CBS Interactive. June 26, 2014.
  13. News: Tony. Baer. Ovum. Paxata puts a business-user face on data preparation. October 28, 2013. June 19, 2014. https://web.archive.org/web/20150112002806/http://www.ovum.com/paxata-puts-a-business-user-face-on-data-preparation/. January 12, 2015. dead.
  14. News: On the Radar: Paxata. Tony. Baer. December 13, 2013. June 13, 2014. Ovum. https://web.archive.org/web/20150112002556/http://www.ovum.com/research/on-the-radar-paxata/. January 12, 2015. dead.
  15. News: February 11, 2014. Michael. Fitzgerald. Is Your Company Running a Data Dump?. InformationWeek. UBM Tech. June 19, 2014.
  16. News: Paxata Give Analysts Valuable Time Back for Analytics. January 29, 2014. June 19, 2014. Tony. Cosentino. Ventana Research. https://web.archive.org/web/20140620162013/http://tonycosentino.ventanaresearch.com/2014/01/29/paxata-give-analysts-valuable-time-back-for-analytics/. June 20, 2014. dead.
  17. News: Cool Vendors in Data Integration and Data. April 24, 2014. Eric. Thoo. Ted. Friedman. Saul. Judah. Rita L.. Sallam. Roxane. Edjlali. Gartner. June 19, 2014.
  18. Web site: Paxata Applies Data Governance Controls to Big Data . IT Business Edge . April 23, 2015 . August 20, 2015.
  19. News: Automating the Pain Out of Big Data Transformation. Alex. Woodie. June 19, 2014. January 24, 2014. Datanami.
  20. News: Dannon Speeds Up Data Preparation and Analysis. March 26, 2014. Eileen. Feretic. Baseline. Bradbourne Publishing. June 22, 2014 .
  21. News: Paxata Rises to the Challenge of Big Data Preparation. Mike. Vizard. November 27, 2013. IT Business Edge. QuinStreet. June 19, 2014.