Tidyverse Explained

Tidyverse
Logo Caption:The tidyverse hex logo
Logo Alt:A black hexagon logo with the word "tidyverse" in white letter in the middle, while having smaller colorful hexagons throughout the larger black hexagon logo
Programming Language:R
Engines:-->
Genre:Package collection
Licence:-->

The tidyverse is a collection of open source packages for the R programming language introduced by Hadley Wickham[1] and his team that "share an underlying design philosophy, grammar, and data structures" of tidy data.[2] Characteristic features of tidyverse packages include extensive use of non-standard evaluation and encouraging piping.[3] [4]

As of November 2018, the tidyverse package and some of its individual packages comprise 5 out of the top 10 most downloaded R packages.[5] The tidyverse is the subject of multiple books and papers.[6] [7] [8] [9] In 2019, the ecosystem has been published in the Journal of Open Source Software.[10]

Its syntax has been referred to as "supremely readable",[11] and some[12] have argued that tidyverse is an effective way to introduce complete beginners to programming, as pedagogically it allows students to quickly begin doing data processing tasks.[13] Moreover, some practitioners have pointed out that data processing tasks are intuitively easier to chain together with tidyverse compared to Python's equivalent data processing package, pandas.[14] There is also an active R community around the tidyverse. For example, there is the TidyTuesday social data project organised by the Data Science Learning Community (DSLC),[15] where varied real-world datasets are released each week for the community to participate, share, practice, and make learning to work with data easier. Critics of the tidyverse have argued it promotes tools that are harder to teach and learn than their built-in, base R equivalents and are too dissimilar to some programming languages.[16] [17]

The tidyverse principles more generally encourage and help ensure that a universe of streamlined packages, in principle, will help alleviate dependency issues and compatibility with current and future features.[18] An example of such a tidyverse principled approach is the pharmaverse, which is a collection of R packages for clinical reporting usage in pharma.[19]

Packages

The core tidyverse packages, which provide functionality to model, transform, and visualize data, include:[20]

Additional packages assist the core collection.[21] Other packages based on the tidy data principles are regularly developed, such as tidytext for text analysis, tidymodels[22] for machine learning, or tidyquant[23] for financial operations.

Notes and References

  1. Web site: Welcome to the Tidyverse. Revolutions. 2018-11-26.
  2. Web site: Tidyverse. www.tidyverse.org. en-us. 2018-11-26.
  3. Book: Wickham, Hadley. 4 Pipes The tidyverse style guide.
  4. Book: Wickham . Hadley . Advanced R . May 30, 2019 . . 2019 . 978-0815384571 . 2nd . New York.
  5. Web site: RDocumentation. www.rdocumentation.org. 2018-11-26.
  6. Duggan. Jim. 2018-09-07. Input and output data analysis for system dynamics modelling using the tidyverse libraries of R. System Dynamics Review. 34. 3. 438–461. en. 10.1002/sdr.1600. 0883-7066. 10379/15029. 70005357. free.
  7. Book: Chang, Winston. R Graphics Cookbook. 2013. "O'Reilly Media, Inc.". 9781449316952. en.
  8. Book: C., Boehmke, Bradley. Data wrangling with R. 9783319455990. Cham. 964404346. 2016-11-17.
  9. Book: Hadley, Wickham. R for data science : import, tidy, transform, visualize, and model data. Grolemund, Garrett. 9781491910399. First. Sebastopol, CA. 968213225. 2017.
  10. Wickham . Hadley . Averick . Mara . Bryan . Jennifer . Chang . Winston . McGowan . Lucy D'Agostino . François . Romain . Grolemund . Garrett . Hayes . Alex . Henry . Lionel . Hester . Jim . Kuhn . Max . Pedersen . Thomas Lin . Miller . Evan . Bache . Stephan Milton . Müller . Kirill . Ooms . Jeroen . Robinson . David . Seidel . Dana Paige . Spinu . Vitalie . Takahashi . Kohske . Vaughan . Davis . Wilke . Claus . Woo . Kara . Yutani . Hiroaki . Welcome to the Tidyverse . Journal of Open Source Software . 21 November 2019 . 4 . 43 . 1686 . 10.21105/joss.01686 . 2019JOSS....4.1686W . 214002773 . free .
  11. Web site: Steinmetz . Art . 2024-04-10 . Outsider Data Science - The Truth About Tidy Wrappers . 2024-04-11 . outsiderdata.netlify.app . en.
  12. Web site: Heppler . Jason . 2018-02-27 . Teaching the tidyverse to R novices . 2023-08-24 . Medium . en.
  13. Web site: on . Teach the tidyverse to beginners was published . Teach the tidyverse to beginners . 2022-07-15 . Variance Explained . 5 July 2017 . en.
  14. Web site: Why pandas feels clunky when coming from R . 2024-03-30 . Rasmus Bååth's Blog . en-us.
  15. Web site: dslc.io . 2024-08-11 . dslc.io . en.
  16. Web site: Matloff . Norm . 30 September 2019 . An opinionated view of the Tidyverse "dialect" of the R language . 28 October 2019 . GitHub.
  17. Web site: Muenchen . Bob . 23 March 2017 . The Tidyverse Curse . r4stats.com . en.
  18. Web site: The Power of Transitioning to a '-verse' Approach in R Package Development . 2024-08-11 . www.appsilon.com . en.
  19. Web site: pharmaverse . 2024-08-11 . pharmaverse.org.
  20. News: Tidyverse packages - Tidyverse. 2018-11-26. en-us.
  21. Web site: Tidyverse packages. 2020-12-22. www.tidyverse.org. en-us.
  22. Web site: Tidymodels . 2023-02-03 . www.tidymodels.org . en-us.
  23. Web site: Tidy Quantitative Financial Analysis . 2023-02-03 . business-science.github.io . en.