Spaghetti plot explained

A spaghetti plot (also known as a spaghetti chart, spaghetti diagram, or spaghetti model) is a method of viewing data to visualize possible flows through systems. Flows depicted in this manner appear like noodles, hence the coining of this term.[1] This method of statistics was first used to track routing through factories. Visualizing flow in this manner can reduce inefficiency within the flow of a system. In regards to animal populations and weather buoys drifting through the ocean, they are drawn to study distribution and migration patterns. Within meteorology, these diagrams can help determine confidence in a specific weather forecast, as well as positions and intensities of high and low pressure systems. They are composed of deterministic forecasts from atmospheric models or their various ensemble members. Within medicine, they can illustrate the effects of drugs on patients during drug trials.

Applications

Biology

Spaghetti diagrams have been used to study why butterflies are found where they are, and to see how topographic features (such as mountain ranges) limit their migration and range.[2] Within mammal distributions across central North America, these plots have correlated their edges to regions which were glaciated within the previous ice age, as well as certain types of vegetation.[3]

Meteorology

Within meteorology, spaghetti diagrams are normally drawn from ensemble forecasts. A meteorological variable e.g. pressure, temperature, or precipitation amount is drawn on a chart for a number of slightly different model runs from an ensemble. The model can then be stepped forward in time and the results compared and be used to gauge the amount of uncertainty in the forecast. If there is good agreement and the contours follow a recognizable pattern through the sequence, then the confidence in the forecast can be high. Conversely, if the pattern is chaotic, i.e., resembling a plate of spaghetti, then confidence will be low. Ensemble members will generally diverge over time and spaghetti plots are a quick way to see when this happens.

Spaghetti plots can be a more favorable choice compared to the mean-spread ensemble in determining the intensity of a coming cyclone, anticyclone, or upper-level ridge or trough. Because ensemble forecasts naturally diverge as the days progress, the projected locations of meteorological features will spread further apart. A mean-spread diagram will take a mean of the calculated pressure from each spot on the map as calculated by each permutation in the ensemble, thus effectively smoothing out the projected low and making it appear broader in size but weaker in intensity than the ensemble's permutations had actually indicated. It can also depict two features instead of one if the ensemble clustering is around two different solutions.[4]

Various forecast models within tropical cyclone track forecasting can be plotted on a spaghetti diagram to show confidence in five-day track forecasts.[5] When track models diverge late in the forecast period, the plot takes on the shape of a squashed spider, and can be referred to as such in National Hurricane Center discussions.[6] Within the field of climatology and paleotempestology, spaghetti plots have been used to correlate ground temperature information derived from boreholes across central and eastern Canada.[7] As in other disciplines, spaghetti diagrams can be used to show the motion of objects, such as drifting weather buoys over time.[8]

Business

Spaghetti diagrams were first used to track routing through a factory.[9] Spaghetti plots are a simple tool to visualize movement and transportation.[10] Analyzing flows through systems can determine where time and energy are wasted, and identify where streamlining would be beneficial.[1] This is true not only with physical travel through a physical place, but also during more abstract processes such as the application of a mortgage loan.[11]

Medicine

Spaghetti plots can be used to track the results of drug trials amongst a number of patients on one individual graph to determine their benefit.[12] They have also been used to correlate progesterone levels to early pregnancy loss.[13] The half-life of drugs within people's blood plasma, as well as discriminating effects between different populations, can be diagnosed quickly via these diagrams.[14]

External links

Notes and References

  1. Book: Introduction to Engineering Statistics and Lean Sigma: Statistical Quality Control and Design of Experiments and Systems. Theodore T. Allen. 128. Springer. 2010. 978-1-84882-999-2.
  2. Book: The Butterflies of North America: A Natural History and Field Guide. 103. James A. Scott. 1992. Stanford University Press. 978-0-8047-2013-7.
  3. Book: Handbook of mammals of the north-central states. registration. 52–55. J. Knox Jones . Elmer C. Birney . 1988. University of Minnesota Press. 978-0-8166-1420-2.
  4. Web site: NCEP Medium-Range Ensemble Forecast (MREF) System Spaghetti Diagrams. Environmental Modeling Center. 2003-08-21. National Oceanic and Atmospheric Administration. 2011-02-17. Environmental Modeling Center.
  5. Book: The storm: what went wrong and why during hurricane Katrina : the inside story from one Louisiana scientist. Ivor Van Heerden . Mike Bryan . 2007. Penguin. 978-0-14-311213-6.
  6. Web site: Tropical Depression Two-E Discussion Number 3. John L. Beven, III. 2007-05-30. 2011-02-17. National Hurricane Center.
  7. Book: Borehole climatology: a new method on how to reconstruct climate. 76. Louise Bodri . Vladimír Čermák . 2007. Elsevier. 978-0-08-045320-0.
  8. Book: 341. The turbulent ocean. S. A. Thorpe. 2005. Cambridge University Press. 978-0-521-83543-5.
  9. Book: Beyond the theory of constraints: how to eliminate variation and maximize capacity. 97. William A. Levinson. 2007. Productivity Press. 978-1-56327-370-4.
  10. Book: How to Implement Lean Manufacturing. 127. Lonnie Wilson. 2009. McGraw Hill Professional. 978-0-07-162507-4.
  11. Book: Supply Chain Management For Competitive Advantage. 130. Rangaraj. 978-0-07-022163-5. Tata McGraw-Hill. 2009.
  12. Book: Hedeker, Donald R. . Longitudinal data analysis . Gibbons . Robert D. . 2006 . John Wiley and Sons . 978-0-471-42027-9 . 52–54 . Robert D. Gibbons.
  13. Book: 2–4. Nonparametric regression methods for longitudinal data analysis. Hulin Wu . Jin-Ting Zhang . John Wiley and Sons. 2006. 978-0-471-48350-2.
  14. Book: 263–264. Pharmacokinetic/pharmacodynamic data analysis: concepts and applications, Volume 1. Johan Gabrielsson . Daniel Weiner . Taylor & Francis. 2001. 978-91-86274-92-4.