Scatterplot matrix of the scagnostics measures for the 91 scatterplots of the variables of the Boston Housing data set

Scagnostics (scatterplot diagnostics) is a series of measures that characterize certain properties of a point cloud in a scatter plot. The term and idea was coined by John Tukey and Paul Tukey, though they didn't publish it; later it was elaborated by Wilkinson, Anand, and Grossman. The following nine dimensions are considered:[1][2]

  1. For the outliers in the data:
    1. outlying
  2. For the density of data points:
    1. skewed
    2. clumpy
    3. sparse
    4. striated
  3. For the shape of the point cloud:
    1. convex
    2. skinny
    3. stringy
  4. For trends in the data:
    1. monotony

References

edit
  1. ^ Wilkinson, Leland (23 April 2008). "Scagnostics". Retrieved 25 March 2022. {{cite journal}}: Cite journal requires |journal= (help)
  2. ^ Wilkinson, L.; Anand, A.; Grossman, R. (2005). "Graph-theoretic scagnostics". IEEE Symposium on Information Visualization, 2005. INFOVIS 2005. pp.ย 157โ€“164. doi:10.1109/INFVIS.2005.1532142. ISBNย 0-7803-9464-X.
edit

๐Ÿ“š Artikel Terkait di Wikipedia

Scatter plot

visualization Rug plot Bar graph Line chart List of mathematical art software Scagnostics Dot plot (statistics) Parity plot Jarrell, Stephen B. (1994). Basic Statistics

John Tukey

Slash distribution Theory of conjoint measurement Coining the term 'bit' Scagnostics Awards Wilks Memorial Award (1965) National Medal of Science (1973) Shewhart

OPTICS algorithm

of density-levels by Hartigan. The OPTICS Cordillera is a descriptive Scagnostics measure of how clustered a data set is. It uses OPTICS to create a dendrogram