Statistically Sound Visual Data Analysis: Difference between revisions

From VrlWiki
Jump to navigation Jump to search
 
(4 intermediate revisions by 2 users not shown)
Line 1: Line 1:
== Overview ==
== Overview ==


This page documents our research effort towards developing visual interfaces that help people perform statistically sound data analysis.
This page documents our research effort towards developing visual interfaces that help people perform statistically sound visual data analysis.


== Proposals ==
== Proposals ==
Line 13: Line 13:


* [http://cidrdb.org/cidr2017/papers/p56-binnig-cidr17.pdf Towards Sustainable Insights (or why polygamy is bad for you)]: outlines a few examples of false discovery issues in existing visual data exploration (e.g. [http://www.vldb.org/pvldb/vol8/p2024-crotty.pdf Vizdom]) and visual recommendation tools (e.g. [http://www.vldb.org/pvldb/vol8/p2182-vartak.pdf SeeDB])
* [http://cidrdb.org/cidr2017/papers/p56-binnig-cidr17.pdf Towards Sustainable Insights (or why polygamy is bad for you)]: outlines a few examples of false discovery issues in existing visual data exploration (e.g. [http://www.vldb.org/pvldb/vol8/p2024-crotty.pdf Vizdom]) and visual recommendation tools (e.g. [http://www.vldb.org/pvldb/vol8/p2182-vartak.pdf SeeDB])
Interactive control of false discover rate during data exploration
* [https://arxiv.org/pdf/1612.01040.pdf Controlling False Discoveries during Interactive Data Exploration]: the paper reports case study results on applying a new procedure for controlling FDR during data exploration and reports; not much emphasis on the interface yet but it sounds like Emanuel is working on that; the case study could provide inspirations for ways in which visualizations could help users better understand the FDR control procedural.
* [https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3427907/ Graphical approaches for multiple comparison procedures using weighted Bonferroni, Simes, or parametric tests]: presents a graphical representation to illustrate test procedures that control family-wise error rate (FWER)


Previous work on visualizations and statistical analysis
Previous work on visualizations and statistical analysis
Line 21: Line 26:
* Visual estimation of coefficients of statistical models
* Visual estimation of coefficients of statistical models
** [http://journals.sagepub.com/doi/pdf/10.1057/ivs.2008.13 Judging correlation from scatterplots and parallel coordinate plots], Li, Martens, and van Wijk, 2008. This paper reports experimental results on how well people can assess correlation under varying sample size and visualization methods (scatter plots v.s. parallel coordinate plots). The results suggest that the accuracy of statistical judgements varies across visualization designs, motivating careful analysis and experimentation when designing visualizations to facilitate statistical data analysis.
** [http://journals.sagepub.com/doi/pdf/10.1057/ivs.2008.13 Judging correlation from scatterplots and parallel coordinate plots], Li, Martens, and van Wijk, 2008. This paper reports experimental results on how well people can assess correlation under varying sample size and visualization methods (scatter plots v.s. parallel coordinate plots). The results suggest that the accuracy of statistical judgements varies across visualization designs, motivating careful analysis and experimentation when designing visualizations to facilitate statistical data analysis.
* Visualization of statistics parameters
** [https://publik.tuwien.ac.at/files/PubDat_242383.pdf Visual Encodings of Temporal Uncertainty: A Comparative User Study] Evaluation of six methods for visualization temporal uncertainty, three of which are for statistical uncertainty.
* Integrating statistics and visualizations for exploratory analysis
** [http://hcil.cs.umd.edu/trs/2008-03/2008-03.pdf Integrating Statistics and Visualization: Case Studies of Gaining Clarity during Exploratory Data Analysis] "Statistics" refers to network metrics specifically, e.g. node degrees, betweenness, closeness. The paper focus on using statistics to help with navigation (e.g. filter and zoom based on network metric values).

Latest revision as of 03:41, 21 March 2017

Overview

This page documents our research effort towards developing visual interfaces that help people perform statistically sound visual data analysis.

Proposals

  • BIGDATA: F: Statistically Sound and Computationally Efficient Massive Data Analysis - Sample Complexity, Uniform Convergence, and False Discovery Rate

References

Vision and use case papers

Interactive control of false discover rate during data exploration

Previous work on visualizations and statistical analysis