The Set Size Bias in Ensemble Comparison (Or Why Showing Raw Data May Be Misleading)

Steve Haroz

doi:10.1167/jov.20.11.741

Journal Articles Journal of Vision Year : 2020

The Set Size Bias in Ensemble Comparison (Or Why Showing Raw Data May Be Misleading)

(1)

Steve Haroz

Function : Author
PersonId : 170583
IdHAL : sharoz
ORCID : 0000-0002-2725-9173

Analysis and Visualization

Abstract

Ensemble perception is characterized by the rapid ability to estimate a summary statistic from a set without needing serial inspection. But which stimulus properties influence how that summary is made? In a within-subject experiment with per-trial feedback, subjects chose which set had a larger average value. Using data visualizations as stimuli, subjects were asked which of two sets had a higher position (dot plots), a larger size (floating bar graphs), or redundantly coded highest position and largest size (regular bar graphs). The experiment also varied set size (1vs1, 12vs12, 20vs20, 12vs20, and 20vs12), mean difference between the sets (0 to 80 pixels in 10 pixel increments), and which set had the largest single value. With 25 repetitions per condition, each subjects ran in over 5,000 trials. For single-item comparisons, position was unsurprisingly more precise than length alone. However, for set comparison, the noisiness of ensemble coding appears to overpower these differences, so position, length, and the redundant combination have indistinguishable discriminability, which contradicts Cleveland & McGill (1984). Moreover, for all visual features, responses were biased towards the larger set size. Previous results (Yuan, Haroz, & Franconeri 2018) suggested that this bias is caused by estimating a sum or total area. But because the effect occurs in the position (dot plot) condition, where sum or total area are unhelpful, that model is unlikely. Additional analyses did not reveal a bias towards the set with the largest single value, the smallest single value, or the largest range of values. These results imply that this bias is holistic and not driven by simpler proxies. As showing raw data rather than only summary statistics is common advice in visualization design, the set size bias could cause people to misinterpret visualizations that do not have the same number of items in each group.

Domains

Psychology

Steve Haroz : Connect in order to contact the contributor

https://inria.hal.science/hal-02989967

Submitted on : Thursday, November 5, 2020-1:11:48 PM

Last modification on : Friday, March 24, 2023-2:53:19 PM

Dates and versions

hal-02989967 , version 1 (05-11-2020)

Identifiers

HAL Id : hal-02989967 , version 1
DOI : 10.1167/jov.20.11.741

Cite

Steve Haroz. The Set Size Bias in Ensemble Comparison (Or Why Showing Raw Data May Be Misleading). Journal of Vision, 2020, 20 (11), pp.741. ⟨10.1167/jov.20.11.741⟩. ⟨hal-02989967⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UMR8623 CENTRALESUPELEC INRIA2 UNIV-PARIS-SACLAY LISN GS-ENGINEERING GS-COMPUTER-SCIENCE GS-LIFE-SCIENCES-HEALTH LISN-AVIZ

45 View

0 Download

The Set Size Bias in Ensemble Comparison (Or Why Showing Raw Data May Be Misleading)

Abstract

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Altmetric

Share