Hypothetical Outcome Plots

(This article was first published on max humber , and kindly contributed toR-bloggers)

suppressPackageStartupMessages(library(tidyverse)) # devtools::install_github("dgrtwo/gganimate") # install.packages("cowplot") library(gganimate) # install image magick in terminal >> "brew install image magick"

If the Weatherman says that there is 30% chance of rain tomorrow. And it rains. Was he wrong? It’s an important question. Because it rained on November 8th. And a lot of people think that it wasn’t supposed to.

Communicating uncertainty is hard. It’s hard because uncertainty can be convoluted and opaque. And it’s hard because a lot of people can’t get down with boxplots or measures of spread.

Take this toy data for example:

set.seed(2016) df <- tibble( `Blue Team` = round(rnorm(50, mean = 48, sd = 5)), `Red Team` = round(rnorm(50, mean = 45, sd = 3)) ) %>% mutate(simulation = row_number()) %>% gather(team, probability, -simulation) %>% mutate(probability = ifelse(probability >= 100, 100, probability) / 100)

Imagine that this data was generated from some model. How might we represent the uncertainty in our model and around our predictions?

Perhaps we might push the data through a boxplot:

df %>% ggplot(aes(x = team, y = probability)) + geom_boxplot()
Hypothetical Outcome Plots

Or a density chart:

df %>% ggplot(aes(x = probability, fill = team)) + geom_density(alpha = 1/2) + scale_fill_manual(values = c("blue", "red"))
Hypothetical Outcome Plots

Or use some errorbars:

df %>% group_by(team) %>% summarise( mean = mean(probability), low = quantile(probability, 0.025), high = quantile(probability, 0.975)) %>% ggplot(aes(x = team, y = mean)) + geom_errorbar(aes(ymin = low, ymax = high))
Hypothetical Outcome Plots

These options all kind of suck, though. They’re not super intuitive. And they aren’t all that convincing, because they can intimidate a lot of people!

Instead of boxplots or density charts or regular errorbars we can hack errorbars to generate a proto-Hypothetical Outcome Plot:

df %>% ggplot(aes(x = team, y = probability)) + geom_errorbar(aes(ymin = probability, ymax = probability))

Hypothetical Outcome Plots (HOPs) are a way to build and visualize uncertainty in the same way that we experience it (in and by countable events). The depth and theory behind HOPs is beyond the scope of this quick post, but if you’re interested in learning more check out this awesome Medium story by the UW Interactive Data Lab .

Extending our hacked errorbars with gganimate we can implement an actual HOP in just a few lines of R:

p <- df %>% ggplot(aes(x = team, y = probability, frame = simulation)) + geom_errorbar(aes(ymin = probability, ymax = probability)) gganimate(p, title_frame = FALSE)
Hypothetical Outcome Plots

In this way we can directly experience the uncertainty of our model and the predictions that it happens to make.

In looking at the HOP it seems as though the Blue Team is supposed to win our imaginary game. Importantly, however, the Red Team comes up on top in certain simulations. So, don’t say that the model is wrong if they happen to actually win our imaginary game!

If you want to add a little polish to your HOP, I might suggest extending it with some ghost bars:

p <- df %>% ggplot(aes(x = team, y = probability)) + geom_errorbar(aes( ymin = probability, ymax = probability, frame = simulation, cumulative = TRUE), color = "grey80", alpha = 1/8) + geom_errorbar(aes( ymin = probability, ymax = probability, frame = simulation), color = "#00a9e0") + scale_y_continuous( limits = c(0, 1), labels = scales::percent_format()) + theme(panel.background = element_rect(fill = "#FFFFFF")) + labs(title = "", y = "Probability of winning", x = "") gganimate(p, title_frame = FALSE)
Hypothetical Outcome Plots

But that’s it. Thanks for reading!

I have two more posts in the queue about visualizing models and model performance. Look for them in the near future!

Hypothetical Outcome Plots

Trending Articles

《沈冰自述——我和周永康的故事》全本

Moog - Subsequent 25

出售: 林憶蓮•回來愛的身邊 (東芝1A1頭版)

筆記 - 使用 PowerShell 清除停用 AD 帳號與 OU

df-dferh-01 中国区 Android 安装 Google Play Store 后报错的解决办法

「一棒接一棒、棒棒強棒」108學年度家長會長交接典禮

吸烟与MBTI类型判断捷径 (豆瓣 INFJ的奇幻之旅小组)

acermark龍璿國際展出多款包裝設備

枋寮北勢寮隆山宮睽違12年再辦迎王祭典

日本女优有村千佳COS集锦：狂三&黑白岩&亚丝娜&绫波丽

有遇到过这个问题么。/jsb-videoplayer.js not found, possible missing file.

MAS v2.8 magicgenius 汉化版 - 11.11更新

出售: Monster Cable Interlink Reference 2

福建佛教人士望云和尚(林斌)的九仙禅寺被强行收走，望云妈妈被赶出寺庙

R 语言中的OpenBLAS*和英特尔® 数学核心函数库的性能比较

[转载]煞貢、直星、人專吉日\金神七煞歌

HAKERS哈克士戶外 12月8~14日廠拍

OBS Studio 23.2.1 免安裝中文版 - 免費網路實況廣播軟體實況主必備軟體取代Fraps

<請教>行駛中安卓機會重新開機

Udp2raw-tunnel 及其一键安装脚本