datamations is a framework for the automatic generation of explanation of the steps of an analysis pipeline. It automatically turns code into animations, showing the state of the data at each step of an analysis.

For more information, please visit the package website, which includes additional examples, defaults and conventions, and more.

Installation

You can install datamations from GitHub with:

# install.packages("devtools")
devtools::install_github("microsoft/datamations")

Usage

To get started, load datamations and dplyr:

A datamation shows a plot of what the data looks like at each step of a tidyverse pipeline, animated by the transitions that lead to each state. The following shows an example taking the built-in small_salary data set, grouping by Degree, and calculating the mean Salary.

First, define the code for the pipeline, then generate the datamation with datamation_sanddance():

library(datamations)
library(dplyr)

"small_salary %>% 
  group_by(Degree) %>%
  summarize(mean = mean(Salary))" %>%
  datamation_sanddance()

datamations supports the following dplyr functions: