This function computes the Gini coefficient and plots the Lorenz curve based on a selected metric from a Person Query data frame. It provides a way to measure inequality in the distribution of the selected metric.This function can be integrated into a larger analysis pipeline to assess inequality in metric distribution.


create_lorenz(data, metric, return = "plot")



Data frame containing a Person Query.


Character string identifying the metric to be used for the Lorenz curve and Gini coefficient calculation.


Character string identifying the return type. Options are:

  • "gini" - Numeric value representing the Gini coefficient.

  • "table" - Data frame containing a summary table of population share and value share.

  • "plot" (default) - ggplot object representing a plot of the Lorenz curve.


create_lorenz(data = pq_data, metric = "Emails_sent", return = "gini")
#> [1] 0.2523283

create_lorenz(data = pq_data, metric = "Emails_sent", return = "plot")

create_lorenz(data = pq_data, metric = "Emails_sent", return = "table")
#> # A tibble: 11 × 2
#>    population_share value_share
#>               <dbl>       <dbl>
#>  1              0      0.000142
#>  2              0.1    0.0425  
#>  3              0.2    0.101   
#>  4              0.3    0.170   
#>  5              0.4    0.246   
#>  6              0.5    0.331   
#>  7              0.6    0.426   
#>  8              0.7    0.528   
#>  9              0.8    0.643   
#> 10              0.9    0.780   
#> 11              1      1