API Reference

Tables

Descriptive, correlation and extreme-observation tables.

prepare_descriptive_table Report descriptive statistics for the numeric/logical variables of df.
prepare_correlation_table Correlation table with Pearson above and Spearman below the diagonal.
prepare_ext_obs_table Display the top and bottom n observations sorted by var.

Graphs

Interactive Plotly figures.

prepare_correlation_graph Visualise a correlation matrix (Pearson above, Spearman below the diagonal).
prepare_trend_graph Line-plot the mean (with standard-error bars) of variables over time.
prepare_quantile_trend_graph Line-plot quantiles of a single variable over time.
prepare_by_group_bar_graph Bar chart of a statistic of var computed within each by_var group.
prepare_by_group_trend_graph Line-plot the mean of var over time, one line per group_var level.
prepare_by_group_violin_graph Violin plots of var distribution across by_var groups.
prepare_histogram Histogram of a numeric variable.
prepare_bar_chart Bar chart of category counts for a (typically categorical) variable.
prepare_missing_values_graph Heatmap of missing-value frequency by variable and time period.
prepare_scatter_plot Scatter plot of y against x with optional aesthetics and a LOESS smoother.

Regression

OLS with fixed effects and clustered standard errors (pyfixest).

prepare_regression_table Build a regression table of one or more OLS models.
prepare_fwl_plot Frisch-Waugh-Lovell scatter of dv against the focal regressor var.

Data preparation

treat_outliers Treat numerical outliers by winsorizing or truncating.

Interactive apps

The no-code ExPdPy exploration apps (Streamlit and Shiny).

streamlit_app.ExPdPy Launch (or describe) the interactive Streamlit ExPdPy app.
app.ExPdPy Launch (or build) the interactive ExPdPy app.

Datasets

load_kuznets Load the synthetic kuznets dataset (country-year, N-shaped regional Kuznets curve).
load_kuznets_data_def Return variable definitions for :func:load_kuznets.
load_gapminder Load the gapminder dataset (life expectancy, population, GDP per capita).
get_config Return a startup configuration for ExPdPy.