Regression Table

Motivating example

Regression tables with multiple models displayed in the table columns are common in academic publications, and they usually follow the same standard format. The table below is an example from Fournier, Soroka, and Nir (2020) showing the effect of negative and positive televised news reports and political ideology on people's emotional arousal and activation, captured by physiological galvanic skin activity. It is easy to produce this type of table with tidypolars $^{4sci}$ , keeping everything in a tidy format.

Data

The synthetic data vote contains information about Democratic and Republican voters, including demographics and voting behavior:

Loading data and modules
import tidypolars4sci as tp
import tools4sci as t4
from tidypolars4sci.data import vote as df
import numpy as np
# 
from statsmodels.formula.api import ols as lm
from statsmodels.formula.api import glm as glm
from statsmodels.api import families as family

# variables:
df.__codebook__.print()

shape: (9, 3)
┌──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────┐
│ Variable            Type    Description                                                                                  │
│ str                 str     str                                                                                          │
╞══════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════╡
│ age                 int     Age                                                                                          │
│ income              float   Income (standardized)                                                                        │
│ gender              int     Gender (Male=0; Female=1)                                                                    │
│ ideology            float   Ideology self-placement (left=-10 to right=10)                                               │
│ treatment           int     Treatment group (treated=1; control=0)                                                       │
│ group               str     Group                                                                                        │
│ partisanship        str     Partisanship (Democrat or Republican)                                                        │
│ vote_conservative   int     Voted for the most conservative in-party candidate (Yes=1, No=0)                             │
│ rate_conservative   float   Voters rate of the most conservative in-party candidate (Dislike=low value; Like=high value) │
└──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────┘

Estimating

Here are the functions for the estimation, prediction, and summarizing:

Functions for estimation, summary, and prediction
def create_formula(outcome, adjusted):
    if adjusted:
        # Adjustments are hard-coded here but could have been provided
        # as arguments for the function instead.
        adjustments = "income + age + gender"
    else:
        adjustments = "1"
    formula = f"{outcome} ~ treatment * ideology + {adjustments}"
    return formula

def estimate(data, model, formula):
    # need to covert to pandas for statsmodels
    data = data.to_pandas()
    if model == 'Linear':
        res = lm(formula, data=data).fit()
    else:
        # logit  model with clustered std. errors by the variable 'group'
        res = glm(formula, data=data, family=family.Binomial()).fit(cov_type="cluster",
                                                                    cov_kwds={"groups": data["group"]})
    return res

def get_summary(fit):
    res = fit.summary2().tables[1].reset_index(drop=False, names='term')
    return tp.from_pandas(res)

def predict(fit, data, at):
    newdata = t4.simulate.newdata(data, at=at)
    pred = fit.get_prediction(newdata.to_pandas()).summary_frame(alpha=0.05)
    return newdata.bind_cols(tp.from_pandas(pred))

And here is how to run the estimation in tidypolars $^{4sci}$ and produce a table with tidy results (click on the (+) sign to see code comments):

Tidy estimation, summary, and prediction
res = (df
       .nest('partisanship') Nest the data by partisanship.

       .crossing(outcome = ['rate_conservative', "vote_conservative"], crossing() expands (replicates) each row of the nested data for
    different outcomes and an indicator of whether the model uses
    adjustment variables.

                 adjusted = ['Yes', 'No'])
       .mutate(
           model = tp.case_when(tp.col("outcome").str.contains('rate'), 'Linear', This variable indicates which model is estimated depending on the
    outcome variable: rate_conservative (continuous) uses a linear
    model; vote_conservative (binary) uses a logit model.

                                tp.col("outcome").str.contains('vote'), 'Logit'),
           formula = tp.map(['outcome', 'adjusted'], lambda row: create_formula(*row))) The function map() performs a row-wise operation, creating the
    regression formula depending on the outcome and whether the
    estimation is adjusted; the star (*) used in *row unpacks the
    columns for the function create_formula().

       .mutate(
           fit     = tp.map(['data', 'model', 'formula'], lambda row: estimate(*row)), Fit the models in each row.

           summ    = tp.map(["fit"], lambda fit: get_summary(*fit)), Create a tidy summary (tibble) for each estimated model in the rows.

           pred    = tp.map(["fit", "data"], lambda row: predict(*row,
                                                                 at={'treatment':[0, 1],
                                                                     'ideology':range(-10, 10)}))  Create tables (tibbles) with predicted values at specified values of
    the predictors treatment and ideology.

       )
       )

Check the resulting tibble
1	`res`

shape: (8, 9)
┌────────────────────────────────────────────────────────────────────────────────┐
│ parti…   data     outco…   adjus…   model    formu…   fit      summ     pred   │
│ str      object   str      str      str      str      object   object   object │
╞════════════════════════════════════════════════════════════════════════════════╡
│ repub…   shape…   rate_…   Yes      Linea…   rate_…   <stat…   shape…   shape… │
│ repub…   shape…   rate_…   No       Linea…   rate_…   <stat…   shape…   shape… │
│ repub…   shape…   vote_…   Yes      Logit    vote_…   <stat…   shape…   shape… │
│ …        …        …        …        …        …        …        …        …      │
│ democ…   shape…   rate_…   No       Linea…   rate_…   <stat…   shape…   shape… │
│ democ…   shape…   vote_…   Yes      Logit    vote_…   <stat…   shape…   shape… │
│ democ…   shape…   vote_…   No       Logit    vote_…   <stat…   shape…   shape… │
└────────────────────────────────────────────────────────────────────────────────┘

Summarizing

Single model

Let us see statmmodel summary the results for a particular model:

pty = 'democrat'
model = 'Logit'
adjusted = 'Yes'
tab = (res
       .filter(tp.col("partisanship")==pty)
       .filter(tp.col("model")==model)
       .filter(tp.col("adjusted")==adjusted)
       .pull('fit')
       )

# result of the first model estimated
tab[0].summary()

                 Generalized Linear Model Regression Results                  
==============================================================================
Dep. Variable:      vote_conservative   No. Observations:                 1017
Model:                            GLM   Df Residuals:                     1010
Model Family:                Binomial   Df Model:                            6
Link Function:                  Logit   Scale:                          1.0000
Method:                          IRLS   Log-Likelihood:                -512.79
Date:                Thu, 06 Mar 2025   Deviance:                       1025.6
Time:                        18:07:44   Pearson chi2:                 1.02e+03
No. Iterations:                     5   Pseudo R-squ. (CS):             0.2843
Covariance Type:              cluster                                         
======================================================================================
                         coef    std err          z      P>|z|      [0.025      0.975]
--------------------------------------------------------------------------------------
Intercept             -0.1600      0.159     -1.008      0.314      -0.471       0.151
treatment             -0.4336      0.092     -4.724      0.000      -0.613      -0.254
ideology              -0.0805      0.031     -2.562      0.010      -0.142      -0.019
treatment:ideology    -0.2886      0.043     -6.765      0.000      -0.372      -0.205
income                -0.0467      0.064     -0.731      0.465      -0.172       0.079
age                    0.0200      0.005      3.972      0.000       0.010       0.030
gender                -0.1203      0.124     -0.967      0.333      -0.364       0.123
======================================================================================

Here is the tidy summary:

shape: (7, 7)
┌─────────────────────────────────────────────────────────────────────────┐
│ term                 Coef.   Std.Err.       z   P>|z|   [0.025   0.975] │
│ str                    f64        f64     f64     f64      f64      f64 │
╞═════════════════════════════════════════════════════════════════════════╡
│ Intercept            -0.16       0.16   -1.01    0.31    -0.47     0.15 │
│ treatment            -0.43       0.09   -4.72    0.00    -0.61    -0.25 │
│ ideology             -0.08       0.03   -2.56    0.01    -0.14    -0.02 │
│ treatment:ideology   -0.29       0.04   -6.76    0.00    -0.37    -0.20 │
│ income               -0.05       0.06   -0.73    0.46    -0.17     0.08 │
│ age                   0.02       0.01    3.97    0.00     0.01     0.03 │
│ gender               -0.12       0.12   -0.97    0.33    -0.36     0.12 │
└─────────────────────────────────────────────────────────────────────────┘

Multiple models

The goal is to create something like this:

To create a regression table with different models displayed in the columns, formatted for publication, we can use the function models2tab() from the model tools4sci. One of the outcomes will be a tibble with the models (tab), the other a string with the latex table (tabl). The function uses a dictionary with the estimated models. The keys are the column names. Line breaks with \n can be used.

# select the models that will show in the table
mods = res.filter(tp.col("partisanship")=='democrat')

# prepare the dictionary (keys will be column names)
mods = {f"Model {m}\nAdjusted: {a}" : fit
        for m, a, fit in zip(mods.pull('model'),
                             mods.pull('adjusted'),
                             mods.pull('fit'))
        }
mods

# from the tools4sci module
tab, tabl = t4.report.models2tab(mods,
                                 latex=True,
                                 # we can rename covariates
                                 covar_labels={"income": "Income (std)"},
                                 kws_latex={'caption': "Example table",
                                            'label': "tab-example",
                                            'header':None,
                                            'align':"lcccc",
                                            'escape':True,
                                            'longtable':False,
                                            'rotate':False
                                            },
                                 sanitize='partial'
                                 )

# here is the tidy table (one can save it in xlsx, or csv)
tab.print()

shape: (20, 5)
┌────────────────────────────────────────────────────────────────────────────────────┐
│                        Model Linear    Model Linear   Model Logit     Model Logit  │
│ str                    Adjusted: Yes   Adjusted: No   Adjusted: Yes   Adjusted: No │
│                        str             str            str             str          │
╞════════════════════════════════════════════════════════════════════════════════════╡
│ Intercept              -0.1194         -0.1194        -0.1600         -0.1600      │
│                        (0.1030)        (0.1030)       (0.1588)        (0.1588)     │
│ treatment              -0.5137***      -0.5137***     -0.4336***      -0.4336***   │
│                        (0.0609)        (0.0609)       (0.0918)        (0.0918)     │
│ ideology               -0.1021***      -0.1021***     -0.0805*        -0.0805*     │
│                        (0.0074)        (0.0074)       (0.0314)        (0.0314)     │
│ treatment x ideology   -0.2804***      -0.2804***     -0.2886***      -0.2886***   │
│                        (0.0104)        (0.0104)       (0.0427)        (0.0427)     │
│ Income (std)           0.0348          0.0348         -0.0467         -0.0467      │
│                        (0.0307)        (0.0307)       (0.0639)        (0.0639)     │
│ age                    0.0234***       0.0234***      0.0200***       0.0200***    │
│                        (0.0020)        (0.0020)       (0.0050)        (0.0050)     │
│ gender                 -0.5098***      -0.5098***     -0.1203         -0.1203      │
│                        (0.0610)        (0.0610)       (0.1244)        (0.1244)     │
│ N. Obs.                1017            1017           1017            1017         │
│ R2 (adj)               0.7641          0.7641                                      │
│ R2 (pseudo)                                           0.2843          0.2843       │
│ BIC                    2859.2311       2859.2311      -5968.2760      -5968.2760   │
│ AIC                    2824.7588       2824.7588      1039.5825       1039.5825    │
│ Std. Error             Classical       Classical      Clustered       Clustered    │
└────────────────────────────────────────────────────────────────────────────────────┘

And here is the latex version (note the footnote with p-values; it can be changed using the parameter footnote of the function t4.report.models2tab() of the tools4sci module):

\begin{table}[!htb]
\caption{Example table}
\label{tab-example}
\centering
\resizebox{\ifdim\width>\linewidth\linewidth\else\width\fi}{!}{
\begin{tabular}{lcccc}
\toprule
  & \makecell{Model Linear\\Adjusted: Yes} & \makecell{Model Linear\\Adjusted: No} & \makecell{Model Logit\\Adjusted: Yes} & \makecell{Model Logit\\Adjusted: No}\\
\midrule
Intercept  &  -0.1194   &  -0.1194   &  -0.1600   &  -0.1600  \\
  &  (0.1030)  &  (0.1030)  &  (0.1588)  &  (0.1588) \\
treatment  &  -0.5137***  &  -0.5137***  &  -0.4336***  &  -0.4336*** \\
  &  (0.0609)  &  (0.0609)  &  (0.0918)  &  (0.0918) \\
ideology  &  -0.1021***  &  -0.1021***  &  -0.0805*  &  -0.0805* \\
  &  (0.0074)  &  (0.0074)  &  (0.0314)  &  (0.0314) \\
treatment x ideology  &  -0.2804***  &  -0.2804***  &  -0.2886***  &  -0.2886*** \\
  &  (0.0104)  &  (0.0104)  &  (0.0427)  &  (0.0427) \\
Income (std)  &  0.0348   &  0.0348   &  -0.0467   &  -0.0467  \\
  &  (0.0307)  &  (0.0307)  &  (0.0639)  &  (0.0639) \\
age  &  0.0234***  &  0.0234***  &  0.0200***  &  0.0200*** \\
  &  (0.0020)  &  (0.0020)  &  (0.0050)  &  (0.0050) \\
gender  &  -0.5098***  &  -0.5098***  &  -0.1203   &  -0.1203  \\
  &  (0.0610)  &  (0.0610)  &  (0.1244)  &  (0.1244) \\
N. Obs.  &  1017  &  1017  &  1017  &  1017 \\
R2 (adj)  &  0.7641  &  0.7641  &    &   \\
R2 (pseudo)  &    &    &  0.2843  &  0.2843 \\
BIC  &  2859.2311  &  2859.2311  &  -5968.2760  &  -5968.2760 \\
AIC  &  2824.7588  &  2824.7588  &  1039.5825  &  1039.5825 \\
Std. Error  &  Classical  &  Classical  &  Clustered  &  Clustered \\
\bottomrule
\multicolumn{5}{r}{+ $p<0.1$; * $p<0.05$; ** $p<0.01$; *** $p<0.001$}\\
\end{tabular}}
\end{table}

Bonus

Grouping rows

We can group the rows in the table by post-processing the tibble outcome from the models2tab() function using tidypolars $^{4sci}$ function to_latex(). Something like this:

We need to create a column indicating the row group:

tab_rows_grouped = tab.mutate(groups = np.array(['Baseline']*2 +
                                                ['Core effects']*6 + 
                                                ['Demographics']*6 +
                                                ['Fit statistics']*6
                                                )
                              )
tab_rows_grouped.print()

shape: (20, 6)
┌─────────────────────────────────────────────────────────────────────────────────────────────────────┐
│                        Model Linear    Model Linear   Model Logit     Model Logit    groups         │
│ str                    Adjusted: Yes   Adjusted: No   Adjusted: Yes   Adjusted: No   str            │
│                        str             str            str             str                           │
╞═════════════════════════════════════════════════════════════════════════════════════════════════════╡
│ Intercept              -0.1194         -0.1194        -0.1600         -0.1600        Baseline       │
│                        (0.1030)        (0.1030)       (0.1588)        (0.1588)       Baseline       │
│ treatment              -0.5137***      -0.5137***     -0.4336***      -0.4336***     Core effects   │
│                        (0.0609)        (0.0609)       (0.0918)        (0.0918)       Core effects   │
│ ideology               -0.1021***      -0.1021***     -0.0805*        -0.0805*       Core effects   │
│                        (0.0074)        (0.0074)       (0.0314)        (0.0314)       Core effects   │
│ treatment x ideology   -0.2804***      -0.2804***     -0.2886***      -0.2886***     Core effects   │
│                        (0.0104)        (0.0104)       (0.0427)        (0.0427)       Core effects   │
│ Income (std)           0.0348          0.0348         -0.0467         -0.0467        Demographics   │
│                        (0.0307)        (0.0307)       (0.0639)        (0.0639)       Demographics   │
│ age                    0.0234***       0.0234***      0.0200***       0.0200***      Demographics   │
│                        (0.0020)        (0.0020)       (0.0050)        (0.0050)       Demographics   │
│ gender                 -0.5098***      -0.5098***     -0.1203         -0.1203        Demographics   │
│                        (0.0610)        (0.0610)       (0.1244)        (0.1244)       Demographics   │
│ N. Obs.                1017            1017           1017            1017           Fit statistics │
│ R2 (adj)               0.7641          0.7641                                        Fit statistics │
│ R2 (pseudo)                                           0.2843          0.2843         Fit statistics │
│ BIC                    2859.2311       2859.2311      -5968.2760      -5968.2760     Fit statistics │
│ AIC                    2824.7588       2824.7588      1039.5825       1039.5825      Fit statistics │
│ Std. Error             Classical       Classical      Clustered       Clustered      Fit statistics │
└─────────────────────────────────────────────────────────────────────────────────────────────────────┘

Then, we apply the to_latex() function:

tabl = tab_rows_grouped.to_latex(group_rows_by='groups')
print(tabl)

\begin{table}[!htb]
\centering
\resizebox{\ifdim\width>\linewidth\linewidth\else\width\fi}{!}{
\begin{tabular}{lllll}
\toprule
  & \makecell{Model Linear\\Adjusted: Yes} & \makecell{Model Linear\\Adjusted: No} & \makecell{Model Logit\\Adjusted: Yes} & \makecell{Model Logit\\Adjusted: No}\\
\midrule
\addlinespace[0.3em]\multicolumn{5}{l}{ \textbf{Baseline} }\\
\hspace{1em}Intercept  &  -0.1194   &  -0.1194   &  -0.1600   &  -0.1600  \\
\hspace{1em}  &  (0.1030)  &  (0.1030)  &  (0.1588)  &  (0.1588) \\
\addlinespace[0.3em]\multicolumn{5}{l}{ \textbf{Core effects} }\\
\hspace{1em}treatment  &  -0.5137***  &  -0.5137***  &  -0.4336***  &  -0.4336*** \\
\hspace{1em}  &  (0.0609)  &  (0.0609)  &  (0.0918)  &  (0.0918) \\
\hspace{1em}ideology  &  -0.1021***  &  -0.1021***  &  -0.0805*  &  -0.0805* \\
\hspace{1em}  &  (0.0074)  &  (0.0074)  &  (0.0314)  &  (0.0314) \\
\hspace{1em}treatment x ideology  &  -0.2804***  &  -0.2804***  &  -0.2886***  &  -0.2886*** \\
\hspace{1em}  &  (0.0104)  &  (0.0104)  &  (0.0427)  &  (0.0427) \\
\addlinespace[0.3em]\multicolumn{5}{l}{ \textbf{Demographics} }\\
\hspace{1em}Income (std)  &  0.0348   &  0.0348   &  -0.0467   &  -0.0467  \\
\hspace{1em}  &  (0.0307)  &  (0.0307)  &  (0.0639)  &  (0.0639) \\
\hspace{1em}age  &  0.0234***  &  0.0234***  &  0.0200***  &  0.0200*** \\
\hspace{1em}  &  (0.0020)  &  (0.0020)  &  (0.0050)  &  (0.0050) \\
\hspace{1em}gender  &  -0.5098***  &  -0.5098***  &  -0.1203   &  -0.1203  \\
\hspace{1em}  &  (0.0610)  &  (0.0610)  &  (0.1244)  &  (0.1244) \\
\addlinespace[0.3em]\multicolumn{5}{l}{ \textbf{Fit statistics} }\\
\hspace{1em}N. Obs.  &  1017  &  1017  &  1017  &  1017 \\
\hspace{1em}R2 (adj)  &  0.7641  &  0.7641  &    &   \\
\hspace{1em}R2 (pseudo)  &    &    &  0.2843  &  0.2843 \\
\hspace{1em}BIC  &  2859.2311  &  2859.2311  &  -5968.2760  &  -5968.2760 \\
\hspace{1em}AIC  &  2824.7588  &  2824.7588  &  1039.5825  &  1039.5825 \\
\hspace{1em}Std. Error  &  Classical  &  Classical  &  Clustered  &  Clustered \\
\bottomrule
\end{tabular}}
\end{table}

Grouping columns

We can also group columns instead, producing something like this:

We need to post-process the tibble outcome from the models2tab() function using tidypolars $^{4sci}$ function to_latex(). The code:

caption = "A regression table"
label = 'tab-regression'
header = [('', ''),
          ('Linear Models', 'Adjusted: Yes'),
          ('Linear Models', 'Adjusted: No'),
          ('Logit Models', 'Adjusted: Yes'),
          ('Logit Models', 'Adjusted: No'),
          ]
tabl = tab.to_latex(caption = caption,
                    label = label,
                    header = header,
                    align = 'lcccc',
                    footnotes = None)
print(tabl)

\begin{table}[!htb]
\caption{A regression table}
\label{tab-regression}
\centering
\resizebox{\ifdim\width>\linewidth\linewidth\else\width\fi}{!}{
\begin{tabular}{lcccc}
\toprule
  &  \multicolumn{2}{c}{Linear Models}  &  \multicolumn{2}{c}{Logit Models} \\
\cmidrule(lr){2-3} \cmidrule(lr){4-5}
  &  Adjusted: Yes  &  Adjusted: No  &  Adjusted: Yes  &  Adjusted: No \\
\midrule
Intercept  &  -0.1194   &  -0.1194   &  -0.1600   &  -0.1600  \\
  &  (0.1030)  &  (0.1030)  &  (0.1588)  &  (0.1588) \\
treatment  &  -0.5137***  &  -0.5137***  &  -0.4336***  &  -0.4336*** \\
  &  (0.0609)  &  (0.0609)  &  (0.0918)  &  (0.0918) \\
ideology  &  -0.1021***  &  -0.1021***  &  -0.0805*  &  -0.0805* \\
  &  (0.0074)  &  (0.0074)  &  (0.0314)  &  (0.0314) \\
treatment x ideology  &  -0.2804***  &  -0.2804***  &  -0.2886***  &  -0.2886*** \\
  &  (0.0104)  &  (0.0104)  &  (0.0427)  &  (0.0427) \\
Income (std)  &  0.0348   &  0.0348   &  -0.0467   &  -0.0467  \\
  &  (0.0307)  &  (0.0307)  &  (0.0639)  &  (0.0639) \\
age  &  0.0234***  &  0.0234***  &  0.0200***  &  0.0200*** \\
  &  (0.0020)  &  (0.0020)  &  (0.0050)  &  (0.0050) \\
gender  &  -0.5098***  &  -0.5098***  &  -0.1203   &  -0.1203  \\
  &  (0.0610)  &  (0.0610)  &  (0.1244)  &  (0.1244) \\
N. Obs.  &  1017  &  1017  &  1017  &  1017 \\
R2 (adj)  &  0.7641  &  0.7641  &    &   \\
R2 (pseudo)  &    &    &  0.2843  &  0.2843 \\
BIC  &  2859.2311  &  2859.2311  &  -5968.2760  &  -5968.2760 \\
AIC  &  2824.7588  &  2824.7588  &  1039.5825  &  1039.5825 \\
Std. Error  &  Classical  &  Classical  &  Clustered  &  Clustered \\
\bottomrule
\end{tabular}}
\end{table}

Plotting coefficients

The tidy format facilitates plotting the model coefficients. One can use the unnest() function. Here is the code:

model = 'Linear'
adjusted = 'Yes'
tab = (res
       .filter(tp.col("model")==model)
       .filter(tp.col("adjusted")==adjusted)
       .select('partisanship', 'summ')
       .unnest('summ')
       #
       .filter(~tp.col("term").str.contains('Intercept'))
       )
tab.print()

shape: (12, 8)
┌─────────────────────────────────────────────────────────────────────────────────────────┐
│ partisanship   term                 Coef.   Std.Err.        t   P>|t|   [0.025   0.975] │
│ str            str                    f64        f64      f64     f64      f64      f64 │
╞═════════════════════════════════════════════════════════════════════════════════════════╡
│ republican     treatment            -0.55       0.07    -8.35    0.00    -0.67    -0.42 │
│ republican     ideology             -0.12       0.01   -14.41    0.00    -0.13    -0.10 │
│ republican     treatment:ideology   -0.29       0.01   -25.66    0.00    -0.31    -0.27 │
│ republican     income               -0.01       0.03    -0.25    0.81    -0.07     0.06 │
│ republican     age                   0.02       0.00     7.81    0.00     0.01     0.02 │
│ republican     gender               -0.44       0.07    -6.76    0.00    -0.57    -0.31 │
│ democrat       treatment            -0.51       0.06    -8.44    0.00    -0.63    -0.39 │
│ democrat       ideology             -0.10       0.01   -13.86    0.00    -0.12    -0.09 │
│ democrat       treatment:ideology   -0.28       0.01   -27.00    0.00    -0.30    -0.26 │
│ democrat       income                0.03       0.03     1.13    0.26    -0.03     0.10 │
│ democrat       age                   0.02       0.00    11.58    0.00     0.02     0.03 │
│ democrat       gender               -0.51       0.06    -8.36    0.00    -0.63    -0.39 │
└─────────────────────────────────────────────────────────────────────────────────────────┘

Here is an example of a possible plot using Altair:

Plotting fitted line

The tidy format facilitates plotting the model prediction or fitted values. One can use the unnest() function. Here is the code:

model = 'Linear'
adjusted = 'Yes'
tab = (res
       .filter(tp.col("model")==model)
       .filter(tp.col("adjusted")==adjusted)
       .select('partisanship', "pred")
       .unnest("pred")
       )
tab.head().print()

shape: (5, 15)
┌──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────┐
│ partisanship     age   income   gender   ideology   treatment   group   vote_conservative   rate_conservative   mean   mean_se   mean_ci_lower   mean_ci_upper   obs_ci_lower   obs_ci_upper │
│ str              f64      f64      f64        i64         i64   str                   f64                 f64    f64       f64             f64             f64            f64            f64 │
╞══════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════╡
│ republican     43.92    -0.04     0.49        -10           0   a                    0.59                0.46   1.81      0.09            1.64            1.98          -0.19           3.82 │
│ republican     43.92    -0.04     0.49         -9           0   a                    0.59                0.46   1.70      0.08            1.54            1.85          -0.31           3.70 │
│ republican     43.92    -0.04     0.49         -8           0   a                    0.59                0.46   1.58      0.07            1.43            1.73          -0.42           3.58 │
│ republican     43.92    -0.04     0.49         -7           0   a                    0.59                0.46   1.46      0.07            1.33            1.60          -0.54           3.46 │
│ republican     43.92    -0.04     0.49         -6           0   a                    0.59                0.46   1.35      0.06            1.22            1.47          -0.66           3.35 │
└──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────┘

The plot with predicted values:

References

Fournier, P., Soroka, S., & Nir, L. (2020). Negativity Biases and Political Ideology: A Comparative Test across 17 Countries. American Political Science Review, 114(3), 775–791. http://dx.doi.org/10.1017/S0003055420000131