Blog 5: The Air War - Advertisements

Introduction

This week, I explored the effect of political advertisements on the Democratic vote share at the Congressional district level. Using political advertisement data from 2006-2018, I am able to run models for 152 congressional districts. Then, I specifically looked at Pennsylvania’s 10th district to make a model that includes both Democratic advertisement share (%) and Democratic generic ballot polling and to make a prediction for this district’s Democratic vote share in 2022.

In the United States, it’s not election season without a bombardment of political ads. According to a CNBC article from September 2022, campaigns in the 2022 elections have poured $6.4 billion into TV, radio and digital ads for U.S. House, Senate, gubernatorial and local races. The 2022 election is on track to become the most expensive election ever. But do these ads actually change voters’ opinions or are they just a waste of money?

Prior Literature

Gerber and authors ran an experiment on the effects of TV and radio advertising on voting preferences. The authors found a “rapid decay of advertising effects” in which “televised ads have strong but short-lived effects on voting preferences” (Gerber et al., 2011). This matches with the exploration we did in section where we found that both Democrats and Republicans spend the most on TV ads closest to the election.

Do Ads Matter in 152 Congressional Districts from 2006-2018?

With this theoretical background, I now explore the relationship between Democratic advertisement share (the percentage of ads run in the district that were for the Democratic candidate) and Democratic vote share in 152 congressional districts from 2006-2018. My main data set that I will be using is from the Wesleyan Media Project, which tracks political TV advertisements.

I ran a simple linear regression model between Democratic vote share and Democratic ad share for 152 congressional districts. The table below shows each district’s linear regression model and its r-squared value and coefficient for Democratic ad share. To interpret the coefficient, it is for every 1% increase in Democratic ad share, there is X% increase/decrease in Democratic vote share.

## # A tibble: 152 × 4
##     state          st_cd_fips r_squared coefficient
##     <chr>          <chr>          <dbl>       <dbl>
##   1 Alabama        0102       1             0.276  
##   2 Alabama        0106       0.995        -0.124  
##   3 Arizona        0401       0.738         0.153  
##   4 Arizona        0402       1.00          0.270  
##   5 Arizona        0405       1            -0.561  
##   6 Arizona        0408       0.666         1.69   
##   7 Arizona        0409       1             0.173  
##   8 Arkansas       0501       1             0.163  
##   9 Arkansas       0502       0.120         0.180  
##  10 Arkansas       0504       0.742         0.225  
##  11 California     0603       1            -0.565  
##  12 California     0607       1.00          0.158  
##  13 California     0610       1            51.0    
##  14 California     0611       1            -0.0857 
##  15 California     0616       1             0.0268 
##  16 California     0620       1            -0.920  
##  17 California     0621       0.895         0.0880 
##  18 California     0624       0.754        -0.0506 
##  19 California     0636       0.661         0.191  
##  20 California     0652       0.933         0.387  
##  21 Colorado       0803       0.992         0.559  
##  22 Colorado       0804       1            -0.360  
##  23 Colorado       0806       0.0539        0.423  
##  24 Colorado       0807       0.358        -0.0178 
##  25 Connecticut    0902       1             0.227  
##  26 Connecticut    0904       1             3.34   
##  27 Connecticut    0905       0.258        -0.0710 
##  28 Florida        1202       0.787        -0.291  
##  29 Florida        1208       1            -0.243  
##  30 Florida        1209       1             0.467  
##  31 Florida        1213       0.00932       0.0808 
##  32 Florida        1216       1             0.283  
##  33 Florida        1218       0.738         0.269  
##  34 Florida        1219       0.488         0.238  
##  35 Florida        1222       0.315         0.278  
##  36 Florida        1226       0.562        -0.263  
##  37 Georgia        1302       1           -19.4    
##  38 Georgia        1312       0.291         0.188  
##  39 Hawaii         1501       0.618        -0.107  
##  40 Idaho          1601       0.161        -0.177  
##  41 Idaho          1602       1             0.0810 
##  42 Illinois       1708       1             0.0500 
##  43 Illinois       1710       0.0000389     0.00106
##  44 Illinois       1711       1            -0.336  
##  45 Illinois       1712       0.173        -0.415  
##  46 Illinois       1713       1            -2.37   
##  47 Illinois       1717       0.0823        0.0831 
##  48 Indiana        1802       0.663         0.736  
##  49 Indiana        1803       1             0.333  
##  50 Indiana        1807       1            -0.191  
##  51 Indiana        1808       0.892         0.926  
##  52 Indiana        1809       0.0245        0.0330 
##  53 Iowa           1901       0.555        -0.626  
##  54 Iowa           1902       0.558         0.177  
##  55 Iowa           1903       0.00114      -0.0140 
##  56 Iowa           1904       0.325        -0.0791 
##  57 Kansas         2001       1            -0.0500 
##  58 Kansas         2003       0.983         0.331  
##  59 Kansas         2004       1             0.281  
##  60 Kentucky       2103       0.300         0.400  
##  61 Kentucky       2106       0.887         0.312  
##  62 Louisiana      2203       0.663         0.628  
##  63 Louisiana      2204       0.274        -0.111  
##  64 Maine          2301       0.186        -0.0455 
##  65 Maine          2302       0.964         1.16   
##  66 Maryland       2402       1             0.128  
##  67 Massachusetts  2504       1             0.282  
##  68 Massachusetts  2506       0.590         0.110  
##  69 Michigan       2606       0.309        -0.0849 
##  70 Michigan       2607       0.999        -0.122  
##  71 Michigan       2608       1             0.125  
##  72 Michigan       2611       0.954         0.238  
##  73 Minnesota      2702       0.944         0.127  
##  74 Minnesota      2703       1             0.0225 
##  75 Minnesota      2706       0.977        -0.611  
##  76 Minnesota      2707       1             0.372  
##  77 Minnesota      2708       0.137         0.141  
##  78 Mississippi    2802       1            -2.20   
##  79 Missouri       2904       1             0.521  
##  80 Missouri       2908       0.0556       -0.0198 
##  81 Nebraska       3102       0.570         0.394  
##  82 Nevada         3203       0.870         0.317  
##  83 Nevada         3204       0.918        -0.231  
##  84 New Hampshire  3301       0.864         0.299  
##  85 New Hampshire  3302       0.0227        0.0205 
##  86 New Jersey     3403       0.781         0.0544 
##  87 New Mexico     3501       0.986         0.190  
##  88 New Mexico     3502       0.629         0.531  
##  89 New York       3601       0.363        -0.117  
##  90 New York       3603       1             0.303  
##  91 New York       3618       1             0.0240 
##  92 New York       3619       0.105         0.287  
##  93 New York       3620       0.526         0.399  
##  94 New York       3621       0.211        -0.720  
##  95 New York       3622       1             0.154  
##  96 New York       3623       0.225        -0.200  
##  97 New York       3624       0.547        -0.734  
##  98 New York       3625       0.0491        0.0528 
##  99 New York       3626       1            -0.352  
## 100 New York       3627       1             1.18   
## 101 New York       3629       1             0.205  
## 102 North Carolina 3708       0.805         0.230  
## 103 North Carolina 3711       1.00          0.327  
## 104 North Carolina 3713       1             0.389  
## 105 Ohio           3901       0.693         0.239  
## 106 Ohio           3902       1             5.58   
## 107 Ohio           3905       1            -0.112  
## 108 Ohio           3906       0.711         0.549  
## 109 Ohio           3912       1             0.278  
## 110 Ohio           3913       1             0.276  
## 111 Ohio           3915       1            -5.99   
## 112 Ohio           3916       1            -0.510  
## 113 Ohio           3918       1            -1.19   
## 114 Oklahoma       4002       1             0.419  
## 115 Oklahoma       4005       0.0847        0.0229 
## 116 Oregon         4101       1             0.500  
## 117 Oregon         4105       0.173         0.0263 
## 118 Pennsylvania   4203       1             0.00722
## 119 Pennsylvania   4204       1            -0.0984 
## 120 Pennsylvania   4206       1            -1.30   
## 121 Pennsylvania   4207       1           -47.9    
## 122 Pennsylvania   4208       0.752        -0.0895 
## 123 Pennsylvania   4210       0.758         0.447  
## 124 Pennsylvania   4211       0.997         0.677  
## 125 Pennsylvania   4212       0.0341       -0.0598 
## 126 Pennsylvania   4216       0.00486       0.0105 
## 127 Rhode Island   4401       1            -0.119  
## 128 Rhode Island   4402       1             0.0767 
## 129 South Carolina 4505       0.538        -0.711  
## 130 Tennessee      4703       1             5.71   
## 131 Tennessee      4704       0.560        -0.109  
## 132 Tennessee      4709       0.335         0.0153 
## 133 Texas          4815       1             0.248  
## 134 Texas          4816       1            -1.65   
## 135 Texas          4817       1            -5.48   
## 136 Texas          4823       0.182         0.190  
## 137 Texas          4827       0.907         0.324  
## 138 Utah           4902       0.633         0.220  
## 139 Utah           4904       0.136         0.187  
## 140 Virginia       5102       0.303        -0.351  
## 141 Virginia       5105       0.534         0.367  
## 142 Virginia       5107       0.387         0.0606 
## 143 Virginia       5109       1            -1.03   
## 144 Virginia       5110       0.000441     -0.00541
## 145 Virginia       5111       1             0.119  
## 146 Washington     5305       0.850         0.214  
## 147 Washington     5308       0.973         0.163  
## 148 West Virginia  5402       0.974         0.127  
## 149 West Virginia  5403       0.639        -0.340  
## 150 Wisconsin      5503       1             0.202  
## 151 Wisconsin      5507       1             0.0386 
## 152 Wisconsin      5508       0.0559        0.327

To better understand and visualize these 152 different linear regression models, I created graphs displaying the distributions of the coefficients and r-square values.

This graph is the distribution of the coefficients of 152 congressional districts from 2006-2018. Almost all of the coefficients fell in the middle “0” range. However, this graph is a bit misleading because the distribution is so widespread that it simplifies the middle of the distribution. Thus, I “zoom” into the middle of the graph to better see the distribution.

## Warning: Removed 10 rows containing non-finite values (stat_bin).

This is the same graph as above but zoomed into the -2 to 2 range. As expected, this “zoom in” has allowed for more nuance in the distribution. Although the distribution is still roughly normally shaped, there is slightly a greater concentration in the positive coefficient zone (>0). This perhaps indicates that there are more districts where there is a positive relationship between Democratic ad share and Democratic vote share than the opposite. In other words, the greater the Democratic ad share, the greater the Democratic vote share.

This graph is the distribution of the r-squared values of 152 congressional districts from 2006-2018. Almost 50% of the models have r-squared values of 1, which at first glance is exciting, especially considering that I have had models with negative adjusted r-squared values, but I will talk more about this in my limitations section.

Pennsylvania’s 10th Congressional District

I was not able to run predictions for all 152 congressional districts like I did with the models, so I focused on Pennsylvania’s 10th Congressional District as an example. The current district is located in [south-central region of the state]“https://en.wikipedia.org/wiki/Pennsylvania%27s_10th_congressional_district”, but this is after it was redrawn in 2018. Prior to 2019, the district was located in the northeastern part of the state. This issue of gerrymandering is further explored in my limitations section. Since 2010, the seat, in its many different forms, has been held by a Republican, but prior to 2010 it has been held by a Democrat. The elections in this district are a bit varied with close elections like in 2006 where the Democratic candidate won with 52.90% of the vote or blow out elections like in 2016 where the Republican incumbent won with 70.2% of the vote.

Besides the interesting nature of its election history, I picked PA-10 because the Wesleyan Media Project had data on it for every election between 2006-2018 except 2008.

I ran a linear regression model for Democratic vote share in PA-10 based on Democratic ad share and nation-wide generic ballot polling for the Democrats.

## 
## Pennsylvania's 10th Congressional District
## ===================================================
##                             Dependent variable:    
##                         ---------------------------
##                          Democratic Vote Share (%) 
## ---------------------------------------------------
## Democratic Ad Share (%)           0.352**          
##                                   (0.069)          
##                                                    
## Generic Ballot Polling            1.440*           
##                                   (0.439)          
##                                                    
## Constant                          -39.773          
##                                  (19.346)          
##                                                    
## ---------------------------------------------------
## Observations                         5             
## R2                                 0.967           
## Adjusted R2                        0.935           
## Residual Std. Error           2.601 (df = 2)       
## F Statistic                29.664** (df = 2; 2)    
## ===================================================
## Note:                   *p<0.1; **p<0.05; ***p<0.01

From this model, we see that for every 1% increase in Democratic ad share, we expect a 0.352% increase in Democratic vote share in PA-10. Similarly, for every 1% increase in the nation-wide generic ballot polling for the Democrats, we expect a 1.440% increase in Democratic vote share in PA-10. The coefficient for Democratic ad share is statistically significant at p=0.05, while the coefficient for generic ballot is statistically significant at p=0.1. Furthermore, the adjusted r-squared value is an astonishing 0.935. However, we must also account for the fact that there are only 5 observations in this model, thus the adjusted r-squared value is not perfect.

###What is the Democratic vote share prediction for Pennsylvania’s 10th district in 2022? Because the data from the Wesleyan Media Project does not have data for 2020 nor 2022, I had to use the 2018 Democratic ad share as a stand-in for 2022. This was 58.67%. In addition, I pulled October 14’s generic ballot polling for Democrats from FiveThirtyEight as the generic ballot polling for 2022. This was 45.8%.

Using the linear regression model from above and this data for 2022, I predict that the Democratic vote share for Pennsylvania’s 10th district in 2022 will be 46.84%.

Limitations

The first limitation with my 152 district models is that the distribution of r-squared values was overwhelmingly 1. This is a bit suspicious and I am in the midst of trying to figure out what happened. I did filter out all districts that only had one row (which would then result in a r-squared value of 1). The second limitation I faced was with my model for PA-10. This district went through multiple rounds of gerrymandering that makes it hard for me to predict for the current district. This raises a potential difficulty I will face if I want to continue doing district-level modeling and predicting. The third limitation is that the Wesleyan Media Project only had data up until 2018 so I had to use the 2018 data as a stand-in for 2022, but we know from earlier that this 2022 election cycle is on-track to become the most expensive, so the Democratic ad share percentage might be different. The fourth limitation is that this data only looks at TV advertisements. It would be interesting to look at if and how online political advertising affects voter behavior.

Conclusion

This week, I explored at the district-level but faced many problems. Thus, I will consider if I want to continue doing district-level predictions in the future. Nonetheless, it was a good exercise.

Data: House Vote (given by class) Wesleyan Media Project (2006-2018) (given by class) House Generic Ballot Polls 1948-2020 (given by class)