Some supporters of Bernie Sanders are circulating
this paper from two Stanford students as proof that the Clinton camp committed electoral fraud. It's an interesting piece of work, but a weak reed for the claims surrounding it.
One of the key arguments is that states differ in whether they have a paper trail for their electronic voting or not, and that Sanders did much better in the states that have the trail, where fraud is presumably harder to commit.
The basic issue is whether there are other explanations for the discrepancy they found, and whether they adequately investigated them.
The initial numbers look undeniably bad. Sanders won only 35% in states without a paper trail, but 51% in states with a trail. If the 51% is the more accurate number, because it’s insulated from fraud, then the implication is that Sanders would have won in a fair election.
The first obvious question is whether states differed in some other way besides paper or no-paper. The authors are aware of that, and after presenting the raw 51% vs. 35% figure, they do a regression where they can at least in part control for those other factors.
The two variables that they add, besides paper vs. no-paper, are:
- Percent of non-Hispanic whites in a given state
- The “blueness” of the state (its history of voting for Democrats rather than Republicans in presidential elections since 1992).
These are both reasonable first efforts. Much was made of Clinton’s “firewall” among people of color, and Sanders’ better performance among whites, so the first control variable is getting at something real. The two candidates also differed in their appeal to long-time party regulars vs. people who don’t necessarily identify as Democrats. I don’t know how much the exit polling bears this out, but commentary suggested that Sanders appealed both to people who were critical of Clinton from the left, and to some whose sense of the system being rigged left them open either to Sanders or to Trump. So it makes sense to have some sort of variable for the party identification of a given state.
But there are problems with both of the variables they chose, and the clearer problem is with the race variable.