Demystifying the Election Prediction 2016 Contest

September 30, 2016

ASA is sponsoring the Prediction 2016 contest, giving students a chance to take part in this year’s election season by using statistics to predict the next president of the United States. Think you’re up for the challenge? We’ve compiled a few resources to help you get started.

FiveThirtyEight

Nate Silver, a well-known American statistician, offers some great tips on how to create good prediction models on his website, FiveThirtyEight. He suggests models be probabilistic and empirical while also responding sensibly to change in inputs and avoiding changes of rules in midstream. There is also a very helpful “User’s Guide” on FiveThirtyEight’s prediction process for the 2016 election that explains the four major steps Silver follows.

Polling Data

So where’s a good first stop on the journey to predicting the next U.S. president? Polls. Data need to be collected, weighted and averaged before any models can be created or any predictions can be made. There are several different places on the web where election polling data can be found, including: Gallup, Quinnipiac and RealClearPolitics, among others. It’s also important to adjust and combine polls with other data points and account for uncertainty when making predictions.

Statistical Methods

After polling data have been collected and adjusted, it’s time to choose which statistical methods will be used to predict the election. This is an opportunity to get creative, as there are several different methods that can be used. Biostatistician Bob O’Hara wrote an article speculating how Nate Silver correctly predicted the outcome of every state in the 2012 U.S. presidential election, suggesting Silver used Bayes’ Theorem, graph theory and hierarchical modeling to make his predictions.

Jo Hardin, professor of mathematics at Pomona College, offers a different possible approach in her post on R-statistics blog. Hardin provides an analysis of data that uses weighted means and a formula for the standard error of a weighted mean. She also suggests the possibility of using a generalized linear model.

Finally, it may be worth taking a look at the Time for Change forecasting model for some additional inspiration. This model estimates the weights of just three predictors and has correctly predicted every U.S. presidential election since 1988.

Additional Resources

For additional reading on statistical approaches for election prediction, see the following:

Forsberg OJ, and Payton ME. (2015) “Analysis of Battleground State Presidential Polling Performances, 2004–2012,” Statistics and Public Policy, 2, 1-10. (Source)

Christensen WF, and Florence LW. (2008) “Predicting presidential and other multistage election outcomes using state-level pre-election polls,” The American Statistician, 62, 1-10. (Source)

Park, D. K., Gelman, A., and Bafumi, J. (2004), “Bayesian Multilevel Estimation with Poststratification: State-Level Estimates from National Polls,” Political Analysis, 12, 375–385. (Source)

Hindsight Is 2023 for Former Statistics and Data Science Students

Posted: September 14, 2023

It’s back-to-school season! Gear up for the upcoming semester and consider diving into the captivating world of statistics and data science. Looking for diverse job opportunities that span across every industry? Look no further! With a variety of graduate programs and jobs, now is a great time for students to become data scientists and statisticians….

Elizabeth J. Kelly: “Statistics is for Adrenaline Junkies”

Posted: June 27, 2023

Elizabeth J. Kelly has always loved math, and as a professional statistician at Los Alamos National Laboratory (LANL) and a recreational rock climber, Elizabeth is an avid thrill-seeker who enjoys a challenge. “Math reminds me of climbing, including the need to focus, problem solve and persevere. I guess I ended up in statistics because I…

Comments are closed.