PollAccuracy

Project implemented in Python to simulate and make "prediction" about the 2016 presidential election.(well, 3 months after the election). The idea is very simple: assuming that the outcome of the presidential race at each state follows a Bernalloui distribution where Donald Trump (now president-elect) has the probability of winning p , we can iterate the race in each state over a large amount of time and compute the mean electoral votes.

The question is: how do we find p?

Two methods are used:

Regression based on Washington Post pre-election poll (adjusted by the 2012 biased)
Bayesian inference with Metropolis-Hasting algorithm

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes. See deployment for notes on how to deploy the project on a live system.

Prerequisites

Project uses Python and many relevant statistical, graphical libaries, notably:

panda
numpy
matplotlib

Information on how to install these packages can be found only. I highly recommend using Anaconda platform.

conda install pandas

R is used for Bayesian simulation, mainly due to its computational prowess. I will soon update a code in Python.

Running:

Jupyter notebook

I prefer using Jupyter notebook as it retains the logical flow of the code and the way I broke the projects down into smaller problems. Very simple use, just run notebook.ipynb cell by cell. At some point, you will have to start using R Bayesian_MCMC.Rto conduct Bayesian MCMC.

I will soon add a detailed discussion on the results of the projects.

The project allows you to run a simulation of 10,000 election races and present data in a nice histogram and a choropleth map

A nice histogram that presents results from 10,000 simulation:

Wtih a less than 3% chance of winning, who would think that he is now the President-elect?

A chloropeth math that presents probability of Trump winning in each state:

Authors

Tuan Nguyen Doan - Initial work - tuangauss

This is a self-learning project and I hope to learn from the expertise of the community. Please reach out to me if you have any suggestion or ideas.

Acknowledgments

This is a self-learning project and I am proud to present the following sources as my reference (and inspiration):

Harvard Open Course CS109: Data Science (the idea of this project comes from one of the problem sets)
Open-resourced code: for choropleth math, basic question, run_time issues
Yale Stat 238
Five-thirty eight
Open poll data from here

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
election_data		election_data
poll_by_state		poll_by_state
.gitattributes		.gitattributes
.gitignore		.gitignore
Bayesian_MCMC.R		Bayesian_MCMC.R
README.md		README.md
demo choropleth map.JPG		demo choropleth map.JPG
demo histogram.JPG		demo histogram.JPG
notebook.ipynb		notebook.ipynb
prototype.R		prototype.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PollAccuracy

Getting Started

Prerequisites

Running:

Jupyter notebook

Authors

Acknowledgments

About

Releases

Packages

Languages

tuangauss/PollAccuracy

Folders and files

Latest commit

History

Repository files navigation

PollAccuracy

Getting Started

Prerequisites

Running:

Jupyter notebook

Authors

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages