These notes go over some aspects of:
- Downloading data in R
- Managing large desktop-sized files (a few GB in size)
- Assessing parsing errors from
readr
- Manipulating data quickly with
dplyr
anddata.table
(in progress) - Implementing the split, apply, combine approach with
dplyr
, and reshaping and plotting withreshape2
andggplot2
- Plotting with
ggvis
I might also put together notes on visualizing missing data with pheatmap
and mi
.
You can view the html files as web pages here.
To run tutorials 1-4, you'll either need to replicate the directory structure on your computer or change dataPath
(at the top of each tutorial) to wherever you downloaded the data. One way to replicate the directory structure is to clone this repository. If you clone the repository, you can run the code directly from the markdown files.
Thanks, and happy computing!