This is my small effort to pickup streamgraph support in R developed by Bob Rudis. (Described here).
What you see is per year aggregations of results of all India v/s Pakistan One day Internationals. I pulled the records from Wikipedia and used rvest by Hadley Wickham. for extracting the results. After that a little data munging using dplyr and lubridate and voilà. Blue’s are India and Green’s are Pakistan in accordance with their team colors.
These are my notes for installing RHadoop on a Cloudera CDH 4.3.0 Hadoop Cluster. Although the notes are geared towards installing on CDH, they can be used to install RHadoop on any other Hadoop distro.
The default installation instructionsas per RHadoop wiki tell you to install the ‘R’ package from the EPEL Repo. The problem with that is ‘R’ pulls in ‘R-core-devel’ package, and that pulls in all sort of ‘build tools’ including the gcc compiler and and a host of other library ‘dev’ packages.