R

Main Purpose

Large datasets (1M+ rows), statistical methods on datasets of any size


Benefits of this Resource

  • ggplot creates many different types of useful data visualizations

  • Robust ecosystem--if you want to do something, there’s almost certainly a package that makes it easy to use

  • Open source and free


Limitations

  • No GUI, a little bit of programming is required

  • Slow when working with huge datasets, can’t handle huge datasets as easily as SAS or Pandas (Python)


File Type Compatibility

Probably all, definitely CSV, XLS(X), TXT, PSV


Level of Expertise Required

High
*Comfort with programming (programming for R in and of itself is not that tough, but you need to be willing to write commands instead of manipulating a GUI)


Where to access

R system package free at rstudio


Learning Resources

Technical Books

R for Data Science  

Statistical Inference via Data Science

Tidy Modeling with R

R for Marketing Research and Analytics

Business Data Science

An Introduction to Statistical Learning with Applications in R

Hands-On Machine Learning with R

MOOCs

Jumpstart with R

Business Analysis with R

Analytics Edge (R)