R
Main Purpose
Large datasets (1M+ rows), statistical methods on datasets of any size
Benefits of this Resource
-
ggplot creates many different types of useful data visualizations
-
Robust ecosystem--if you want to do something, there’s almost certainly a package that makes it easy to use
-
Open source and free
Limitations
-
No GUI, a little bit of programming is required
-
Slow when working with huge datasets, can’t handle huge datasets as easily as SAS or Pandas (Python)
File Type Compatibility
Probably all, definitely CSV, XLS(X), TXT, PSV
Level of Expertise Required
High
*Comfort with programming (programming for R in and of itself is not that tough, but you need to be willing to write commands instead of manipulating a GUI)
Where to access
R system package free at rstudio
Learning Resources
Technical Books
Statistical Inference via Data Science
R for Marketing Research and Analytics
An Introduction to Statistical Learning with Applications in R
Hands-On Machine Learning with R
MOOCs