BA Concept Note 2
BA Concept Note 2
Since the domain of big data and analytics itself is an ‘invention’ of this millennium, it seems
difficult to believe that its most widely-used statistical package was launched more than two
decades back! But, that’s true. R 1st appeared in 1996, once the statistics professors Ross
Ihaka and parliamentarian Gentleman of the University of Auckland in New Zealand
discharged the code as a free software package. The professors wanted technology better
suited and easily accessible, for their statistics students, who needed to analyze data and
produce graphical models of the information. Most comparable software e.g. SAS had been
designed by computer scientists, had expensive licensing fees and were not user-friendly.
R is an open source software and it is the language of statisticians and Data analysts. Its syntax
and structure have been explicitly designed to formulate expressions about statistical objects.
Particularly useful for Data analysis as it contains a number of built-in easy to use commands
for organizing data and creating both numerical and graphical summaries of data. It is also a
platform for predictive analytics and data science and runs on Windows, UNIX and Mac OS.
Not just that even in the field of big data analytics, R is used for handling complex and large
data and can be used on high performance clusters.
As a language R supports effective object-oriented programming, including all usual
features such as conditionals, loops, user-defined recursive function. Unlike Python, it
is not a general-purpose language but heavily geared towards statistical work.
As a work environment R offers “an integrated suite of software facilities for data
manipulation” or “an environment within which statistical techniques are
implemented”.