Introduction to R
Session Format
Theory and hands-on will go on side by side.
●
Focuss will be on the data analysis and analytics part.
●
●
Assumptions/Pre-requisite
Familiarity with at least one programming language. (Python, Java,
●
C/C++).
During the session, similarity or differences of R w.r.t. other languages
●
will be discussed.
●
Background & Usage
Lineage from the statistical languages which causes off-bit syntax.
●
Usage in the different industry segments.
●
●
Current Position
In-memory vs. distributed
●
License.
●
●
Installation
Here the relevant links are given w.r.t. Ubuntu OS but for the other OS
also, plenty of helpful links can be googled.
●
R (Essential)
– https://www.digitalocean.com/community/tutorials/how-to-install-r-on-ubuntu-20-
04
●
●
Rstudio (Demos will be given using this IDE)
– https://linuxconfig.org/how-to-install-rstudio-on-ubuntu-20-04-focal-fossa-linux
●
●
R in Jupyter Notebook (Good to have as an alternative)
– https://docs.anaconda.com/anaconda/navigator/tutorials/r-lang/
–
Resources and Acknowledgements
1.Good tutorial:
2.https://kingaa.github.io/R_Tutorial/
3.Edureka tutorial:
https://www.edureka.co/blog/r-tutorial/?utm_source=youtube&utm_camp
aign=r-tutorial-020617-wr&utm_medium=description#dataOperators
4.Youtube video: https://www.youtube.com/watch?v=_V8eKsto3Ug
Roadmap-1
1 What is this and why is it chosen?
2 What is R ?
3 Getting started with R
3.1 Installing R on your computer
3.2 Starting R
3.3 Stopping R
4 Interactive calculations
5 The help system
Roadmap-2
6 A first interactive session
6.1 Descriptive statistics
6.2 Linear regression
7 Statistics in R
8 The R package system
Roadmap-3
9 Data structures in R
9.1 Vectors
9.1.1 Element recycling
9.1.2 Functions for creating vectors
9.1.3 Vector indexing
9.1.4 Logical operations
9.1.5 More on vector indexing
Roadmap-4
9.2 Matrices and arrays
9.2.1 Creating matrices
9.2.2 cbind and rbind
9.2.3 Matrix indexing
9.3 Arrays
9.4 Factors
Roadmap-5
9.5 Lists
9.6 Data frames
10 Probability distributions in R
11 Scripts and data files
12 Looping in R
12.1 for loops
12.2 while loops
12.3 repeat loops
Roadmap-6
13 Functions and environments
13.1 Definitions and examples
13.2 Function scope
13.3 Nested functions and environments
Roadmap-7
14 The apply family of functions
14.1 List apply: lapply
14.2 Sloppy list apply: sapply
14.3 Multiple-list apply: mapply
14.4 Array apply: apply
14.5 Table apply: tapply
14.6 sapply with expected result: vapply
15 Vectorized functions vs loops
Demo
One small demo covering various steps of a ML based process
using R including:
●
Data collection
●
Data exploration and pre-processing
●
Applying ML based algorithm
●
Graphically result Analysis
Thank You