Data Science
Foundation
Program
450+ Hiring Partners
Hybrid Model for Project
Sessions
175% Average
Salary Hike
www.learnbay.co
CONTEXT
01 About the program
02 Program Highlights
03 Program details
04 Alumni Spotlight
05 Learnbay’s ProjectLab
06 Project Innvoation Lab
07 Career Service
08 Certificate
09 Learning path
10 Program syllabus
11 Real-time projects and case-studies
About The Program
The Data Science Foundation Program is tailored
2cr for individuals looking to excel in the dynamic field
worth of data science. This comprehensive program
scholarship
s awarded encompasses essential topics such as data analysis
and machine learning. Participants gain valuable
industry insights and hands-on experience. Our
mission is to offer accessible education that
empowers individuals to thrive in the evolving
world of data science.
600+
professionals
secured jobs
after a
career break We exist to provide accessible, reasonable, and
industry-relevant education that empowers
India's workforce to grow and develop.
35k+
Trusted
Learners
Program Details
COURSE PREREQUISITE
Prior knowledge of programming/coding is not mandatory. Just the urge
to learn programming and basic ideas about advanced math is enough.
PROGRAM ELIGIBILITY
Working professionals having more than 6 months of experience in
any domain (Technical/Non-Technical)
KEY FEATURES
Dedicated Placement Cell | 100% Guaranteed Interview calls
Globally Recognised Certification from IBM & Microsoft
JOB ROLES TO TARGET
Get equipped with the industry relevant skills and aim for job roles like
Financial Data Analyst, Risk Analyst, Insurance Data Analyst, Fraud
Detection Analyst etc.
Click below
Check Eligibility
Alumni Spotlight
Learnbay has helped me a lot to learn data science applications
in the e-commerce industry. The live class concept was really
helpful in receiving proper DS training. Thanks to all my
mentors and the placement team.
Shravanthi A
Mechanical
Data Scientist @
230%
Data Scientist
Domain Salary Hike
The course structure is excellent with emphasis on concept
building and tools & software at the same time. The support
team is excellent and supportive and quite agile to respond to
doubts.
Preksha Mishra
Telecom
Data Scientist @ 140%
Lead Data Scientist
Domain Salary Hike
Karan Chawala Jaya Sinha Shubham Dev
Data Scientist Senior Analyst Lead Data Analyst
Alumni Spotlight
Thanks to the Learnbay data science course & excellent
guidance, I was able to ace the TCS interview and secure a job
with a 210% pay raise. The real-world time projects helped me
develop my concepts as a data scientist.
Mohd. Israr
Data Scientist
Mechanical 210%
Data Scientist @
Domain Salary Hike
When I joined Learnbay I did not have any knowledge apart
from the very basics. I gradually build my concept via various
trainers and get trained in data science with strong
knowledge/concepts.
Saurabh Kumar
Mathematics 135%
Data Scientist Data Scientist @
Professor Salary Hike
Aravind Ritesh Kumar Ramki
Senior Data Scientist Data Scientist Data Analyst
Learnbay’s ProjectLab
Choose Learnbay for your career journey because we're more than
just a training provider. Our Project Innovation Lab lets you apply
your skills in real-world scenarios. Get dual certifications for a
competitive edge. Specialize in your desired domain. Discover how
Learnbay can boost your career growth. Don't settle for less – choose
Learnbay, your path to success!
1 Project Innovation Lab
Work in an industry like environment and gain practical
hands-on experience of data scientist with dedicated
mentors from industry.
2 Dedicated Placement Cell
Experience 100% job assistance with guaranteed interview call
from leading MNCs and startups globally.
3 Degree & Certification
Gain top-notch skills for a successful career through our degree
and certification program
1 Project Innovation Lab
Learnbay's Project Innovation Lab replicates industry like
environment for real time projects. With our ProjectLab, you gain
real proof of hands-on experience by having your project certified by
the industry.
In our ProjectLab, you work like a data scientist with dedicated
project mentors from industry and get certified on capstone project.
450+ 1-1 Doubt Session
Hiring
HYDERABAD
Partners
PUNE
Capstone
Project
Certificate
from IBM
35k
Trusted
Learners DELHI
Project
Innovation Labs
Across India
BENGALURU
2 Career Service
Get 1 Year of Job and Placement support
Unleash your career potential with 1 year of unlimited
job access, interview support, and profile review.
1 Mock Interview with Industry Experts
Master the art of data science and stay ahead of the
curve with mockups and industry insights
Resume Building Session
Craft a powerful resume showcasing your expertise in
software development to stand out from the
competition
4 Guaranteed Interview Calls
Receive 4 interview calls from a diverse pool
of interested employers/recruiters.
3
Certificate
IBM Course Certificate
Obtain an internationally recognized
certificate through training.
Validate your Data Science skills with
IBM Certificate
Enhance your IT profile with IBM's
certification
Others Vs Learnbay
Benefits Learnbay Others
Guaranteed Interview
Calls
Industry capstone project
certificate from IBM
Domain specialized
programs for professionals
100% live interactive sessions
with industry experts
On-demand video call
with industry experts
Personalised Resume
Review Session
Program Fee
& Financing
Scholarship Financing as low as
Rs. 4,917/month
No Cost EMI
Scholarships are awarded based on
profile review. Eligible candidates
can avail upto 25% scholarship on
desired courses. Click the button
below to apply.
Click below Program Fee
Check Scholarship Eligibility Rs. 75,000/- +18% GST
Learning Path
L1 Cohort Orientation + Special Programming
Classes
Python Programming (Basic + Advance)
L2 Python, Anaconda, Github, Pandas
Statistics and Machine Learning
L3
Matplotlib, Scikit-Learn, Seaborn
Data Science Tools
L4
SQL, MongoDB, Tableau, PowerBI, Big Data & Spark Analytics, Time Series
AI Tools
L5
Deep Learning, NLP
Deplyment
L6
AWS+Azure
L7 AI Generative Tools and Future Trends
ChatGPT, Midjourney, DALL·E
TERM 1
Program Syllabus
Python Programming Module 1 (50 hours)
Programming Basics & Python Programming Overview
Environment Setup Python Overview
Installing Anaconda, Anaconda Basics Python 2.7 vs Python 3
and Introduction Writing your First Python Program
Get familiar with version control, Git Lines and Indentation, Python
and GitHub. Identifiers
Basic Github Commands. Various Operators and Operators
Introduction to Jupyter Notebook Precedence
environment. Basics Jupyter notebook Getting input from User, Comments,
Commands. Multi line Comments
Programming language basics
Python Data Types
Strings, Decisions & Loop Control List, Tuples, Dictionaries
Working With Numbers, Booleans Python Lists, Tuples, Dictionaries
and Strings, String types and Accessing Values, Basic Operations
formatting, String operations Indexing, Slicing, and Matrixes
Simple if Statement, if-else Statement Built-in Functions & Methods
if-elif Statement. Exercises on List, Tuples And
Introduction to while Loops, for Dictionary
Loops, Using continue and break
Class Hands-on:
Functions And Modules
6 programs/coding exercise on string, Anonymous Functions - Lambda
loop and conditions in classroom Using Built-In Modules, User-Defined
Modules, Module Namespaces,
Iterators And Generators
Functions And Modules Class Hands-on:
Introduction To Functions 8+ Programs to be covered in class of
Defining & Calling Functions functions, Lambda, modules, Generators
Functions With Multiple Arguments and Packages.
TERM 1
Program Syllabus
Python Programming Module 1 (50 hours)
File I/O An d Exceptional Handling Data Analysis Using Numpy
and Regular Expression Introduction to Numpy. Array
Opening and Closing Files Creation, Printing Arrays, Basic
open Function, file Object Attributes Operation - Indexing, Slicing and
close() Method, Read, write, seek. Iterating, Shape Manipulation -
Exception Handling, try-finally Clause Changing shape, stacking and
Raising an Exceptions, User-Defined splitting of array
Exceptions Vector stacking, Broadcasting with
Regular Expression- Search and Numpy, Numpy for Statistical
Replace Operation
Regular Expression Modifiers
Regular Expression Patterns
Assignment 1 (Week 2):
Class hands-on :
10 Coding exercises on Python
10+ Programs to be covered in class
Basics - Variables, Operators,
from File IO, Reg-ex and exception
Strings, Loops, Control Statement
handling.
Assignment 2 (Week 3):
10 Python programs and practice
set on List, Tuples, Dictionaries &
Data Analysis Using Pandas Matrices operations
Pandas : Introduction to Pandas Assignment 3 (Week 4):
Importing data into Python 10 Coding exercises on Functions,
Pandas Data Frames, Indexing Data Lambda, Input-Output, File and
Frames ,Basic Operations With Data Regular Expression
frame, Renaming Columns,
Subsetting and filtering a data frame.
TERM 1
Program Syllabus
Python Programming Module 1 (50 hours)
Data Visualization using Matplotlib Data Visualization using Seaborn
Matplotlib: Introduction, plot(), Seaborn: Intro to Seaborn And
Controlling Line Properties, Subplot Visualizing statistical relationships ,
with Functional Method, Multiple Plot, Import and Prepare data. Plotting
Working with Multiple Figures, with categorical data and Visualizing
Histograms linear relationships.
Seaborn Exercise
CASE STUDY
3 Case Study on Numpy, Pandas, Matplotlib
1 Case Study on Pandas And Seaborn
Assessment Test in Python :
2 hour of Assesment Test in Python (
Coding & Objective Questions )
Real time Use cases in Python to be Covered in Class with 5 assignments
TERM 2
Program Syllabus
Statistics Module 1 (30 hours)
Fundamentals of Math and All about Population & Sample
Probability Population vs Sample, Sample Size
Probability distributed function & Simple Random Sampling, Systematic
cumulative distribution function. Sampling, Cluster Sampling, Stratified
Conditional Probability, Baye’s Sampling, Convenience Sampling,
Theorem Quota Sampling, Snowball Sampling
Problem solving for probability and Judgement Sampling
assignments
Random Experiments, Mutually
Exclusive Events, Joint Events, Descriptive Statistics
Dependent & Independent Events
Measures of Central Tendency –
Mean, Median and Mode
Introduction to Statistics, Measures of Dispersion – Standard
Statistical Thinking Deviation, Variance, Range, IQR (Inter-
Quartile Range)
Variable and its types Measure of Symmetricity/ Shape –
Quantitative, Categorical, Discrete, Skewness and Kurtosis
Continuous,
*all with examples
Five Point Summary and Box Plot
Inferential Statistics
Outliers, Causes of Outliers, How to
Characteristics of Z-distribution and
treat Outliers, I-QR Method and Z-
T-Distribution.
Score Method
Type of test and rejection region.
Type of errors in Hypothesis Testing
Inferential Statistics
Central Limit Theorem
Point estimate and Interval estimate
Creating confidence interval for
population parameter
TERM 2
Program Syllabus
Statistics Module 1 (30 hours)
Hypothesis Testing Linear Algebra
Type of test and Rejection Region Dot Product, Projecting Point on Axis.
Type o errors-Type 1 Errors, Type 2 Matrices in Python, Element Indexing,
Errors. P value method, Z score Square Matrix, Triangular Matrix,
Method. The Chi-Square Test of Diagonal Matrix, Identity Matrix,
Independence. Addition of Matrices, Scalar
Regression. Factorial Analysis of Multiplication, Matrix Multiplication,
Variance. Pearson Correlation Matrix Transpose, Determinant, Trace
Coefficients in Depth. Statistical T-Test, Analysis of variance (ANOVA),
Significance and Analysis of Covariance (ANCOVA)
Null and Alternative Hypothesis One- Regression analysis in ANOVA
tailed and Two-tailed Tests, Critical
Class Hands-on:
Value, Rejection region, Inference
EXCEL Problem solving for C.L.T Problem
based on Critical Value solving Hypothesis Testing Problem
Binomial Distribution: Assumptions solving for T-test, Z-score test Case
of Binomial Distribution, Normal study and model run for ANOVA,
Distribution, Properties of Normal
ANCOVA
Distribution, Z table, Empirical Rule of
Normal Distribution & Central Limit
Theorem and its Applications
Data Processing & Exploratory
Data Analysis
What is Data Wrangling
Data Pre-processing and cleaning?
How to Restructure the data?
What is Data Integration and
Transformation
TERM 2
Program Syllabus
Statistics Module 1 (30 hours)
EDA
Finding and Dealing with Missing Values.
What are Outliers?
Using Z-scores to Find Outliers.
Bivariate Analysis, Scatter Plots and Heatmaps.
Introduction to Multivariate Analysis
Note: Problem-Solving Techniques and Case Studies using Statistics will be covered
in class from week 2
Statistics Assignments : Total 4 practice set and Assignments from Statistics
TERM 2
Program Syllabus
Machine Learning Module 2 (40 hours)
Machine Learning Introduction Data Preprocessing
Definition, Examples, Importance of Types of Missing values (MCAR, MAR,
Machine Learning MNAR) , Methods to handle missing
Definition of ML Elements: Algorithm, values
Model, Predictor Variable, Response Outliers, Methods to handle outliers:
Variable, Training - Test Split, Steps in IQR Method, Z Method
Machine Learning, Feature Scaling: Definition , Methods:
ML Models Type: Supervised Absolute Maximum Scaling, Min-Max
Learning, Unsupervised Learning and Scaler , Normalization,
Reinforcement Learning Standardization, Robust Scaling
Data Preprocessing Logistic Regression Model
Encoding the data: Definition, Definition. Why is it called the
Methods: OneHot Encoding, Mean “Regression model”?
Encoding, Label Encoding, Target Sigmoid Function, Transformation &
Guided Ordinal Encoding Graph of Sigmoid Function
K Nearest Neighbours Model
Evaluation Metrics for
Classification model Definition, Steps in KNN Model,
Types of Distance: Manhattan
Confusion Matrix, Accuracy,
Distance, Euclidean Distance, ‘Lazy
Misclassification, TPR, FPR, TNR,
Learner Model’.
Precision, Recall, F1 Score, ROC Curve,
Confusion Matrix of Multi Class
and AUC. Using Python library Sklearn
Classification
to create the Logistic Regression
Using Python library Sklearn to create
Model and evaluate the model
the K Nearest Neighbours Model and
created
evaluate the model
TERM 2
Program Syllabus
Machine Learning Module 2 (40 hours)
Decision Tree Model Random Forest Model
Definition, Basic Terminologies, Tree Ensemble Techniques:
Splitting Constraints, Splitting Bagging/bootstrapping & Boosting.
Algorithms: Definition of Random Forest, OOB
CART, C4.5, ID3, CHAID Score
Splitting Methods: K-Fold Cross-Validation
GINI, Entropy, Chi-Square, and
Reduction in Variance
Using Python library Sklearn to create
Naive Baye’s Model
the Decision Tree Model and evaluate
the model created Definition, Advantages, Baye’s
Theorem Applicability, Disadvantages
of Naive Baye’s Model, Laplace’s
Hyperparameter Tuning Correction, Types of Classifiers:
Gaussian, Multinomial and Bernoulli
GridSearchCV, Variable Importance.
Using Python library Sklearn to create
Using Python library Sklearn to create
the Naive Baye’s Model and evaluate
the Random Forest Model and
the model created
evaluate the model created.
Use cases
CASE STUDY
Business Case Study for Kart Model
Business Case Study for Random Forest
Business Case Study for SVM
To classify an email as spam or not spam using logistic Regression.
Application of Linear Regression for Housing Price Prediction
TERM 2
Program Syllabus
Machine Learning Module 2 (40 hours)
K Means and Hierarchical Hierarchical Clustering
Clustering Dendrogram, Agglomerative
Definition of Clustering, Use cases of Clustering, Divisive Clustering,
Clustering Comparison of K Means Clustering
K Means Clustering Algorithm, and Hierarchical Clustering
Assumptions of K Means Clustering Using Python library Sklearn to create
Sum of Squares Curve or Elbow Curve and evaluate the clustering model
Principal Component Support Vector Machine(SVM)
Analysis(PCA)
Model: Definition, Use Cases, Kernel
Definition, Curse of Dimensionality, Function, Aim of Support Vectors,
Dimensionality Reduction Technique, Hyperplane, Gamma Value,
When to use PCA, Regularization Parameter
Use Cases Using Python library Sklearn to create
Steps in PCA, EigenValues and and evaluate the SVM Model
EigenVectors, Scree Plot.
Using Python library Sklearn to create
Principal Components
Summary of all Machine Learning Models and Discussion about the Capstone
Project
Note : All Machine Learning Algorithms are covered in depth with real time case
studies for each algorithm. Once 60% of ML is completed, Capstone Project will be
released for the batch.
TERM 2
Program Syllabus
CASE STUDY Module 2 (40 hours)
Recommendation Engine for e-commerce/retail chain
Twitter data analysis using NLP
TERM 3
Program Syllabus
SQL Module 1 (14 hours)
SQL and RDBMS Advance SQL
RDBMS And SQL Operations. Advance SQL Operations
Single Table Queries - SELECT, Data Aggregations and summarizing
WHERE, the data
ORDER BY, Distinct, And, OR Ranking Functions: Top-N Analysis
Multiple Table Queries: INNER, SELF, Advanced SQL Queries for Analytics
CROSS, and OUTER, Join, Left Join,
Right
Join, Full Join, Union
JSON Data & CRUD
Basics and CRUD Operation
Databases, Collection & Documents
NoSQL, HBase & MongoDB Shell & MongoDB drivers
NoSQL Databases What is JSON Data
Introduction to HBase Create, Read, Update, Delete
HBase Architecture, HBase Finding, Deleting, Updating,
Components, Storage Model of HBase Inserting Elements
HBase vs RDBMS Working with Arrays
Introduction to Mongo DB, CRUD Understanding Schemas and
Advantages of MongoDB over RDBMS Relations
Programming with SQL Programming with SQL
Mathematical Functions Partitioning
Variables Filtering Data
Conditional Logic Subqueries
Loops
Custom Functions
Grouping and Ordering
TERM 3
Program Syllabus
SQL Module 1 (14 hours)
Assignments
Working with multiple tables
Practice Joins, Grouping and Subqueries
Using GROUP BY and HAVING Clauses
Practice Aggregation Queries
TERM 3
Program Syllabus
Tableau Module 3 (14 hours)
Introduction to Tableau Visual Analytics
Connecting to data source Getting Started With Visual Analytics
Creating dashboard pages Sorting and grouping
How to create calculated columns Working with sets, set action
Different charts Filters: Ways to filter, Interactive
Filters
Forecasting and Clustering
Dashboard and Stories
Working in Views with Dashboards
Tableau (Advance)
and Stories
Working with Sheets Mapping
Fitting Sheets Coordinate points
Legends and Quick Filters Plotting Latitude and Longitude
Tiled and Floating Layouts, Floating Custom Geocoding
Objects Polygon Maps
WMS and Background Image
Hands-on Assignments
Connecting data source and Tools covered
data cleansing
Working with various charts
Deployment of Predictive
model in visualization
TERM 3
Program Syllabus
Power BI Module 4 (14 hours)
Getting Started With Power BI Programming with Power BI
Installing Power BI Desktop and Working with Time Series
Connecting to Data Understanding aggregation and
Overview of the Workflow in Power BI granularity
Desktop Filters and Slicers in Power BI Maps
Introducing the Different Views of the Scatterplots and BI Reports
Data Mode Connecting Dataset with Power BI
Query Editor Interface Creating a Customer Segmentation
Working on Data Model Dashboard Analyzing the Customer
Segmentation Dashboard
Assignments
Create Bar charts
Tools covered
Create Pie charts
Create Tree maps
Create Donut Charts
Create Waterfall Diagrams
Creating Table Calculations
for Gender
The IBM exam will be conducted for all the
modules after completion of the course
Real-time Projects
12 hours 17 hours
Learn and develop Building a content
classification techniques for recommendation model on the
the digital transformation of basis of regional viewer
banking categorization
JPMorgan offers tax-friendly Netflix is a global entertainment video
insurance choices. You can help them streaming site. They offer content in
forecast insurance premiums. various regional languages. Build a
Targeted marketing using your local recommendation engine for
Random Forest Algorithm skills can Netflix customers residing in south
help obtain better premium values. Bangalore on their weekend and
weekday activities, utilizing NLP.
18 hours 14 hours
Reduction of waiting time via a Understanding in-depth
highly precise forecasting about logging while drilling
model (LWD) technique
Make a demand forecasting model Saudi Aramco company is working on
based on specific time period rider the development of high-efficiency
demands. Such a model will help drilling models. Use the bright sides of
both riders and cab drivers to ensure big data analytics to identify the most
the least possible waiting time. You cost- effective and highly productive
can include measures like latitude drilling sites.
and longitude identification.
Contact Us
Click here to whatsapp
or call us at
+91 77956 87988
www.learnbay.co