Data Analytics Course handout 2024 29.11.24 anjamma

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 42

COURSE FILE

Subject : DATA ANALYTICS


Subject Code : 22CS513PE
Academic Year : 2024-25

Name of the Faculty : N.ANJAMMA


Department : CSE
Branch &Year : CSE& II
Check List of the Course File
Department: CSE Date:
Subject Code: 22CS513PE
Title of the subject: DATA ANALYTICS

S.No Attributes Yes/No


1 Vision and Mission of institute
2 Course handout& its contents
a) Vision and Mission of department
b) PEOs of the program
c) Program Outcomes (POs)
d) Prerequisites
e) Course Outcomes (COs)
f) Detailed syllabus
g) Course Plan
h) Evaluation scheme
3 CO-PO mapping
4 Course material
5 Teaching diary for the course
6 Question bank prepared by faculty (unit wise)
7 Descriptive question bank for assignment
8 Sets of copies of old question papers
9 Analysis of student performance
10 Answer book copies
11 a) Internal books (Excellent/Good/Fail) = 3 Nos
b) Assignment Copies (Excellent/Good/Fail) = 3 Nos
c) Laboratory records (if any) (Excellent/Good/Fair) = 3 Nos.
Whether remedial measures were taken by faculty members
aftercompletion of first module (with supporting documents)
12 IQAC review report

HOD
Review Report
Department: CSE Date:
Subject Code:22CS513PE
Title of the subject: Data Analytics

S. No Observations Excellent/Good/Fair Suggestions/Remarks


1 Subject coverage
alignmentwiththeacademic
calendar
2 Quality of the question bank
a) Quiz question bank -NA-
b) Descriptive assignment
3 Correlation between
CourseOutcomes (C.O.),
ProgramOutcomes (POs)
andProgram Specific
Outcomes(PSOs).
4 Rate the alignment of
thecourse content in
bridgingthe gap for
achievingProgram
Outcomes (POs)and
Program-
SpecificOutcomes
(PSOs) through
meaningful Course
Outcomes(C.O).
5 Faculty-initiated
remedialmeasures and the
resultingoutcomes
achieved for
improvement.

Name and Signature of the Members

1. Subject Expert:

2. IQAC Coordinator1:

3. IQAC Coordinator 2:
COURSE FILE
COURSE DESCRIPTION / COURSE INFORMATION SHEET

Name of the Dept.: COMPUTER SCIENCE AND ENGINEERING

Course Title Object Oriented Analysis and Design

Course Code 22CS513PE Programme B.Tech

Regulation R22 Year/Semester III-I

Lectures Tutorials Practical Credits


Course Structure
5 1 0 3

Course Teacher N.ANJAMMA

Email nanjamma@tkrec.ac.in

Phone No 8919192173

Lectures Tutorial Practical


NoofHoursAllottedper
Week
6 1 0
I. COURSE OVERVIEW:

1. Vision & Mission of the Institution

Imparting Knowledge and instilling skills to the aspiring students in the fieldof
Vision Engineering, Technology, Science and Management to face the emerging
challenges of the society.

Mission  Encouraging scholarly activities that transfer knowledge in the areasof


Engineering, Technology, Science and Management.
 Ensuring students of all levels, well trained to meet the needs of
education and their future endeavors. Inculcating human values andethics
into the education system for the all-round development of thestudents

2. Course Handout

a)Vision & Mission of the Department


Vision Enhance learning that promotes techno graduates aiming employabilityand
entrepreneurship with human values to face the challenges in the
global technological society.

Mission MISSION 1: Empowering students for professional career and higherstudies


by providing hands on experience and value education tobecome successful
technocrats in the society.

MISSION 2: Nurturing students with interpersonal and entrepreneurialskills, so


that they gain ability to work as a team.

MISSION 3: Imparting quality education, employability skills and techno


ethical values among the students for the benefit of the society.

b) Program 1. The students of the program will have strong foundation in the
Educational fundamental principles and gain advanced knowledge in the Basic
Objectives Sciences, Mathematics and other application of Advanced Computer
(PEOs) Engineering.
2. The students of the program will be prepared for their successful
careers in the software industry / seek higher studies and continue to
develop.
3. The students of the program will prepare to engage in professional
development through self-study, graduate and professional studies in
engineering & business.
4. Graduates shall have good communication skills, leadership skills,
professional, ethical and social responsibilities

c) Program PO 1.Engineering knowledge: Apply the knowledge of mathematics,


Outcomes Science, engineering fundamentals and an engineering specialization tothe
& Program solution of complex engineering problems.
Specific PO 2. Problem analysis: Identify, formulate, review research literature,and
Outcomes(P
analyze complex engineering problems reaching substantiated
Os)&
(PSOs) conclusions using first principles of mathematics, natural sciences, and
Engineering sciences.
PO 3. Design/development of solutions: Design solutions for complex
engineering problems and design system components or processes that
meet the specified needs with appropriate consideration for the public
health and safety, and the cultural, societal, and environmental
considerations.
PO 4. Conduct investigations of complex problems: Use research-
based knowledge and research methods including design of
experiments, analysis and interpretation of data, and synthesis of the
information to provide valid conclusions.
PO 5. Modern tool usage: Create, select and apply appropriate
techniques, resources and modern engineering and IT tools including
prediction and modeling to complex engineering activities with an
understanding of the limitations.
PO 6. The engineer and society: Apply reasoning informed by the
contextual knowledge to assess societal, health, safety, legal and cultural
issues and the consequent responsibilities relevant to the professional
engineering practice.
PO 7. Environment and sustainability: Understand the impact of the
professional engineering solutions in societal and environmental
contexts and demonstrate the knowledge of, and need for sustainable
development.
PO 8. Ethics: Apply ethical principles and commit to professional ethics
and responsibilities and norms of the engineering practice.
PO 9. Individual and team work: Function effectively as an individual,
and as a member or leader in diverse teams, and in multidisciplinary
settings.
PO 10. Communication: Communicate effectively on complex
engineering activities with the engineering community and with society
at large, such as, being able to comprehend and write effective reports
and design documentation make effective presentations and give and
receive clear instructions.
PO 11. Project management and finance: Demonstrate knowledge
and understanding of the engineering and management principles and
apply these to one’s own work, as a member and leader in a team, to
manage projects and in multidisciplinary environments.
PO 12. Life-long learning: Recognize the need for, and have the
preparation and ability to engage in independent and life-long learning
in the broadest context of technological change.
PSO 1: Acquire knowledge will be used to design and modify principlesin the
development of software and hardware systems to get a betterquality product.
PSO 2: An ability to identify the state of professional development inpreparing
for competitive examinations that offers successful career andcareer building.

d)Pre-  A course on “Database Management Systems”.


Requisites  K knowledge of probability and statistics.
e) CO1: Understand the impact of data analytics for business decisions and
CourseOu strategy.
tcomes(C CO2: Carry out data analysis/statistical analysis.
Os) CO3: To carry out standard data visualization and formal inference procedures.
CO4: Design Data Architecture.
CO5: Understand various Data Sources.

f) UNIT-I:Data Management: Design Data Architecture and manage the data


DetailedSyl for analysis, understand various sources of Data like Sensors/Signals/GP S
labus etc. Data Management, Data Quality (noise, out liars, missing values,
duplicate data) and Data Processing & Processing.
UNIT-II: Data Analytics: Introduction to Analytics, Introduction to Tools and
Environment, Application of Modeling in Business, Databases & Types of
Data and variables, Data Modeling Techniques, Missing Imputation sets. Need
for Business modeling.
UNIT- III: Regression:–Concepts, Blue property assumptions, LeastSquare
Estimation, Variable Rationalization, and Model Building etc.
Logistic Regression: Model Theory, Model fit Statistics, Model
Construction, Analytics applications to various Business Domains etc.
UNIT-IV: Object Segmentation: Regression Vs Segmentation – Supervised
and Unsupervised Learning, Tree Building – Regression, Classification, Over
fitting, Pruning and Complexity, Multiple Decision Trees etc. Time Series
Methods: Arima, Measures of Forecast Accuracy, STL approach, Extract
features from generated model as Height, Average Energy etc and Analyze for
prediction

UNIT-V:Data Visualization: Pixel-Oriented Visualization Techniques,


Geometric Projection Visualization Techniques, Icon-Based Visualization
Techniques, Hierarchical Visualization Techniques, Visualizing Complex Data
and Relations
TopicsC  Data Mining Analysis and Concepts.
overed  Mining of .massive Datasets.
Beyond
Syllabus
Text Books 1. Student’s Hand book for Associate Analytics–II, III.
2. Data Mining Concepts and Techniques, Han, Kamber, 3rdEdition, Morgan
Kaufmann Publishers.
Reference 1. Introduction to Data Mining, Tan, Steinbach and Kumar, Addision Wisley, 2006.
Books 2. Data Mining Analysis and Concepts, M.Zaki and W. Meira
3. Mining of Massive Datasets, Jure Leskovec Stanford Univ. Anand
Rajaraman Mill way Labs JeffreyD Ullman Stanford Univ.

ACADEMIC CALENDAR FOR III B. TECH. I SEM. 2024-25


TEEGALA KRISHNA REDDY ENGINEERING COLLEGE
(UGC-Autonomous)
Approved by AICTE, Affiliated by JNTUH, Accredited by NAAC- ‘A’
GradeMedbowli, Meer pet, Balapur, Hyderabad, Telangana- 500097
Mob: 9393959597.Email: info@tkrec.ac.in,deanacademics@tkrec.ac.in

No. Topic(s) No. of


LectureH
ours
UNIT-I- Data Management
1 Design Data Architecture and manage the data for analysis 2

2 1
Manage the data for Analysis
3 Understand various sources of Data like Sensors/Signals/GPS etc. 2
4 Data Management 1
5 Data Quality(noise, outliers ,missing values, duplicate data) 2
6 1
Data Processing
7 1
Data Processing
8 1
Assignment
9 Slip Test 1
Total 12
UNIT-II- Data Analytics

10 Introduction to Data Analytics 1


11 1
Introduction to Tools and Environment
12 Application of Modeling in 1
Business
13 Data bases & Types of Data 2
And Variables
14 2
Data Modeling techniques
15 1
Missing Imputations
16 1
Need for Business Modeling
17 1
Assignment
18 Slip Test 1
Total 11
UNIT-III-Regression

19 1
Regression Concepts
20 1
Blue Property Assumptions
21 1
Least Square Estimation
22 1
Variable Rationalization
23 1
Model Building
24 1
Logistic Regression
25 1
Model Fit Statistics
26 Model Construction 1
27 Analytics Applications 1
28 1
Assignment
29 1
Slip Test
Total 11
UNIT-IV- Object Segmentation

30 Regression vs Segmentation 1
31 Supervised and Unsupervised Learning 1
32 1
Tree Building -Regression
33 1
Classification & Over fitting
34 Pruning and Complexity 1
35 1
Multiple Decision Trees
36 Times Series Methods - 1
Arima
37 1
Measures of Forecast Accuracy

38 STL Approach 1
39 Extract Features from Models 1

40 Assignment 1
41 Slip Test 1

UNIT-V- Data Visualization

42 Pixel-Oriented Visualization Techniques. 2


43 Geometric Projection Visualization Techniques. 2
44 Icon-Based Visualization Techniques 2
45 Hierarchical Visualization Techniques, 2
46 Visualizing Complex Data and Relations 2
47 Assignment 1
48 Slip Test 1
49 Total 12
50 Grand Total 58
h) Evaluation
SchemeTheory
Evaluation Criteria Marks
Assignment I 05
Midterm-1 Descriptive Paper 20
Total 25
Assignment II 05
Midterm-2 Descriptive Paper 20
Total 25
Average of Midterm-1 and Midterm-2 25
End-Examination 75
Total 100
CourseObjectives
1. Mapping of CO-PO&PSO

1. To explore the fundamental concepts of data analytics.


2. To learn the principles and methods of statistical analysis.
3. Discover interesting patterns, analyze supervised and unsupervised models and estimate
the accuracy of the algorithms.
4. To understand the various search methods and visualization techniques.

Course Outcomes

CO1. Understand the impact of data analytics for business decisions and strategy.
CO2. Carry out data analysis/statistical analysis.
CO3. To carry out standard data visualization and formal inference procedures.
CO4. Design Data Architecture.
CO5. Understand various Data Sources.

CO/PO PO1 PO2 PO PO4 PO5 PO6 PO7 PO8 PO9 PO1 PO1 PO1 PSO PSO
3 0 1 2 1 2
CO-1 2 2 1 2 3
CO-2 1 1 1 1 1 2 3
CO-3 1 2 1 2 2 1

CO-4 1 1 2 2 1

CO-5 2 1 1 2 2 2 1

Average 1

*To be rated with 1- slightly, 2 – moderately, 3- substantial

Contribution of course to program outcomes& Program Specific outcomes

Type Course PO1 PO2 PO PO PO5 PO PO7 PO8 PO PO1 PO11 PO12 PSO PS
Code,Title 3 4 6 9 0 1 O2
Theory 2.25 2.5 2 2.66 2 2.66 2.5 2.25 1.8 1.8
22CS513PE,
Data
Analytics
(Professioal
Elective - I)
Delivery Methodology

Course Delivery Methods/Modes:

1. Class room lectures : block board


2. Presentations : Yes
3. Laboratory sessions : No
4. Demos : Yes
5. Assignments : Yes
6. Case studies : No
7. Seminars : Yes
8. Projects: : No
9. E-Learning Resources: Yes

Mapping between Course Delivery Methodology andProgram Outcomes

Course Delivery Methods/CO’S 1 2 3 4 5


Class room lecture 3 3 3 3 3
Presentations 1 1 1 1 1
Laboratory sessions - - - - -
Demo or simulations 1 1 1 1 1
Assignments 3 2 2 2 2
Case studies - - - - -
Projects - - - - -
Seminars 1 1 1 1 1
E-Learning resources 1 1 1 1 1
Weight age 37% 33% 33% 33% 33%

*To be rated with 1- slightly, 2 – moderately, 3- substantial


Assessment Methodology

Outcome AssessmentTool Activityaligned to the Outcome


CO1,CO2, Conductedmidexamsandsliptest
CO3,CO4, Test
CO5
CO1,CO2, Given problems questions & to solve and told to
CO3,CO4, write multiple choice questions
CO5 Assignment
CO1,CO2,
CO3,CO4, Rubric Evaluatedminorandmidexamquestionpaper.
CO5
CO1,CO2, Conductedmultiplechoicequestions&fillinthebl
CO3,CO4, Quiz anks for mid exams & minor-2 exam.
CO5
CO1,CO2, Theypracticedvariousprogramswithdifferentlog
CO3,CO4, Laboratory ics in the laboratory.
CO5
CO1,CO2,
CO3,CO4, E- FollowedYouTube or NPTEL videos.
CO5 LearningR
esources
CO1,CO2,
Wedeliveredthecontentsaccordingtothesyllabusan
CO3,CO4, EndSemesterTest
dgivenimportantquestionsaccordingtotheunitwise.
CO5

Note-MentionOtherAssessmenttools(ifany)
Teaching diary for the course
At the end of the course, the students are able to achieve the following course learning outcomes

UNITS Course Topics to be Covered Text Method of Covered Date


Learning Book Teaching -PPT /
Outcomes /Reference Blackboard
Book
I Data Management- TB & RB Block board
I :DesignDataArchitecture and
manage the data for analysis
Manage the data for Analysis TB Block board

Understand various sources of TB Block board


Data like Sensors/Signals/GPS
etc.
Data Management TB Block board
Data Quality(noise, outliers, TB Block board
missing values, duplicate data)

TB Block board
Data Processing
TB Block board
Data Processing
Data Analytics: Introduction to
Data Analytics TB & RB Block board
II
Introduction to Tools and TB Block board
CO2, CO3
Environment
Application of Modeling in TB Block board
Business
Data bases& Types of Data
And Variables TB & RB Block board

Data Modeling techniques TB Block board

TB Block board
Missing Imputations

Need for Business Modeling TB Block board

Regression: Regression TB Block board


III
Concepts
CO2, CO3
Blue Property Assumptions TB Block board

Least Square Estimation TB Block board

VariableRationalization TB & RB Block board

ModelBuilding TB Block board

LogisticRegression TB Block board


ModelFitStatistics TB Block board

ModelConstruction TB Block board

AnalyticsApplications TB & RB Block board

Object
Segmentation:RegressionvsSeg
mentation
CO3, CO4 Supervised and TB & RB Block board
UnsupervisedLearning
TreeBuilding -Regression TB Block board
IV
TB & RB Block board
Classification&Over fitting
TB & RB Block board
PruningandComplexity
TB & RB Block board
MultipleDecisionTrees

TimesSeriesMethods - TB & RB Block board


Arima
TB & RB Block board
MeasuresofForecas
tAccuracy
TB Block board
STLApproach
TB & RB Block board
ExtractFeaturesfromModels
Data Visualization: Pixel- TB Block board
Oriented Visualization
V CO3, Techniques.
CO4& Geometric Projection
CO5 Visualization Techniques. TB & RB Block board
Icon-Based Visualization
Techniques TB & RB Block board
Hierarchical Visualization TB Block board
Techniques,
Visualizing Complex Data and TB Block board
Relations

Question Bank prepared by faculty (unit wise)


UNIT - I
SECTION – A
(Short Answer Questions for 1 marks)

Q. Question CO BL
No
1. 1 1
What is Data Management?
2. Define Design Data Architecture? 1 1

3. List out Enterprise Requirements 3 1

4. What is Randomized Block Design ? 1 1

5. Define Data Quality? 1 1

6. 1 1
What is Data Preprocessing?
7. Distinguish between Data Analytics and 3 3
Data Analysis?

SECTION – B
(Essay Questions for 10 Marks)

Q. Question CO BL
No
1. Explain Design Data Architecture and 1 2
manage the data for analysis indetail with
neat sketch..
2. Explainthe sourcesof primaryData. 1 2

3. Demonstrate briefly 1 3
aboutdatapreprocessing.
4. Explain in detail forgeneratingprimarydata. 1 2

5. Explain Surveymethods and experimental 1 2


method.
UNIT - II
SECTION – A
(Short Answer Questions for 1 Mark)
Q. No Question CO BL
1. whatisdataanalytics? 1 1

2. Explainabouttools usedfordataanalytics? 1 2

3. List out somedatamodelingtechniques? 3 1

4. Explainmissingimputations? 2 1

5. Definedatavariables?Interprettheuseofvariablesfor 1 1
businessmodeling
6. 1 1
What is the need for Business Modelling?

SECTION – B
(Essay Questions for 10 Marks)
Q. No Question CO BL
1. Discusstheimportanceofdataanalytics. 2 2

2. Describethetoolsusedfordata analyticswithanexample? 2 2

3. Explainhowandwheremissingimputationsareinvolvedin 2 2
realworldscenario
4. Explaindatabasesandtypesofdataandvariablesinvolvedi 2 6
ndataanalytics
5. Explainwith exampletheneedfor businessmodeling 2 2

UNIT - III
SECTION – A
(Short Answer Questions for 1 Mark)
Q. No Question CO BL
1. StateBLUEpropertyassumptions? 3 2

2. Whatisvariablerationalization? 1 1

3. Explaintheoreticallyananalyticsapplicationinbusiness 2 2
domain?
4. Howtocalculatea LSEregressionline? 3 3

5. ExplainOLS? 2 2

6. 3 1
What is Linear Regression?
7. 1 1
Define Logistic Regression?
8. 5 6
Write about Model Fit Statistics?

SECTION – B
(Essay Questions for 10 Marks)

Q. No Question CO BL
1. Explainaboutregressionanddiscusswithanexample? 2 2

2. Summarizehowdoes LSEwork? 2 2

3. DescribetheworkingproceduresofLogisticRegressionin 3 2
Businessworld?
4. Discuss briefly aboutvariablerationalization? 3 3

5. Explainaboutmodelfitstatisticsusedforregressionwithan 2 2
exampleandalsodiscussaboutmodelconstruction?
UNIT - IV
SECTION – A
(Short Answer Questions for 1 Mark)
Q. Question CO BL
No
1. WhatisSegmentation in Data Analytics ? 1 1

2. Describesegmentationwithanexample? 2 1

3. Givereal-timeexamplesofsupervisedlearning 4 2

4. Whataredecisiontrees 4 1

5. Brieflydescribe Arima method 4 2

6. What is STL approach? 1 1

7. Tell about Pruning and Complexity in Object 1 1


Segmentation?

SECTION – B
(Essay Questions for 10 Marks)

Q. Question CO BL
No
1. What is Linear Regression ? 2 1&2
Explainwithanexample
2. Differentiatebetweensupervisedandunsupervisedle 4 6
arning
3. Write briefly aboutoverfittingandpruning? 5 6

4. Explaintimeseriesmethodwithan example 4 2

5. Generatea modelto measureforecastaccuracy 4 3

UNIT – V
SECTION – A
(Short Answer Questions for 1 Mark)

Q. Question CO BL
No
1. Describe the purpose of data visualization in data 2 1
analytics?
2. Define Pixel Oriented Visualization Techniques. 1 1
3. Write short notes on Hierarchical visualization 5 6
techniques.
4. Identify some of important tools to visualize complex 2 2
data and relationships in business analytics?
5. DDefine Geometric Projection Visualization? 1 1
6. Tell about Icon- based Visualization Techniques? 1 1

SECTION – B
(Essay Questions for 10 Marks)

Q. Question CO BL
No
1. How can pixel-oriented visualization techniques be 2 2
applied to large datasets? Discuss the challenges and
solutions related to scalability and interpretability.

2. Describe geometric projection visualization 2 2


Techniques in detail.

3. Illustrate briefly about Icon Based Visualization 3 3


Techniques.
4. Write notes of a) circle segment technique and 2) 5 6
space filling curves.
5. Discuss briefly about Hierarchical Visualization 2 2
Techniques
6. Discriminate the challenges in visualizing complex 4 4
data and relations and suggest suitable
Mechanisms to address them.
Descriptive Question Bank for Assignment

UNIT I: Data Management

1.Explain DesignDataArchitectureand manage the data for analysis indetail with neat sketch.

2.Write briefly about sourcesof primaryData.

3.Demonstrate briefly about data preprocessing.

4.Illustrate Surveymethods and experimental method.

5.Discuss briefly about Data Quality (noise, out liars, missing values, duplicate data) and show in data
sets.
UNIT II: Data Analytics

1.what is data analytics? Why data analytics is important in real world?

2.Write briefly about Tools and Environment in Data Analytics.

3.Summarize data modeling techniques?

4.Explain databases and types of data and variables involved in

data analytics

5.Illustrate with example the need for business modeling.


Question Bank Prepared by Faculty (Unit Wise)

UNIT-I
Bloom’sTa
S. No Questions xonomyLe
vel
1 whatisdataanalytics? L1

2 Explainabouttools usedfordataanalytics? L1
3 List out somedatamodelingtechniques? L3

4 Explainmissingimputations? L2
5 Definedatavariables?Interprettheuseofvariablesfor businessmodeling L1
6 What is the need for Business Modelling? L1
7 Explainthe sourcesof primaryData. L2
8 Writeaboutdatapreprocessingneeds. L6

9 Explain in detail forgeneratingprimarydata. L2

10 Illustrate Surveymethods and experimental method. L3


UNIT-II
Bloom’sTa
S. No Questions xonomyLe
vel
1 whatisdataanalytics? L1
2 Explainabouttools usedfordataanalytics? L2
3 Summarize datamodelingtechniques? L2
4 Explainmissingimputations? L2
5 Definedatavariables?Interprettheuseofvariablesfor L1
businessmodeling
6 Discusstheimportanceofdataanalytics. L4
7 Describethetoolsusedfordata analyticswithanexample? L4
8 Explainhowandwheremissingimputationsareinvolvedin L2
realworldscenario
9 Explaindatabasesandtypesofdataandvariablesinvolvedin L2
dataanalytics
10 Demonstratewith exampletheneedfor businessmodeling L2
UNIT-III
Bloom’sTa
S. No Questions xonomyLe
vel
11 StateBLUEpropertyassumptions? L1
12 Whatisvariablerationalization? L1
13 Explaintheoreticallyananalyticsapplicationinbusiness L2
domain?
14 Howtocalculatea LSEregressionline L3

15 Define briefly about OLS? L1

16 Explainaboutregressionanddiscusswithanexample? L2

17 Summarizehowdoes LSEwork? L2
18 DescribetheworkingproceduresofLogisticRegressionin L4
Businessworld?
19 Discussaboutvariablerealization? L4

20 Explain about model fit statistics used for regression with an L2


Example and also discuss about model construction?
UNIT-IV

Bloom’sTa
S. No Questions xonomyLe
vel
21 What is regression. L1
22 Describe segmentation with an example. L2
23 Givereal-timeexamplesofsupervisedlearning L2
24 Whataredecisiontrees L1
25 Brieflydescribe Arima method L3
26 WhatisLinearRegression?Explainwithanexample L1

27 Differentiate between supervised and unsupervisedlearning L4

28 Detail over fitting and pruning? L3


29 Explain time series method with an example L2
30 Generate a model to measure forecast accuracy L4

UNIT-V
Bloom’sTa
S. No Questions xonomyLe
vel
31 Name some frequently used 2-D space-filling curves? L2

32 What is a scatter plot and scatter-plot matrix? L1


33 Specify the dimensionality of Chern off faces? L4

34 Write a short note on Hierarchical visualization techniques. L6

35 Explain tag cloud briefly. L2&L3

36 Explain complex data and deduce its relationships? L5

37 Explain a visualization technique using parallel coordinates? L5

38 Explain a symmetrical Chern off faces? L5


39 Explain geometric projection visualization L5

40 Write notes of a) circle segment technique and 2) space filling L5


curves

NAME & SIGN OF THE SUBJECT FACULTY


N. ANJAMMA

UNIVERSITY PAPERS
CodeNo:138FU R16
JAWAHARLALNEHRUTECHNOLOGICALUNIVERSITYHYDERABAD
B.TechIVYearIISemesterExaminations, September-2020
DATAANALYTICS
(ComputerScienceandEngineering)
Time:2Hours Max.Marks: 75
AnsweranyFiveQuestionsAllQu
estionsCarryEqualMarks
---

1. MakeacomparisonofRandomizedblockdesignandLatinsquaredesign.Quoteap
propriateexamples.
[15]

2.a) Explaindatapreprocessingindatamanagement.
b) Discusstheprocessof handlingduplicate valuesin organizationaldata. [7+8]

3. Explaindataimputationandhowcanrepeatedimputationsenormouslyimproveth
equalityofestimation.
[15]

4.a) Discusstheimportanceofbusinessmodeling.
b) Comparethetechniques fordealingnumerical datawith categoricaldata. [7+8]

5. Applylinearregressionusingthemethodofleastsquarestothefollowingdataandpr
edictthecropyieldfor rainfallof5cm.
[15]
Rainfall(incms) 10.5 8.8 13.4 12.5 18.8 10.3 7.0 15.6 16
Paddyyield(quintalperacre) 30.3 46.2 58.8 59.0 82.4 49.2 31.9 76.0 78.8

6.a) Explaintheadvantagesofdecisiontrees.
b) Describetheneed oftreepruningindecision trees. [7+8]
c) Discussin detail thesteps involved in ETLprocess andtools available
forthis process.[15]
7. Explainthechallengesinvisualizingcomplexdataandrelationsandsuggestsuitablemechanis
mstoaddress them. [15]

---ooOoo---
CodeNo:138FU R16
JAWAHARLALNEHRUTECHNOLOGICALUNIVERSITYHYDERABAD
B.TechIVYearIISemesterExaminations, September-2020
DATAANALYTICS
(ElectronicsandCommunicationEngineering)
Time:2Hours Max.Marks: 75
AnsweranyFiveQuestions
AllQuestionsCarryEqualMarks
---

1.a)Data set D {10K, 15K,22K, 25K,36K,40K,13K,19K, 88K,94K} represents packages of thestudents placed in an
interview where "K represents thousand". Identify the outliers in thedataset and analyzeitsimpact instudyingthe
spread of data.
b) Illustratetechniques ofmissingvalues treatment withexample. [8+7]

2.a) DemonstrateMissingImputationmethodsindetailwithexamples.
b) IllustrateDatamodelingtechniques. [8+7]

3.a) ExplaindifferenttypesofvariablesusedinRegressionmodeling.
b) Demonstrate linearregressionwith suitableexample. [8+7]

4.a) Outlinemajorsteps ofdecision treeclassification witha suitableexample.


b) Whatistreepruning?Illustratedrawbackofusingseparatesetoftuplestoevaluatepruning. [7+8]

5.a) Applydimensional stackingandexplainhow tovisualizemultivariatedata.


b) Analyzeandoutlinetheimportanceofscalesanddimensionsinspreadsheetvisualization. [7+8]

6.a) Demonstratedatapreprocessingtechniquesindetail.
b) What isdatadeduplication?Explaindeduplication methods. [9+6]

7.a) Qualitativevariables arenotcategorical.Justifywithsuitableexample?


b) Discussstoragemechanismof unstructured dataindistributed computing. [7+8]

8.a) Applylogisticregressiontodemonstratebinaryclassification.
b) What isleastsquareestimate?Illustrateitsimportancein regressionmodeling. [7+8]

---ooOoo---
LISTOFTOPICSFORSTUDENTSEMINARS(Optional):
1. DataManagement
2. DataAnalytics
3. Regression
4. LogisticRegression
5. ObjectSegmentation
6. TimeSeriesMethods
7. DataVisualization

MID QUESTION PAPERS

TEEGALA KRISHNA REDDY ENGINEERING COLLEGE


(UGC-Autonomous)
B. Tech (III-I)Semester[R22] MID-I Examinations, SEP-2024

Name of the subject: Data Analytics Date: 03-10-2024


Branch: CSE
Time: 2:00 hrs. Max. Marks: 30
All questions carry equal marks

Questions
Q. Blooms
Questions from Marks CO
No Level
UNIT

Discuss briefly about Understand various sources of


Data and also explain data like Sensors/Signals/GPS
Q.1 etc. I 6 L6 V

Explain briefly about data preprocessing in data


Q.2 management.
I 6 L2 II

Define Data Analytics. Why data analytics is important in


Q.3 real world? And also explain tools and environment in data II 6 L1 I
analytics.

Q.4 II 6 L4 III
Illustrate Data modeling techniques with examples.

Q.5 Demonstrate linear regression with suitable example. III 6 L2 II

TEEGALA KRISHNA REDDY ENGINEERING COLLEGE


(UGC-Autonomous)
B. Tech (III-I)Semester[R22] MID-I Examinations, SEP-2024
Name of the subject: Data Analytics Date: 03-10-2024
Branch: CSE
Time: 2:00 hrs. Max. Marks: 30
All questions carry equal marks
Questions Blooms
Q. No Questions Marks CO
from UNIT Level
Explain briefly about Design Data
Architecture and manage the data
Q.1 for analysis in data management I 6 L2 II
with neat sketch.

Discuss briefly about Data


Quality (noise, outliers, missing
Q.2 values, duplicate data) with
I 6 L6 V
examples.

Define Data Analytics. And also


Q.3 explain briefly about tools and II 6 L1 I
environment in data analytics.

Q.4 Demonstrate Missing Imputation II 6 L2 II


methods in detail with examples.

Illustrate Least Square Estimation


Q.5 method with example.
III 6 L4 III
Analysis of student performance in the course Performance Index(Theory)

CSE-A

MID EXT (60) Grade


S . No H . T No Name of the Student
(40)
1 22R91A0501 Akkenapalli Kirthan
2 22R91A0502 Akkineni Tharun
3 22R91A0503 Aleti Teja Reddy
4 22R91A0504 Alugani Praneeth Goud
5 22R91A0505 Anil Kumar Malik
6 22R91A0506 Anugonti Prathyusha
7 22R91A0507 Anugu Sathwika
8 22R91A0508 Appani Sai Rakshith
9 22R91A0509 Arakala Ravali
10 22R91A0510 Asamagari Varsha Reddy
11 22R91A0511 Azmeera Bhavanjali
12 22R91A0512 B Poojitha
13 22R91A0513 B Vamshi
14 22R91A0514 Bagara Sangeetha
15 22R91A0515 Bakka Mounika
16 22R91A0516 Banala Manasa
17 22R91A0517 Bandari Sriram
18 22R91A0518 Bandi Laxmi Prasuna
19 22R91A0519 Bandi Prashanth Reddy
20 22R91A0520 Bandlapelli Raghu
21 22R91A0521 Banothu Bhagya Teja
22 22R91A0522 Bayyaram Rishika Chary
23 22R91A0523 Bhukya Anusha
24 22R91A0524 Bijili Sriya
25 22R91A0525 Boda Bhuvaneshwari
26 22R91A0526 Boda Ram Kumar
27 22R91A0527 Boge Ashritha
28 22R91A0528 Bogini Shiva Kumar
29 22R91A0529 Bonala Sriharshini
30 22R91A0530 Boniga Bala Akshith
31 22R91A0531 Borra Dinesh Reddy
32 22R91A0532 Botumanchi Ezra Harsha
33 22R91A0533 Brahmanlapally Praneeth
34 22R91A0534 Busheera Begum
35 22R91A0535 Butharaju Prathyusha
36 22R91A0536 Byagari Abhishek
37 22R91A0537 Chamakuri Navya
38 22R91A0538 Chenchu Swathi
39 22R91A0539 Chenna Sujan Kumar
40 22R91A0540 Chetkuri Ramanjay
41 22R91A0541 Chikurthi Srinivas
42 22R91A0542 Chintakatla Saikumar Goud
43 22R91A0543 Chinthakindi Ganesh
44 22R91A0544 Chinthareddy Madhuveer Re
45 22R91A0545 Chinthkoti Gayathri
46 22R91A0546 Chippakurthi Alekhya
47 22R91A0547 D Bharath Kumar
48 22R91A0548 Dadigela Varun
49 22R91A0549 Daida Gopinadh
50 22R91A0550 Damidi Srinitha Reddy
51 22R91A0551 Dandrekala Madhav
52 22R91A0552 Daravath Naveen
53 22R91A0553 Degloorkar Bhargavarama
54 22R91A0554 Dhanavath Ranga
55 22R91A0555 Dhanavath Simhadri
56 22R91A0556 Dharmendar Yadav
57 22R91A0557 Dharshanala Shravya
58 22R91A0558 Dokkari Dileep
59 22R91A0559 Dommata Sreeja Reddy
60 22R91A0560 Dundigalla Varun Raj
61 22R91A0561 Dundra Shravan Kumar
62 22R91A0562 Durshetty Shivkumar Rames
63 22R91A0563 E Joshitha
64 22R91A0564 Eluri Ramcharan Reddy
65 22R91A0565 Enagandula Keerthana
66 23R95A0501 Akula Harshitha Patel
67 23R95A0502 Are Akshay Kumar
68 23R95A0503 Bairoju Sanjay Kumar
69 23R95A0504 Bhandinagar Meenakshi
70 23R95A0505 Darshanapally Lakshmikant
71 23R95A0506 Gaddha Surya Teja
CSE-B

S. Ext Grade
H . T No Name of the Student MID
No
1 22R91A0566 ENDRAKANTI ABHISHEK
2 22R91A0567 G SHIRISHA
3 22R91A0568 GANGADHARI THARUN
4 22R91A0569 GANTA NAGARJUNA
5 22R91A0570 GARDAS GANESH
6 22R91A0571 GHANTA EESHA
7 22R91A0572 GILLA SHIVA
8 22R91A0573 GOLKONDA KEERTHANA
9 22R91A0574 GOMASA SHARANYA
10 22R91A0575 GUGGILLA BHARGAVI
11 22R91A0576 GUMMULA SINDHUJA
12 22R91A0577 GUNDAPUNENI AKHIL
13 22R91A0578 GUNDU ADARSH SAI
14 22R91A0579 INDLA RAMYA SRI
15 22R91A0580 ISLAVATH ASHOK KUMAR
16 22R91A0581 ITIKELA ABHINAYA
17 22R91A0582 J BHAVANA
18 22R91A0583 J KARTHIK
19 22R91A0584 JABU SRIMAN NARAYANA
20 22R91A0585 JADHAV SHIV RAJ
21 22R91A0586 JAGANNATH SINGH
22 22R91A0587 JAKKALA SAGAR
23 22R91A0588 JAKKENA VARSHA
24 22R91A0589 JAPA SAMPATH
25 22R91A0590 JAYAVARAPU YUGESH
26 22R91A0591 JIDUGU ANUSHA LAVANYA
27 22R91A0592 JINNA SUDHEER
28 22R91A0593 JINUKALA SHIVA KUMAR
29 22R91A0594 K NIKHIL
30 22R91A0595 KADABOINA SANJANA
31 22R91A0596 KAKINADA AJITH KUMAR
32 22R91A0597 KALERI MANIKANTA
33 22R91A0598 KALLU VARSHITHA
34 22R91A0599 KALUVALA BABY LAHARI
35 22R91A05A0 KAMATHAM VIGNESH
36 22R91A05A1 KAMBALA BHAVANA
KAMBHAMPATI SUJANA
37 22R91A05A2
HARITHA
KANCHARLA JESHWANTH
38 22R91A05A3
REDDY
39 22R91A05A4 KANDENI TILAK
40 22R91A05A5 KANDUKURI SAI SRUTHI
KANMARALAPUDI SAI
41 22R91A05A6
KIRITI
42 22R91A05A7 KANNEVENI ASHWITHA
43 22R91A05A8 KANUGULA ANUSHA
KANUGULA JEEVAN
44 22R91A05A9
AVINASH
KAPILAVAI BHANU
45 22R91A05B0
PRAKASH
46 22R91A05B1 KARNE VIJAY

47 22R91A05B2 KARTHIK DERANGULA


48 22R91A05B3 KASIREDDY SRIYA REDDY
49 22R91A05B4 KATAKAM LAXMI
50 22R91A05B5 KATTA ASHWINI
51 22R91A05B6 KATTEBOINA SANDEEP
52 22R91A05B7 KEMA SAI SANDEEP
53 22R91A05B8 KETHAVATH NANDU NAIK
54 22R91A05B9 KINIKAR RAHUL
55 22R91A05C0 KOLA MAHESH
56 22R91A05C1 KOMARAJU PAVANI
57 22R91A05C2 KOMIREDDY KOMALIKA
KOMIRISHETTI PREM
58 22R91A05C3
KUMAR
59 22R91A05C4 KOMIRISHETTI SANJAY
KONDAPALLY KRISHNA
60 22R91A05C5
PRASAD
61 22R91A05C6 KORATLA KARTHIK
62 22R91A05C7 KORMILLA AANUJ REDDY
KOTHAKAPU SAI KIRAN
63 22R91A05C8
REDDY
64 22R91A05C9 KUNCHAM VENU
65 22R91A05D0 KUNTA ASHWITHA
66 23R95A0507 GUDIMALLA CHANDANA

CSE-C

S. MID EXT Grade


H . T No Name of the Student
No (40) (60)
1 22R91A05D1 Kyatham Sahithi
2 22R91A05D2 Lakavath Priyanka
3 22R91A05D3 Macha Ajay Kumar
4 22R91A05D4 Macherla Rohith
5 22R91A05D5 Maddi Sri Vidya
6 22R91A05D6 Madugu Shiva Kumar
7 22R91A05D7 Madugula Sarika
8 22R91A05D8 Maheshwaram Kavya
9 22R91A05D9 Maile Sabitha
10 22R91A05E0 Mallavarapu Drakshavalli
11 22R91A05E1 Malreddy Nisha Reddy
12 22R91A05E2 Malyala Naveen
13 22R91A05E3 Manchana Vikas
14 22R91A05E4 Mancharla Anup Kumar
15 22R91A05E5 Manda Sai Priya
16 22R91A05E6 Manukonda Nikhil Nagasai
17 22R91A05E7 Marijuddin Ghulam Khaja
18 22R91A05E8 Marupaka Sandhya
19 22R91A05E9 Masku Jessi
20 22R91A05F0 Maya Venkatesh
21 22R91A05F1 Md Adnan Umez
22 22R91A05F2 Medhidha Kavya
23 22R91A05F3 Mendikar Shiva Sai
24 22R91A05F4 Mogili Devi Vaishnavi
25 22R91A05F5 Mohammad Abdul Abrar
26 22R91A05F6 Mohammad Lateef
27 22R91A05F8 Mohammad Sharmila
28 22R91A05F9 Mohammed Abdul Shabbir Ah
29 22R91A05G0 Mohd Muzammiluddin
30 22R91A05G1 Mora Upendra Chary
31 22R91A05G2 Morla Venkata Sai Krishna
32 22R91A05G3 Mote Mahesh
33 22R91A05G4 Mote Shiva
34 22R91A05G5 Mothupally Yeshwanth Redd
35 22R91A05G6 Mudiam Santosh
36 22R91A05G7 Mulagundla Naga Jaya Kris
37 22R91A05G8 Munagapati Usha Sree
38 22R91A05G9 Munjala Navya Sri Goud
39 22R91A05H0 Nalla Nithish Reddy
40 22R91A05H1 Nalla Vardhan Reddy
41 22R91A05H2 Nallangi Sai Teja Reddy
42 22R91A05H3 Nallani Chalapathi Naidu
43 22R91A05H4 Nampally Indu
44 22R91A05H5 Nandi Vasanthi
45 22R91A05H6 Nandipati Naresh
46 22R91A05H7 Nanju Kavya
47 22R91A05H8 Nellikanti Kirankumar
48 22R91A05H9 Nimmala Rahul
49 22R91A05J0 Nimmanagoti Mahendar
50 22R91A05J1 Nimmanagoti Sruthi
51 22R91A05J2 Nirati Varun Chand
52 22R91A05J3 Nula Yashwanth
53 22R91A05J4 Pagilla Sohith Krishna
54 22R91A05J5 Palvayi Naveen
55 22R91A05J6 Pandiri Bharath Kumar
56 22R91A05J8 Pantagi Varun Goud
57 22R91A05J9 Pashikanti Chiranjeevi
58 22R91A05K0 Pasuladhi Prashanth
59 22R91A05K1 Pasunuti Indu
60 22R91A05K2 Peetla Shanmuka
61 22R91A05K3 Peramalapalli Sanjay
62 22R91A05K4 Perati Harshith Reddy
63 22R91A05K5 Perumalla Praveen Kumar
64 23R95A0513 Lagishetty Naresh
65 23R95A0514 Maila Praveen
66 23R95A0515 Maloth Bhargav Sai Siddar
67 23R95A0516 Masam Charan
68 23R95A0517 Mohammed Arman Ali
69 23R95A0518 Nadikatla Anirudh
CSE-D

S. Ext Grade
H . T No Name of the Student MID
No
1 22R91A05K6 POCHAMPALLY BABU
2 22R91A05K7 POKURI PREMALATHA
3 22R91A05K8 PULI VAMSHI GOUD
4 22R91A05K9 PUTTA MADHAVI
5 22R91A05L0 PUTTA SREEJIT KUMAR
6 22R91A05L1 RAJKUNDAL BALAJI
7 22R91A05L2 RAJULA NALINKUMAR
8 22R91A05L3 RAMAVATH RAMESH
9 22R91A05L4 RAPARTHI RAHUL
10 22R91A05L5 RAPOLU HARIKA
11 22R91A05L6 SABHAVAT SRINU
12 22R91A05L7 SAMUDRALA ABHINESH
13 22R91A05L8 SANGOJI PRANAV
14 22R91A05L9 SARTHAK KUMAR
SARVIGARI BHARATH
15 22R91A05M0
REDDY
16 22R91A05M1 SHAIK AFREEN
17 22R91A05M2 SHAIK AFZAL
18 22R91A05M3 SHAIK ASIF
19 22R91A05M4 SHAIK MALIK
20 22R91A05M5 SHAIK NASEEMA
21 22R91A05M6 SHARTA ABHINAY
22 22R91A05M7 SINGIREDDY VARSHA
SINGIREDDY YUGANDHAR
23 22R91A05M8
REDDY
24 22R91A05M9 SIRIGIRI SRUTHI
25 22R91A05N0 SIRIKONDA NITHIN
26 22R91A05N1 SIRVATI MANASA
27 22R91A05N2 SOMISHETTY AKHILA
28 22R91A05N3 SUDIREDDY PALLAVI
29 22R91A05N4 SUNKARABOINA RAKESH
30 22R91A05N5 SYED NABRAAS
TALAGADADEEVI KHYATHI
31 22R91A05N6
GAYATHRI
32 22R91A05N7 TALLA ANILKUMAR REDDY
33 22R91A05N8 TALLAPELLI SRINIVAS
TEKULA SURYAVARDHAN
34 22R91A05N9
REDDY
35 22R91A05P0 TEKULAPALLY SRILATHA
36 22R91A05P1 THIPPARAPU SAHITH
37 22R91A05P2 THIRUMANI AKHILA
THODIMA SHASHIDHAR
38 22R91A05P3
REDDY
39 22R91A05P4 THOGITI ADITHYA
40 22R91A05P5 THOKALA VIVEK REDDY
41 22R91A05P6 THUMBURI ABHINAV

42 22R91A05P7 THUMMA MOUNIKA


43 22R91A05P8 THUMMANNAPALLY
NAVYASRI
44 22R91A05P9 THURPU VINAY
45 22R91A05Q0 UPPALA SAMYOG
46 22R91A05Q1 V PRANAV REDDY
VALLAPU REDDY MOHIT
47 22R91A05Q2
REDDY
48 22R91A05Q3 VALLAPUREDDY HARSHINI
49 22R91A05Q4 VEMULA VINAY
50 22R91A05Q5 VENKATI SANDEEP
51 22R91A05Q6 VIPPARTHI HEMANTH
52 22R91A05Q7 VORIMALLA SAI CHARAN
53 22R91A05Q8 VORUGANTI SHREYA
54 22R91A05Q9 VUTUKURU HIMA BINDU
VUYYURU SAI VENKATA
55 22R91A05R0
REDDY
56 22R91A05R2 Y MAHESH
57 22R91A05R3 YALAM NAGA PRAVALLIKA
58 22R91A05R4 YASANI ABHINAYA SRI
59 22R91A05R5 YELLA RAGHUNANDHU
60 22R91A05R6 YERNAGI VAMSHI KRISHNA
NAGAVARAPU VENKATA
61 23R95A0519
RAMANA TILAK
62 23R95A0520 P BUGGA SAI RAM REDDY
63 23R95A0521 RAJANALA RAVINDER
64 23R95A0522 S NARENDRA
65 23R95A0523 THURPATI GANESH
66 23R95A0524 VADLA ROHITH CHARY
67 23R95A0525 VANAM PRASHANTH
VANKUNAVATH
68 23R95A0526
SRIKANTHNAYAK

Analysis of PI (for the past three batches)

Summary statistic
NO.STUDENTS APPEARED: 70
NO. STUDENTS PASS :
NO. STUDENTS FAILED:

Subject For Batch: 2024-25


% of % of B % of C
A Grades Grades
Grades
DATA ANALYTICS

Graphical statistics (Pie-chart for CAY and bar chart for the past three year)
REMEDIAL CLASSES DETAILS

Name of the Subject:DATA ANALYTICS


YearandSemester:IIIYear1stSemester
Name of the Faculty: N. Anjamma Date:

Date Roll numbers Topic Absentees Remarks


Signature of the Faculty Signature of the HOD

Analysis of student feedback

Specify the feedback collection process:

Percentage of students participating:

Specify the feedback analysis process:

Basis of reward/corrective measures, if any:

Attainment of COs (using course end survey) (bar chart (CO scores vs each CO)

Corrective actions taken in the last three years:

Teacher self-assessment (at the completion of course)

All five units are completed and 80% of students understood the subject with the help
oflab practical’s students and the various examples given in the classroom.

Recommendation/Suggestions for improvement by faculty

CERTIFICATE

I, the undersigned, have completed the course allotted to me as shown below.

S. Semester Subject with code Total Remarks


No units
DATA
1 I ANALYTICS(Professional 5
Elective -I)(22CS513PE)

Date: Signature of faculty


Submitted to HOD

Certificate by HOD

I,theundersigned,certifythatN. ANJAMMAhascompletedthecourseworkallotted to him/her


satisfactorily/not satisfactorily.

Date: Signature of HOD

You might also like