Date: 16/APR/2024 Time: 20:00 Venue: Google Meets platform
Course Overview
This comprehensive data analysis course provides a structured approach to mastering essential
data analysis concepts, tools, and techniques. Through a combination of lectures, hands-on
exercises, quizzes, and exams, students will gain a deep understanding of data analysis principles
and practical skills applicable to real-world scenarios. Additionally, the course will introduce
students to the fundamentals of data science, covering key concepts and methodologies.
Course Objectives
Understand foundational concepts in data analysis.
Learn various data manipulation and cleaning techniques.
Develop proficiency in data visualization for effective communication of insights.
Gain skills in statistical analysis and hypothesis testing.
Explore advanced topics such as machine learning and predictive modelling.
Apply learned techniques to analyze real-world datasets.
Enhance critical thinking and problem-solving abilities in data-driven decision-making.
Course Structure (April 16 - June 30):
Week 1 (April 16 - April 20): Introduction to Data Analysis
- Overview of data analysis and its applications
- Introduction to Python programming language for data analysis
- Basics of Jupyter Notebook for interactive data analysis
Week 2 (April 23 - April 27): Data Manipulation with Pandas
- Introduction to Pandas library for data manipulation
- Data loading, cleaning, and preprocessing techniques
- Hands-on exercises with Pandas for data manipulation
Week 3 (April 30 - May 4): Data Visualization with Matplotlib and Seaborn
- Principles of effective data visualization
- Introduction to Matplotlib and Seaborn libraries
- Plot types: line plots, scatter plots, bar plots, histograms
- Customizing plots for better communication of insights
Week 4 (May 7 - May 11): Statistical Analysis with NumPy and SciPy
- Overview of NumPy and SciPy libraries for numerical computing and statistical analysis
- Descriptive statistics: mean, median, variance, standard deviation
- Statistical hypothesis testing: t-tests, ANOVA, chi-square tests
- Hands-on exercises applying statistical analysis techniques
Week 5 (May 14 - May 18): SQL for Data Analysis
- Introduction to SQL (Structured Query Language) for data retrieval and manipulation
- Querying databases using SQL: SELECT, FROM, WHERE, JOIN
- Hands-on exercises with SQL for data analysis tasks
Week 6 (May 21 - May 25): Data Visualization with Power BI
- Overview of Power BI for interactive data visualization and business intelligence
- Creating visualizations, dashboards, and reports in Power BI
- Connecting Power BI to various data sources for analysis
Week 7 (May 28 - June 1): Introduction to Data Science
- Overview of data science and its applications
- The data science workflow: problem formulation, data collection, data preprocessing,
modeling, evaluation, and deployment
- Ethical considerations in data science
Week 8 (June 4 - June 8): Machine Learning Fundamentals
- Introduction to machine learning concepts and algorithms
- Supervised vs. unsupervised learning
- Regression and classification algorithms
- Model evaluation metrics: accuracy, precision, recall, F1-score
Week 9 (June 11 - June 15): Advanced Data Science Techniques
- Feature engineering and selection methods
- Dimensionality reduction techniques: PCA (Principal Component Analysis)
- Time series analysis with Pandas
- Introduction to natural language processing (NLP) with NLTK library
Week 10 (June 18 - June 22): Midterm Exam Preparation
- Review of key concepts covered in weeks 1-9
- Practice quizzes and exercises
- Discussion of midterm exam format and expectations
Week 11 (June 25 - June 29): Midterm Exam (June 25)
- Comprehensive exam covering theoretical concepts and practical applications
- Closed-book format, duration: 2 hours
- Exam will include multiple-choice questions, short-answer questions, and coding exercises
Week 12 (July 2 - July 6): Applied Data Analysis Projects
- Group projects applying learned techniques to real-world datasets
- Data-driven decision-making exercises
- Presentation of project findings and insights
Week 13 (July 9 - July 13): Advanced Topics and Case Studies
- Guest lectures on advanced data analysis topics
- Case studies showcasing real-world applications of data analysis techniques
- Discussion and analysis of recent trends in data analysis and data science
Week 14 (July 16 - July 20): Final Project and Course Wrap-Up
- Completion of final data analysis project
- Presentation of final project findings and insights
- Course review and feedback session
- Distribution of course completion certificates
Assessment:
- Weekly quizzes to assess understanding of key concepts
- Midterm exam covering theoretical concepts and practical applications (June 25)
- Final data analysis project demonstrating application of learned techniques
- Class participation and engagement in discussions
Thank you!