0% found this document useful (0 votes)
42 views14 pages

Data Science Task List Pfsinterns

Data science

Uploaded by

Kunal Pal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
42 views14 pages

Data Science Task List Pfsinterns

Data science

Uploaded by

Kunal Pal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 14

Pinnacle

Full-Stack Interns

Data Science
Task List
Instructions

Start your tasks only after the internship begins. The start date will be
specified in your Offer Letter email.

To qualify for the Completion Certificate, complete a minimum of 3 tasks


from this task list.

You have the freedom to select your own learning resources to study and
complete the tasks. No training will be provided during the internship.

Maintain a separate public GitHub repository for all the completed tasks
name it as "pfsinterns". Refer this guide for any doubts.
Instructions

A video need to be created for each task that you do and should be post
on LinkedIn for proof of your work and build credibility among your peers.

You need to tag Pinnacle Full-Stack Interns, use hastags


#pinnaclefullstackinterns, #pfsinterns, and any other relevant ones.

Task submission form will be shared with you later through email. where
you will need to provide the repository link and LinkedIn video post link
for each completed task. Till then please continue your task.

If you have any questions or need clarification on any task, feel free to reach
out to us. We're here to support you throughout your internship journey.
LinkedIn Profile Improvement Task (Mandatory)

The assigned task is designed to aid you in building your professional profile
on LinkedIn and enhance your visibility to recruiters.

Post Your internship Offer Letter on your Linkedin profile

Go through these articles and follow them to improve your LinkedIn


Profile. #1 #2 #3

Add "Intern at Pinnacle Full-Stack Interns" in your profile headline & add
"Pinnacle Full-Stack Interns" as your current company in the work
experience section.
Task 1 Email Spam Detection

Develop an email spam detection system to classify emails as spam or non-spam (ham).

The system will analyze email content and metadata to identify spam messages. The

completed project should involve text preprocessing, feature extraction, and training a

classification model (e.g., decision trees, ensemble methods) to detect spam emails.

Interns will gain experience in text classification, understand spam detection techniques,

and learn to build models for email filtering. Skills Required: Proficiency in Python

programming, familiarity with text processing libraries, understanding of classification

algorithms.

Download the dataset from here.


Task 2 Customer Segmentation for E-commerce

Conduct customer segmentation analysis on transaction data from an e-commerce

platform to identify distinct customer groups. This segmentation will help tailor marketing

strategies and improve customer satisfaction. The completed project should involve

clustering algorithms such as K-means or hierarchical clustering to segment customers

based on their purchasing behavior. Interns will deepen their understanding of data

analysis, gain experience in segmentation techniques, and learn how to leverage

customer insights for business growth. Skills Required: Proficiency in Python

programming, understanding of data manipulation with libraries like Pandas, basic

statistical concepts.

Download the dataset from here.


Task 3 Movie Genre Classification

Build a movie genre classification model to predict the genre(s) of a movie based on its

plot summary or metadata. The model will categorize movies into predefined genres. The

completed project should involve data preprocessing, feature extraction, and training a

classification model (e.g., Naive Bayes, SVM) to classify movie genres. Interns will

deepen their understanding of text classification, gain experience in genre prediction, and

learn how to apply machine learning to movie data. Skills Required: Proficiency in Python

programming, understanding of classification algorithms, familiarity with text processing

techniques.

Download the dataset from here.


Task 4 Heart Disease Prediction

Develop a machine learning model to predict the likelihood of heart disease based on

patient health data and risk factors. The model assists healthcare professionals in early

diagnosis and intervention. The completed project should involve data preprocessing,

feature engineering, and training a classification model (e.g., logistic regression, random

forest) to predict the presence or absence of heart disease. Interns will gain expertise in

healthcare analytics, understand cardiovascular risk factors, and learn to develop

predictive models for medical diagnosis. Skills Required: Proficiency in Python

programming, knowledge of classification algorithms, understanding of healthcare data

privacy regulations.

Download the dataset from here.


Task 5 Credit Card Fraud Detection

Develop an advanced fraud detection system for credit card transactions using ensemble

learning techniques. Ensemble models combine multiple base models to improve

prediction accuracy. The completed project should involve data preprocessing, feature

engineering, and training ensemble models such as Random Forest, Gradient Boosting,

or XGBoost to detect anomalies in transaction data. Interns will deepen their

understanding of ensemble learning, gain expertise in fraud analytics, and learn to

develop robust solutions for fraud prevention with high accuracy. Skills Required:

Proficiency in Python programming, knowledge of ensemble learning algorithms,

experience with feature engineering and model evaluation techniques.

Download the dataset from here.


Task 6 House Price Prediction

Develop a model to predict house prices based on features such as location, size, and

amenities. The model will help homebuyers or real estate agents estimate property

values. The completed project should involve data preprocessing, feature engineering,

and training a regression model (e.g., gradient boosting, neural networks) to predict house

prices. Interns will deepen their understanding of regression analysis, gain experience in

predictive modeling for real estate, and learn to make data-driven decisions in housing

markets. Skills Required: Proficiency in Python programming, knowledge of regression

analysis, experience with feature engineering techniques.

Download the dataset from here.


Remember

“Even if you've tackled these tasks before, there's always room


to innovate and improve. Explore advanced techniques, adopt new
approaches, and aim for even higher accuracy. Challenge yourself
to surpass your previous accomplishments and strive for
excellence in every project. Keep pushing the boundaries of your
skills and knowledge!”
Ask Us For Help!

The purpose of this internship is to Learn and Grow.


We're here to support, not dictate; the choice to seek guidance is
yours to make.
The given tasks may seems very easy or very difficult, embrace each
challenge with enthusiasm and dedication. Your commitment will fuel
your success!"
Get Social With Us

pfsinterns.com

internship@pfsinterns.com

@pinnacle-full-stack-interns
Thank You

You might also like