EDS Mini Project
EDS Mini Project
Noor Hasan
Mat no: 5008008
Date: 18.03.2025
1
Background
This study analyzes survival factors of passengers aboard the Titanic. The sinking of
the Titanic in 1912 led to the loss of over 1,500 lives, but survival rates varied
significantly based on demographic and socioeconomic factors.
● The dataset contains 891 passenger details, including age, gender, class, fare,
and family relations.
● The dataset needs cleaning and transformation before analysis.
● The goal is to find which factors significantly influenced survival.
2
Research Questions
1. Did gender affect survival chances?
2. Did social class influence survival?
3. Did younger passengers have a better survival rate?
4. Did having family members onboard improve survival chances?
5. Can machine learning models predict survival accurately?
3
Data Analysis Methods
This analysis was conducted in R with Rstudio, following these steps:
5
Result and Discussion
6
Result and Discussion
Logistic Regression Findings
● A logistic regression model was trained using all passenger details.
● Key Predictors of Survival:
○ Gender: Males had a much lower survival probability (p < 2e-16).
○ Passenger Class: Lower-class passengers were less likely to survive (p < 2e-16).
○ Age: Older passengers had a lower survival rate (p = 4.35e-07).
● Accuracy: Logistic Regression correctly predicted survival with approximately 78%
accuracy.
7
Result and Discussion
Key Takeaways:
1. Women were significantly more likely to survive than men.
2. First-class passengers had the highest survival rates.
3. Younger passengers had a better survival rate.
4. Having family members onboard slightly improved survival chances.
5. Random Forest was more accurate than Logistic Regression for survival prediction.
Implications:
● This analysis confirms that survival on the Titanic was influenced by socioeconomic and
demographic factors.
● The findings support the historical "Women and children first" evacuation policy.
Use of LLM
● ChatGPT was used to debug codes and make the plots more aesthetic.
9
Thank you
10