Titanic Classification Disaster Kaggle
Titanic Classification Disaster Kaggle
Titanic Classification Disaster Kaggle
Available:
https://www.kaggle.com/c/titanic-gettingStarted. [Accessed: 13-Dec-2013].
[3] Wiki, “Titanic.” [Online]. Available: http://en.wikipedia.org/wiki/Titanic. [Accessed: 13-Dec-2013].
http://www.cs.waikato.ac.nz/ml/weka/
https://www.kaggle.com/c/titanic-gettingStarted
Field Action Comment
PassengerId Removed Not needed for analysis as it’s
just an identifier
Survived Converted to No/Yes Needed Nominal identifier
Pclass Removed -> created Class Needed Nominal Identifier
column instead
Class New column Simple calculation based upon
PClass
Age Removed -> created AgeGroup Wanted simple classification
class coding
AgeGroup Formula based; some values not Arbitrarily did the following:
supplied. But ended up with 4 =IF(F2="", "Unk",IF(F2<10,
group other than Unknown "Child", IF(F2<20, "Adolescent",
(Child, Adolescent, Adult, Old) IF(F2<50, "Adult", "Old"))))
Ecode Removed -> created class Needed nominal identifier
Embarked
Embarked New column that converted
Ecode to the real name of the
departure point for the
passenger
@relation 'train4-weka.filters.unsupervised.attribute.Remove-R1,3,6,8'
@data
No,3rd,male,Adult,Southampton
Yes,1st,female,Adult,Cherbourg
J48 pruned tree
------------------
Number of Leaves : 11