Weka Weka: A - Antony Alex MCA DR G R D College of Science - CBE Tamil Nadu - India
Weka Weka: A - Antony Alex MCA DR G R D College of Science - CBE Tamil Nadu - India
Java based Routines are implemented as classes and logically arranged in packages Comes with an extensive GUI interface
9/10/2012
Main Features
49 data preprocessing tools 76 classification/regression algorithms 8 clustering algorithms 3 algorithms for finding association rules 15 attribute/subset evaluators + 10 search algorithms for feature selection
9/10/2012
java weka.core.converters.CSVLoader data.csv > data.arff java weka.core.converters.C45Loader c45_filestem > data.arff java weka.classifiers.rules.ZeroR -t weather.arff java weka.classifiers.trees.J48 -t weather.arff java weka.filters.supervised.attribute.Discretize -i data/iris.arff \ -o iris-nom.arff -c last java weka.filters.supervised.attribute.Discretize -i data/cpu.arff \ -o cpu-classvendor-nom.arff -c first
Main GUI
Three graphical user interfaces
The Explorer (exploratory data analysis) The Experimenter (experimental environment) The KnowledgeFlow (new process model inspired interface)
9/10/2012
Explorer: pre-processing the data preData can be imported from a file in various formats: ARFF, CSV, C4.5, binary Data can also be read from a URL or from an SQL database (using JDBC) Pre-processing tools in WEKA are called filters WEKA contains filters for:
Discretization, normalization, resampling, attribute selection, transforming and combining attributes,
9/10/2012 7
ACCESSING DATABASE
jdbcDriver jdbcURL
DatabaseUtils.props.hsql - HSQLDB DatabaseUtils.props.msaccess - MS Access DatabaseUtils.props.mssqlserver - MS SQL Server DatabaseUtils.props.mysql - MySQL DatabaseUtils.props.odbc - ODBC access via ODBC/JDBC bridge, DatabaseUtils.props.oracle - Oracle 10g DatabaseUtils.props.postgresql - PostgreSQL 7.4 DatabaseUtils.props.sqlite3 - sqlite 3.x
9/10/2012
9/10/2012
University of Waikato
10
9/10/2012
University of Waikato
11
9/10/2012
University of Waikato
12
9/10/2012
University of Waikato
13
Meta-classifiers include:
Bagging, boosting, stacking, error-correcting output codes, locally weighted learning,
9/10/2012
16
Apriori can compute all rules that have a given minimum support and exceed a given confidence
9/10/2012 17
Color-coded class values Jitter option to deal with nominal attributes (and to detect hidden data points) Zoom-in function
9/10/2012 19
Thank U