DWBI4
DWBI4
DWBI4
AIM:
Classification Algorithms are Supervised Machine Learning Algorithms that use
labeled data (aka training datasets) to train classifier models. These models then
predict outcomes with the best possible accuracy when new data (aka testing
datasets) is fed to them.
The outcome predicted by a classification algorithm is categorical in nature. These
algorithms classify variables into a specific set of classes – such as classifying a text
message into transactions or promotions through an SMS filter on your iPhones.
Classification techniques predict discrete class label output(s) to which the data
elements belong. For example, weather prediction is a type of classification problem
– ‘hot’ and ‘cold’ being the class labels. This is called binary classification since there
are only two classes.
Random Forest
Like a forest has trees, a random forest is a collection of decision trees. This
classifier aggregates the results from multiple predictors. It additionally utilizes the
bagging technique that allows each tree to be trained on a random sampling of the
original dataset and takes the majority vote from trees. A random forest classifier has
better generalization but is less interpretable than a decision tree classifier, naturally
because more layers are added to the model.
Naïve Bayes
This classifier data into different classes according to the Bayes’ Theorem. But
assumes that the relationship between all input features in a class is independent.
Hence, the model is called naïve. This algorithm works relatively well even when the
size of the training dataset is small. Naïve Bayes is commonly used for text
classification, sentiment analysis, etc.
How to Run a Classification Task with Naive Bayes
Conclusion:
5 AIM
Clustering Algorithms
There are many types of clustering algorithms.
1. Clustering
2. Clustering Algorithms
3. Examples of Clustering Algorithms
1. Library Installation
2. Clustering Dataset
3. Affinity Propagation
4. Agglomerative Clustering
5. BIRCH
6. DBSCAN
7. K-Means
8. Mini-Batch K-Means
9. Mean Shift
10. OPTICS
11. Spectral Clustering
12. Gaussian Mixture Model
Clustering
Cluster analysis, or clustering, is an unsupervised machine learning
task.
For example:
The phylogenetic tree could be considered the result of a manual clustering analysis.
Separating normal data from outliers or anomalies may be considered a clustering
problem.
Separating clusters based on their natural behavior is a clustering problem, referred to
as market segmentation.
Clustering can also be useful as a type of feature engineering, where
existing and new examples can be mapped and labeled as belonging to
one of the identified clusters in the data.
Clustering Algorithms
There are many types of clustering algorithms.
Agglomerative Clustering
BIRCH
DBSCAN
K-Means
Mini-Batch K-Means
Conclusion: