Yubraj Shrestha
Yubraj Shrestha
Yubraj Shrestha
2020-21 Autumn
College ID:np05cp4s210098@iic.edu.np
I confirm that I understand my coursework needs to be submitted online via Google Classroom under the relevant
module page before the deadline for my assignment to be accepted and marked. I am fully aware that late
submissions will be treated as non-submission and a mark of zero will be awarded.
Table of Contents
1. Introduction ............................................................................................................... 1
Synonymy .................................................................................................................... 3
Privacy ......................................................................................................................... 3
Sparsity........................................................................................................................ 3
3. Solution ..................................................................................................................... 6
Algorithm ..................................................................................................................... 7
Pseudocode ................................................................................................................. 8
Flowchart ..................................................................................................................... 9
4. Conclusion .............................................................................................................. 10
References .................................................................................................................... 12
Table of Figures
Figure 1: Example of K-Means Clustering ....................................................................... 6
Figure 2: Flowchart of K-Means Clustering. .................................................................... 9
YUBRAJ SHRESTHA ARTIFICIAL INTELLEGENCE
1. Introduction
Artificial intelligence (AI) refers to the simulation of human intelligence in machines
programmed to think like humans and mimic their behavior. The term can also be
applied to any machine exhibiting properties related to the human mind, such as:
Learning and problem solving.
From the user's perspective, it is designed to meet the user's needs in the shortest
possible time. For example, the type of content you watch on Netflix or Hulu. People
who only watch Korean dramas will only see related titles, but those who like action
shows this on the home screen.
1
CU6051NT
YUBRAJ SHRESTHA ARTIFICIAL INTELLEGENCE
2
CU6051NT
YUBRAJ SHRESTHA ARTIFICIAL INTELLEGENCE
know what your users like, you can recommend them new, relevant content. And that’s
true for everything from movies and music, to romantic partners.
Netflix, YouTube, Tinder, and Amazon are all examples of recommender systems in
use. The systems entice users with relevant suggestions based on the choices they
make.
Synonymy
Synonymy is the tendency of very similar articles to have different names or entries. Most
recommendation systems have difficulty distinguishing between closely related items such as
baby clothes and baby clothes.
Privacy
Providing personal information to the recommendation system leads to data problems
Privacy and Security. Users are reluctant to provide data to affected recommender systems
Sparsity
However, all the proposed techniques require a rich history of interactions, especially in
systems where the number of users, the number of items, and the number of observed ratings
profoundly affect the performance. system capacity. This is called an analysis problem.
3
CU6051NT
YUBRAJ SHRESTHA ARTIFICIAL INTELLEGENCE
2. Back Ground
To make this project of the movie recommendation system i did a lot if research. as now
online movie streaming has become popular nowadays, instead of going into movie
center you can watch movies online of your choice, by checking your browsing history
movie recommendation system will provide you the movies similar to your choice. many
algorithms were studied while carrying out this project.
Netflix, YouTube, Tinder, and Amazon are all examples of recommender systems in
use. The systems entice users with relevant suggestions based on the choices they
make.
In recommendation system there are two approaches, they are collaborative Filtering
and content Based filtering.
Collaborative Filtering
Collaborative filtering filters information using interactions and data collected by the
system from other users. It is based on the idea that people who have already agreed in
their assessment of certain items are likely to agree again in the future.
The concept is very simple, when we want to find a new movie to watch, we often ask
friends for recommendations. Naturally, we trust more recommendations from friends
with similar interests to us.
Most of the collaborative filtering systems apply the so-called similarity index-based
technique. In the neighborhood-based approach, several users are selected based on
their similarity to the active user. Inference for active users is done by averaging the
ratings of selected users.
The collaborative filtering system focuses on the relationship between users and items.
The similarity of items is determined by the similarity of ratings of those items by users
who have rated both items.
Collaborative methods are often developed using utility matrices. The task of the
proposed model is to learn a function that predicts the usefulness of the fit or similarity
for each user. Utility matrices are often very sparse, huge, and have suppressed values.
4
CU6051NT
YUBRAJ SHRESTHA ARTIFICIAL INTELLEGENCE
Here, the system uses your characteristics and preferences to suggest things you might
like. It uses the information you provide on the internet and what they can gather, then
they make recommendations based on that.
The goal of content-based filtering is to rank products for specific keywords, find out
what customers like, search for those terms in the database, and then recommend
similar things.
5
CU6051NT
YUBRAJ SHRESTHA ARTIFICIAL INTELLEGENCE
3. Solution
Clustering is the task of dividing the data points into a number of groups such that data
points in the same groups are more similar to other data points in the same group and
dissimilar to the data points in the group. (GeekForGeeks, 2022)
The main task of K-Means clustering is to determines the best value for K center points
or centroids by an iterative process and assigns each data point to its closest k-center
and those data points which are near to the particular k-center, create a cluster.
(GeekForGeeks, 2022)
6
CU6051NT
YUBRAJ SHRESTHA ARTIFICIAL INTELLEGENCE
Algorithm
K-Means clustering is a centroid-based algorithm, where each cluster is related with a
centroid. This algorithm minimizes the sum of distance between the data point and their
corresponding clusters. The algorithm takes the unlabeled dataset as input, splits the
dataset into k-number of clusters, and repeats the process until it does not find the best
clusters. The value of k should be predetermined in this algorithm. (Javatpoint, 2021)
Step 2: Select K data at random from the dataset as an initial cluster centroid.
Step 3.2: Compute the distance between the datapoint and the cluster centroid.
Step 4: For each cluster calculate the new mean based on the datapoints in the cluster.
Step 5: Repeat step 3 and step 4 until the mean of the clusters stops changing or
7
CU6051NT
YUBRAJ SHRESTHA ARTIFICIAL INTELLEGENCE
Pseudocode
Choose number of clusters, K
Repeat
Compute the distance between the datapoint and the cluster centroid
END
8
CU6051NT
YUBRAJ SHRESTHA ARTIFICIAL INTELLEGENCE
Flowchart
9
CU6051NT
YUBRAJ SHRESTHA ARTIFICIAL INTELLEGENCE
4. Conclusion
Analysis of work done
This is the first coursework of the artificial intelligence so it was quite tough to do the
coursework. I faced so much problem while choosing the topic for this coursework but
finally with the help of my research I was able to overcome the problem. I have
researched a lot and chose movie recommendation system for the research topic. I
have used the K-Means Clustering algorithm to find the solution for movie
recommendation system and have create the pseudocode and flowchart of the
algorithm was developed to ease the coding part in future.
10
CU6051NT
YUBRAJ SHRESTHA ARTIFICIAL INTELLEGENCE
Further Work
After this course work, I have learned more about recommendation system so I will start
developing the model by using python programming language. I will be focused on
making my system accurate.
11
CU6051NT
YUBRAJ SHRESTHA ARTIFICIAL INTELLEGENCE
References
(2021). Retrieved from Javatpoint: https://www.javatpoint.com/k-means-clustering-
algorithm-in-machine-learning
12
CU6051NT
YUBRAJ SHRESTHA ARTIFICIAL INTELLEGENCE
13
CU6051NT