Recommendation system in e-commerce

Objective

The purpose of this repository is to build a recommendation system of goods for users as well as deploy it in the production environment and support it via various frameworks and services.

Metrics

The task which we are going to solve for fulfilling the project objective is optimizing the system of recommendations for users' actions of adding the goods to the cart via maximization of the following metrics:

precision@10
recall@10

The choice of such metrics can be explained by their popularity of usage for evaluating the quality of recommendation systems as well as by the fact that in this case our focus will be upon relevance of recommendations for users and such metrics enable mathematically calculating it. It is logical that users will only see a part of the recommendations we generate for them so in this project we are going to consider the first 10 recommendations while computing metric values and hence making a decision about the quality of the recommendation system.

precision@10 metric will allow us to say whether a recommended item is liked by a user, whilst by recall@10 metric we will be able to judge about the ability of the system to detect and anticipate users' preferences.

Project structure

Project is divided into several blocks where each one is described in the table below:

Component	Description	Frameworks	Link
Modeling experiments	Deploying Mlflow service with the artifact store and building/optimization of the goods recommendation system	`mlflow` `catboost` `implicit` `sklearn`	recsys
Pipelines	Deploying multi-container Airflow service in Docker environment and creating pipelines of data loading and re-training of the model	`docker` `docker-compose` `airflow` `catboost`	airflow_service
Web-service deployment	Deploying FastAPI service of recommendations in the Docker environment with an additonal monitoring system via Prometheus and Grafana	`docker` `docker-compose` `fastapi` `prometheus` `grafana` `requests`	fastapi_service

By accessing each link in the last column of the table, one can access the respective directory with a detailed description in the respective README.

Additional toolbox

This repository is also equipped with additional tools for working with S3 cloud storage and PostgreSQL database where each has a dedicated folder with useful scripts:

Component	Description	Frameworks	Link
S3 Object Storage	Compilation of scripts for pushing files to cloud as well as checking the contents/space in S3-bucket	`boto3`	s3_scripts
PostgreSQL	Compilation of scripts for dropping, loading and inspecting tables from Postgres	`psycopg2`	postgres_scripts

Interacting with S3

Folder s3_scripts contains scripts for convenient interaction with cloud storage. The present version contains the most basic ones:

Inspecting S3-bucket contents/space

python s3_scripts/check_storage.py --option=contents

python s3_scripts/check_storage.py --option=space

Pushing files to S3

python s3_scripts/push_file.py --local-file-path=data/test.parquet --s3-file-path=recsys/test/test.parquet

Note: In this case one needs to specify the full path to a file in the local directory and the desired path to this file in the cloud storage

Interacting with Postgres

Inspecting table

python postgres_scripts/show_table.py --table-name=<table_name>

Loading/saving Postgres table locally

python postgres_scripts/load_table.py --save-dir=<save_dir> --table-name=<table_name>

Note: Upon running the script from the root directory there is no need to specify --save-dir, as the required table will be loaded to the needed folder in accordance with the default path

Dropping Postgres table

python postgres_scripts/drop_table.py --table-name=<table_name>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Recommendation system in e-commerce

Objective

Metrics

Project structure

Additional toolbox

Interacting with S3

Interacting with Postgres

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
airflow_service		airflow_service
data		data
fastapi_service		fastapi_service
postgres_scripts		postgres_scripts
recsys		recsys
s3_scripts		s3_scripts
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

spolivin/mle-pr-final

Folders and files

Latest commit

History

Repository files navigation

Recommendation system in e-commerce

Objective

Metrics

Project structure

Additional toolbox

Interacting with S3

Interacting with Postgres

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages