0% found this document useful (0 votes)

18 views2 pages

FM Algorithm Theory Explanation

Uploaded by

21106053.rohit.negi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views2 pages

FM Algorithm Theory Explanation

Uploaded by

21106053.rohit.negi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Why FM algorithm?

Every day on the internet, more than 2.5 quintillion bytes of data are created. This data

is increasing in terms of variety, velocity and volume, hence called big data. To analyze this

data, one has to collect this data, store it in a safe place, clean it and then perform analysis.

One of the major problems faced by big data engineers is dealing with unuseful or redundant

data. A lot of time and memory is used to store and analyze this extra data which turns out

to be fruitless in the end. Thus, the removal of duplicate data becomes extremely essential

to cut the analysis cost and reduce redundancy.

Data cleaning can be done using various techniques but before cleaning the data, it is

necessary to know the amount of useful data present in the dataset. Therefore, before the

removal of duplicate data from a data stream or database, it is necessary to have knowledge

of distinct or unique data present. A way to do so is by hashing the elements of the universal

set using the Flajolet Martin Algorithm. The FM algorithm is used in a database query, big

data analytics, spectrum sensing in cognitive radio sensor networks, and many more areas.

It shows superior performance as compared with many other methods to find distinct

elements in a stream of data.

Flajolet Martin Algorithm:

Flajolet Martin Algorithm, also known as FM algorithm, is used to approximate the number

of unique elements in a data stream or database in one pass. The highlight of this algorithm

is that it uses less memory space while executing.

Pseudo Code-Stepwise Solution:

1. Selecting a hash function h so each element in the set is mapped to a string to at least

log2n bits.

2. For each element x, r(x)= length of trailing zeroes in h(x)

3. R= max(r(x))

=> Distinct elements= 2R

For small values of m(where m is the number of unique elements), the brute force approach

can work, but for large data sets or data streams, where m is very large, a lot of space is

required. The compiler may not let us run the algorithm in some cases.This is where the

Flajolet Martin Algorithm can be used. Not only does it occupy less memory, but it also

shows better results in terms of time in seconds.

This approach is used to maintain a count of distinct values seen so far, given a large number

of values. For example, getting an approximation of the number of distinct URLs surfed by

a person on the web. Many companies want to check how many unique users logged in to

their website the previous day to check if their advertisement was successful or not. Here,

the FM algorithm is an excellent solution for these companies.

Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Communication Networks by Leon Garcia and Indra Widjaja PDF
33% (3)
Communication Networks by Leon Garcia and Indra Widjaja PDF
2 pages
ICT Final Exam For Grade 8
80% (10)
ICT Final Exam For Grade 8
3 pages
Bda Exp5 Chinmay
No ratings yet
Bda Exp5 Chinmay
3 pages
Viden Io Data Analytics Lecture8 Counting Distinct Elements PDF
No ratings yet
Viden Io Data Analytics Lecture8 Counting Distinct Elements PDF
13 pages
Bda Exp8
No ratings yet
Bda Exp8
4 pages
Flajolet-Martin Algorithm
No ratings yet
Flajolet-Martin Algorithm
28 pages
Counting Distinct Elements in A Stream
No ratings yet
Counting Distinct Elements in A Stream
4 pages
Search Algorithm: Fundamentals and Applications
From Everand
Search Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Experiment No 8
No ratings yet
Experiment No 8
7 pages
Estimating Distinct Elements Using Flajolet-Martin Algorithm On A Data Stream
No ratings yet
Estimating Distinct Elements Using Flajolet-Martin Algorithm On A Data Stream
3 pages
Lexicon of Computer Science Terminology: Lexicon of Tech and Business, #16
From Everand
Lexicon of Computer Science Terminology: Lexicon of Tech and Business, #16
Mustafa Al-Dori
4/5 (1)
Algorithms and Data Structures: An Easy Guide to Programming Skills
From Everand
Algorithms and Data Structures: An Easy Guide to Programming Skills
Rigdon Jonathan
No ratings yet
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
From Everand
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
Byron Ellis
No ratings yet
FP Growth Alg
No ratings yet
FP Growth Alg
17 pages
Efficient Memory Optimization for IoT Intrusion Detection
From Everand
Efficient Memory Optimization for IoT Intrusion Detection
Ethan Evelyn
No ratings yet
Data Mining: Fundamentals and Applications
From Everand
Data Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet
DA Numericals
No ratings yet
DA Numericals
15 pages
Learn Hadoop in 24 Hours
From Everand
Learn Hadoop in 24 Hours
Alex Nordeen
No ratings yet
Unit 4 - 4.4
No ratings yet
Unit 4 - 4.4
23 pages
Mastering Data Structures and Algorithms with Python: Unlock the Secrets of Expert-Level Skills
From Everand
Mastering Data Structures and Algorithms with Python: Unlock the Secrets of Expert-Level Skills
Larry Jones
No ratings yet
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
From Everand
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
Peter Bradley
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Blooms Filter
No ratings yet
Blooms Filter
15 pages
Expo Sys
No ratings yet
Expo Sys
2 pages
Svyatoslav Covanov Rapport de Stage Recherche 2014
No ratings yet
Svyatoslav Covanov Rapport de Stage Recherche 2014
25 pages
Tutorial 02
No ratings yet
Tutorial 02
17 pages
Data Mining Unit 2 (Part 2) - 1
No ratings yet
Data Mining Unit 2 (Part 2) - 1
7 pages
Learn Design and Analysis of Algorithms in 24 Hours
From Everand
Learn Design and Analysis of Algorithms in 24 Hours
Alex Nordeen
No ratings yet
Modified Frequent Pattern Mining From Data Stream
No ratings yet
Modified Frequent Pattern Mining From Data Stream
38 pages
Python Machine Learning: Machine Learning Algorithms for Beginners - Data Management and Analytics for Approaching Deep Learning and Neural Networks from Scratch
From Everand
Python Machine Learning: Machine Learning Algorithms for Beginners - Data Management and Analytics for Approaching Deep Learning and Neural Networks from Scratch
Ahmed Ph. Abbasi
No ratings yet
Week 6
No ratings yet
Week 6
12 pages
FP Tree Example
No ratings yet
FP Tree Example
11 pages
Probabilistic Counting Algorithms For Database Applications - Flajolet
No ratings yet
Probabilistic Counting Algorithms For Database Applications - Flajolet
28 pages
DM Unit-2
No ratings yet
DM Unit-2
14 pages
Tables Needed Pt 2
No ratings yet
Tables Needed Pt 2
12 pages
03 Pre Processing
No ratings yet
03 Pre Processing
20 pages
Data Analytics with Generative AI
From Everand
Data Analytics with Generative AI
Younish P
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
Group Method of Data Handling: Fundamentals and Applications for Predictive Modeling and Data Analysis
From Everand
Group Method of Data Handling: Fundamentals and Applications for Predictive Modeling and Data Analysis
Fouad Sabry
No ratings yet
Bda PT 2
No ratings yet
Bda PT 2
35 pages
Data Science through R. Unsupervised Learning. Dimension Reduction Techniques: Principal Components, Factor Analysis and Correspondence Analysis
From Everand
Data Science through R. Unsupervised Learning. Dimension Reduction Techniques: Principal Components, Factor Analysis and Correspondence Analysis
César Pérez López
No ratings yet
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Automatic Target Recognition: Fundamentals and Applications
From Everand
Automatic Target Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Algorithm Fast Fourier Transform
No ratings yet
Algorithm Fast Fourier Transform
2 pages
Efficient Linux Tracing with LTTng: The Complete Guide for Developers and Engineers
From Everand
Efficient Linux Tracing with LTTng: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Mastering Data Structures and Algorithms in C and C++
From Everand
Mastering Data Structures and Algorithms in C and C++
Sachin Naha
No ratings yet
PYTHON DATA ANALYTICS: Mastering Python for Effective Data Analysis and Visualization (2024 Beginner Guide)
From Everand
PYTHON DATA ANALYTICS: Mastering Python for Effective Data Analysis and Visualization (2024 Beginner Guide)
FLOYD BAX
No ratings yet
Exp 3
No ratings yet
Exp 3
14 pages
FM Algorithm
No ratings yet
FM Algorithm
3 pages
Association Rule Mining Lesson PDF
No ratings yet
Association Rule Mining Lesson PDF
9 pages
Heuristic: Fundamentals and Applications
From Everand
Heuristic: Fundamentals and Applications
Fouad Sabry
No ratings yet
Machine Learning with Python: Foundations and Applications: ML, #1
From Everand
Machine Learning with Python: Foundations and Applications: ML, #1
Mohammed Nurudeen
No ratings yet
Automatic Target Recognition: Advances in Computer Vision Techniques for Target Recognition
From Everand
Automatic Target Recognition: Advances in Computer Vision Techniques for Target Recognition
Fouad Sabry
No ratings yet
Artificial Intelligence Algorithms
From Everand
Artificial Intelligence Algorithms
akosnemeth
No ratings yet
Forward Chaining: Fundamentals and Applications
From Everand
Forward Chaining: Fundamentals and Applications
Fouad Sabry
No ratings yet
Mock Test
No ratings yet
Mock Test
13 pages
DM Exp 1.42637
No ratings yet
DM Exp 1.42637
3 pages
Computer Data
From Everand
Computer Data
Angel Gabaldon
No ratings yet
PYTHON MACHINE LEARNING: Leveraging Python for Implementing Machine Learning Algorithms and Applications (2023 Guide)
From Everand
PYTHON MACHINE LEARNING: Leveraging Python for Implementing Machine Learning Algorithms and Applications (2023 Guide)
Roberta Bowman
No ratings yet
Mining Data Streams (Part 2)
No ratings yet
Mining Data Streams (Part 2)
56 pages
Untitled Document
No ratings yet
Untitled Document
5 pages
Experiment 8
No ratings yet
Experiment 8
2 pages
Exp 5
No ratings yet
Exp 5
2 pages
Game Theory Lab Assignment 1 - Colab
No ratings yet
Game Theory Lab Assignment 1 - Colab
5 pages
LRDI Live Class
No ratings yet
LRDI Live Class
21 pages
Agile Methodology 2016
No ratings yet
Agile Methodology 2016
17 pages
CST Diag R12
No ratings yet
CST Diag R12
36 pages
Social Media Management System Project Report
No ratings yet
Social Media Management System Project Report
92 pages
ISO Implementation Guide
No ratings yet
ISO Implementation Guide
74 pages
Latest Log
No ratings yet
Latest Log
14 pages
Centricity Pacs Quick Guide
No ratings yet
Centricity Pacs Quick Guide
6 pages
TDA7264 TDA7264A: 25 + 25W Stereo Amplifier With Mute/St-By
No ratings yet
TDA7264 TDA7264A: 25 + 25W Stereo Amplifier With Mute/St-By
12 pages
Installation or Run AstroHora File
No ratings yet
Installation or Run AstroHora File
5 pages
How To Make Custom Shops in Elden Ring - Introduction To Talk Menus
No ratings yet
How To Make Custom Shops in Elden Ring - Introduction To Talk Menus
35 pages
BHT1500B UsersManual E3 PDF
No ratings yet
BHT1500B UsersManual E3 PDF
238 pages
Navigation With Compose - Jetpack Compose - Android Developers
No ratings yet
Navigation With Compose - Jetpack Compose - Android Developers
15 pages
Fronius - Xplorer - Basisfunktionen - en
No ratings yet
Fronius - Xplorer - Basisfunktionen - en
14 pages
SQL Injection
No ratings yet
SQL Injection
3 pages
Electronic Gear
No ratings yet
Electronic Gear
6 pages
LabManual (18 21)
No ratings yet
LabManual (18 21)
11 pages
Recitation05 Cachelab
No ratings yet
Recitation05 Cachelab
97 pages
Source Data For FI - Accounts Payable Open Item
No ratings yet
Source Data For FI - Accounts Payable Open Item
68 pages
ES Teaser Example
100% (1)
ES Teaser Example
4 pages
Comparative Study of Seven Level Boost Inverters Using Sinusoidal Multicarrier PWM Technique
No ratings yet
Comparative Study of Seven Level Boost Inverters Using Sinusoidal Multicarrier PWM Technique
10 pages
Tutorial - 5 and 6
100% (1)
Tutorial - 5 and 6
2 pages
(Ijcst-V13i2p3) :dr.d.j.samatha Naidu, M.lahya
No ratings yet
(Ijcst-V13i2p3) :dr.d.j.samatha Naidu, M.lahya
3 pages
APQP PQP Flow Chart PDF
100% (2)
APQP PQP Flow Chart PDF
1 page
05 - CH73,76 - Full Autority Digital Engine Control FADEC
100% (2)
05 - CH73,76 - Full Autority Digital Engine Control FADEC
54 pages
Shinymanager
No ratings yet
Shinymanager
20 pages
Unit Testing Tutorial: What Is, Types, Tools & Test Example
100% (1)
Unit Testing Tutorial: What Is, Types, Tools & Test Example
7 pages
AllTorque Gen II Manual
100% (1)
AllTorque Gen II Manual
43 pages
Process Simulator Product Summary
No ratings yet
Process Simulator Product Summary
2 pages
MyWalboxApp QuickStartGuide EU
No ratings yet
MyWalboxApp QuickStartGuide EU
15 pages

FM Algorithm Theory Explanation

Uploaded by

FM Algorithm Theory Explanation

Uploaded by

Why FM algorithm?

to cut the analysis cost and reduce redundancy.

elements in a stream of data.

is that it uses less memory space while executing.

Pseudo Code-Stepwise Solution:

2. For each element x, r(x)= length of trailing zeroes in h(x)

=> Distinct elements= 2R

shows better results in terms of time in seconds.

the FM algorithm is an excellent solution for these companies.

You might also like