0% found this document useful (0 votes)
7 views

Fake News Detection Using Deep Learning

The document presents a system for detecting fake news using various classification techniques, including Logistic Regression, Naïve Bayes, Support Vector Machine, Random Forest, and Deep Neural Networks. It discusses the challenges of identifying fake news in the context of increasing misinformation online and provides experimental results comparing the performance of these algorithms in terms of memory usage, execution time, and accuracy. The findings indicate that Deep Neural Networks outperform other methods in accuracy and execution time, although they require more memory.

Uploaded by

luizz etc
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views

Fake News Detection Using Deep Learning

The document presents a system for detecting fake news using various classification techniques, including Logistic Regression, Naïve Bayes, Support Vector Machine, Random Forest, and Deep Neural Networks. It discusses the challenges of identifying fake news in the context of increasing misinformation online and provides experimental results comparing the performance of these algorithms in terms of memory usage, execution time, and accuracy. The findings indicate that Deep Neural Networks outperform other methods in accuracy and execution time, although they require more memory.

Uploaded by

luizz etc
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

2019 1st International Conference on Advances in Information Technology

Fake News Detection Using Deep Learning


Techniques
1 2
Chaitra K Hiramath Prof. G.C Deshpande
M.Tech in CSE Assistant Professor
KLS Gogte Institute of Technology, KLS Gogte Institute of Technology,
Belagavi,Karnataka,India Belagavi,Karnataka,India
chaitrakh1992@gmail.com gcdeshpande@git.edu

Bits of gossip or spam recognition are considered issue in


Abstract— News is crucial part of our life. In day to day life Natural Language Processing (NLP) & specialists discovered
current news are helpful to enhance knowledge what happen these issues broadly. Issue of recognizing or finding false
around the world. So most of peoples prefer watching news most information in regular day to day existence, although very
of the peoples generally prefer reading newspaper early in the
much similar to deception finding, but it is critical to detect
morning enjoying with cup of tea. If news is fake that will
mislead peoples sometimes fake news utilized to spread rumors due to the news body frequently comprise a less and small
about things or it will affect some political leader positions just proclamations.
because of fake news. So it’s crucial to find the fake news. So we
proposed system to detect fake news but now a day’s data on web Everyday access of news sources, for example, web based
or social media is increasing vastly and it is so hectic to detect life channels, news sites, and online papers have created
news is fake or not by looking all data and it is time consuming so demanding to verifying reliable news sources due to
we utilize classification techniques to classify huge data. Here we enhancement of misleading information. We center around the
proposed fake news detection system based on classification such ID of phony contents or articles in news sites. First, we present
as Logistic regression (LR), Naïve bayes (NB), Support vector
database for the phony news discovery task, using numerous
machine (SVM), Random forest (RF) and deep neural network
(DNN). We compare all machine learning techniques for news spaces and depict the accumulation, explanation, and
detecting fake news. approval process in detail and present a few exploratory
examinations on the acknowledgment of etymological
Keywords: Fake news, Logistic regression, Support vector machine, varieties in phony and real news content.
Naïve Bayes algorithm, Random Forest algorithm, Deep neural
network. Now a day’s fake news identification has gained a
developing enthusiasm from the overall population and
I. INTRODUCTION analysts as the spreading of deception online increments,
especially in news sources like internet based life channels,
T he term of false news is regularly connected with
news web journals, and online papers.

misdirection, gossip, fraud, deception and so on. Related work The section I gives introduction of fake news detection
may be for gossip, rumor, fraud and fabrication. Issues system utilizing RF, NB, SVM, DNN, and LR classification
identified with such subjects are frequently been seen techniques. Section II introduces the writing survey of present
depending on classification. Likewise, a large portion of frameworks and proposed framework design is given in
printed matters have seen false information location as a Section III subtleties area IV givess test examination, results
binary classification issue. Some organization uses numerous and talk of new framework. Area V finishes up our proposed
deep learning techniques on databases made out of false framework.
information articles and genuine information articles mined
from media news database and saw classifiers is great to II. LITERATURE SURVEY
classify huge data.

There are various classifiers existed. From LR, RF and NB to Rumor play a complex part in clashes of numerous types such
DNN. Classification categorized into types LR, NB, SVM, RF as intergroup and global [1]. Their vitality is considered about
and DNN. These are most commonly utilized classifiers. the way that in current day, the society ability of clashing
Classification is termed as “Supervised learning”. Linear gatherings to influence each other are very constrained with
Classifiers are LR, NB, SVM, Decision Trees, RF, Neural administrative cutoff points and famous assessment [2].
Networks (NN) are classifiers in Machine Learning (ML).
Features of society engage improvement of significance of
those methodologies for affection that are considered about
difference in greater part's musings in regards to battle toward

978-1-7281-3241-9/19/$31.00 © 2019 IEEE 411


2019 1st International Conference on Advances in Information Technology

In the other hand WhatsApp in Brazil [13] constrains the


a way that is gainful for one of conflicting get-togethers. Such number of messages with the same content that can be shared
changes are possible on account of exceptionally picked data by the same user, is using a AI to find abuses and harass
spreading first through the expansive communications and messages and like Facebook using third-party agencies (TPA)
through channels of easygoing correspondence. Bits of gossip to check and classify news. Also, the WhatsApp team trained
in this case raises genuine weapon in clashes [3]. and showed the capabilities of their app to the current
president candidates and their communication team in an
V. Subrahmanian et al [4] C. Shao et al. [5] says Because of attempt to avoid possible use of the app for fake news spread.
the promotion of artificial intelligence (AI) and related
territories of intellectual processing, the number of bots has III. SYSTEM ARCHITECTURE
detonated all through the system. In this segment, author will
investigate their job in the gossip rumors and misinformation A. System Architeure
spreading.
At first we get dataset as news or articles list form. At that
William Y Yang gives example, there can be sure point on that dataset processing is done for feature mining and
misrepresented or deceiving data joined to a genuine articles determination. Data should be pre-processing. It incorporates
or news. In this way, the whole news or article cannot be expulsion of punctuations, URL's, images, stemming and stop
accepted as totally obvious or cannot be disposed of as words. At that point NLP processing for extricating highlights
completely false. This issue was tended to in [6] where it is and dependent on that training file is made and afterward we
presented Liar dataset including a generous size of small classify that information utilizing classifiers, for example, LR,
political articles distinctive category comments deciding SVM, NB, RF and DNN. News Dataset: Dataset of news is
measure of false substance of every declaration. taken from online.

Aswini Thota et al [7] introduced automated detection system Processing: Content information needs processing to execute
for fake news detection he said is difficult work to complete as AI on them. There are kinds of methods generally utilized to
it needs representation to recognize nuances in natural change over content information into a structure that is
language. To address the gaps between related and unrelated prepared for demonstrating. The information processing steps
news, they introduce neural network architecture to exactly that are applied on headlines and articles.
guess the posture that is to be provided headline and article
body. Stemming and stopwords removal: Stemming technique
utilized to detach suffixes or prefixes from a word. And Stops
Wang gives the LIAR dataset [8], consisting of statements Words may be filtered and processed from content because are
made by public figures, annotated with its veracity, extracted increasingly normal and hold less significant data. Stop words
from the site polifact.com for research focused on rumors, generally utilized as a linking element of the sentences, for
there is the PHEME dataset, by Zubiaga et al. [10]. This instance, “and”, “or”, “but”, “of”, “in”, “from”, “to”, “a”,
dataset groups a number of tweets in rumor threads, and “an”, and “the” etc. This kind of stop words that are of not
associate them with news events. vital and it can utilize important processing time, since to take
away stop words is crucial task.
N. J. Conroy et al [9] enroll numerous methodologies that
appear to be encouraging close to goal of effectively arranging Natural Language Processing: there is huge amount of
deluding news. Additionally, noted basic substance related n information stored in file but it is inaccessible to computer
grams and shallow grammatical feature known as POS assisted analysis NLP allows to analysts to find crucial
labeling have shown not sufficient for the grouping task. information.

Veronica Perez-Rosas et al. [11] use dataset to detect fake Feature Extraction: Here important features are extracted to
news from different domains. Generally they obtained news generate training file.
from an assortment of standard news sites in the US, for
example, CNN, ABCNews, USAToday, FoxNews,
NewYorkTimes, Bloomberg, and CNET.

M. Granik et al. [12] present naïve bayes to detect fake news


here they utilizes Facebook news post dataset also he check
accuracy of the system and it gives 74% accuracy.

978-1-7281-3241-9/19/$31.00 © 2019 IEEE 412


2019 1st International Conference on Advances in Information Technology

Fig 1. System Architecture

B. Algorithm
6: Go to step 1

Deep Neural Network 7: The training cycle is completed.

Back propagation Method


{I1, I2,…., In} are given Inputs C. Experimental Setup

Where, I. Framework expands on Java system. For advancement, the


Ii is Input Layer input in, and i=1, 2, n. NetBeans (adaptation 8.1) instrument is used MySQL as
J is hidden layer where Sigmoid Transfer function is backend. Classification algorithms along with numerous
utilized to estimate output of every neuron in hidden layer. extraction techniques. Framework doesn't require a specific
O is output layer. equipment to run any standard machine can run the
and are weights for hidden and output layer. application.

Sigmoid transfer function is as follows: II. Database Description: News dataset is downloaded from
web it consists of news from different sites.
1
1 +

The steps are as follow:


1: Execute network forward with input records to obtain D. Comparison Results
network yield.
This part represents performance of LR, RF, SVM NB and
DNN algorithms. Figure 2 exhibit Memory Comparison of
2: Calculate error value.
five classification algorithms for numerous Threshold. X-axis
3: Error signal vectors and of both layers are calculated. represent Algorithms & Y-axis present Memory in bytes.
Vector is for output layer, is for hidden layer. According to graph DNN uses more memory that NB and NB
uses more than SVM also memory required for LR is less than
RF and SVM.
4: Adjust output layer weights. Table 1 shows memory comparison in tabular form LR
requires less memory than other algorithms.
5: Adjust hidden layer weights.

978-1-7281-3241-9/19/$31.00 © 2019 IEEE 413


2019 1st International Conference on Advances in Information Technology

TABLE I. MEMORY COMPARISON

SR No. Algorithm Memory in bytes


1 LR 920000000
2 RF 950000000
3 SVM 1100000000
4 NB 1100000000
5 DNN 1120000000

Fig. 3: Time Comparison Graph

Figure 3 depict Time comparison of LR, RF, SVM NB and


DNN classification algorithms for different time in ms. X-axis
present algorithms and Y- axis present Time in ms. DNN
requires less time than other classification algorithms. DNN
algorithm requires 400ms which is much less than all other
methods.

Fig. 2: Memory Comparison Graph Table 3 shows accuracy comparison of algorithms where
accuracy of DNN is greater than other four algorithms.
Figure 2 depicts comparison of LR, RF, SVM NB and DNN
TABLE III. ACCURACY COMPARISON
classification algorithms for different byte value. X-axis
present algorithms and Y- axis present memory in bytes. LR SR No. Algorithm Accuracy in %
algorithm utilizes less memory compared to all other methods. 1 LR 75
2 RF 77
3 SVM 79
Table 2 shows Time comparison table where DNN requires 4 NB 89
less time than other algorithms. 5 DNN 91

TABLE II. TIME COMPARISON

SR No. Algorithm Time in ms


1 LR 3750
2 RF 1800
3 SVM 2800
4 NB 1900
5 DNN 400

Fig. 4: Accuracy Comparison Graph

978-1-7281-3241-9/19/$31.00 © 2019 IEEE 414


2019 1st International Conference on Advances in Information Technology

Figure 4 represent the Accuracy comparison of LR, RF, SVM International Conference on Computational Linguistics, pages 3391–3401
NB and DNN classification algorithms for different Santa Fe, New Mexico, USA, August 20-26, 2018.
percentage. The X-axis display algorithms and Y- axis display [12] M. Granik, V. Mesyura, "Fake news detection using naive Bayes
accuracy %. Accuracy percentage of DNN is more than rest of classifier," 2017 IEEE First Ukraine Conference on Electrical and Computer
other classifiers. Engineering (UKRCON), Kiev, 2017, pp. 900-903.

[13] B. Capelas, “Whatsapp anuncia planos para tentarcombater ’fake news’


IV. CONCLUSION no brasil,”Estado
Here we, present different algorithms for classifying
[14] Cade Metz. (2016, Dec. 16). "The bittersweet sweepstakes to build an AI
statements made by public figures were implemented. In that destroys fake news". Available:
proposed system LR, RF, SVM NB and DNN classification https://www.wired.com/2016/12/bittersweet-sweepstakes-build aidestroys-
techniques are utilized that will help to detect fake news. fake-news/
Classification techniques like LR, RF, SVM NB and DNN for
[15] Fake news RAMP: classify statements of public figures. (n.d.) [Online].
feature selection and extraction utilized, DNN will work fine Available: https://www.ramp.studio/problems/fake_news
in execution time and accuracy cases but it needs large
memory than other. Then we compare NB, RF, SVM, LR and [16] The Principles of the Truth-O-Meter: PolitiFact’s methodology for
DNN on basis in terms of time and memory and accuracy, independent fact-checking. (2018, Feb. 12) http://www.politifact.com/truth-o
meter/article/2018/feb/12/principlestruth- o-meter-politifacts-methodology-i/.
according to comparison results it exhibit that DNN Algorithm Accessed Mar. 24, 2018.
is improved than rest algorithm in accuracy and time kind
because rest classifiers requires more time and gives less
accuracy hence DNN is more crucial to detect the fake news.

REFERENCES
[1] Arif, A., Robinson, J. J., Stanek, S. A., Fichet, E. S., Townsend, P.,
Worku, Z., &Starbird, K. (2017, February). A Closer Look at the Self-
Correcting Crowd: Examining Corrections in Online Rumors. In Proceedings
of the 2017 ACM Conference on Computer Supported Cooperative Work and
Social Computing (pp.155-168). ACM.

[2] Louni, A., Santhanakrishnan, A., &Subbalakshmi, K. P (2015).


Identification of source of rumors in social networks with incomplete
information. arXiv preprint arXiv:1509.00557.

[3] Pasquini, C., Brunetta, C., Vinci, A. F., Conotter, V., &Boato, G. (2015,
June). Towards the verification of image integrity in online news. In
Multimedia & Expo Workshops (ICMEW), 2015 IEEE International
Conference on (pp. 1-6). IEEE.

[4] V. Subrahmanian, A. Azaria, S. Durst, V. Kagan, A. Galstyan, K. Lerman,


L. Zhu, E. Ferrara, A. Flammini, and F. Menczer, “The DARPA Twitter Bot
Challenge,” Computer, vol. 49, pp. 38–46, June 2016.

[5] C. Shao, G. L. Ciampaglia, O. Varol, A. Flammini, and F. Menczer, “The


spread of fake news by social bots,” arXiv preprint arXiv:1707.07592, 2017.

[6] William Yang Wang.” Liar, Liar Pants on Fire”: A New Benchmark
Dataset for Fake News Detection. ACL 2017. arXiv:1705.00648

[7] Aswini Thota1, Priyanka Tilak1, Simeratjeet Ahluwalia1, Nibhrat Lohia1


,” Fake News Detection: A Deep Learning Approach ” SMU Data Science
Review, Vol. 1 [2018], No. 3, Art. 10

[8] W. Y. Wang, “liar, liar pants on fire”:A new benchmark dataset for fake
news detection,”ar Xivpreprint arXiv:1705.00648, 2017.

[9] N. J. Conroy, V. L. Rubin, and Y. Chen, “Automatic deception detection:


Methods for finding fake news,” Proceedings of the Association for
Information.

[10] A. Zubiaga, M. Liakata, R. Procter, G. W. S. Hoi, and P. Tolmie,


“Analysing how people orient to and spread rumours in social media by
looking at conversation althreads,”PloS one, vol. 11, no. 3, p. e0150989, 2016.

[11] Ver´onica P´erez-Rosas, Bennett Kleinberg, Alexandra Lefevre ,Rada


Mihalcea, “Automatic Detection of Fake News” Proceedings of the 27th

978-1-7281-3241-9/19/$31.00 © 2019 IEEE 415

You might also like