An Efficient Fake News Detection System Using Contextualized Embeddings and Recurrent Neural Network.

Junaid Ali Reshi; Rashid Ali

doi:10.9781/ijimai.2023.02.007

Authors

Junaid Ali Reshi Aligarh Muslim University
Rashid Ali Aligarh Muslim University

DOI:

https://doi.org/10.9781/ijimai.2023.02.007

Keywords:

Contextualized Embeddings, Deep Learning, Fake News Detection, Natural Language Processing

Abstract

Fake news is detrimental for society and individuals. Since the information dissipation through online media is too quick, an efficient system is needed to detect and counter the propagation of fake news on social media. Many studies have been performed in last few years to detect fake news on social media. This study focusses on the efficient detection of fake news on social media, through a Natural Language Processing based approach, using deep learning. For the detection of fake news, textual data have been analyzed in unidirectional way using sequential neural networks, or in bi-directional way using transformer architectures like Bidirectional Encoder Representations from Transformers (BERT). This paper proposes ConFaDe - a deep learning based fake news detection system that utilizes contextual embeddings generated from a transformer-based model. The model uses Masked Language Modelling and Replaced Token Detection in its pre-training to capture contextual and semantic information in the text. The proposed system outperforms the previously set benchmarks for fake news detection; including state-of-the-art approaches on a real-world fake news dataset, when evaluated using a set of standard performance metrics with an accuracy of 99.9 % and F1 macro of 99.9%. In contrast to the existing state-of-the-art model, the proposed system uses 90 percent less network parameters and is 75 percent lesser in size. Consequently, ConFaDe requires fewer hardware resources and less training time, and yet outperforms the existing fake news detection techniques, a step forward in the direction of Green Artificial Intelligence.

Downloads

Download data is not yet available.

References

X. Zhou and R. Zafarani, “A Survey of Fake News: Fundamental Theories, Detection Methods, and Opportunities,” ACM Computing Surveys, vol. 53, no. 5, pp. 1–40, 2020, https://doi.org/10.1145/3395046

J. Golbeck et al., “Fake news vs satire: A dataset and analysis,” in WebSci 2018 - Proceedings of the 10th ACM Conference on Web Science, pp. 17–21, 2018, https://doi.org/10.1145/3201064.3201100

X. Zhou, R. Zafarani, K. Shu, and H. Liu, “Fake News: Fundamental Theories, Detection Strategies and Challenges,” In Proceedings of the twelfth ACM international conference on web search and data mining, pp. 836-837, 2019, https://doi.org/10.1145/3289600.3291382

K. Shu, A. Sliva, S. Wang, J. Tang, and H. Liu, “Fake News Detection on Social Media: A Data Mining Perspective,” vol. 19, no.1, pp.22-36, 2017, https://doi.org/10.1145/3137597.3137600

X. Zhou and R. Zafarani, “Fake News: A Survey of Research, Detection Methods, and Opportunities,” ACM Computing Surveys, vol. 53, no.5, pp 1–40, 2018, https://doi.org/10.1145/3395046

S. Vosoughi, D. Roy, and S. Aral, “The spread of true and false news online,” Science, vol. 359, no. 6380, pp. 1146–1151, 2018, https://doi.org/10.1126/science.aap9559

H. Allcott and M. Gentzkow, “Social media and fake news in the 2016 election,” Journal of Economic Perspectives, vol. 31, no. 2, pp. 211–236, 2017, https://doi.org/10.1257/jep.31.2.211

M. H. Alkawaz and S. A. Khan, “Use of Fake News and Social Media by Main Stream News Channels of India,” in Proceedings - 2020 6th IEEE International Colloquium on Signal Processing & Its Applications, pp. 93–97, 2020, https://doi.org/10.1109/CSPA48992.2020.9068673

S. Lewandowsky, U. K. H. Ecker, C. M. Seifert, N. Schwarz, and J. Cook, “Misinformation and Its Correction: Continued Influence and Successful Debiasing,” Psychological Science in the Public Interest, Supplement, vol. 13, no. 3, pp. 106–131, 2012, https://doi.org/10.1177/15291006124510

N. Walter and R. Tukachinsky, “A Meta-Analytic Examination of the Continued Influence of Misinformation in the Face of Correction: How Powerful Is It, Why Does It Happen, and How to Stop It?,” Communication Research, vol. 47, no. 2, pp. 155–177, 2020, https://doi.org/10.1177/0093650219854600

Y. Tsfati, H. G. Boomgaarden, J. Strömbäck, R. Vliegenthart, A. Damstra, and E. Lindgren, “Causes and consequences of mainstream media dissemination of fake news: literature review and synthesis,” vol. 44, no. 2, pp. 157–173, 2020, https://doi.org/10.1080/23808985.2020.1759443

R. Zafarani, X. Zhou, K. Shu, and H. Liu, “Fake news research: Theories, detection strategies, and open problems,” in Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 3207–3208, 2019, https://doi.org/10.1145/3292500.3332287

T.A. Shaikh, W.A. Mir, I. Mohammad, and R. Ali, “An Intelligent Healthcare System for Automated Alzheimer’s Disease Prediction and Personalized Care,” International Journal of Next-Generation Computing, vol. 12, no. 2, pp.240-253, 2021, https://doi.org/10.47164/ijngc.v12i2.196

K. Shu, S. Wang, and H. Liu, “Beyond news contents: The role of social context for fake news detection,” in WSDM 2019 - Proceedings of the 12th ACM International Conference on Web Search and Data Mining, pp. 312–320, 2019, https://doi.org/10.1145/3289600.3290994

T. Young, D. Hazarika, S. Poria, and E. Cambria, “Recent trends in deep learning based natural language processing,” in IEEE Computational Intelligence Magazine, vol. 13, no. 3, pp. 55-75, 2018, https://doi.org/10.1109/MCI.2018.2840738

N. Ruchansky, S. Seo, and Y. Liu, “CSI: A hybrid deep model for fake news detection,” in International Conference on Information and Knowledge Management, Proceedings, Nov. 2017, pp. 797–806, 2017, https://doi.org/10.1145/3132847.3132877

C. Song, K. Shu, and B. Wu, “Temporally evolving graph neural network for fake news detection,” Information Processing & Management, vol. 58, no. 6, 2021, https://doi.org/10.1016/j.ipm.2021.102712

L. Weng, F. Menczer, and Y. Y. Ahn, “Virality prediction and community structure in social networks,” Scientific Reports, vol. 3, no. 1, pp. 1–6, 2013, https://doi.org/10.1038/srep02522

D. Centola, “The spread of behavior in an online social network experiment,” Science, vol. 329, no. 5996, pp. 1194–1197, 2010, https://doi.org/10.1126/science.1185231

R. Oshikawa, J. Qian, and W. Y. Wang, “A survey on natural language processing for fake news detection,” in LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings, European Language Resources Association (ELRA), 2020, pp. 6086–6093.

J. Pennington, R. Socher, and C. D. Manning, “GloVe: Global vectors for word representation,” in EMNLP 2014 - 2014 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference, pp. 1532–1543, 2014, http://dx.doi.org/10.3115/v1/D14-1162

J. Devlin, M. W. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of deep bidirectional transformers for language understanding,” in NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference, pp. 4171–4186, 2019, https://doi.org/10.18653/v1/n19-1423

K. Clark, M.-T. Luong, Q. v. Le, and C. D. Manning, “ELECTRA: Pretraining Text Encoders as Discriminators Rather Than Generators,” arXiv preprint arXiv:2003.10555, 2020, https://doi.org/10.48550/arXiv.2003.10555

A. Bondielli, F. Marcelloni “A survey on fake news and rumour detection techniques,” Information Sciences, vol. 497, pp. 38–55, 2019, https://doi.org/10.1016/j.ins.2019.05.035

V. K. Singh, R. Dasgupta, D. Sonagra, K. Raman, and I. Ghosh, “Automated Fake News Detection Using Linguistic Analysis and Machine Learning,” In International conference on social computing, behavioral-cultural modeling, & prediction and behavior representation in modeling and simulation (SBP-BRiMS), pp. 1-3, 2017, https://doi.org/10.13140/RG.2.2.16825.67687

S. P. Yadav, “Emotion recognition model based on facial expressions,” Multimedia Tools and Applications, vol. 80, no. 17, pp. 26357–26379, 2021, https://doi.org/10.1007/s11042-021-10962-5

S. P. Yadav, “Vision-based detection, tracking, and classification of vehicles,” IEIE Transactions on Smart Processing and Computing, vol. 9, no. 6, pp. 427–434, 2020, https://doi.org/10.5573/IEIESPC.2020.9.6.427

V. L. Rubin, Y. Chen, and N. J. Conroy, “Deception Detection for News: Three Types of Fake News,” in Proceedings of the 78th ASIS&T Annual Meeting: Information Science with Impact: Research in and for the Community, pp.1–4, 2015, https://doi.org/10.1002/pra2.2015.145052010083

C. Castillo, M. Mendoza, and B. Poblete, “Predicting information credibility in time-sensitive social media,” Internet Research, vol. 23, no. 5, pp. 560–588, 2013, https://doi.org/10.1108/IntR-05-2012-0095

H. Ahmed, I. Traore, and S. Saad, “Detection of Online Fake News Using N-Gram Analysis and Machine Learning Techniques,” in Intelligent, Secure, and Dependable Systems in Distributed and Cloud Environments. ISDDC 2017. Lecture Notes in Computer Science, vol. 10618, pp. 127–138, 2017, https://doi.org/10.1007/978-3-319-69155-8_9

A. Giachanou, G. Zhang, and P. Rosso, “Multimodal fake news detection with textual, visual and semantic information,” In Proceedings of Text, Speech, and Dialogue: 23rd International Conference, Springer-Verlag, Berlin, Heidelberg, pp. 30–38, 2020, https://doi.org/10.1007/978-3-030-58323-1_3

S. Singhal, R. R. Shah, T. Chakraborty, P. Kumaraguru and S. Satoh, “SpotFake: A Multimodal Framework for Fake News Detection,” 2019 IEEE Fifth International Conference on Multimedia Big Data (BigMM), Singapore, pp. 39-47, 2019, https://doi.org/10.1109/BigMM.2019.00-44

T. Mikolov, K. Chen, G. Corrado, and J. Dean, “Efficient estimation of word representations in vector space,” 1st International Conference on Learning Representations, Workshop Track Proceedings, USA, 2013.

K. He, X. Zhang, S. Ren and J. Sun, “Deep Residual Learning for Image Recognition,” 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, 2016, https://doi.org/10.1109/CVPR.2016.90

G. Huang, Z. Liu, L. Van Der Maaten and K. Q. Weinberger, “Densely Connected Convolutional Networks,” In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2261-2269, 2017, https://doi.org/10.1109/CVPR.2017.243

R. K. Kaliyar, A. Goswami, and P. Narang, “FakeBERT: Fake news detection in social media with a BERT-based deep learning approach,” Multimedia Tools and Applications, vol. 80, no. 8, pp. 11765–11788, 2021, https://doi.org/10.1007

K. Cho, B.van Merriënboer, C. Gulcehre, D. Bahdanau, F. Bougares, H. Schwenk, and Y. Bengio. “Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation,” In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, pp. 1724–1734, 2014, https://doi.org/10.3115/v1/d14-1179

K. Greff, R. K. Srivastava, J. Koutník, B. R. Steunebrink and J. Schmidhuber, “LSTM: A Search Space Odyssey,” in IEEE Transactions on Neural Networks and Learning Systems, vol. 28, no. 10, pp. 2222-2232, Oct. 2017, https://doi.org/10.1109/TNNLS.2016.2582924

S. Hochreiter and J. Schmidhuber, “Long Short-Term Memory,” Neural Computation, vol. 9, no. 8, pp. 1735–1780, 1997, https://doi.org/10.1162/neco.1997.9.8.1735

M. Schuster and K. K. Paliwal, “Bidirectional recurrent neural networks,” in IEEE Transactions on Signal Processing, vol. 45, no. 11, pp. 2673-2681, 1997, https://doi.org/10.1109/78.650093

S. Girgis, E. Amer and M. Gadallah, “Deep Learning Algorithms for Detecting Fake News in Online Text,” 2018 13th International Conference on Computer Engineering and Systems (ICCES), Cairo, Egypt, 2018, pp. 93-97, https://doi.org/10.1109/ICCES.2018.8639198

J. Devlin, M. W. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of deep bidirectional transformers for language understanding,” In Proceedings of NAACL-HLT, pp. 4171-4186, 2019, https://doi.org/10.18653/v1/n19-1423

A. Vaswani et al., “Attention is all you need,” Advances in neural information processing systems, vol. 30, 2017, pp. 5999–6009.

S. A. Alkhodair, B. C. M. Fung, S. H. H. Ding, W. K. Cheung, and S. C. Huang, “Detecting High-Engaging Breaking News Rumors in Social Media,” ACM Transactions on Management Information Systems, vol. 12, no. 1, pp. 1-16, 2021, https://doi.org/10.1145/3416703

T. Bian et al., “Rumor detection on social media with bi-directional graph convolutional networks,” in Proceedings of the AAAI conference on artificial intelligence vol. 34, no. 01, pp. 549-556, 2020.

J. Shin, L. Jian, K. Driscoll, and F. Bar, “The diffusion of misinformation on social media: Temporal pattern, message, and source,” Computers in Human Behavior, vol. 83, pp. 278–287, 2018, https://doi.org/10.1016/j.chb.2018.02.008

X. Zhou and R. Zafarani, “Network-based Fake News Detection: A Pattern-driven Approach,” ACM SIGKDD Explorations Newsletter, vol. 21, no. 2, pp. 48–60, 2019, https://doi.org/10.1145/3373464.3373473

C. Song, C. Yang, H. Chen, C. Tu, Z. Liu and M. Sun, “CED: Credible Early Detection of Social Media Rumors,” in IEEE Transactions on Knowledge and Data Engineering, vol. 33, no. 8, pp. 3035-3047, 2021, https://doi.org/10.1109/TKDE.2019.2961675

H. G. Oliveira, T. Sousa, A. Alves, “Assessing Lexical-Semantic Regularities in Portuguese Word Embeddings,” International Journal of Interactive Multimedia and Artificial Intelligence, vol. 6, no. 5, pp. 34-46, 2021, https://doi.org/10.9781/ijimai.2021.02.006

S. P. Yadav, S. Zaidi, A. Mishra, and V. Yadav, “Survey on Machine Learning in Speech Emotion Recognition and Vision Systems Using a Recurrent Neural Network (RNN),” Archives of Computational Methods in Engineering, vol. 29, no. 3, pp. 1753–1770, 2021, https://doi.org/10.1007/s11831-021-09647-x

J. Ma et al., “Detecting rumors from microblogs with recurrent neural networks,” In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, pp. 3818–3824, 2016, https://dl.acm.org/doi/10.5555/3061053.3061153

Q. Le and T. Mikolov, “Distributed representations of sentences and documents,” in Proceedings of the 31st International Conference on International Conference on Machine Learning, 2014, pp. 1188–1196. https://dl.acm.org/doi/10.5555/3044805.3045025

S. Arora, Y. Liang, and T. Ma, “A simple but tough-to-beat baseline for sentence embeddings,” In Proceedings of International conference on learning representations, 2017.

T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean, “Distributed Representations of Words and Phrases and their Compositionality,” In Proceedings of the 26th International Conference on Neural Information Processing Systems, vol. 2, pp. 3111–3119, 2013.

A. Agarwal, M. Mittal, A. Pathak, L. M. Goyal, “Fake News Detection Using a Blend of Neural Networks: An Application of Deep Learning,” S.N. Computer Science, vol. 1, no. 3, 2020, https://doi.org/10.1007/s42979-020-00165-4

G. K. W. Huang and J. C. Lee, “Hyperpartisan News and Articles Detection Using BERT and ELMo,” 2019 International Conference on Computer and Drone Applications (IConDA), Kuching, Malaysia, pp. 29-32, 2019, https://doi.org/10.1109/IConDA47345.2019.9034917

R. M. Silva, R. L. S. Santos, T. A. Almeida, and T. A. S. Pardo, “Towards automatically filtering fake news in Portuguese,” Expert Systems with Applications, vol. 146, pp. 113199, 2020, https://doi.org/10.1016/j.eswa.2020.113199

M. Samadi, M. Mousavian, and S. Momtazi, “Deep contextualized text representation and learning for fake news detection,” Information Processing & Management, vol. 58, no. 6, pp. 102723, 2021, https://doi.org/10.1016/j.ipm.2021.102723

M. Hoang, O. A. Bihorac, and J. Rouces, “Aspect-Based Sentiment Analysis using BERT,” In Proceedings of the 22nd Nordic conference on computational linguistics, pp. 187-196, 2019.

A. Alessa, M. Faezipour and Z. Alhassan, “Text Classification of FluRelated Tweets Using FastText with Sentiment and Keyword Features,” In Proceedings of IEEE International Conference on Healthcare Informatics, pp. 366-367, 2018, https://doi.org/10.1109/ICHI.2018.00058

B. Büyüköz, A. Hürriyetoğlu, and A. Özgür, “Analyzing ELMo and DistilBERT on Socio-political News Classification,” In Proceedings of the Workshop on Automated Extraction of Socio-political Events from News 2020, pp. 9-18, 2020.

F. Scarselli, M. Gori, A. C. Tsoi, M. Hagenbuchner and G. Monfardini, “The Graph Neural Network Model,” in IEEE Transactions on Neural Networks, vol. 20, no. 1, pp. 61-80, 2009, https://doi.org/10.1109/TNN.2008.2005605

M. F. Mridha, A. J. Keya, M. A. Hamid, M. M. Monowar and M. S. Rahman, “A Comprehensive Review on Fake News Detection with Deep Learning,” in IEEE Access, vol. 9, pp. 156151-156170, 2021, https://doi.org/10.1109/ACCESS.2021.3129329

B. Khoo, R. C. W. Phan, and C. H. Lim, “Deepfake attribution: On the source identification of artificially generated images,” Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, vol. 12, no. 3, p. e1438, 2022, https://doi.org/10.1002/widm.1438

J. F. Low, B. C. M. Fung, F. Iqbal, and S. C. Huang, “Distinguishing between fake news and satire with transformers,” Expert Systems with Applications, vol. 187, pp. 115824, 2022, https://doi.org/10.1016/j.eswa.2021.115824

J. A. Reshi and R. Ali, “Rumor proliferation and detection in Social Media: A Review,” 2019 5th International Conference on Advanced Computing & Communication Systems (ICACCS), Coimbatore, India, 2019, pp. 1156- 1160, https://doi.org/10.1109/ICACCS.2019.8728321

J. Lies, “Marketing Intelligence: Boom or Bust of Service Marketing?”, International Journal of Interactive Multimedia and Artificial Intelligence, vol. 7, no. 7, pp. 115-124, 2022, https://doi.org/10.9781/ijimai.2022.10.001

S. A. Alameri and M. Mohd, “Comparison of Fake News Detection using Machine Learning and Deep Learning Techniques,” In Proceedings of 2021 3rd International Cyber Resilience Conference (CRC), pp. 1-6, 2021, https://doi.org/10.1109/CRC50527.2021.9392458

M. Z. Khan and O. H. Alhazmi, “Study and analysis of unreliable news based on content acquired using ensemble learning (prevalence of fake news on social media),” International Journal of Systems Assurance Engineering and Management, vol. 11, no. 2, pp. 145–153, 2020, https://doi.org/10.1007/s13198-020-01016-4

V.L. Rubin, “Artificially Intelligent Solutions: Detection, Debunking, and Fact-Checking,” In Misinformation and Disinformation, pp. 207-263, 2022, https://doi.org/10.1007/978-3-030-95656-1_7

D. P. Kingma and Jimmy Ba, “Adam: A Method for Stochastic Optimization,” arXiv preprint arXiv:1412.6980, 2014.

M. Sokolova and G. Lapalme, “A systematic analysis of performance measures for classification tasks,” Information Processing and Management: an International Journal, vol. 45, no. 4, pp. 427–437, 2009, https://doi.org/10.1016/j.ipm.2009.03.002

Z. C. Lipton, C. Elkan, and B. Naryanaswamy, “Optimal thresholding of classifiers to maximize F1 measure,” in Lecture Notes in Computer Science, vol. 8725, pp. 225–239, 2014, https://doi.org/10.1007/978-3-662-44851-9_15

B. Ghanem, P. Rosso, and F. Rangel, “Stance Detection in Fake News A Combined Feature Representation,” in Proceedings of the First Workshop on Fact Extraction and VERification (FEVER), pp. 66–71, 2018, https://doi.org/10.18653/v1/W18-5510

Y. Liu and Y.-F. B. Wu, “Early Detection of Fake News on Social Media Through Propagation Path Classification with Recurrent and Convolutional Networks,” in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32, no. 1, 2018, https://doi.org/10.1609/aaai.v32i1.11268

N. O’Brien, S. Latessa, G. Evangelopoulos, and X. Boix, “The Language of Fake News: Opening the Black-Box of Deep Learning Based Detectors,” In Proceedings of workshop on “A.I. for Social Good,” 32nd Conference on Neural Information Processing Systems, 2018.

E. M. Bender, T. Gebru, A. McMillan-Major, and S. Shmitchell, “On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?” in Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, pp. 610–623, 2021, https://doi.org/10.1145/3442188.3445922

E. Strubell, A. Ganesh, and A. McCallum, “Energy and Policy Considerations for Deep Learning in NLP,” in Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 3645–3650, 2019, https://doi.org/10.18653/v1/P19-1355