Combining transformer, Bi-LSTM and CNN for efficient Vietnamese fake news detection
Abstract: 195
|
PDF: 105
##plugins.themes.academic_pro.article.main##
Author
-
Chi Khanh NinhThe University of Danang – Vietnam - Korea University of Information and Communication Technology, VietnamTrung Hung VoThe University of Danang – University of Technology and Education, VietnamDuy Khanh NinhThe University of Danang – University of Science and Technology, Vietnam
Keywords:
Abstract
Fake news spreading widely on digital platforms presents challenges where the rapidly evolving information landscape lacks effective detection. This paper introduces a hybrid deep learning model that integrates Transformer, Bidirectional Long Short-Term Memory (Bi-LSTM), and Convolutional Neural Network architectures to jointly capture both semantic and structural characteristics of Vietnamese text for the task of fake news detection. A high-quality dataset comprising 2,336 manually annotated Vietnamese news articles was developed, spanning three major domains: Politics, Healthcare, and Society. The proposed model was trained and evaluated on this dataset and benchmarked against two widely adopted baseline models: Gated Recurrent Unit and Bi-LSTM. Experimental results indicate that the proposed model achieves superior performance, attaining an overall accuracy of 95.3% and an F1-score of 0.951. These findings underscore the efficacy of combining multi-layered linguistic representations in enhancing Vietnamese fake news detection and contribute a valuable annotated resource for future studies in this domain.
References
-
[1] H. Allcott and M. Gentzkow, “Social media and fake news in the 2016 election”, Journal of Economic Perspectives, vol. 31, no. 2, pp. 211–236, 2017. doi: 10.1257/jep.31.2.211
[2] K. Shu, A. Sliva, S. Wang, J. Tang, and H. Liu, “Fake News Detection on Social Media: A Data Mining Perspective”, ACM SIGKDD Explorations Newsletter, vol. 19, no. 1, pp. 22–36. https://doi.org/10.1145/3137597.313760
[3] S. I. Manzoor, J. Singla and Nikita, “Fake News Detection Using Machine Learning Approaches: A Systematic Review,” in 2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI), Tirunelveli, India, 2019, pp. 230–234.
[4] V. Sivasangari, V. A. Pandian, and R. Santhya, “A modern approach to identify the fake news using machine learning”, International Journal of Pure and Applied Mathematics, vol. 118, no. 20, pp. 3787–3795, 2018.
[5] N. Ruchansky, S. Seo, and Y. Liu, “CSI: A hybrid deep model for fake news detection”, in Proceedings of the 2017 ACM Conference on Information and Knowledge Management (CIKM), New York, NY, USA, 2017, pp. 797–806.
[6] A. Vaswani et al., “Attention Is All You Need”, in Proceedings of 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA, 2017, pp. 6000–6010.
[7] J. Devlin, M. -W. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding”, in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Minneapolis, Minnesota, USA, 2019, pp. 4171–4186.
[8] K. Shu, D. Mahudeswaran, S. Wang, D. Lee, and H. Liu, “FakeNewsNet: A data repository with news content, social context, and spatiotemporal information for studying fake news on social media”, Big Data, vol. 8, no. 3, pp. 171–188, 2020. doi: 10.1089/big.2020.0062
[9] W. Y. Wang, “‘Liar, Liar Pants on Fire’: A New Benchmark Dataset for Fake News Detection”, in Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, Vancouver, Canada, 2017, pp. 422–426.
[10] D. V. Vo and P. Do, “Detecting Vietnamese fake news”, CTU Journal of Innovation and Sustainable Development, vol. 15 (Special issue on ISDS), pp. 39–46, 2023.
[11] Q. T. Ho and M. N. Pham, “VFND - Vietnamese fake news datasets”, github.com, February, 2019. [Online]. Available: https://github.com/WhySchools/VFND-vietnamese-fake-news-datasets [Accessed March 03, 2025].
[12] V. -D. Le, “Vietnamese stopwords”, github.com, 2015. [Online]. Available: https://github.com/stopwords/vietnamese-stopwords [Accessed March 13, 2025].
[13] V. D. K. Nguyen and P. Do, “Fake news detection using knowledge graph and graph convolutional network”, Journal of Intelligent and Fuzzy Systems, vol. 45, no. 6, pp. 11107–11119, 2023. doi:10.3233/JIFS-233260
[14] M. Schuster and K. K. Paliwal, “Bidirectional recurrent neural networks”, IEEE Transactions on Signal Processing, vol. 45, no. 11, pp. 2673–2681, 1997. doi: 10.1109/78.650093
[15] A. Krizhevsky, I. Sutskever, and G. E. Hinton, “ImageNet classification with deep convolutional neural networks”, in Proceedings of Advances in Neural Information Processing Systems 25 (NIPS 2012), 2012, pp. 1097–1105.
[16] H. T. Duong, V. H. Ho, and P. Do, “Fact-checking Vietnamese Information Using Knowledge Graph, Datalog, and KG-BERT”, ACM Transactions on Asian and Low-Resource Language Information Processing, vol. 22, no. 10, 240, 2023. https://doi.org/10.1145/3624557
[17] N. -D. Pham, T. -H. Le, T. -D. Do, T. -T. Vuong, T. -H. Vuong, and Q. -T. Ha, “Vietnamese Fake News Detection Based on Hybrid Transfer Learning Model and TF-IDF”, in Proceedings of the 13th International Conference on Knowledge and Systems Engineering (KSE), Bangkok, Thailand, 2021, pp. 1–6.
[18] K. D. Pham, D. Van Thin, and N. L. T. Nguyen, “Improving Vietnamese Fake News Detection based on Contextual Language Model and Handcrafted Features”, Journal of Science and Technology Development, vol. 26, no. 2, pp. 2705–2712, 2023.

