Predatory Conversation Detection Using Transfer Learning Approach
Loading...
Files
Date
2022
Authors
Agarwal, Nancy
Unlu, Tugce
Wani, Mudasir Ahmad
Bours, Patrick
Journal Title
Journal ISSN
Volume Title
Publisher
Springer International Publishing Ag
Open Access Color
OpenAIRE Downloads
OpenAIRE Views
Abstract
Predatory conversation detection on social media can proactively prevent the netizens, including youngsters and children, from getting exploited by sexual predators. Earlier studies have majorly employed machine learning approaches such as Support Vector Machine (SVM) for detecting such conversations. Since deep learning frameworks have shown significant improvements in various text classification tasks, therefore, in this paper, we propose a deep learning-based classifier for detecting predatory conversations. Furthermore, instead of designing the system from the beginning, transfer learning has been proposed where the potential of the pre-trained BERT (Bidirectional Encoder Representations from Transformers) model is utilized to solve the predator detection problem. BERT is mostly used to encode the textual information of a document into its context-aware mathematical representation. The inclusion of this pre-trained model solves two major problems, i.e. feature extraction and Out of Vocabulary (OOV) terms. The proposed system comprises two components: a pre-trained BERT model and a feed-forward neural network. To design the classification system with a pretrained BERT model, two approaches (feature-based and fine-tuning) have been used. Based on these approaches two solutions are proposed, namely, BERT_frozen and BERT_tuned where the latter approach is seen performing better than the existing classifiers in terms of F-1 and F-0.5- scores.
Description
7th International Conference on Machine Learning, Optimization, and Data Science (LOD) / 1st Symposium on Artificial Intelligence and Neuroscience (ACAIN) -- OCT 04-08, 2021 -- ELECTR NETWORK
Keywords
Child grooming, Online sexual predators, Deep learning, Language modelling, BERT
Turkish CoHE Thesis Center URL
Fields of Science
Citation
3
WoS Q
N/A
Scopus Q
Q2
Source
Machine Learning, Optimization, and Data Science (Lod 2021), Pt I
Volume
13163
Issue
Start Page
488
End Page
499