key: cord-0160370-zsudca7t
authors: Dong, Xishuang; Qian, Lijun
title: Integrating Human-in-the-loop into Swarm Learning for Decentralized Fake News Detection
date: 2022-01-04
journal: nan
DOI: nan
sha: 3d0d35da40a265851969480569d7b06cdb412cb8
doc_id: 160370
cord_uid: zsudca7t

Social media has become an effective platform to generate and spread fake news that can mislead people and even distort public opinion. Centralized methods for fake news detection, however, cannot effectively protect user privacy during the process of centralized data collection for training models. Moreover, it cannot fully involve user feedback in the loop of learning detection models for further enhancing fake news detection. To overcome these challenges, this paper proposed a novel decentralized method, Human-in-the-loop Based Swarm Learning (HBSL), to integrate user feedback into the loop of learning and inference for recognizing fake news without violating user privacy in a decentralized manner. It consists of distributed nodes that are able to independently learn and detect fake news on local data. Furthermore, detection models trained on these nodes can be enhanced through decentralized model merging. Experimental results demonstrate that the proposed method outperforms the state-of-the-art decentralized method in regard of detecting fake news on a benchmark dataset.

The development of social media (e.g., Twitter) has significantly changed the way of information collection for human. Social media has dominated information generation and spreading, which results in that it is becoming more and more challenging for people to live without social media [1] , [2] , [3] . Although it reduces the cost of information retrieval, unfortunately, the absence of systematic and effective management on the information on social media platforms has led to that social media comes up with the hotbed of generation and spreading of fake news [4] , [5] , where fake news refers to the news that is intentionally and verifiably false [6] , [7] . As a result, fake news causes many confusions and severe damages to the society. For example, during the COVID-19 pandemic, fake news makes it more difficult for people to find trustworthy and reliable information to combat the spreading and treatment of the virus [8] . Thus, preventing the spreading of fake news is imperative to decrease political polarization, increase trust in public institutions, and improve decisionmaking for everyone's life.

Machine learning techniques are effective to recognize potential fake news by building models on news features including content [9] , [10] , [11] and context [12] , [13] , [14] . Centralized detection methods dominated this field by X collecting big data to a cloud storage for building highperformance detection models, where deep learning techniques such as convolutional neural networks (CNN), recurrent neural networks (RNN), and deep graph models outperform other techniques [11] , [15] , [16] , [17] , [18] , [19] . Unfortunately, one potential risk in the procedure of building these methods is to violate user privacy when collecting big data of news in the centralized manner.

To reduce this risk, decentralized methods are required to implement privacy preserving, which is to learn and infer on local data, not upload data to a centralized data storage for building detection models. Federated learning [20] , [21] , [22] is a distributed machine learning approach that enables training a high-quality centralized model while training data remains distributed over a large number of users. However, it relies on a model center to control the process of updating models for users, which increases the potentials of hacking regarding the communication between the model center and user models. On the contrary, swarm learning [23] is able to implement decentralized learning to maintain user privacy without the need for a central coordinator, thereby going beyond federated learning. It enables decentralized training without sharing the data through learning a set of nodes, where each node learns on training data locally and enhances the model collaboratively without sharing the training data. They share parameters (weights) derived from training the model on the local data. Thus, it allows users at the nodes to maintain the confidentiality and privacy of the raw data. Nevertheless, swarm learning is not designed to leverage valuable user feedback on the inference to further enhance learning performance since the user feedback has been approved to effective to improve data analysis in the loop of learning and inference [24] .

In this paper, we propose a human-in-the-loop based based swarm learning (HBSL) to integrate user feedback into the learning and inference of swarm learning via human-in-theloop (HITL) techniques [24] . HITL is to involve human activities in the process of building machine learning models to improve the model performance via human knowledge [24] . We applied HITL to generate the user feedback in the learning process to improve fake news detection. The detailed learning process is shown in the diagram shown in Figure 1 . It consists of three stages in the learning process, namely, (a) Local Learning, (b) Model Updating, and (c) Human Feedback, which forms a loop until the learning is terminated when meeting stop criteria. In stage (a), all nodes will learn the detection models independently on the local data. Then these models in all nodes will be updated in the stage (b) by a for nodes i and j, respectively, are different as well, 1 i 4, and 1 j 4. In stage (a), all nodes will learn the detection models on the local data. Then, in stage (b), these models will be updated by a master node selected through averaging model weights. Finally, in stage (c), all nodes will apply the model updated to accomplish fake news detection on the local testing data. Afterwards, users will correct a portion of predictions selected by random sampling as feedback to extend the training data. The models updated will be enhanced by fine-tuning on the training sets extended. All these three stages will form a loop of learning and inference to update their models until meeting the stop criteria.

master node through averaging model weights. In stage (c), all nodes will apply the model updated to accomplish fake news detection on the local testing data. Afterwards, users will provide feedback on the predictions of testing data, which is used to extend the training data to improve the detection. All models on these nodes will keep updating their models in a loop with these three stages until meeting the stop criteria of learning. Experimental results demonstrate that the proposed method outperformed swarm learning on decentralized fake news detection.

In summary, the contributions of this study are:

• We proposed a novel decentralized method through combing swarm learning and human-in-the-loop. In the learning and inference process, users in different nodes provides feedbacks on the inference results to generate feedback. Then nodes collects the feedback to extend training sets to fine-tune the model in order to enhance the inference performance within a loop of learning and inference. • We validate our proposed model on a benchmark LIAR [25] . Compared to swarm learning, the proposed model is able to significantly improve the performance for each node on detecting fake news by learning on local training data together with user feedbacks.

The proposed decentralize method is to combine the advantages of swarm learning [23] and human-in the loop (HITL) [24] , which is able to accomplish decentralized fake news detection via human feedback and model update in the swarm learning.

Swarm learning [23] is a decentralized machine-learning approach that unites edge computing, blockchain-based peerto-peer networking and coordination while maintaining privacy without a central control. It implements learning on distributed nodes (edges) without sharing data to protect privacy in a local community. Compared to federated learning [20] , [26] , swarm iii learning will be totally decentralized without parameter central control, which is illustrated as Figure 2 . In addition, swarm learning can update parameters without complex process to enhance the inference performance on nodes, only through merging parameters of models between nodes given by

(1)

where P M is the model parameter updated on a node, P k is parameters from the k th node, w k is the weight of the k th node, and n is the number of nodes participating in the merging process.

Human-in-the-loop can be applied to improve the performance of machine learning models by integrating human knowledge and experience for data analytics [24] . For example, human can significantly reduce algorithm bias in the training and inference in terms of human feedback for various tasks in the field of natural language processing (NLP) such as text classification [27] , syntactic and semantic parsing [28] , topic modeling [29] , text summarization [30] , and sentiment analysis [31] . The general framework is shown in Figure 3 . Human can provide feedback to the model training, data preprocessing, data collection regarding the model inference, even predictions to improve the model inference in a loop, where the feedback can be associate with inference results and its performance, inference time cost, and inference computation cost.

This paper proposed a model to combine swarm learning and HITL to implement decentralized fake news detection. Local Learning: This stage is to independently learn the model on local data in the node. The local data is preprocessed by removing missing values, stemming, and onehot representations of sentences. These representations are input to the model built via bidirectional recurrent neural networks (BRNN). This model will not be complex regarding the resource constrain like limited memory and thus contains one embedding layer, one forward layer, and one backward layers. The embedding layer represents the input <x 1 , x 2 , x 3 , ..., x t , ..., x n > as an embedding vector <e 1 , e 2 , e 3 , ..., e t , ..., e n >, which is to resolve feature sparsity issues. Afterwards, these two layers generate two directional correlation features on the embedding vector. Then we combine these two features as the output z of this model, where z is a sequence <z 1 , z 2 , z 3 , ..., z t , ..., z n > and z t is given by

., x t , ..., x n >. a(·) is the activation function for hidden layers. w f z , w f h , and w f e are forward weights for three layers, namely, output layer, forward layer, and backward layer. w b z , w b h , and w b e are backward weights for these three layers, respectively. b z , b f h , and b b h are bias for these three layers.

We utilize the output z to calculate the cross entropy loss given by

where φ(z) is the sigmod function. These four nodes will learn these models with this identical architecture of BRNN on local data.

Model Updating: The master node in this stage will be selected in terms of predefined criterion. The criterion is that the node with the highest performance of detection is selected as the master node, which can be implemented by local communication between nodes. This node collects all models from other nodes and updates parameters w master below.

where n = 4 in this proposed method. Afterwards, the master node will share the updated model to replace parameters of models in other nodes.

Human Feedback: In the third stage, the model will perform inference on their local testing data in a parallel manner, where the dataset for this inference for different nodes will be various. For example, for the diagram in Figure 1 , the dataset d T est i for node i will be different from d T est j for node iv j, where i = j, 1 i 4, and 1 j 4. Then, each user i (Human) will generate feedback on the prediction results. In detailed, the user will randomly select a portion of predictions and correct them as feedback. This feedback will be integrated into corresponding training data to extend training sets. For instance, feedback on the predictions of d T est 2 will generate a set of corrected predictions d 2 . Then d 2 will be integrated into d T rain 2 to extend the training set for next round of local learning of local learning in the loop of learning.

Stages (a) to (c) form the loop to update node's model parameters to enhance detection performance. The details learning process is shown in Algorithm 1, where the portion of predictions is predefined. Updating the parameter w t,i of the model of the master node i with equation (6) 4:

Replacing the model parameter of other nodes with w t,i

Inferencing on each node's data d T est t,i , 1 i 4

Generating feedback d t,i by correcting a portion of predictions on each nodes, 1 i 4

Extending d T rain

We validate the effectiveness of the proposed model by detecting fake news on the benchmark LIAR [25] . It is a standard benchmark dataset for fake news detection. It includes 12,836 real-world short statements collected from a variety of occasions such as debate, campaign, Facebook, Twitter, interviews, ads, etc. Each statement is labeled with six-grade truthfulness, namely, true, false, half-true, part-fire, barelytrue, and mostly-true. We reorganize the data as two classes by treating five classes including false, half-true, part-fire, barely-true, and mostly-true as Fake class and true as True class. Therefore, the fake news detection on this benchmark is converted to a binary classification task.

The key hyper-parameters for training the proposed model are shown in Table I and we employ Adam optimizer to complete the training. 

We apply accuracy to evaluate the performance of fake news detection regarding the task features on the benchmark, where the accuracy is calculated by dividing the number of news detected correctly over the total number of news.

We validate the proposed methods with two different configurations. One is to implement decentralized learning on four nodes with human feedback while the other is to implement learning on eight nodes.

1) Learning on four nodes: In that regard of limited data available for individual user in the real application, we distribute small number of samples (e.g. less than 1, 000) for inference in different nodes, which is shown in Figure 4 with two classes: Fake and True. Specifically, we assume that all nodes share the same class distribution to simplify the task. We show the details of performance of 5 runs in Table II from two dimensions. One dimension is to examine the performance in terms of the accuracy for different runs. It appears that HBSL consistently performs better than SL in different runs. The other dimension is to check the performance for v different nodes. HBSL can improve detection performance for different nodes up to 5%. From these two dimensions, human feedback for swarm learning can consistently improve detection performance for 5 runs of 4 nodes. However, it can be observed that the performance of these nodes for different runs are various up to 7% and 10% for SL and HBSL, respectively. The unstable performance is caused by learning on small local data with the shallow RNNs on these nodes. Moreover, Figure 5 presents the inference performance for different nodes in the learning process. It is observed that with human feedbacks, HBSL detection performance is higher than those of SL, which means that human feedback can contribute to improvement of detection performance. In addition, compared to SL, HBSL can converge to higher performance for most of nodes in different runs. Moreover, although human feedback enlarges the training sets by introducing more samples, it will not decrease learning speed by increasing learning rounds.

2) Learning on eight nodes: We also examine if HBSL can perform better than SL with the condition of learning on 8 nodes. Similarly, we distribute small number of samples for learning in different nodes as well, which is shown in Figure 6 . In addition to more data sets involved in this experiment, the size differences of samples for these data sets are larger compared to the case of learning on 4 nodes. For instance, the size difference between node 6 and node 3 is 1, 700 (2, 000 − 300) while that for the case of 4 node is 700 (1, 000−300) between node 2 and node 3. Moreover, for node 8, the dataset is not balanced. These changes will introduce new challenges like increasing converge time for fake news detection. Table III presents the comparison of the testing accuracy for the case with 8 nodes. Similar observation is obtained that HBSL outperforms SL on all nodes. However, the improvement is up to about 2%, which is less that 5% for the case of learning on 4 nodes. In other words, it seems that HBSL will be more suitable to learn on a local net with less nodes. Moreover, it enhances the detection performance for all runs in terms of average accuracy for different runs. Figure 7 shows the testing accuracy for 8 nodes in the learning process for 5 runs. It is observed that the curves of test accuracy is various since the sizes of data for fine-tuning models will be various in the loop of learning and inference. In addition, although some nodes performed worse in the learning of HBSL, they can still converge to higher or similar performance of SL. Finally, it uses more learning rounds to achieve model convergence as it involves more nodes to update models, which reduces the convergence speed.

Fake news detection attracted lots of attentions to effectively preventing the dissemination of fake news. Traditional fake news detection based on machine learning is classified into three categories, namely, content feature based [32] , [11] , [33] , [34] , propagation feature based [14] , and context feature based [35] . Recently, more dimensions have been exploited. The first dimension is to combine these features to enhance detection performance. Moti et al. represented news as a graph containing these three features and utilized graph convolutional networks (GCN) to classify the graph into fake or true [18] . In addition, it is resilience to adversarial attacks since generating adversarial samples on three features is extremely challenging. Wang et al. used multimodal features to detect fake news by developing event adversarial neural networks. It consists of three components: a multimodal feature extractor, a fake news detector, and an event classifier. The feature extractor recognized multimodal features on text and pictures related to news as input to the fake news detector and the event classifier. Then the fake news detector will beat the event classifier in the adversarial learning process, which is extract the multimodal features, that are more useful to fake news detection, through the learning process. The second dimension is to fight against adversarial "fake news" generated by AI techniques. For instance, Fung et al.

proposed InfoSurgeon to detect adversarial "fake news" that is represented as a fine-grained knowledge graph of news with external knowledge bases [36] . It utilized graph neural networks to determine if the knowledge graph of news is true or fake. Shu et al. developed F actGen to generate high-quality news by leveraging external facts to enrich the news and improve the consistency between input and output. Moreover, they proposed F actGen def to detect these synthetic fake news with high performance [37] .

The third dimension is to reduce efforts of labeling big data for training detection models. For example, Wang et al. proposed WeFEND, a reinforcement learning method, that is to leverage user's reports as weak supervision to enlarge training sets for fake news detection [38] . It is composed of three components including annotator, reinforce selector, and detector. The annotator labeled data with weak labels and the reinforce selector chooses weakly labeled data to extend training sets for fine-tuning the detector.

The forth dimension is to detect fake news from the perspective of psychology. For example, Karami et al. exploited fake news detection through combining psychology and data science. They extracted five features about motivation of spreading fake news that present user's psychology, namely, uncertainty, emotions, lack of control, relationship enhancement, and rank, where the first three features are extracted with Linguistic Inquiry and Word Count (LIWC) [39] , and the other two are obtained based on user behaviors on social media like retweeting and following. These features are significantly different between users who spread fake news and users who spread true news. Moreover, these features can be combined with content features to improve detection performance [40] .

Although these dimensions have broadened and enhanced fake new detection, few of work has exploited how to introduce human feedback into decentralized fake news detection to implement privacy preserving for users. This paper combined human-in-the-loop techniques with swarm learning to exploit decentralized fake news detection.

In this paper, a novel decentralized model is proposed for detecting fake news through combining swarm learning and human-in-the-loop. The learning and inference forms a loop until meeting stop criteria by learning and predicting on local data. Specifically, the human feedback is generated in this loop to extend training sets for improving detection performance. The proposed model is validated on a benchmark, LIAR dataset. Experimental results indicate that the proposed model could outperform swarm learning on fake news detection in a decentralized manner. In the future, we plan to extend this work by designing detection models according to node features. 

Processing social media messages in mass emergency: A survey

Fighting misinformation on social media using crowdsourced judgments of news source quality

Breaking news detection and tracking in twitter

Digital wildfires: Propagation, verification, regulation, and responsible innovation

Misinformation in social media: definition, manipulation, and detection

Social media and fake news in the 2016 election

A survey of fake news: Fundamental theories, detection methods, and opportunities

Observational study of hydroxychloroquine in hospitalized patients with covid-19

Automatic deception detection: Methods for finding fake news

Fake news or truth? using satirical cues to detect potentially misleading news

Csi: A hybrid deep model for fake news detection

Some like it hoax: Automated fake news detection in social networks

Fake news propagation and detection: A sequential model

Hierarchical propagation networks for fake news detection: Investigation and exploitation

Fake news identification on twitter with hybrid cnn and rnn models

Temporally evolving graph neural network for fake news detection

Adversarial active learning based heterogeneous graph neural network for fake news detection

Fake news detection on social media using geometric deep learning

Semi-supervised learning and graph neural networks for fake news detection

Federated optimization: Distributed machine learning for on-device intelligence

Federated learning

Federated learning: Challenges, methods, and future directions

Swarm learning for decentralized and confidential clinical machine learning

A survey of humanin-the-loop for machine learning

liar, liar pants on fire": A new benchmark dataset for fake news detection

Federated learning: Strategies for improving communication efficiency

Journalist-in-the-loop: Continuous learning as a service for rumour analysis

Building natural language interfaces to web apis

Interactive topic modeling

Learning to summarize from human feedback

When and why does a model fail? a human-in-the-loop error detection framework for sentiment analysis

Fake news detection through multi-perspective speaker profiles

A survey on natural language processing for fake news detection

Fake news detection as natural language inference

Beyond news contents: The role of social context for fake news detection

Infosurgeon: Cross-media finegrained information consistency checking for fake news detection

Fact-enhanced synthetic news generation

Weak supervision for fake news detection via reinforcement learning

The psychological meaning of words: Liwc and computerized text analysis methods

Profiling fake news spreaders on social media through psychological and motivational factors