key: cord-0219686-huugpyem
authors: Kong, Quyu; Ram, Rohit; Rizoiu, Marian-Andrei
title: A Toolkit for Analyzing and Visualizing Online Users via Reshare Cascade Modeling
date: 2020-06-11
journal: nan
DOI: nan
sha: 2b40a59106b9b5cbfb848d2e7d8c99ed6f2ad9f8
doc_id: 219686
cord_uid: huugpyem

Modeling online discourse dynamics is a core activity in understanding the spread of information, both offline and online, and emergent online behavior. There is currently a disconnect between the practitioners of online social media analysis - usually social, political and communication scientists - and the accessibility to tools capable of handling large quantities of online data, and examining online users and their behavior. We present two tools,birdspotter and evently, for analyzing online users based on their involvement in retweet cascades. birdspotter provides a toolkit to measure social influence and botnets of Twitter users. While it leverages the multimodal information of tweets, such as text contents, evently augments the user measurement by modeling the temporal dynamics of information diffusions using self-exciting processes. Both tools are designed for users with a wide range of computer expertise and include tutorials and detailed documentation. We illustrate a case study of a topical dataset relating to COVID-19, using both tools for end-to-end analysis of online user behavior.

The dissemination of information and opinion through social media, drives change in our societies today. The existence of "viral" diffusion of information suggests that some users can exert a disproportionate influence on discourse [5, 10] , and that "bad actors" can exploit misinformation campaigns causing societal divisiveness [26] . Consequently, there is a clear need for tools to analyze the dynamics and weaknesses of online discourse systems, and to identify important users based on their activity.

There seems to currently exist a disconnect between the practitioners of online social media analysis (who are most often social and political scientists, journalists or communication scientists) and the existing tools facilitating this analysis. The latter -when they exist -either require extensive programming experience, or make particular unrealistic assumptions about the usage flow. The result is that practitioners carefully curate large social media datasets, which remain underutilized due to the lack of accessible tools. This work aims to fill this gap by proposing a suite of tools, aimed at non-computing experts, to analyze online discussions and users from the view of information reshare cascades.

This work addresses three specific open questions concerning the tools to model reshare cascades and analyze online users. The first open question relates to profiling user botness and influence on previously collected data. The state-of-the-art bot detector, Botometer [15] , can only be accessed through its web APIs and cannot * Both authors contributed equally to this research. produce predictions for users that are no longer accessible, such as suspended accounts. Since bots have a high tendency of being suspended by Twitter, measuring botness a while after collecting data risks missing a large proportion of the bots involved in discussions. Likewise, while there are plenty of research on examining the influence and reputation of online users in the literature [6, 41] , few of these have converted into accessible tools for this task, and even existing tools often require the knowledge of the social graph which is sometimes impossible to capture retrospectively. The question is: can we have a tool that determines users' botness and influence, locally, on existing curated datasets. The second open question relates to modeling reshare cascades. Only individual scripts or packages of proposed models are provided by recent works on information diffusion modeling [36, 44, 55] with disconnected API designs and (potentially complex) environmental setups. A question, therefore, remains open: is there an easily accessible tool that allows comparing multiple self-existing models on real data, while remaining easily accessible to the nonexperts? The third open question relates to describing users based both on their activity dynamics, and how other users react to their content. Informative temporal features of reshare cascades have been explored in prior research [33, 36] , but no existing software can extract such features at the user-level. The question is can we extract reshare cascade features easily with a tool and show their effectiveness in online user analysis.

In this work we address the above-mentioned open questions by introducing an integrated suite of tools: the two packages (evently 1 and birdspotter 2 ) and one visualizer (birdspotter.ml 3 ). Starting from an already collected dataset containing one day-worth of Twitter discussion around COVID-19, we showcase the usage of the tools to analyze the reshare cascades and the online users. Both tools are open-source and available on GitHub 12 , and they both feature extensive documentation and usage tutorials 45 .

We address the first open question by introducing birdspotter, a python3 package designed to measure retrospectively the botness and social influence of Twitter users. For influence estimation, birdspotter implements a recent influence estimation algorithm [42] which relies only on the reshare cascades that users are involved in, and does not require the users' social graph which is prohibitive to obtain. For bot detection, birdspotter extracts a large number of user and activity features from the dataset only - Figure 1 : (a) The birdspotter.ml visualization system: Twitter users are plotted based on their user influence and botness (left panel), and a selected user's profiles and cascade history are shown (right panel). (b) A schematic view of the software suite to analyze reshare cascades and online users, and the functionality and data dependencies between its components birdspotter and evently, designed to examine the static user attributes and dynamic user attributes respectively. The arrows depict the incoming features for birdspotter to compute botness and user influence, and for evently to model reshare cascades.

i.e., it does not require accessing the user's current profile -and deploys a pre-trained XGboost classifier to produce the likelihood of a user being a bot. The tool features easy-to-use Python and R interfaces, and it exposes a simple interface which allows researchers to annotate their own Twitter user collection. This allows fine-tuning predictions to a particular problem or dataset, and even predicting entirely different user attributes.

We address the second question by introducing evently, a R package dedicated to modeling online information reshare cascades using self-exciting point processes. It currently supports fitting and sampling realizations of the Hawkes process and its finite population version HawkesN [44] , using the exponential and power-law decaying kernels, both unmarked and with a continuous event mark. evently exposes a number of functionalities around reshare cascades and online users. For online cascades, it can fit any of its supported models to observed data, and it can sample synthetic cascades from fitted models. It can be used to continue likely unfoldings of partially observed cascades, and compute their expected final popularity. For online users, evently can jointly fit all cascades initiated by the same users and obtains a single model which is descriptive for the user. It also allows to build a large number of dynamic user descriptors, such as the viral score (i.e., the expected size of a cascades posted by the user), and summaries of cascade size, reshare timing and event magnitude.

To answer the third question, we first introduce birdspotter.ml shown in Fig. 1a -a visualizer designed to analyze Twitter users engaged in online discussions, and aim to provide both broad and specific views of the data. Second, we show how the functionalities of birdspotter and evently can be combined to extract user and reshare cascade information from aTwitter dataset surrounding COVID-19 discussions, fit cascades and predict final popularity, and build user features. Finally, we show that two clusters of active users emerge, and that bots are not among the influential users.

The main contributions of this paper are as follows: • evently -a software package dedicated to modeling reshare cascades, and capable of characterizing online users based on the reshare dynamics of the cascades they generate. • birdspotter -a software package designed to detect online bots from retrospective (already collected) data, and to estimate online user influence based on the reshare cascades. • birdspotter.ml -an online visualizer designed to perform exploratory analysis of online Twitter users. • A set of online tutorials showcasing how these tools can be used as an integrated toolkit aimed at non-computer science experts, and an example analysis of discussions around COVID-19.

In this section, we briefly review the theoretical prerequisites concerning modeling reshare cascades using point processes , and estimating reshare influence. Reshare cascades. Both evently and birdspotter analyze the spread of online information in the form of online reshare cascades. A reshare cascade consists of an initial user post and some reshare events of the post by other users. In Twitter, for example, this can happen when users use the retweet functionality. We denote a cascade observed up to time T as H (T ) = {t 0 , t 1 , . . . } where t i ∈ H (T ) are the event times relative to the first event (t 0 = 0). We denote cascades with additional information about eventsdubbed here as event marks -as marked cascades. We use the notation H m (T ) = {(t 0 , m 0 ), (t 1 , m 1 ), . . . }, where each event is a tuple of the event time and the event mark. For example, for retweet cascades, the numbers of followers of a Twitter user are commonly adopted as event marks [36, 55] .

The Hawkes processes. Both our proposed tools model reshare cascades using Hawkes processes [21] -a type of point processes with the self-exciting property, i.e., the occurrence of past events increases the likelihood of future events. The occurrence of events in a Hawkes process is controlled by the event intensity function:

where µ(t) is the background intensity function and ϕ : R + → R + is a kernel function capturing the decaying influence from a historical event. We note that, for reshare cascades, all events are considered to be offspring of the initial event, i.e. there is no background event rate µ(t) = 0. Two widely adopted parametric forms for the kernel function ϕ include the exponential function ϕ EX P (t) = κθe −θ t and the power-law function ϕ P L (t) = κ(t + c) −(1+θ ) . The HawkesN process [44] is a finite-population variant of the Hawkes processes, stemming from the connection of Hawkes with the classical SIR epidemic model [25] . HawkesN assumes a finite N -the maximum number of events in the process -, and modulates the likelihood of future event by the remaining proportion of total population. Its intensity function is defined as:

where N t is the counting process, i.e., the number of events up to time t. Note that N 0 = 1 due to the initial event t 0 = 0. Marked Models. Both evently and birdspotter implement marked versions of the point processes, where the mark is the number of followers that the user emitting the tweet has. This is because the mark of each event governs the number of future events, e.g., a tweet from a largely followed user is likely to attract more retweets. The marked versions of both Hawkes [36] and HawkesN [29] processes are then derived by rescaling the kernel functions with the marks, i.e., ϕ(m, t) = m β ϕ(t); β controls the warping effect of the mark.

Sampling Hawkes realizations are of use when one wants to generate a synthetic reshare cascade given model parameters, or when one wants to continue a partially observed cascade. We apply the rejection-sampling algorithm [37] to simulate events from Hawkes and HawkesN processes. We refer to [43] for the detailed simulation algorithm. The branching factor n * is an important quantity for the Hawkes and HawkesN processes (as discussed in Section 3.1). It is defined as the expected number of events directly spawned by a single event, i.e., n * = ∫ ∞ 0 ϕ(τ )dτ . For the marked variants, an expectation is also taken over the distribution of marks,

is a doubly stochastic formulation of Hawkes processes where the branching factor (dubbed as infectiousness by Zhao et al. [55] ) is a stochastic time-varying function n * (t) estimated from the observed events H m (t). Parameter estimation. We estimate the parameters of Hawkes and HawkesN using the log-likelihood function for point processes [14] :

SEISMIC is fitted using the R package by Zhao et al. [55] . Cascades joint modeling. When analyzing the reshare dynamics of online items (like Youtube videos and news articles) or users, it is desirable to account for the multiple cascades relating to them. Kong et al. [28] proposed to jointly model a group of cascades with a shared Hawkes model by summing the log-likelihood functions of individual cascades. In Section 4, we jointly model the cascades initiated by the same user, and we show that the learned models can be used to separate active Twitter users from bots. Final popularity prediction. The final popularity of a reshare cascade is the total number of events which occurred until the cascade has ended. Predicting the final popularity of a cascade that is still active (i.e., before its end) has been extensively explored in prior works [36, 44, 47, 55] . Given a Hawkes process observed until time T , its expected final size has an analytical solution:

where N T is the number of events observed until time T . While SEISMIC predicts final popularities with Eq. (4), Mishra et al. [36] and Kong et al. [29] apply an additional regression layer over the model parameters to train a final popularity regressor. Viral score v describes a user or an online item, and it is defined as the expected popularity of a newly started cascade relating to the given user or item. It is obtained using the model jointly trained on all observed cascades of the user (item) [45] , and it is defined as

for marked models). User influence estimation. Birdspotter adopts the following definition for user influence, widely used in literature [16, 42, 52] :

is defined as the mean number of reshares generated directly and indirectly by a message posted by u, irrespective if it is an original message or a reshare.

Estimating influence from retweet cascades has the additional difficulty of not observing the branching structure of the diffusion -i.e., the Twitter API attributes all retweets to the original tweet. birdspotter estimates Twitter user influence using only the observed retweet cascade H m (T )

where marks correspond to users' number of followers.

Rizoiu et al. [42] propose a method to estimate user influence in the absence of the branching structure by assuming that retweets arrive following a Hawkes point process [43] . We can quantify the probability that an event v j is generated by an previous event v i as the ratio of the event intensity generated by v i and the total intensity at time t j . Formally, the probability v j retweets v i is

Rizoiu et al. [42] also introduce the pairwise influence score m i j , intuitively defined as the amount of influence that v i exerts over v j either directly (when v j is a direct retweet of v i ) or indirectly (when v j is a retweet of a descendant of v i ): Finally, the influence of v i is φ(v i ) = n k =i m ik , and the influence of a user u is the average of the influences of all of their tweets:

where T (u) is the set of all the tweets emitted by user u.

In this section, we give an overview of the two packages and the visualizer, and we describe their usage, functionalities, and design. Fig. 1b schematically shows the usages and data dependencies between the two packages.

evently is a R package for modeling online reshare cascades -and retweet cascades in particular -using Hawkes processes and their variants. By design, it provides an integrated set of functionalities to enable one to conduct cascade-level or user-level analysis of reshare diffusion. Design. evently is designed around the interactions among three components: data (i.e., reshare cascades), models and diffusion measures. In applications, models can be used to simulate new cascades, and diffusion measures are analyzed with off-the-shelf supervised and unsupervised tools. Table 1 shows available diffusion measures and their corresponding R function calls at cascade and user levels.

For cascade-level analysis, a reshare cascade is usually observed until a certain time T . A chosen model is then fitted on the cascade capturing its temporal dynamics. From the learned model, evently characterizes the cascade with its branching factor (in Section 2). It can also simulate possible future developments of the cascade after time T and, in addition, derive the expectation of all future unfolding (i.e., the final popularity in Eq. (4)).

When performing user-level analysis, cascades are grouped based on the user that initiates them. evently models these cascades jointly, and the resulting fitted model encodes the reshare patterns at the user level. Similarly, new reshare cascades can be simulated from this model, and the viral score denoting the expected popularity of a new cascade from the same user can be derived (Section 2). Other temporal features for the user that can be derived from the group of cascades include 6-point summaries (mean, first/third quarters, median, minimum and maximum values) of cascade sizes, reshare event time intervals and event magnitudes [36] . Implementation. evently contains two core functions in terms of data and models: fit_series fits a model on given cascades; generate_series simulates cascades from a provided model. A model can be indicated by passing an model_type argument to these functions where we use abbreviated strings to denote models. For example, EXP and PL stands for Hawkes processes with an exponential kernel and a power-law kernel respectively, while mEXP and mPL are their marked variants. We refer to the package documentation 4 for a complete table of model abbreviations. Data structure. Cascades are structured as tables (or data.frames in R) where a time column stores event timestamps relative to the first event t 0 and an optional magnitude column holds the corresponding event mark information. The APIs of evently also work with an R list of cascade data.frames assuming these cascades share a same model. Optimization. As mentioned in Section 2, the model parameter estimation is done via AMPL, a modeling language designed to describe and solve large-scale optimization problems. Compared to other optimization tools such as nlopt [23] , which require precomputed or numerical gradients, AMPL provides automatic differentiation of functions leading to model implementation efficiency. Moreover, it is also compatible with a wide range of solvers including two standard non-linear solvers IPOPT [46] and LGO [39] . Installation. Evently can be installed in R directly from Github 1 : remotes::install_github('behavioral-ds/evently'). Upon the first load, it automatically downloads and configures its dependencies AMPL and IPOPT, which if performed manually would involve considerable effort.

birdspotter is a python3 package providing a toolkit to measure the botness and social influence of Twitter users retrospectivelyi.e. on previously collected tweets provided in the standard jsonl or json format. The package is aimed at practitioners who analyze discourse and user activity on social media -such as social scientists and journalists -, while requiring only basic R or python experience and no statistical modeling knowledge. Measure influence and botness. birdspotter measures user influence as outlined in Section 2, using by default a marked Hawkes exponential kernel with parameters β = 1, κ = 1 θ and θ = 6.8×10 −4 . These were tuned on a large collection of real cascades [42] , and can be customized using the function getInfluenceScores().

Internally, birdspotter leverages an XGboost classifier [8] trained on a large tweet dataset of labeled bots. We construct user features in three categories: classic, semantic, and topic-based features. The classic features include user features (such as the follower-tofollowee ratio, friends count and years on twitter), activity features (such as retweet count, statuses rate, and favorite count) and lexical features (such as number of characters in tweets, number of punctuation marks, and number of mentions). The semantic features are constructed from word2vec [35] embeddings of users' tweets and their user descriptions. By default, birdspotter automatically downloads and uses the GloVe 300d Twitter embeddings [38] , however customized embeddings can easily be used. Finally, the topic-based features are constructed from the term frequencyinverse document frequency (TF-IDF) [24] of the hashtags of the tweets/retweets which users participate in. Usage and functionalities. Given a dataset of tweets collected externally, leveraging the Twitter API, birdspotter's core functionality revolves around two steps. First, it loads the Twitter dataset, 1 birdspotter <import('birdspotter') 2 bs <-birdspotter$BirdSpotter('corona_2020_01_31.jsonl') 3 cascades <-bs$getCascadesDataFrame() 4 # user information can be obtained by 5 labeled_users_botness <-bs$getBotness() 6 labeled_users_influence <-bs$getInfluenceScores() extracts retweet cascades, and compiles the user-level information.

In the second step, it performs the influence and botness analysis. The former is achieved by simply invoking the BirdSpotter constructor, while the latter is achieved by calling the function getLabeledUsers() which returns a table with the user features detailed above. For every observed cascade, birdspotter computes the most likely branching structure (see discussion in Section 2 about how Twitter attributes all retweets to the original tweet). This can be achieved using the function getCascadesDataFrame(), which returns the reshare cascades in the same format as evently with the additional column expected_parent which indicates for a tweet its most likely parent.

For power users, birdspotter provides a number of robust configurations -such as changing the parameters of the Hawkes kernel in Eq. (5) or using user-defined word embeddings -documented using its readthedocs 5 documentation. A usage tutorial is available on birdspotter's repository 2 . For users who prefer to analyze the results outside python, birdspotter can dump the user table and the reshare cascades in Comma Separated Values (CSV) files, that can be loaded in outside tools. All birdspotter functionalities can be accessed in R via reticulate [1] .

Birdspotter's pre-trained XGBoost classifier can be customized to a particular application or dataset through labeling and re-training. The function getBotAnnotationTemplate() outputs a CSV that can be annotated by the user, and trainClassifierModel() retrains the classifier with this annotated data. An option controls whether the detection parameter is further tuned starting from the existing model (useful for adapting both detection to a particular dataset) or retrained from scratch. We exemplify this in Section 4. Data Structures. birdspotter's main class, called BirdSpotter, is used to access methods and attributes. birdspotter uses pandas dataframes to store extracted data and perform analysis. In particular, three dataframes are made accessible, through the main object after tweets are extracted: featuresDataframe (users and their extracted features), cascadeDataframe (tweets and cascade information), and hashtagDataframe (TD-IDF of hashtags used by users in tweets text). Installation. birdspotter installs in the canonical python way: pip install birdspotter.

birdspotter.ml 3 is a visualizer built on top of birdspotter, and designed to analyze Twitter users engaged in online discussions.

The visualisation provides both broad and specific views of the data, via the three components shown in Fig. 1a: a scatter 

In this section, we apply birdspotter and evently to a dataset related to online discussions about the COVID-19 [7] and demonstrate individual functionalities with code snippets and the outputs. All steps and results presented here are reproducible which can be accessed via an online Rmarkdown notebook 6 . We have also pre-loaded a sample of the dataset into the birdspotter.ml public installation 3 . Dataset. We use the dataset of tweets concerning the novel coronavirus COVID-19 pandemic in 2020. Chen et al. [7] collected the tweet IDs by subscribing to TwitterâĂŹs streaming API with a set of manually selected accounts and keywords (refer to their online repository 7 for a full list). We limit our dataset to tweets posted on January the 31st. The dataset is provided as a list of tweet IDs which require re-hydration with tools like twarc. As deleted tweets cannot be recovered, we obtain 68.8% of the original dataset. Import from raw data. Importing the COVID-19 dataset, and extracting user and cascade information can be achieved using birdspotter in just five lines of code (see Fig. 2a, lines 1-6) . including "ghost" tweets, i.e., tweets posted before Jan 31st, which spawned retweet cascades during the studied period. Our dataset contains 1,566,328 unique tweets (71,940 of which are "ghost" tweets) posted by 919,176 unique users. Birdspotter extracts 423,443 cascades, started by 280,336 users. Dataset profiling. We perform an exploratory data analysis of the dataset. In line with prior literature, we observe that cascade size is long-tail distributed, with a mean of 3.7 events per cascade. The cascade duration shows a more complex distribution, and an average cascade duration of 18.7 hours. Note that due to space constraints, the cascade size and duration distributions are only shown in the accompanying online tutorial. Fig. 2b shows the empirical distributions of user botness, influence and activity levels -the latter being defined as the number of retweet cascades that a user participates in. Botness has a mean of 0.27, and its distribution shows two maxima: a larger one around 0.17 showing the most participants in the discussions are likely human, and a smaller one towards high botness values, indicating that a non-trivial number of bots participated in the COVID-19 discussions. In line with prior work [42] , both the influence and the activity level are long-tailed distributed -they appear as lines on a log-log CCDF plot -, which is expected as most popularity measures follow the "rick-get-richer" paradigm. While most users participate in only one cascade, the maximum participation is 1,348 by a self-proclaimed bot @viriyabot (with botness 0.634).

(Re-)Labeling Users. Upon closer examination, we observe that the users @DumplingSays, @eddfuentess, and @marat_dospolovwith botness scores of 0.873, 0.83, and 0.925 respectively-are in fact likely humans. Birdspotter allows to correct such false positives (and false negatives) by fine-tuning the model on additional labeled data. We use the getAnnotationTemplate function to obtain a template for manual annotation of all users in the data set (see Fig. 2a, line 8) . We label the three users above as human, we remove the other rows, and we use trainClassifier (with parameter update set to True, see Fig. 2a , line 10) to continue training the bot detection classifier. In the end, the new botness scores for the above users are 0.375, 0.296, and 0.559, respectively.

In theory, practitioners can use birdspotter as a general purpose user classifier by retraining the classifier with any latent user attribute (of a type supported by XGboost). Fit observed reshare cascades efficiently. evently fits Hawkes processes efficiently by leveraging the AMPL interface with a range of model choices. Fig. 3a depicts an example where we apply marked Hawkes processes with the power-law kernel function to jointly fit the cascades of two randomly selected Twitter users: @BobOngHugots (account posting quotes from a Filipino author) and @Jaefans_Global (account of a K-pop singer), respectively. We employ the function fit_series from evently and obtain the fitted models. The learned kernel functions for the two users are plotted at lines 7-8, and shown in the lower panel. We observe that, on average, tweets posted by @BobOngHugots have an initial higher intensity but demonstrate a faster decay trend in followers' memory compared to @Jaefans_Global. On the other hand, tweets from @Jaefans_Global tend to influence followers for a longer period. Simulate processes with a range of models. With a given model, evently allows to sample entire synthetic new cascades, or continue partially observed cascades. For instance, in line 1-3 in Fig. 3b we use evently to simulate a hypothetical cascade started by @BobOngHugots using the model obtained in Fig. 3a . The lower panel plots the simulated cascade which contains 21 reshare events which is a pretty large cascade given that @BobOngHugots's viral score is 7.40). In another example at Fig. 3c, line 1-3 , we partially observe a real cascade from @BobOngHugots, and we use evently to continue the cascade unfolding via simulation. 25 new events are spawned following the observed history (line 2-6). Compute popularity measures. The above-mentioned procedure outputs just one possible ending for a given cascade. Using evently we can compute the cascade's popularity, i.e. the expected cascade size over all possible unfolding. At line 7-13, we obtain the expected final popularity with two methods: a marked powerlaw Hawkes process and the SEISMIC model, which output final sizes values of 458.3028 and 729.923, respectively. We note that the true final popularity of the cascade is 472 obtained by checking the retweets within following 10 days (1st Feb to 10th Feb). Another two diffusion measures are computed in the example: @BobOngHugots's branching factor, and their viral score. Visualize users in a latent space. The aforementioned applications provide methods to study individual user, however it might be desirable to analyze users in relation to each other. We leverage the temporal features from evently and user metrics from birdspotter to create a visualization of the users in the dataset. We select the top 300 users who initiated the most number of cascades in the dataset. For these users, using birdspotter we obtain their influence and botness as shown in Fig. 2a , and using evently we build their temporal features using the function generate_features. In Fig. 4 , we apply the state-of-the-art dimension reduction tool t-SNE [32] to build a two-dimensional space from the higher dimensional space of the temporal features. Finally, we label as bots the users with a botness score higher than 0.6 [42] , and we color them based on their user influence scores.

From Fig. 4 , we observe two obvious clusters that divide less influential users (top-right corner) from high influence users (bottomleft corner). Noticeably, most users who are classified as bots group at the top-right corner, i.e., the less influential side. On the contrary, users with high influence scores are less likely to be bots.

We structure the discussion about the related work into two sections: tools for modeling reshare cascades using Hawkes processes, and tools for detecting Twitter bots and quantifying user influence. Tools for cascade modeling with Hawkes processes. Most prior works provide the proposed models as scripts mainly for reproducing experimental results [4, 29, 36, 44] . Zhao et al. [55] ship their model in an R package as a accessible tool for predicting the final popularity of retweet cascade. Unlike the above which are mostly developed for demonstration purposes, evently is designed a extendible, multi-purpose, unified set of APIs for modeling with different Hawkes process variants.

There exist other tools that implement multiple models, and which emphasize specific aspects such as language-specific implementations (e.g., THAP [50] in matlab, PoPPy [49] in PyTorch), or network Hawkes models (pyhawkes [31] ). Among these, a Python package, tick [2] , has the most active community and supplies a comprehensive set of models and helper functions for general timedependent modeling including Hawkes processes. evently differs from these toolboxes in two ways: it is a Hawkes process toolbox in the native R language with limited dependencies; it is designed with an emphasis on online information diffusion modeling. Tools for detecting Twitter bots. Bot detection is known to be a hard problem. Human annotated bot datasets are often noisy, as even humans struggle to detect bots. Solutions have been proposed to detect bot activity in real-time [9] , or to identify potential bot campaigns on Twitter [22] . However, these methods are designed to analyze a discussion landscape as a whole, and can not determine if a particular user is a bot. Bot detectors are usually trained on human-labeled datasets, and they usually rely on NLP approaches [12, 27] , deep-learning approaches [30] , feature-engineering [11, 15, 51] and other methods [ 19, 34] . The de-facto bot detection tool in the social science community has become Botometer (formerly BotOrNot) [15] , which uses more than 1000 user-and recent activity-related features to train a classifier. Botometer performance is known to deteriorate over time [40] , and performs poorer non-English Twitter accounts. Furthermore, Botometer is provided as an online API and requires access to the Twitter API, which enforces rate limits and precludes the detection of bots in large Twitter datasets. But most importantly, Botometer cannot output predictions on Twitter dumps collected in the past (like used in many work [3, 18, 48] ).

Birdspotter addresses the above-stated shortcomings by producing bot predictions on already collected Twitter dumps, and by exposing a simple interface to allows researchers to annotate their own Twitter user collection.

Tools for quantifying online influence. There are many features used to score the influence, reputation or popularity of online users. We delineate these into three areas: those using static user attributes (including lexical features and information on a user's profile) [13] , those that analyze the online social graph (e.g. degree, PageRank, HITs, etc.) [6, 41] , and those modeling information diffusion [54] . However, few of these have translated into accessible tools for the non-experts in the field. For instance, Cossu et al. [13] provide a set of scripts to perform their influence measurement method. Other tools, like ConTinEst [16, 20] , require knowledge of the social graph (which is often prohibitively expensive to obtain) on which it performs random walks (which are very slow on large social graphs). Birdspotter estimates user influence from reshare dynamics, in the absence of knowledge about the social graph, and provides an end-to-end tool to analyze Twitter users.

In this work, we present a toolkit for analyzing Twitter users with an emphasis on their involvement in online information diffusions. The toolkit consists in two packages, birdspotter and evently, and a visualizer birdspotter.ml. First, we provide the theoretical background information. Then we give an overview of the toolkit. While birdspotter examines users regarding their bot-like behavior and social influence to followers, evently models the reshare cascades initiated by users. Lastly, given a dataset of tweets around COVID-19, we demonstrate the applications of the proposed tools. Future work. The future development plan of evently mainly involves the incorporation of more models and optimization algorithms, such as a non-parametric Hawkes process and a variational inference algorithm proposed in recent literature [53] . Moreover, we plan to integrate evently with additional optimization tools. A project [17] established recently brings the automatic differentiation to R language. As AMPL requires a dedicated model language, this project enables evently to define models in native R language and maintain the optimization efficiency. birdspotter will be extended to incorporate further diffusion-based influence measures and allow extensible features for bot detection. Furthermore, birdspotter.ml will increase the datasets that it hosts, as more social media topics are discovered.

reticulate: R Interface to Python

Tick: a Python library for statistical learning, with an emphasis on hawkes processes and time-dependent models

Social bots distort the 2016 US Presidential election online discussion

Deephawkes: Bridging the gap between prediction and understanding of information cascades

Sandy" Pentland. 2016. Beyond viral

Measuring user influence in twitter: The million follower fallacy

Tracking Social Media Discourse About the COVID-19 Pandemic: Development of a Public Coronavirus Twitter Data Set

Xgboost: A scalable tree boosting system

An unsupervised approach to detect spam campaigns that use botnets on Twitter

Do Cascades Recur?

Blog or block: Detecting blog bots through behavioral biometrics

Sifting robotic from organic text: a natural language approach for detecting automation on Twitter

A review of features for the discrimination of twitter users: Application to the prediction of offline influence

Conditional Intensities and Likelihoods

Botornot: A system to evaluate social bots

Scalable Influence Estimation in Continuous-Time Diffusion Networks

Tensors and Neural Networks with 'GPU' Acceleration

2020. # covid-19 on twitter: Bots, conspiracies, and social media activism. arXiv

The rise of social bots

Influence Estimation and Maximization in Continuous-Time Diffusion Networks

Spectra of some self-exciting and mutually exciting point processes

Bot-Slayer: real-time detection of bot amplification on Twitter

The NLopt nonlinear-optimization package

A statistical interpretation of term specificity and its application in retrieval

A Contribution to the Mathematical Theory of Epidemics

Analysing user identity via time-sensitive semantic edit distance (t-SED): a case study of Russian trolls on Twitter

Language-Agnostic Twitter-Bot Detection

Exploiting Uncertainty in Popularity Prediction of Information Diffusion Cascades Using Self-exciting Point Processes

Modeling Information Cascades with Self-exciting Processes via Generalized Epidemic Models

Deep neural networks for bot detection

Discovering latent network structure in point process data

Visualizing data using t-SNE

Exploring limits to prediction in complex social systems

RTbust: exploiting temporal patterns for botnet detection on twitter

Efficient estimation of word representations in vector space. arXiv

Feature Driven and Point Process Approaches for Popularity Prediction

On Lewis' simulation method for point processes

Glove: Global vectors for word representation

Nonlinear optimization with GAMS/LGO

The False positive problem of automatic bot detection in social science research

Measuring user influence on Twitter: A survey. Information processing & management

# DebateNight: The Role and Influence of Socialbots on Twitter During the 1st 2016 US Presidential Debate

A Tutorial on Hawkes Processes for Events in Social Media

SIR-Hawkes: on the Relationship Between Epidemic Models and Hawkes Point Processes

Online Popularity Under Promotion: Viral Potential, Forecasting, and the Economics of Time

On the Implementation of a Primal-Dual Interior Point Filter Line Search Algorithm for Large-Scale Nonlinear Programming

Retweet wars: Tweet popularity prediction via dynamic multimodal regression

Bots in the Twittersphere

PoPPy: A Point Process Toolbox Based on PyTorch. arXiv

THAP: A matlab toolkit for learning with Hawkes processes

Uncovering social network sybils in the wild

RedQueen: An Online Algorithm for Smart Broadcasting in Social Networks

Variational Inference for Sparse Gaussian Process Modulated Hawkes Process

Learning Influence Probabilities and Modelling Influence Diffusion in Twitter

SEISMIC: A Self-Exciting Point Process Model for Predicting Tweet Popularity