key: cord-0329249-mr9bd8ny
authors: Mokhberian, Negar; Abeliuk, Andr'es; Cummings, Patrick; Lerman, Kristina
title: Moral Framing and Ideological Bias of News
date: 2020-09-27
journal: nan
DOI: 10.1007/978-3-030-60975-7_16
sha: de196985c8284d54ef9db4cbfbbe830d3a028cc2
doc_id: 329249
cord_uid: mr9bd8ny

News outlets are a primary source for many people to learn what is going on in the world. However, outlets with different political slants, when talking about the same news story, usually emphasize various aspects and choose their language framing differently. This framing implicitly shows their biases and also affects the reader's opinion and understanding. Therefore, understanding the framing in the news stories is fundamental for realizing what kind of view the writer is conveying with each news story. In this paper, we describe methods for characterizing moral frames in the news. We capture the frames based on the Moral Foundation Theory. This theory is a psychological concept which explains how every kind of morality and opinion can be summarized and presented with five main dimensions. We propose an unsupervised method that extracts the framing Bias and the framing Intensity without any external framing annotations provided. We validate the performance on an annotated twitter dataset and then use it to quantify the framing bias and partisanship of news.

A growing share of Americans receives vital information and news from online sources [8] . In recent years, however, the online information environment has grown increasingly polarized along political and ideological lines [21] . Anecdotal evidence suggests that the shift to online news consumption has accelerated during the uncertainty and the 'fog of war' created by the Covid19 pandemic 1 , accompanied by growing ideological polarization, which colors how information about the pandemic is produced and consumed [24] . These converging developments suggest a need for accurate methods to quantify the ideological bias of news sources. The resulting metrics could help the public become more responsible consumers of news by cutting through polarization.

Measuring ideological biases from text is a problem that has only recently attracted attention from the Natural Language Processing (NLP) community. Beyond identifying topics in the text, this research attempts to uncover how implicit biases manifest themselves through language and how to infer them from the text. Research has shown that language captures implicit associations that shape how people perceive the world and relationships within it, including gender and career representations that lead people to associate women with artistic careers and men scientific careers [3] . In cognitive linguistics and communication theory, the different perspectives and meanings that texts carry are captured by "semantic frames". When applied to news media, framing captures how news stories express meaning and cultural values and how they evoke an emotional reaction from readers. Media framing has been studied as a tool to influence the opinion of newsreaders [18] .

Political scientists have characterized the framing of text using Moral Foundations Theory (MFT) [11] . MFT defines five moral foundations which categorize the intuitive ethics and automatic emotional reactions to various social situations. These foundations concern dislike for the suffering of others (care/harm), dislike of cheating (fairness/cheating), group loyalty (loyalty/betrayal), respect of authority and tradition (authority/subversion), and concerns with purity and contamination (sanctity/degradation). Across cultures, assessments of moral foundations reveal correlations between political ideology and the five foundations [10] , as well as interesting relationships between gender, culture, religion, and personality. For example, consistently and across studies, liberals and conservatives draw upon these foundations to different degrees, measured via questionnaires [9] : liberals assign more weight to the Harm and Fairness foundations, whereas conservatives are more sensitive to the Ingroup, Authority, and Purity.

We use MFT as a generalizable framework to quantify the framing Bias in the news. The five moral foundations (care/harm, fairness/cheating, loyalty/betrayal, authority/subversion, sanctity/degradation) give a high-level understanding of the values promoted by news sources, which we use to quantify the biases inherent in how the news is framed by different sources.

In this paper, we propose a framework to quantify the moral framing of texts. The framework leverages a large corpus of tweets annotated with their moral foundations [12] . We use sequence embeddings as features to train a classifier to predict the scores of text corresponding to the moral dimensions. Additionally, instead of using the embedding directly, we evaluate a low-dimensional feature representation of the text based on the Frame Axis approach [16] . The Frame Axis method computes the Bias and Intensity of each moral frame based on alignments of the text relative to specific target words. We show that these simple features capture the moral frames implicit in the text, at least as well as much more complex (high-dimensional) representations. Finally, we show that moral frames help with the prediction of the partisanship of news based on the headlines. Our work demonstrates the feasibility of automatically classifying the moral framing and political partisanship of news.

Moral Foundations Theory. Moral Foundation Theory (MFT) has been first introduced by Haidt and Joseph to explain moral differences across cultures [11] . The theory introduces five basic moral foundations which are the basis of many intuitive and cultural human psychological values. These five dimensions consist of Care/Harm, Fairness/Cheating, Loyalty/Betrayal, Authority/Subversion, and Purity/Degradation. Later, Graham and Haidt showed that Liberals and Conservatives have substantial variation in their moral concerns across these five Moral Foundations [9] . This triggered future works analyzing the political rhetoric based on MFT and characterizing the political ideologies using the five Moral Foundations.

Other than the works in the psychology domain, there are other works considering MFT in computational linguistic approaches. They are mostly using the lexical resource Moral Foundation Dictionary (MFD) [9] . This dictionary consists of words regarding virtues and vices of the five Moral Foundation dimensions and a sixth dimension regarding general morality terms. Studies rely on the usage of words with respect to the Moral Foundation dictionary terms. For example, to analyze temporal changes in the frequency of MFT terms in English books for the years 1900 to 2007 [25] , or to demonstrate changes in the Moral Foundation language regarding the word 'gay' in the US political Senate speeches from 1988 to 2012 [7] . The study showed that republicans were significantly using more Purity words than Democrats.

Recently, novel methods have been developed to recognize if a text is relevant to any of the MFT dimensions. This problem has been formulated as a classification problem on tweets in [7, 14] and on news in [6] .

NLP Some of the previous work on analyzing MFT in text corpora have relied on the word counts [6] . Other studies [7, 15] have used features based on word embeddings [20] or sequence embeddings [5] to obtain more robust models and better performance. Recent work has proposed methods for analyzing rhetorical frames in text. In SemAxis [1] , the authors introduce semantic axis which are word-level domain semantics structured on word antonym pairs. The similarity of a word with respect to different predefined antonymous axes can capture the semantics of the word in various contexts. (e.g., the word 'soft' can be a negative word in the context of sports and positive in the context of toys). The word and the semantic axis are represented in the same representation vector space trained on a corpora. The authors of SemAxis later introduced the concept "Frame Axis" which is a method of characterizing the framing of a text by identifying the most relevant semantic axis [16] . Similarly, studies have explored using groups of words with opposite meanings to define semantic dimensions to improve the interpretability of text representations [19] .

In this section, we describe our computational framework to measure the framing bias of news sources based on the text of their headlines. The framework is composed of two tasks: (1) An unsupervised method to learn a low-dimensional representation characterizing the text with respect to a set of target words; (2) A supervised classification task using Twitter data with human-annotated MF framing, with contextualized language representation models.

In the first task, we represent the text with scores according to the moral dimensions, computing framing Bias and Intensity scores for each MF using the Frame Axis [16] method. This approach projects the words on micro-frame dimensions characterized by two sets of opposing words. These MF framing scores capture the ideological slant of the news source and can be used as features for predicting partisanship. Details are given in Section 3.2.

In the second task, we leverage the annotated Twitter dataset [12] to develop a supervised model to classify the MFT related micro-frames in the news headlines (see Section 3.3). In section 4.1, we validate the accuracy of different latent representations for the text on the annotated Twitter data set.

Finally, in section 4.3 we present a case study of our framework analyzing the moral framing differences between Liberal and Conservative media in USA news sources.

News Articles We use "All the News" dataset from Kaggle 2 . The data primarily falls between the years 2016 and July 2017. The news sources include the New York Times, Breitbart, CNN, Business Insider, the Atlantic, Fox News, Buzzfeed News, National Review, New York Post, the Guardian, NPR, Reuters, Vox, and the Washington Post. For each news story, we have worked with its headline and publication (the news outlet). Also, we have looked up the political leaning of each source from allsides 3 website and have added a column indicating the political side of each news. We have eliminated the news stories from central sources and only kept the liberal and conservative leaning sources. We have narrowed down the headlines to the ones related to immigration and elections topics. We did this by checking the headlines to include some hand-picked words regarding each topic. Among several topics, we chose these two because the frequency of articles falling in these two topics was larger. After all the above steps, the data consisted of 3242 news articles about immigration and 29345 regarding elections.

Annotated MF Data. We use the annotated twitter dataset [12] dimensions for each tweet (virtues and vices of the five MFT dimensions and also non-moral dimension). Each annotator can assign more than one MF dimension to a single tweet. To aggregate the votes for a single tweet, we have assigned 1 to the dimensions having at least two votes and 0 to the dimensions that have less than two votes. There were 35k tweet ids provided in this dataset and at least three annotators per tweet.

For the unsupervised method, we quantify the strength of the moral framing of text along the dimensions of MFT using the Frame Axis approach [16] . Frame Axis proposes two measures-Intensity and Bias-to capture the document-level framing based on the word contributions [16] . Intensity and Bias for a text are calculated as the weighted average of mapping of its words towards the desired semantic axis. Each semantic axis (also called micro-frame) builds on a set of antonyms, i.e., words with opposite meaning [1] . In our case, we choose the vices and virtues of the Moral Foundations as opposites of a word axis, e.g. Care terms vs Harm terms from the MFT. Some of these words are shown in Table 1 . For each moral foundation (MF) dimension, the axis is calculated by subtracting the average vector of the embeddings of positive words (virtues) and the average vector of the embeddings of negative words (vices) of that MF dimension. Formally, let m be one of the MF dimensions (e.g. Care) and V + m denote the set of embedding vectors of virtue words (e.g. vectors for Care words) and V − m denote to set of vectors of vice words (e.g. vector for Harm words), then the semantic axis corresponding to this MF dimension is:

For the computations of this part, the embeddings of words are obtained from the pretrained GloVe model [20] called "Common Crawl" which includes 2.2M vocab 4 . Following [16] , we define the framing Bias B D m of a document D along a semantic axis m as: Second, we use framing Intensity on a moral dimension to capture how heavily that moral dimension appears in the document with respect to the background distribution:

where B T m is the baseline framing Bias of the entire text corpus T on a moral dimension m. Intensity doesn't reveal information about the polarization. However, in situations that both poles of an axis actively appear in a text, the positive and negative terms will cancel out each other, and the document wouldn't have a significant Bias toward any pole of that axis, but Intensity will show the relevance to that axis.

As a result, each document can be represented by 12 dimensions, each representing the Bias and Intensity scores for each of the 6 MF dimensions in Table 1 . 

In the supervised approach, we leverage the corpus of tweets, manually annotated with their moral foundations, as train data to learn a classifier model on MF frames from text. We trained a binary classifier on the twitter dataset to learn the relevance of the Moral Foundations. Specifically, each MF is considered as a label that can get values 0 or 1, and for each MF we train a binary classifier. A Logistic Regression classifier is used for this part.

For creating text features, we use a contextualized sequence encoding method known as Bidirectional Encoder Representations from Transformers (BERT) [5] to obtain the embeddings for each tweet. We encode each tweet in a 768dimensional encoding using a pre-trained BERT model.

In the inference phase, we convert the text in the test dataset to the same feature space, and each classifier gives a likelihood showing how much the given text is related to the corresponding moral foundation.

First, we evaluate the ability of the two latent representations to measure MF frames from the text on the annotated twitter dataset. We compare the classification performance of the tweet BERT embeddings to the Frame Axis features on the annotated tweets. We run the classification task repeatedly on random Table 2 show that both approaches dramatically outperform the baseline. The baseline predicts each moral foundation according to its frequency distribution in the training set. They also outperform the method from [12] , which reported F 1 < 0.5 on a subset of the twitter dataset. Remarkably, Frame Axis achieves comparable performance to the embedding-based approach using only two features (Bias and Intensity) for each moral dimension.

In this section, we explore the empirical correlations among the different MF frames learned from the data. We take inspiration from previous work studying correlations using surveys based on the Moral Foundations Questionnaire [9] . For this purpose, we compute the correlation matrix using two approaches: 1 ) based on the original hand-annotated MF frames on the Twitter dataset; and 2) based on the inferred frames that we obtained from our supervised method. With the latter method, we can infer the MF frames correlation matrix for the Twitter data and the news articles corpus. To evaluate the method, we compare the inferred and hand-annotated correlation of MF frames. Finally, we explore the correlations learned from the news headlines.

The annotated Twitter dataset contains the number of human annotators who have selected each tweet as relevant to a MF frame. We can compare the correlation matrix of these expert annotations to the likelihoods calculated by the classification task for each MF frame. Figure 1 shows that the correlations between moral foundation dimensions on the raw annotations (Fig. 1a) are very similar to the correlations between MFs likelihoods predicted by the supervised classifier on the same Twitter data (Fig. 1b) . Even though we are using distinct classifiers for each MF and our classifier does not see different labels at the same time, still the correlations in (Fig. 1b) are comparable to (Fig. 1a) Table 3 : The results on classifying partisanship of headlines with the likelihoods calculated from the Logistic Regression classifier. The classifier was trained with three different input features: 1) the tweet BERT embeddings; 2)the Bias and Intensity scores from the Frame Axis approach; 3) the combinations of the two sets of described features. The baseline predicts the result based on the training set distributions.

that the automatically detected frames are consistent with human judgment. For example, we see that similar to the original Twitter dataset, the correlation of non-moral and other moral foundations stay negative, which makes sense because if a text is not showing any MF frame, then the highest score for it must be the non-moral label and all the other frame scores must be low. However, when we use the supervised classifier to compute MF frame scores on the news headlines, we see different correlations between the moral dimensions (Fig. 1c) .

Here, Purity is more correlated with the Care/Harm dimension, and Ingroup is more correlated with Fairness and Authority than in Twitter data.

Predicting Partisanship of News Using moral frames as features, we classify the partisanship of news headlines. We have chosen Immigration and Elections as two categories of news for experimenting. Our dependent variable is the partisanship of the news source (Liberal or Conservative), and we use Bias and Intensity scores for each moral foundation and the likelihoods obtained from the classifier trained on the tweets as features to test for systematic differences in the moral framing of news. We test whether moral frame scores can distinguish between ideologies of news sources from different political sides. The Bias and Intensity are unsupervised measures since for calculating those, no annotations are needed. Another feature set for representing MF framing can be obtained providing the news headlines as test data to the classifier previously trained on the annotated twitter data. This supervised classifier gives likelihoods corresponding to each of the MF dimensions which we use to classify the partisanship. Table 3 shows F1 and accuracy of classifying the partisanship based on supervised MF likelihoods and unsupervised Frame Axis scores used as features. The row 'combine' denotes concatenating these two feature sets. The baseline is a simple classifier that learns the distribution of partisanship from the training data and uses it to make predictions for the test data. Features based on moral frames outperform the baseline by a wide margin. Fig. 2 : Coefficients of Logistic Regression classification task using moral frames to predict liberal partisanship. The coefficients whose 0.95 confidence intervals exclude 0 are significant. The binary classifier target label is the partisanship with Liberal as 1 and Conservative as 0.

Moral Framing of News Next, to quantify systematic differences in moral framing between liberal and conservative news articles, we inspect the model coefficients learned by the partisanship classifier. One key aspect of using the Frame Axis approach is that the model coefficients are straightforward to interpret (see Section 3.2) and have a competitive performance. Figure 2 reports the Frame Axis coefficients and their 0.95 confidence intervals for Intensity and Bias features for each moral foundation. Since in our setting, we coded the label for Conservative partisanship as 0 and Liberal partisanship as 1, the interpretation of a positive coefficient is that all else being equal, Liberal news articles are more likely than Conservative articles to exhibit the corresponding attribute. We highlight the following findings:

-The sign of the coefficients are consistent across the two topics analyzed and show significant differences across partisanship for most moral foundations. -Purity/Degradation foundation. The high and positive Intensity coefficient indicates that liberal media stress the purity/degradation foundation more strongly than conservative media when reporting news about immigration and elections. The negative bias coefficient implies that liberal media emphasize more the recognition of vices being violated regarding dirtiness, unholiness, and impurity, whereas conservative media tend to emphasize more the virtues like austerity, sacred, and pure.

-Authority/Subversion foundation. The Intensity of Authority is insignificant for the Immigration topic. However, it is substantial for the Election topic. This shows liberals and conservatives hold significantly different views on Authority in the Election topic. Liberal media, framing their articles with more attention to the Authority/Subversion foundation, contrary to the withheld consensus [9] . The negative Bias suggests a stronger framing on vices words that describe rebellion by Liberal media. -Care/Harm foundation. The negative Intensity coefficient suggests that conservatives tend to endorse more strongly this foundation, especially in election articles. The positive Bias shows that the framing of conservative media, compared to liberal media, has a higher emphasis on the vice side of the care/harm moral dimension, which condemns malice, abuse, and inflicting suffering. -The large positive coefficient of Bias for General Morality suggests that liberal media tends to frame their news about immigration with a more positive moralistic view, indicative of normative judgments (e.g., good, moral, noble). While Conservative media use a more negative moral judgment based on words like bad, incorrect, or offensive.

Recent research puts in evidence a change in partisanship among the general electorate, where the number of issues with partisan conflict has increased [2] . Studies suggest a link between partisan media exposure and polarization [22] , driven by motivated reasoning to explain why partisan media polarizes viewers [17] . An alternative approach emphasizes the impact on how the news media frame political discourse. In their seminal study, Kahneman and Tversky, demonstrate the impact of framing in human decisions [23] . Since then, there has been vast experimental evidence on how moral framing can influence attitudes towards polarizing issues like climate change [13] .

In this paper, we focused on developing computational methods to detect moral framing Biases in news media using NLP techniques. Inspired by the work done on Frame Axis [16] , identifying meaningful micro-frames from antonym word pairs, coupled with the principled moral foundation theory, we choose the vices and virtues of the five Moral Foundations as polar opposites of a micro-Frame Axis. We proceed to first, validate our approaches on an annotated Twitter dataset, and second, study the moral framing Bias on partisan news articles related to immigration and elections topics. Our findings reveal systematic differences across liberal and conservative media. It supports the correlations between political ideology and the five foundations which have been shown in the empirical evidence based on surveys [9] . In particular, our results suggest that rhetoric on Purity is different between liberal media compared to conservative media, where liberals tend to use more rhetoric towards the violations of the Purity moral foundation. This observation, supporting the growing evidence that Purity is among the most differentiating moral dimension between conservatives and liberals [4] .

Our work has some limitations that should be noted. First, our methods rely on the dataset, and changing the dataset might change the results. We have demonstrated our methods on two different topics immigration and elections. Second, in the calculation of Bias, as defined in equation (2), the negations in the sentence which can change the polarization are not considered. e.g., "This is not fair" would have a positive Bias in Fairness micro-frame because the presence of the word "not" is not considered in the definition of Bias. However, this problem does not appear in Intensity because that measures the frequency of usage of words from both poles of each axis. Lastly, even though there is a twitter dataset annotated with moral foundations, there is no annotated dataset of news articles. In the supervised method described in section 3.3, we have trained the classifier on the labeled tweets and then predicted the moral foundations of news headlines using that model.

As future work, we plan to study different topics and compare framing across different news sources. Most of the studies have paid attention to liberal and conservative news sources. An interesting question is how central news sources MF framing would be different from the polarized ones? Another path to continue this work will be leveraging the BERT text encodings by fine tuning it according to our MF frame classification task.

SemAxis: A lightweight framework to characterize domain-specific word semantics beyond sentiment

The rise of partisanship and the expansion of partisan conflict within the american electorate

Semantics derived automatically from language corpora contain human-like biases

Purity homophily in social networks

BERT: Pre-training of deep bidirectional transformers for language understanding

An empirical exploration of moral foundations theory in partisan news sources

Morality between the lines: Detecting moral sentiment in text

News Use Across Social Medial Platforms

Liberals and conservatives rely on different sets of moral foundations

Mapping the moral domain

Intuitive ethics: How innately prepared intuitions generate culturally variable virtues

Moral foundations twitter corpus: A collection of 35k tweets annotated for moral sentiment

Messaging for environmental action: The role of moral framing and message source

Classification of moral foundations in microblog political discourse

Moral concerns are differentially observable in language

Frameaxis: Characterizing framing bias and intensity with word embedding

Why do partisan media polarize viewers?

Detecting frames in news headlines and its application to analyzing news framing trends surrounding U.S. gun violence

The polar framework: Polar opposites enable interpretability of pre-trained word embeddings

Glove: Global vectors for word representation

Media and political polarization

Going to extremes: How like minds unite and divide

Rational choice and the framing of decisions. In: Multiple criteria decision making and risk analysis using microcomputers

Using social and behavioural science to support covid-19 pandemic response

Twentieth century morality: the rise and fall of moral concepts from 1900 to

This work was conducted in connection with Contracts #W56KGU-19-C-0004 with the U.S. Army Combat Capabilities Development Command C5ISR Center. The views, opinions, and findings contained in this document are those of the authors and should not be construed as an official position of the United States Army.