Abstract
Life satisfaction is generally regarded as the cognitive component of subjective well-being, as opposed to positive and negative affect, which are regarded as the affective components. This topic has been extensively studied worldwide and has been linked to a variety of outcomes related to the work context as well as psychological well-being. In this study, we examine the psychometric properties of the Satisfaction with Life Scale (SWLS), one of the most widely used measures of life satisfaction, using three different approaches: classical test theory, Rasch analysis and Mokken analysis. Combining these three approaches provides a more comprehensive validation of an instrument. In this study, schoolteachers (n = 355) completed the SWLS, the trait scale of the State-Trait Anxiety Inventory, the Center for Epidemiological Studies Depression Scale, the Beck Hopelessness Scale and the University of California, Los Angeles Loneliness Scale. The three approaches confirmed the reliability, validity and unidimensional nature of the SWLS, thus supporting its continued use as a measure of life satisfaction in the South African context.
Keywords: Mokken analysis; Rasch analysis; classical test theory; satisfaction with life scale; reliability; validity.
Introduction
Positive psychology is a movement based on the seminal work of Seligman and others (e.g. Seligman, 2002; Seligman & Csikszentmihalyi, 2014), which sparked renewed interest in focusing on what can go right rather than what can go wrong, also known as psychological strength. During the Tanner Lecture Series at the University of Michigan, Seligman (2010) bemoaned psychology’s obsession with ‘what is wrong with life: suicide, depression, schizophrenia, and all the brick walls that can fall on you’ (p. 232) and ‘[we] tried to create a field in which we asked the question, “what makes life worth living, and how can we build it?”’ (p. 232) This sparked interest in research on positive psychological variables such as happiness, life satisfaction and subjective well-being. These terms are often used interchangeably. For example, in a multination study of subjective well-being worldwide, life satisfaction and happiness were used as variables with which to compare different types of economic systems in terms of subjective well-being (Tsai, 2009).
Whilst life satisfaction and happiness are often used as equivalents to subjective well-being, the latter concept is generally regarded as a multidimensional concept that consists of both affective and cognitive dimensions (Pavot & Diener, 2008). In this context, positive and negative affect are typically regarded as the affective components, whereas life satisfaction is regarded as the cognitive dimension (Prasoon & Chaturvedi, 2016). In this regard, Diener, Emmons, Larsen and Griffin (1985) described life satisfaction as a cognitive judgement of people’s level of satisfaction based on a comparison with a standard. Moreover, a meta-analytic review provided some evidence for the hierarchical conceptualisation of subjective well-being and showed that positive affect, negative affect and life satisfaction load on a latent subjective well-being construct (Busseri, 2018). Busseri (2018) used meta-analytic correlations to estimate a latent subjective well-being factor, which had moderate to strong loadings on positive and negative affect as well as life satisfaction.
Life satisfaction is linked to a wide variety of outcomes related to the work context and psychological well-being. It has also been found to be negatively associated with depression, anxiety and stress in migrant and nonmigrant samples in the United States of America and Russia (Brailovskaia, Schönfeld, Kochetkov, & Margraf, 2019), as well as in university students in Brazil (Lopes & Nihei, 2021) and school students in China (Tang, Xiang, Cheung, & Xiang, 2021). In a study on work–family conflict and its correlates, it was found that respondents with high levels of work–family conflict have decreased levels of life satisfaction (Cazan, Truţă, & Pavalache-Ilie, 2019). The same study also revealed positive correlations between life satisfaction and psychological well-being and positive affectivity, as well as a negative correlation with negative affectivity. Amongst the work-related variables that have been found to be associated with life satisfaction are job satisfaction and workplace attachment (Cazan et al., 2019), turnover intentions (Lin, Hu, Danaee, Alias, & Wong, 2021; Ohunakin, Adeniji, Oludayo, Osibanjo, & Oduyoye, 2019) and innovative work behaviour and job performance (Chughtai, 2021). Overall, life satisfaction has been found to be related to a vast range of organisational and health-related variables.
The Satisfaction with Life Scale (SWLS) is arguably the most extensively used measure of life satisfaction and has, in fact, been described as the gold standard for measuring life satisfaction (Kaczmarek, Bujacz, & Eid, 2015). This scale has been used in various countries and is available in several languages (e.g. Dutch: Van Loon, Tijhuis, Surtees, & Ormel, 2001; Spanish: Extremera & Fernandez-Berrocal, 2005; Japanese: Oishi & Sullivan, 2005; Korean: Cha, 2003; and Chinese: Liang & Zhu, 2015). Notably, the psychometric properties of the SWLS are largely based on classical test theory (CTT). For example, in the original study of the scale development, the authors reported a test–retest reliability and an alpha coefficient of 0.82 and 0.87, respectively. Exploratory factor analysis (EFA) resulted in a single factor extracted that explained 66% of the variance, and validity was established through positive correlations between the SWLS and a range of other measures of subjective well-being (Diener et al., 1985). Similarly, other studies have reported satisfactory reliability, validity and evidence for the unidimensionality of the SWLS based on the CTT in other cultures (e.g. Mexico: López-Ortega, Torres-Castro, & Rosas-Carrasco, 2016; Iran: Maroufizadeh, Ghaheri, Samani, & Ezabadi, 2016; and Pakistan: Barki, Choudhry, & Munawar, 2020).
Relatively few studies have employed item response theory (IRT), either parametric or nonparametric. Oishi (2006) examined the cross-cultural equivalence of the SWLS between American and Chinese samples, using the two-parameter logistic model of the IRT. They found that the item difficulty estimates of two of the items of the SWLS were different between the two groups. They also found that the mean satisfaction score of the American sample was substantially higher than that of the Chinese sample. Several studies have also examined the psychometric properties of the SWLS using Rasch analysis (e.g. Akif, 2021; Løvereide & Hagell, 2016; Schutte, Negri, Delle Fave, & Wissing, 2021) but with mixed results. Schutte et al. (2021) found that the results supported the unidimensional structure of the SWLS, that one item did not fit the model and that there was no differential item functioning (DIF) between South African and Italian samples. However, Akif (2021) and Løvereide and Hagell (2016) found that the IRT supported the reliability of the scale and that all items fitted the model well. Both Schutte et al. (2021) and Løvereide and Hagell (2016) suggested the use of fewer response categories, rather than the seven-point Likert scale, which is the current format of the SWLS. To a certain extent, Oishi (2006) and Akif (2021) used a combination of both CTT and parametric IRT to examine the psychometric properties of the SWLS. In both studies, the authors used EFA and confirmatory factor analysis (CFA) to confirm the unidimensionality of the scale, in addition to an IRT analysis. Only one study was found that used a combination of CTT, parametric IRT and nonparametric IRT to examine the SWLS (Avşar, 2021). However, this was not, strictly speaking, an examination of the psychometric properties of the SWLS but rather an examination of the impact of excluding participants who provided aberrant responses to items of the SWLS. The results demonstrated that after the aberrant individuals were excluded, a better fit was obtained for the CFA, Mokken model and graded response model.
In this study, the reliability, validity and dimensionality of the SWLS are examined from three perspectives: CTT, parametric IRT (Rasch analysis) and nonparametric IRT (Mokken analysis). Generally, combining the IRT and CTT provides a comprehensive picture of the psychometric properties of an instrument (Akif, 2021; Oishi, 2006). For example, Oishi (2006) found that if only structural equation modelling was used in the analysis of the SWLS, they would have erroneously concluded that only one item of the SWLS had DIF, whereas IRT revealed that four items had DIF. In CTT, the instrument is the unit of analysis, whereas in IRT, the item is the unit of analysis. In this regard, CTT focuses more on instrument-level indices such as reliability or standard error of a scale, whilst IRT, as its name indicates, focuses more on item-level indices such as item difficulty and DIF (Abedalaziz & Leng, 2018). Moreover, IRT indices are less sample dependent than CTT indices. In this regard, Magno (2009) empirically demonstrated that unlike CTT, item difficulty indices and estimates of reliability in IRT were more stable across different samples. In addition, IRT provides information regarding person–item interactions which is not provided by CTT (Akif, 2021; Oishi, 2006). Mokken analysis is a non-parametric alternative to Rasch analysis; thus, it has fewer assumptions than Rasch analysis. In addition, the Rasch model assumes that all items have the same response function. Item response function refers to the probability that respondents with a high level of the latent trait will endorse an item whilst respondents with a lower level of the latent trait will not endorse the item. In Mokken analysis, no such assumption is made, and the item response function could differ for different items. Hence, using more than one approach provides a comprehensive picture of the instrument under investigation.
Method
Participants
Both primary and secondary schoolteachers (n = 355) from across South Africa participated in this study. Most of them were based in the province of the Western Cape (82.3%) and taught primary school students (61.1%). The sample was largely urban in nature (61.7%) and women (76.6%), and the mean age of the sample was 41.9 years (± 12.4). The mean number of years that the participants have worked in the field of teaching was 15.7 years (± 11.8).
Instruments
All participants completed the following instruments: a brief demographic survey, the SWLS (Diener et al., 1985), the trait scale of the State-Trait Anxiety Inventory (STAI-T; Spielberger, 1988), the Center for Epidemiological Studies Depression Scale (CES-D; Radloff, 1977), the Beck Hopelessness Scale (BHS; Beck, Weissman, Lester, &Trexler, 1974) and the University of California, Los Angeles Loneliness Scale (UCLA-LS; Russell, 1996). As indicated, the SWLS measures the cognitive component of subjective well-being and consists of five items scored on a 7-point Likert scale, ranging from 1 (strongly disagree) to 7 (strongly agree). The SWLS has generally demonstrated satisfactory psychometric properties, as previously indicated. Pavot and Diener (2008) provided an extensive review of the psychometric properties of the SWLS up until 2008. They also confirmed that factor analytic studies have replicated the one-factor structure of the scale. However, they highlighted that Item 5 (‘If I could live my life over, I would change almost nothing’) typically has lower factor loadings and item-total correlations than those of the rest of the items. However, they argue for the retention of Item 5 based on its high correlation with the other items. More recent studies have also confirmed the satisfactory psychometric properties of the SWLS. For example, López-Ortega et al. (2016) reported a reliability coefficient of 0.74. Moreover, in terms of validity, they found that the SWLS significantly correlates with depression and perceived health, amongst other factors. They also confirmed the single-factor structure through an EFA. Similarly, Barki et al. (2020) reported a reliability estimate of 0.89 and found that the SWLS significantly correlates with anxiety and depression. They also confirmed the unidimensional structure with a CFA. Other researchers have also used the SWLS in South Africa and reported satisfactory internal consistency reliability (Padmanabhanunni & Pretorius, 2021a).
The STAI-T is a measure of trait anxiety that consists of 20 items scored on a 4-point Likert scale, ranging from 1(almost never) to 4 (almost always). A favourable estimate of reliability has been reported both in the original study (α = 0.86–0.92; Spielberger, 1988) and in more recent studies (e.g. Bee Seok, Abd Hamid, Mutang, & Ismail, 2018; Hallit et al., 2020; Stojanović et al., 2020). In South Africa, the STAI-T has also demonstrated satisfactory reliability when used with students (Pretorius & Padmanabhanunni, 2021) as well as with a manganese-exposed community when translated into several South African languages (Racette et al., 2021).
The CES-D is a widely used 20-item measure of depressive symptomology relying on a 4-point scale with response options ranging from zero (rarely or none of the time) to three (most of or all the time). This measure has generally demonstrated satisfactory internal consistency (e.g. Ilic, Babic, Dimitrijevic, Ilic, & Grujicic, 2019; Nia et al., 2019; Singh, Zaki, Farid, & Kaur, 2021). In South Africa, Hassem (2021) developed a 19-item adapted version of the CES-D to be used as an online screening tool for depression. The CES-D has also been used to assess depression in students (Padmanabhanunni & Pretorius, 2020; Pretorius, 1991) and teachers (Padmanabhanunni, Pretorius, Stiegler, & Bouchard, 2022), and it has generally demonstrated satisfactory internal consistency.
The BHS is a 20-item self-report measure of hopelessness scored on a dichotomous true-or-false response format. Satisfactory reliability estimates have been reported for the BHS both in the original study (Kuder-Richardson -20 = 0.93; Beck et al., 1974) and in several different contexts (e.g. Colombia: Kocalevent et al., 2017; Nigeria: Aloba, Olabisi, Ajao, & Aloba, 2017; and Japan: Sueki, 2020). Similar satisfactory estimates of reliability have also been reported for student samples in South Africa (Heppner, Pretorius, Wei, Lee, & Wang, 2002; Padmanabhanunni & Pretorius, 2021b).
The UCLA-LS is a 20-item self-report measure of loneliness scored on a 4-point Likert scale ranging from 1 (I never feel this way) to 4 (I often feel this way). The author of the UCLA-LS reported alpha coefficients ranging between 0.89 and 0.94 for different samples of students, nurses, teachers and older individuals. In more recent studies, the UCLA-LS has consistently demonstrated satisfactory reliability (e.g. Arimoto & Tadaka, 2019; Zeas-Sigüenza, Oliveira, Ferreira, Ganho, & Ruisoto, 2021). The UCLA-LS has also demonstrated acceptable reliability in South Africa (Padmanabhanunni & Pretorius, 2021c; Pretorius, 1993). In a study in South Africa, Pretorius (2022) examined the dimensionality of the UCLA-LS using CFA and bifactor indices and concluded that it is best used as a total scale with three subscales.
Procedure
An electronic version of the above-mentioned instruments was first constructed using Google Forms. Then, a Google link was posted on teacher Facebook groups after permission was obtained from the administrators of these sites. The school liaison officers of the university also sent the link to schools with which they had a working relationship.
Data analysis
IBM SPSS Statistics version 27 for Windows (IBM Corp., Armonk, NY, USA) was used to perform the CTT analyses, and IBM SPSS Amos version 27 (IBM Corp.) was used to conduct CFA. In addition, Winsteps version 5.1.4 (Linacre, 2021a) was used to perform the Rasch analysis, and R (R Core Team, 2017) was used to conduct the Mokken analysis with the ‘Mokken’ package (Van Der Ark, 2012).
The reliability of the SWLS was assessed in terms of Cronbach’s alpha (α), composite reliability (CR) and Mokken scale reliability (MSrho). Conventionally, a reliability coefficient greater than 0.70 is considered evidence of satisfactory reliability (Taber, 2018).
To determine the construct validity of the SWLS, the item-total correlations (CTT), item and person separation indices (Rasch analysis) and scalability coefficients for each item (Hi, Mokken analysis) were evaluated. In general, item-total correlations greater than 0.50 (DeVon et al., 2007, Hajjar, 2018) indicate that all items contribute to the measurement of the latent construct (i.e. life satisfaction). The Hi coefficient serves the same function as the item-total correlations in the sense that it indicates the extent to which each item contributes to the total scale. According to Mokken (1971), Hi coefficients greater than 0.30 indicate well-fitting items that contribute to the measurement of the latent construct. With regard to person and item separation indices, Linacre (2021b) recommends that a person separation index of > 2 together with person reliability of > 0.80 and an item separation index of > 3 together with item separation reliability of > 0.80 are acceptable. If these criteria are met, this would indicate that the scale can distinguish between different levels of ‘performers’ (i.e. those with high and low scores on the latent construct–person separation) and that an item difficulty hierarchy exists (item separation). For each item, the Rasch analysis also provides fit statistics called the infit and outfit mean square (MnSq), which is used to determine the extent to which each item fits the Rasch model. Linacre (2021b) suggested that mean square values below 0.50 and above 1.5 are indicative of misfitting items. Mokken analysis also provides an indication of whether items discriminate between participants who have high or low levels of life satisfaction (monotonicity) and whether there are items that respondents with the same level of life satisfaction may have endorsed in significantly different ways (invariant item ordering [IIO]; Sijtsma & Van Der Ark, 2017). For these two assumptions, monotonicity and IIO, Mokken analysis provides a Crit value, which is used to assess potential violations. According to Sijtsma and Van Der Ark (2017), a Crit value greater than 80 indicates serious violations, whereas a Crit value between 40 and 80 indicates minor but acceptable violations. To assess the measurement invariance between men and women, DIF was calculated using Rasch analysis. In this context, a DIF value smaller than 0.50 would indicate that the items measure the same construct across different groups (Linacre, 2021b). Given reported findings that women generally reported higher levels of life satisfaction (e.g. Joshanloo & Jovanović, 2020), it is important to demonstrate that the SWLS measures the same construct in the two groups.
Other types of construct validity include convergent, discriminant and concurrent validity. Firstly, to demonstrate convergent validity, the average variance extracted (AVE), CR and factor loadings were used. In general, significant factor loadings (Posch et al., 2019), an AVE value greater than 0.50 and an AVE value smaller than the CR value are evidence of convergent validity. Secondly, with regard to discriminant validity, an AVE value greater than the maximum shared variance (MSV) and average shared variance (ASV) is indicative of discriminant validity. This is because it indicates that the latent construct explains a greater proportion of the variance in the items that contribute to its measurement compared with the proportion of variance shared with other related constructs (Almén, Lundberg, Sundin, & Jansson, 2018). Finally, concurrent validity was established through the associations between life satisfaction and the indices of psychological distress which have been consistently linked to life satisfaction in the literature, namely anxiety, hopelessness, loneliness and depression (e.g. Brailovskaia et al., 2019; Tang et al., 2021).
The dimensionality of the SWLS was evaluated using all three approaches. To perform a factor analysis, both EFA (principal axis) and CFA were conducted. However, before the EFA was conducted, the suitability of the data for factor analysis was examined using the Kaiser–Meyer–Olkin (KMO) measure of sampling adequacy and Bartlett’s test of sphericity. In general, a KMO value above 0.5, both at scale level and individual item level, and a significance level for Bartlett’s test below 0.05 suggests a substantial correlation in the data. Thus, it would be appropriate to proceed with factor analysis (Hadi, Abdullah, & Sentosa, 2016). The following fit indices were used in the CFA to measure the model fit (Kline, 2005): chi-square (χ2, best if p > 0.05), comparative fit index (CFI, best if above 0.90), root-mean-square error of approximation (RMSEA, best if below 0.08), the Tucker–Lewis index (TLI, best if above 0.90) and the goodness-of-fit index (GFI, best if above 0.95). Mokken analysis provides an algorithm, an automated item selection procedure (AISP) that partitions items into scales. Items that are not selected through the AISP are regarded as unscalable (Sijtsma & Van Der Ark, 2017). In addition to providing an H-coefficient for each item (Hi), Mokken analysis also provides a scalability coefficient (H) for the entire scale to reflect the strength of the scale. The following rule of thumb is typically used to evaluate H: H ≥ 50 reflects a strong scale, 0.40 ≤ H < 0.50 reflects an intermediate scale and H < 0.40 reflects a weak scale (Wind, 2017). After the presumed latent trait is removed, a principal component analysis (PCA) of the residuals is used in the Rasch analysis to detect multidimensionality. If a possible additional dimension, as indicated by the PCA (called the ‘first contrast’), has an eigenvalue of > 2, then this suggests two or more items loading on a possible second factor, thus indicating multidimensionality (Linacre, 2021b).
Ethical considerations
Ethical approval for this study was obtained from the Humanities and Social Sciences Ethics Committee of the University of the Western Cape (reference number: HS21/3/8). Participation was voluntary, and all participants provided informed consent before they were allowed to proceed with the electronic survey.
Results
Table 1 shows the reliabilities, descriptive statistics and intercorrelations between the study variables. Overall, the reliabilities of all scales can be considered to be satisfactory (α > 0.70). Notably, the mean life satisfaction score in the current study was 21.9 (± 7.3). This is significantly lower than the value reported by Diener et al. in the original research of the scale (M = 23.5, standard deviation [SD] = 6.43, t = −4.15, p < 0.001). It is also significantly lower than the value reported more recently by Jovanović and Brdar (2018)1 for three different countries: Austria (M = 25.6, SD = 5.95, t = −9.58, p < 0.001), Croatia (M = 24.1, SD = 5.1, t = −5.71, p < 0.001) and Serbia (M = 23.3, SD = 5.7, t = −3.64, p < 0.001). However, it is comparable to the mean life satisfaction score reported in the same study for two countries: Bosnia and Herzegovina (M = 21.6, SD = 6.7, t = 0.76, p = 0.449) and Montenegro (M = 22.4, SD = 6.5, t = −1.31, p = 0.191).
TABLE 1: Intercorrelations, reliabilities and descriptive statistics for variables. |
Table 1 also indicates a significant negative relationship between life satisfaction and the indices of psychological distress: anxiety (r = −0.52, p < 0.001, 95% confidence interval [CI] [−0.59, −0.44]), depression (r = −0.55, p < 0.001, 95% CI [−0.62, − 0.47]), hopelessness (r = −0.62, p < 0.001, 95% CI [−0.68, −0.55]) and loneliness (r = −0.53, p < 0.001, 95% CI [−0.61, −0.45]). This indicates that high levels of life satisfaction are associated with low levels of anxiety, depression, hopelessness and loneliness, thus providing evidence of concurrent validity.
Table 2 shows the CTT, Rasch and Mokken indices for the items of the SWLS. It can be observed that the inter-item correlations were all significant and above 0.50. Moreover, the item-total correlations ranged between 0.63 and 0.84. Item 5 (‘If I could live my life over, I would change almost nothing’) exhibited the lowest correlation with the latent variable, but it was higher than 0.50. The Hi coefficients ranged between 0.58 and 0.73, thus exceeding the suggested cut-off of > 0.30. The factor loadings were all above 0.70 (Hair, Ringle, & Sarstedt, 2011), ranged between 0.74 and 0.91 and were all significant. In addition, there were no significant violations of monotonicity and only one minor violation of IIO (Item 5, Crit = 61). The infit and outfit MnSq values were all within the range of > 0.50 – < 1.5, indicating the absence of misfitting items. The DIF value for all items across gender was < 0.50, indicating measurement invariance across men and women.
TABLE 2: Classical test theory and item response theory indices for the Satisfaction with Life Scale at the item level. |
The results of the KMO and Bartlett’s tests indicated that the data set was adequately sampled and that there was substantial correlation in the data set (KMO = 0.863, KMO for individual items = 0.82–0.91, Bartlett’s test = 1125.88, p < 0.001). They therefore indicated that factor analysis of the data is appropriate. The EFA extracted one factor, which accounted for 71.15% of the variance, hence demonstrating a dominant factor underlying the SWLS. As shown in Figure 1, a one-factor model of the SWLS was examined using CFA.
|
FIGURE 1: One-factor model of the Satisfaction with Life Scale with the indices of psychological well-being as outcome variables. |
|
The results of CFA are reported in Table 3. The fit indices were all above the suggested best-fit indicator (χ2 = ns, GFI = 0.99, TLI = 0.99, CFI = 0.99 and RMSEA = 0.05), indicating that a one-factor representation of the SWLS is a favourable fit for the data.
TABLE 3: Confirmatory factor analysis fit indices for the Satisfaction with Life Scale: one-factor model. |
Table 4 shows the CTT, Rasch and Mokken indices for the SWLS at the scale level, together with the suggested cut-off values. With regard to reliability, the Cronbach alpha (0.90), CR (0.93) and MSrho (0.90) values were all above 0.70, and the AVE value (0.73) was larger than 0.50 and also larger than the MSV value (0.38) and the ASV (0.31). In this regard, the highest correlation coefficient between life satisfaction and the indices of psychological well-being was with hopelessness (r = −0.62). Thus, the squared correlation (MSV) was 0.38. Average shared variance is the mean of the squared correlations between life satisfaction, anxiety, depression, hopelessness and loneliness. The separation indices were at an acceptable level (item separation index = 5.91, person separation index = 2.37). Similarly, the separation reliabilities also exceeded the suggested cut-off values (item separation reliability = 0.97, person separation reliability = 0.85). The eigenvalue associated with a possible additional factor in the Rasch analysis was found to be 1.82, indicating that the scale is essentially unidimensional. However, at least two items (items four and five) were reflected on the first contrast. The disattenuated correlation between the Rasch dimension and the first contrast was 0.73, indicating that the two clusters of items have more than half of their variance in common, which would support a possible interpretation that the two clusters of items measure the same latent variable (Linacre, 2021b). The scalability coefficient in the Mokken analysis also indicated the existence of a very strong scale (H = 0.66).
TABLE 4: Classical test theory and item response theory indices for the Satisfaction with Life Scale at the scale level. |
Discussion
In this study, we used CTT and IRT to examine the replicability of the psychometric properties of the SWLS in a sample of South African teachers. The results obtained support the findings in the literature regarding the psychometric properties of the SWLS as examined through CTT (e.g. Barki et al., 2020; Diener et al., 1985; Maroufizadeh et al., 2016) and IRT (e.g. Akif, 2021; Oishi, 2006).
Firstly, the mean life satisfaction score for the current sample of teachers in South Africa was found to be significantly lower than the values reported in other countries. Pavot and Diener (2009) asserted that it is typical for citizens in Western countries to score highly on a range of measures of well-being, including the SWLS. This assertion seems to be corroborated by South African studies that have also reported low life satisfaction scores prior to the pandemic (e.g. Westaway, Maritz, & Golele, 2003: M = 21.7, SD = 8.8; Field & Buitendach, 2011: M = 17.47, SD = 6.33). However, there have also been South African studies that reported very high life satisfaction scores amongst South African samples (Le Roux, Kagee, Van Der Merwe, & Parker, 2008: M = 28.7, SD = 7.8; Roothman, Kirsten, & Wissing, 2003: M = 24.9 and SD = 5.4 for men and M = 24.8 and SD = 6.0 for women). The available evidence therefore does not allow for a definitive statement of the impact of the pandemic on the life satisfaction of teachers. Rather, it merely reflects the fact that they have low levels of life satisfaction, which may have been the case even prior to the pandemic.
Secondly, all the indices of reliability (Cronbach’s alpha, CR and MSrho) exceeded the conventional cut-off (> 0.70), thus demonstrating that the SWLS has very satisfactory reliability that supports its continued use as a research instrument. Thirdly, both the CTT and the IRT confirmed that the SWLS demonstrates sufficient validity. With regard to construct validity, all items were found to highly correlate with the total scale. In addition, the scalability coefficient of the individual items (Hi) indicated that all items contributed to the measurement of life satisfaction. Moreover, the person and item separation indices confirmed that the SWLS can distinguish between respondents with low and high scores on life satisfaction (person separation – Rasch; monotonicity – Mokken) and that an item difficulty hierarchy exists (item separation). The Mokken analysis also confirmed that there was no violation of the assumption of IIO, and thus that there was consistency in the way respondents with the same level of satisfaction responded to items. Differential item functioning also demonstrated that there were no gender differences in the measurement of the construct.
Notably, the convergent, discriminant and concurrent validity of the SWLS were also confirmed. The significant factor loadings of the five items and the fact that the AVE value was above 0.50 and below the CR value confirmed the convergent validity of the scale. The total life satisfaction score accounted for a greater proportion of the variance in the five items (AVE) as opposed to the variance it shared with the indices of psychological well-being (MSV, ASV), thus demonstrating discriminant validity. Finally, the significant associations between life satisfaction and the indices of psychological distress provided evidence of concurrent validity. The CTT, Rasch and Mokken analyses provided complementary evidence of the unidimensional nature of the scale through EFA, CFA (CTT), the scalability coefficient (Mokken) and PCA of the residuals after the Rasch factor was extracted.
Some concerns have been expressed regarding Item 5 (‘If I could live my life over, I would change almost nothing’; Oishi, 2006; Pavot & Diener, 2008; Schutte et al., 2021). This item seems to be conceptually different from the other four items in that it focuses on the past, whereas the other four focus on the present. As in these other studies, Item 5 has also been found to have the lowest item-total correlation and the lowest factor loading in the current study. However, both CTT and IRT seem to suggest that, whilst this item is conceptually different, its inclusion in the scale is probably warranted. In CFA, the factor loading of Item 5 was lower than the other four items but still significant, whilst in Rasch analysis the disattenauted correlation coefficient suggested that the two clusters of items measure the same underlying construct. This requires further and more detailed investigation in future studies.
In summary, the three approaches mentioned in this study provide complementary evidence of the reliability, validity and unidimensional nature of the SWLS. The SWLS has largely been used as a research rather than a diagnostic instrument, and the evidence from three different perspectives supports its continued use in research on subjective well-being.
Limitations
This study has some limitations. For example, as we have largely used self-report measures, it is important to acknowledge the potential self-report bias. The study results, however, are comparable to previously reported results. In addition, most of the teachers were from one province only, thereby limiting the generalisability of the study. Therefore, in future studies, researchers should attempt to select more representative samples.
Conclusion
The SWLS is a popular and widely used measure of life satisfaction that is extensively used in South Africa. To our knowledge, this is the first study in which CTT, Rasch and Mokken analyses are used in a complementary manner to evaluate the psychometric properties of the SWLS. The results indicate that the SWLS is a reliable, valid and unidimensional measure of the cognitive component of subjective well-being.
Acknowledgements
Competing interests
The authors declare that they have no financial or personal relationships that may have inappropriately influenced them in writing this research article.
Authors’ contributions
A.P. and T.B.P. contributed equally to the conceptualisation, data collection, writing, review and editing of this article. T.B.P. was responsible for the data analysis.
Funding information
This research received no specific grant from any funding agency in the public, commercial or not-for-profit sectors.
Data availability
The data sets generated and/or analysed during the current study are available from the corresponding author upon reasonable request.
Disclaimer
The views and opinions expressed in this article are those of the authors and do not necessarily reflect the official policy or position of any affiliated agency of the authors.
References
Abedalaziz, N., & Leng, C.H. (2018). The relationship between CTT and IRT approaches in analyzing item characteristics. MOJES: Malaysian Online Journal of Educational Sciences, 1(1), 64–70. Retrieved from https://www.researchgate.net/publication/316172903
Akif, A.V.C.U. (2021). Item response theory-based psychometric investigation of SWLS for university students. International Journal of Psychology and Educational Studies, 8(2), 27–37. https://doi.org/10.52380/ijpes.2021.8.2.265
Almén, N., Lundberg, H., Sundin, Ö., & Jansson, B. (2018). The reliability and factorial validity of the Swedish version of the recovery experience questionnaire. Nordic Psychology, 70(4), 324–333. https://doi.org/10.1080/19012276.2018.1443280
Aloba, O., Olabisi, O., Ajao, O., & Aloba, T. (2017). The beck hopelessness scale: Factor structure, validity, and reliability in a non-clinical sample of student nurses in South-western Nigeria. Journal of Behavioral Health, 6(1), 58–65. https://doi.org/10.5455/jbh.20161022032400
Arimoto, A., & Tadaka, E. (2019). Reliability and validity of Japanese versions of the UCLA loneliness scale version 3 for use among mothers with infants and toddlers: A cross-sectional study. BMC Women’s Health, 19(1), 1–9. https://doi.org/10.1186/s12905-019-0792-4
Avşar, A.Ş. (2021). Aberrant individuals’ effects on fit indices both of confirmatory factor analysis and polytomous IRT models. Current Psychology, 1–10. https://doi.org/10.1007/s12144-021-01563-4
Barki, N., Choudhry, F.R., & Munawar, K. (2020). The satisfaction with life scale: Psychometric properties in Pakistani population. Medical Journal of the Islamic Republic of Iran, 34, 159. https://doi.org/10.47176/mjiri.34.159
Beck, A.T., Weissman, A., Lester, D., &Trexler, L. (1974). The measurement of pessimism: The hopelessness scale. Journal of Consulting and Clinical Psychology, 42(6), 861–865. https://doi.org/10.1037/h0037562
Bee Seok, C., Abd Hamid, H.S., Mutang, J.A., & Ismail, R. (2018). Psychometric properties of the state-trait anxiety inventory (form Y) among Malaysian university students. Sustainability, 10(9), 3311. https://doi.org/10.3390/su10093311
Brailovskaia, J., Schönfeld, P., Kochetkov, Y., & Margraf, J. (2019). What does migration mean to us? USA and Russia: Relationship between migration, resilience, social support, happiness, life satisfaction, depression, anxiety and stress. Current Psychology, 38(2), 421–431. https://doi.org/10.1007/s12144-017-9627-3
Busseri, M.A. (2018). Examining the structure of subjective well-being through meta-analysis of the associations among positive affect, negative affect, and life satisfaction. Personality and Individual Differences, 122(1), 68–71. https://doi.org/10.1016/j.paid.2017.10.003
Cazan, A.M., Truţă, C., & Pavalache-Ilie, M. (2019). The work-life conflict and satisfaction with life: Correlates and the mediating role of the work-family conflict. Romanian Journal of Psychology, 21(1), 3–10. https://doi.org/10.24913/rjap.21.1.02
Cha, K.H. (2003). Subjective well-being among college students. Social Indicators Research, 62(1), 455–477. https://doi.org/10.1023/A:1022669906470
Chughtai, A.A. (2021). A closer look at the relationship between life satisfaction and job performance. Applied Research in Quality of Life, 16(2), 805–825. https://doi.org/10.1007/s11482-019-09793-2
DeVon, H.A., Block, M.E., Moyle-Wright, P., Ernst, D.M., Hayden, S.J., Lazzara, D.J., … Kostas-Polston, E. (2007). A psychometric toolbox for testing validity and reliability. Journal of Nursing Scholarship, 39(2), 155–164. https://doi.org/10.1111/j.1547-5069.2007.00161.x
Diener, E., Emmons, R.A., Larsen, R.J., & Griffin, S. (1985). The satisfaction with life scale. Journal of Personality Assessment, 49(1), 71–75. https://doi.org/10.1207/s15327752jpa4901_13
Extremera, N., & Fernandez-Berrocal, P. (2005). Perceived emotional intelligence and life satisfaction: Predictive and incremental validity using the trait meta-mood scale. Personality and Individual Differences, 39(5), 937–948. https://doi.org/10.1016/j.paid.2005.03.012
Field, L.K., & Buitendach, J.H. (2011). Happiness, work engagement and organisational commitment of support staff at a tertiary education institution in South Africa. SA Journal of Industrial Psychology, 37(1), 1–10. https://doi.org/10.4102/sajip.v37i1.946
Hadi, N.U., Abdullah, N., & Sentosa, I. (2016). An easy approach to exploratory factor analysis: Marketing perspective. Journal of Educational and Social Research, 6(1), 215. Retrieved from https://www.mcser.org/journal/index.php/jesr/article/view/8799
Hair, J.F., Ringle, C.M., & Sarstedt, M. (2011). PLS-SEM: Indeed a silver bullet. Journal of Marketing theory and Practice, 19(2), 139–152. https://doi.org/10.2753/MTP1069-6679190202
Hajjar, S.T. (2018). Statistical analysis: Internal-consistency reliability and construct validity. International Journal of Quantitative and Qualitative Research Methods, 6(1), 46–57. Retrieved from https://www.eajournals.org/wp-content/uploads/Statistical-Analysis-Internal-Consistency-Reliability-and-Construct-Validity-1.pdf
Hallit, S., Haddad, C., Hallit, R., Akel, M., Obeid, S., Haddad, G., … Salameh, P. (2020). Validation of the Hamilton anxiety rating scale and state trait anxiety inventory A and B in Arabic among the Lebanese population. Clinical Epidemiology and Global Health, 8(4), 1104–1109. https://doi.org/10.1016/j.cegh.2020.03.028
Hassem, T. (2021). Establishing the content validity of an online depression screening tool for South Africa. African Journal of Psychological Assessment, 3, a62. https://doi.org/10.4102/ajopa.v3i0.62
Heppner, P.P., Pretorius, T.B., Wei, M., Lee, D.G., & Wang, Y.W. (2002). Examining the generalizability of problem-solving appraisal in black South Africans. Journal of Counseling Psychology, 49(4), 484–498. https://doi.org/10.1037/0022-0167.49.4.484
Ilic, I., Babic, G., Dimitrijevic, A., Ilic, M., & Grujicic, S.S. (2019). Reliability and validity of the center for epidemiologic studies depression (CES-D) scale in Serbian women with abnormal papanicolaou smear results. International Journal of Gynecologic Cancer, 29(6), 996–1002. Retrieved from https://ijgc.bmj.com/content/29/6/996.abstract
Joshanloo, M., & Jovanović, V. (2020). The relationship between gender and life satisfaction: Analysis across demographic groups and global regions. Archives of Women’s Mental Health, 23(3), 331–338. https://doi.org/10.1007/s00737-019-00998-w
Jovanović, V., & Brdar, I. (2018). The cross-national measurement invariance of the satisfaction with life scale in a sample of undergraduate students. Personality and Individual Differences, 128(1), 7–9. https://doi.org/10.1016/j.paid.2018.02.010
Kaczmarek, L.D., Bujacz, A., & Eid, M. (2015). Comparative latent state – Trait analysis of satisfaction with life measures: The steen happiness index and the satisfaction with life scale. Journal of Happiness Studies, 16(2), 443–453. https://doi.org/10.1007/s10902-014-9517-4
Kline, R.B. (2005). Principles and practice of structural equation modeling (2nd ed.). New York: Guilford.
Kocalevent, R.D., Finck, C., Pérez-Trujillo, M., Sautier, L., Zill, J., & Hinz, A. (2017). Standardization of the beck hopelessness scale in the general population. Journal of Mental Health, 26(6), 516–522. https://doi.org/10.1080/09638237.2016.1244717
Le Roux, M.C., Kagee, A., Van Der Merwe, M., & Parker, F. (2008). Subjective well-being of primary health care patients in the Western Cape, South Africa. South African Family Practice, 50(3), 68–68. https://doi.org/10.1080/20786204.2008.10873723
Liang, Y., & Zhu, D. (2015). Subjective well- of Chinese landless peasants in relatively developed regions: Measurement using PANAS and SWLS. Social Indicators Research, 123(3), 817–835. https://doi.org/10.1007/s11205-014-0762-z
Lin, Y., Hu, Z., Danaee, M., Alias, H., & Wong, L.P. (2021). The impact of the COVID-19 pandemic on future nursing career turnover intention among nursing students. Risk Management and Healthcare Policy, 14, 3605–3615. https://doi.org/10.2147/RMHP.S322764
Linacre, J.M. (2021a). Winsteps® Rasch measurement computer program (Version 5.1.4). Portland, OR: Winsteps.com.
Linacre, J.M. (2021b). Winsteps® Rasch measurement computer program user’s guide. Version 5.1.1. Portland, OR: Winsteps.com.
Lopes, A.R., & Nihei, O.K. (2021). Depression, anxiety and stress symptoms in Brazilian university students during the COVID-19 pandemic: Predictors and association with life satisfaction, psychological well-being and coping strategies. PLoS One, 16(10), e0258493. https://doi.org/10.1371/journal.pone.0258493
López-Ortega, M., Torres-Castro, S., & Rosas-Carrasco, O. (2016). Psychometric properties of the satisfaction with life scale (SWLS): Secondary analysis of the mexican health and aging study. Health and Quality of Life Outcomes, 14(1), 1–7. https://doi.org/10.1186/s12955-016-0573-9
Løvereide, L., & Hagell, P. (2016). Measuring life satisfaction in Parkinson’s disease and healthy controls using the satisfaction with life scale. PLoS One, 11(10), e0163931. https://doi.org/10.1371/journal.pone.0163931
Magno, C. (2009). Demonstrating the difference between classical test theory and item response theory using derived test data. The International Journal of Educational and Psychological Assessment, 1(1), 1–11. Retrieved from https://ssrn.com/abstract=1426043
Maroufizadeh, S., Ghaheri, A., Samani, R.O., & Ezabadi, Z. (2016). Psychometric properties of the satisfaction with life scale (SWLS) in Iranian infertile women. International Journal of Reproductive BioMedicine, 14(1), 57–62. Retrieved from https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4837918/
Mokken, R.J. (1971). A theory and procedure of scale analysis. The Hague: Mouton.
Nia, H.S., Rezapour, M., Allen, K.A., Sharif, S.P., Jafari, A., Torkmandi, H., … Goudarzian, A.H. (2019). The psychometric properties of the center for epidemiological studies depression scale (CES-D) for Iranian cancer patients. Asian Pacific Journal of Cancer Prevention: APJCP, 20(9), 2803–2809. https://doi.org/10.31557/APJCP.2019.20.9.2803
Ohunakin, F., Adeniji, A.A., Oludayo, O.A., Osibanjo, A.O., & Oduyoye, O.O. (2019). Employees’ retention in Nigeria’s hospitality industry: The role of transformational leadership style and job satisfaction. Journal of Human Resources in Hospitality & Tourism, 18(4), 441–470. https://doi.org/10.1080/15332845.2019.1626795
Oishi, S. (2006). The concept of life satisfaction across cultures: An IRT analysis. Journal of Research in Personality, 40(4), 411–423. https://doi.org/10.1016/j.jrp.2005.02.002
Oishi, S., & Sullivan, H.W. (2005). The mediating role of parental expectations in culture and well-being. Journal of Personality, 73(5), 1267–1294. https://doi.org/10.1111/j.1467-6494.2005.00349.x
Padmanabhanunni, A., & Pretorius, T. (2021a). The loneliness – Life satisfaction relationship: The parallel and serial mediating role of hopelessness, depression and ego-resilience among young adults in south africa during covid-19. International Journal of Environmental Research and Public Health, 18(7), 3613. https://doi.org/10.3390/ijerph18073613
Padmanabhanunni, A., & Pretorius, T. (2021b). Behaviour is the key in a pandemic: The direct and indirect effects of COVID-19-related variables on psychological wellbeing. Psychological Reports. https://doi.org/10.1177/00332941211025269
Padmanabhanunni, A., & Pretorius, T.B. (2020). When coping resources fail: The health-sustaining and moderating role of fortitude in the relationship between COVID-19-related worries and psychological distress. African Safety Promotion: A Journal of Injury and Violence Prevention, 18(2), 28–47. Retrieved from https://www.ajol.info/index.php/asp/article/download/211538/199456
Padmanabhanunni, A., & Pretorius, T.B. (2021c). The unbearable loneliness of COVID-19: COVID-19-related correlates of loneliness in South Africa in young adults. Psychiatry Research, 296, 113658. https://doi.org/10.1016/j.psychres.2020.113658
Padmanabhanunni, A., Pretorius, T.B., Stiegler, N., & Bouchard, J.P. (2022). A serial model of the interrelationship between perceived vulnerability to disease, fear of COVID-19, and psychological distress among teachers in South Africa. Annales Médico-Psychologiques, Revue Psychiatrique, 180(1), 23–28. https://doi.org/10.1016/j.amp.2021.11.007
Pavot, W., & Diener, E. (2008). The satisfaction with life scale and the emerging construct of life satisfaction. The Journal of Positive Psychology, 3(2), 137–152. https://doi.org/10.1080/17439760701756946
Pavot, W., & Diener, E. (2009). Review of the satisfaction with life scale. In E. Diener (Ed.), Assessing wellbeing: The collected works of ed Diener (pp. 101–117). Dordrecht: Springer.
Posch, L., Bleier, A., Lechner, C.M., Danner, D., Flöck, F., & Strohmaier, M. (2019). Measuring motivations of crowdworkers: The multidimensional crowdworker motivation scale. ACM Transactions on Social Computing, 2(2), 1–34. https://doi.org/10.1145/3335081
Prasoon, R., & Chaturvedi, K.R. (2016). Life satisfaction: A literature review. The Researcher: International Journal of Management, Humanities and Social Sciences, 1(02), 24–31. Retrieved from https://researcher.galgotiapublications.com/researcher/article/download/9/9
Pretorius, T.B. (1991). Cross-cultural application of the center for epidemiological studies depression scale: A study of black South African students. Psychological Reports, 69(Suppl 3), 1179–1185. https://doi.org/10.2466/pr0.1991.69.3f.1179
Pretorius, T.B. (1993). The metric equivalence of the UCLA loneliness scale for a sample of South African students. Educational and Psychological Measurement, 53(1), 233–239. https://doi.org/10.1177/0013164493053001026
Pretorius, T.B. (2022). The applicability of the UCLA loneliness scale in South Africa: Factor structure and dimensionality. African Journal of Psychological Assessment, 4, 8. https://doi.org/10.4102/ajopa.v4i0.63
Pretorius, T., & Padmanabhanunni, A. (2021). A looming mental health pandemic in the time of COVID-19? Role of fortitude in the interrelationship between loneliness, anxiety, and life satisfaction among young adults. South African Journal of Psychology, 51(2), 256–268. https://doi.org/10.1177/0081246321991030
Racette, B.A., Nelson, G., Dlamini, W.W., Hershey, T., Prathibha, P., Turner, J.R., … Nielsen, S.S. (2021). Depression and anxiety in a manganese-exposed community. Neurotoxicology, 85, 222–233. https://doi.org/10.1016/j.neuro.2021.05.017
Radloff, L.S. (1977). The CES-D scale: A self-report depression scale for research in the general population. Applied Psychological Measurement, 1(3), 385–401. https://doi.org/10.1177/014662167700100306
R Core Team. (2017). R: A language and environment for statistical computing. R foundation for statistical computing. Vienna. Retrieved from https://www.R-project.org/
Roothman, B., Kirsten, D.K., & Wissing, M.P. (2003). Gender differences in aspects of psychological well-being. South African Journal of Psychology, 33(4), 212–218. https://doi.org/10.1177/008124630303300403
Russell, D.W. (1996). UCLA loneliness scale (version 3): Reliability, validity, and factor structure. Journal of Personality Assessment, 66(1), 20–40. https://doi.org/10.1207/s15327752jpa6601_2
Schutte, L., Negri, L., Delle Fave, A., & Wissing, M.P. (2021). Rasch analysis of the satisfaction with life scale across countries: Findings from South Africa and Italy. Current Psychology, 40(10), 4908–4917. https://doi.org/10.1007/s12144-019-00424-5
Seligman, M. (2010). Flourish: Positive psychology and positive interventions. The Tanner Lectures on Human Values, 31(4), 1–56. Retrieved from http://www.isbm.at/pics/Flourish_Seligman.pdf
Seligman, M.E. (2002). Authentic happiness: Using the new positive psychology to realize your potential for lasting fulfillment. New York: Simon and Schuster.
Seligman, M.E., & Csikszentmihalyi, M. (2014). Positive psychology: An introduction. in Csikszentmihalyi, M., & Larson, R. (Eds.), Flow and the foundations of positive psychology (pp. 279–298). Dordrecht: Springer.
Sijtsma, K., & Van Der Ark, L.A. (2017). A tutorial on how to do a Mokken scale analysis on your test and questionnaire data. British Journal of Mathematical and Statistical Psychology, 70(1), 137–158. https://doi.org/10.1111/bmsp.12078
Singh, S., Zaki, R.A., Farid, N.D.N., & Kaur, K. (2021). Reliability analysis of the malay version of the center for epidemiologic studies-depression scale (CESD) among adolescents in Malaysia. Preventive Medicine Reports, 24, 101585. https://doi.org/10.1016/j.pmedr.2021.101585
Spielberger, C.D. (1988). State-trait anger expression inventory: Revised research manual. Odessa, FL: Psychological Assessment Resources.
Stojanović, N.M., Ranđelović, P.J., Nikolić, G., Stojiljković, N., Ilić, S., Stoiljković, B., … Radulović, N.S. (2020). Reliability and validity of the Spielberger’s state-trait anxiety inventory (STAI) in serbian university student and psychiatric non-psychotic outpatient populations. Acta Facultatis Medicae Naissensis, 37(2), 149–159. https://doi.org/10.5937/afmnai37-25011
Sueki, H. (2020). Relationship between beck hopelessness scale and suicidal ideation: A short-term longitudinal study. Death Studies, 46(2), 1–6. https://doi.org/10.1080/07481187.2020.1740833
Taber, K.S. (2018). The use of cronbach’s alpha when developing and reporting research instruments in science education. Research in Science Education, 48, 1273–1296. https://doi.org/10.1007/s11165-016-9602-2
Tang, S., Xiang, M., Cheung, T., & Xiang, Y.T. (2021). Mental health and its correlates among children and adolescents during COVID-19 school closure: The importance of parent-child discussion. Journal of Affective Disorders, 279, 353–360. https://doi.org/10.1016/j.jad.2020.10.016
Tsai, M.C. (2009). Market openness, transition economies and subjective wellbeing. Journal of Happiness Studies, 10(5), 523–539. https://doi.org/10.1007/s10902-008-9107-4
Van Der Ark, L.A. (2012). ‘New developments in Mokken scale analysis in R’. Journal of Statistical Software, 48(5), 1–27. https://doi.org/10.18637/jss.v048.i05
Van Loon, A.J.M., Tijhuis, M., Surtees, P.G., & Ormel, J. (2001). Personality and coping: Their relationship with lifestyle risk factors for cancer. Personality and Individual Differences, 31(4), 541–553. https://doi.org/10.1016/S0191-8869(00)00158-6
Westaway, M.S., Maritz, C., & Golele, N.J. (2003). Empirical testing of the satisfaction with life scale: A South African pilot study. Psychological Reports, 92(2), 551–554. https://doi.org/10.2466/pr0.2003.92.2.551
Wind, S.A. (2017). An instructional module on Mokken scale analysis. Educational Measurement: Issues and Practice, 36(2), 50–66. https://doi.org/10.1111/emip.12153
Zeas-Sigüenza, A., Oliveira, S., Ferreira, C., Ganho, A., & Ruisoto, P. (2021). Psychometric properties and factor structure of the UCLA loneliness scale v3: The European portuguese version. PsyArXiv. Retrieved from https://psyarxiv.com/eyxvn/download/?format=pdf
Footnote
1. The Jovanović and Brdar (2018) study reported scaled mean scores.
|