Introduction
Women's health is different from men's health. Women have poorer overall health: they have a higher number of chronic conditions, higher levels of cognitive impairment, and a higher prevalence of severe pain and physical disability (Case & Paxson, 2005; Chiasson & Hirsch, 2005; Crimmins et al., 2010; Ministerio de Sanidad, Consumo y Bienestar Social (Spanish Ministry of Health, Consumer Affairs, and Social Welfare), 2018a; Oksuzyan et al., 2019; Sánchez-López et al., 2012). However, if there is one area of health where gender-based differences in the prevalence of disorders is particularly significant, that is mental health, where the prevalence of mental health problems is twice as high in women as in men (Ministerio de Sanidad, Consumo y Bienestar Social (Spanish Ministry of Health, Consumer Affairs, and Social Welfare), 2018b; World Health Organisation (WHO), 2018a; Velasco et al., 2007). In fact, even though women have a longer life expectancy, engage in a greater number of preventive behaviours, and have fewer addictions, they paradoxically have poorer health, wellbeing, and quality of life than men. They also use more health services and psychotropic drugs and have a higher prevalence of numerous psychiatric disorders (Hartung & Lefler, 2019; Leal, 2006). This has been termed “the mortality/morbidity paradox”: women live longer than men but are in poorer health (Case & Paxson, 2005; Sánchez-López & Limiñana, 2017).
Epidemiological data show that there are a number of diagnoses for which there are no sex-based differences, e.g. the prevalence of schizophrenia is very similar in men and women (Jiménez & Vázquez, 2012; Sáenz-Herrero, 2015). The prevalence of other diagnoses, such as antisocial, narcissistic, obsessive-compulsive, paranoid, schizotypal, and schizoid personality disorders, is higher in men (Garnica de Cos, 2015). Men also commit suicide at a higher rate than women (Mackenzie et al., 2019) and tend to use more violent means to do so (Bahamón et al., 2019; Kohen, 2010).
However, there are many other disorders that are much more prevalent in women than in men. The most significant example is eating disorders, with the highest female-to-male ratio of all psychiatric disorders (Fletcher-Janzen, 2009; Keel & Forney, 2013): 90% of all people diagnosed are women (Kohen, 2010; Ruiz et al., 2016). A similar situation applies to affective disorders. Depression is two to three times more common in women than in men (Ferrari et al., 2013; Pérez & Serra, 1997; Pérez & Gaviña, 2015; Sáenz-Herrero, 2015; Salk et al., 2017); rates of bipolar disorder are similar in both sexes, but women's cycles are faster and experience a greater number of depressive episodes and mixed phases. Anxiety disorders (such as panic disorder, specific phobias, and generalised anxiety disorder) and dysthymia affect women more than men (Bekker & van Mens-Verhulst, 2007; Leal, 2006; Pérez & Gaviña, 2015; Sánchez-López & Cuéllar, 2013). With regard to personality disorders, epidemiological data show that women have a higher prevalence rate of borderline personality disorder than men (3:1) (Escribano, 2006; García et al., 2010; Kienast et al., 2014; Ortiz-Tallo et al., 2011; Skodol & Bernder, 2003; Tomko et al., 2014; Trull et al., 2010).
It also follows from the above that women have, in general, a higher prevalence of internalising disorders (anxiety, depression) while men tend towards externalising disorders (antisocial personality disorder, addictions) (Eaton et al., 2012).
The sex differential in morbidity may be explained by two hypotheses. The first hypothesis holds that constitutional, genetic, and/or endocrine factors are determinants of sex-based differences in morbidity. However, it seems that most studies in this area fall short in explaining this phenomenon (Fine, 2018; Montero et al., 2004; Pérez & Serra, 1997). As pointed out by Sánchez-López and Cuéllar (2013) echoing Ellis et al. (2008), mental disorders in childhood-e.g., conduct disorders such as attention deficit hyperactivity disorder (Arnett et al., 2015), communication and language disorders such as stuttering, autism (Baron-Cohen et al., 2011), and Asperger syndrome-are more commonly diagnosed in boys versus girls. The trend reverses in adolescence, where females start to have poorer health. Data suggest that these are not (only) genetic disorders but may also be influenced by environmental factors.
The second hypothesis is based on environmental theories, which argue that sociocultural variables acting through socially imposed roles and patterns of behaviour are what ultimately condition the way in which men and women manifest their psychological distress (Montero et al., 2004; Sánchez-López et al., 2013). This is in line with Kohen (2010), who, quoting Dohrenwend and Dohrenwend (1977), Gove (1984), and Nazroo (2001), point out that women's mental health being poorer than men's may be related to women being affected differently by social stress: they usually are poorer than men-which has been termed the feminisation of poverty (Belle & Doucet, 2003)-and experience more abuse (sexual abuse and sexist violence) than men during both childhood and adulthood (WHO, 2018a). Gender-based violence, the sexual division of labour, women's double or triple shifts (work, housework/caregiving, and affective networks), and beauty ideals (social pressure on their bodies) are other significant contributors to women's poorer health (Sánchez, 2018).
As Wood and Eagly (2013) pointed out, it is true that neither biology nor culture alone can explain sex-based differences or similarities. This is where the biopsychosocial model (Engel, 1980) becomes particularly relevant as an explanatory theory, as it is widely accepted and used for conceptualising and treating a wide range of physical and mental health problems (Meyer & Melchert, 2011) and is still very popular today (Bott et al., 2016; Yagahmaian & Miller-Smedema, 2019).
Sex and gender are two complex and distinct entities, although they may overlap and relate to each other to some extent (Fernández, 2004): gender interacts with biological sex, but it is a different concept (WHO, 2019). However, in the scientific literature, the two terms are often used interchangeably and confusingly (Sánchez-López & Limiñana, 2017). The term sex refers to biologically determined characteristics (chromosomes, genes, gonads, hormones, morphology, etc.), while the term gender refers to the socially defined roles, characteristics, and opportunities that are considered appropriate for men, women, boys, girls, and individuals with non-binary identities (WHO, 2019).
These roles and characteristics are learnt through the process of differential socialisation, which begins at birth and continues throughout life, whereby individuals, through interaction, learn and internalise the values, attitudes, expectations, and behaviours characteristic of the society to which they belong, enabling them to function in it (Giddens, 2014). The World Health Organisation (2019) states that if people do not conform to norms or roles (including masculinity and femininity), they are often subject to stigmatisation, social exclusion, and discrimination, which may eventually have a negative impact on their health.
Based on the above, we set out to analyse to what extent gender (sociocultural aspects) contributes to women seeking psychological care. Our objective was to study the classification ability of a number of clinical variables-taking as a reference the classification of the Diagnostic and Statistical Manual of Mental Disorders or DSM-5 by the American Psychiatric Association (2013)-and sociocultural factors (conformity to gender norms) in order to ascertain whether or not women were receiving psychological care and determine the importance of each of the variables when predicting which women were receiving therapy.
Method
Design
This study used a retrospective, cross-sectional, ex post facto group comparison design. In ex post facto designs, a sample of participants with already existing qualities associated with the study variables is selected. The potential independent variables are provided, and the dependent variable can be observed before, after, or at the same time as the independent variable (as it is the case of the present study). This is a retrospective group comparison because the study compares a group of women selected for having a certain characteristic (i.e. the clinical cases) with another group of women lacking that characteristic (the non-clinical cases). The groups are compared on a number of potential independent variables (conformity to feminine/masculine gender norms and clinical variables), which are considered relevant for the occurrence of that characteristic.
Participants
The total sample consisted of 368 women, but 10 of the assessments were discarded due to missing data. The final sample was made up of 358 women (M = 38.41, SD = 12.96), 168 (M = 35.75, SD = 11.49) of whom were recruited from private psychological care centres and were receiving psychological and/or psychiatric care, and 190 (M = 40.73, SD = 13.73) were controls, as they were not receiving psychological/psychiatric care.
The inclusion criteria were as follows: (a) being a cisgender woman, (b) being over 18 years old, (c) being proficient and fluent in Spanish, and (d) not being institutionalised. The exclusion criteria were (a) being a man, (b) being a transgender/transsexual woman, (c) being under 18 years of age, (d) having language comprehension problems for whatever reason (difficulties in understanding questionnaires, intellectual disability, etc.), and (e) being institutionalised.
Table 1 shows the relevant sociodemographic information.
The main reasons for consultation in the clinical group are related to anxiety problems (39.29%), followed by couple relationship problems (27.98%) and depression (27.38%). None of the women in the clinical group reported alcohol or drug problems as a reason for consultation.
Table 2 shows the reasons for consultation in the clinical group.
Instruments
The assessment instruments collected sociodemographic information and data on clinical variables and conformity to gender norms.
An ad hoc sociodemographic questionnaire was included to collect information on age, sex, level of education, employment status, and marital status. The therapists of the women who participated in the study completed a screening questionnaire indicating the type of centre they worked in (public practice, private practice, specialised centre), reasons for consultation-up to three reasons for consultation ranging from 1 (major) to 3 (minor)-, and type of diagnosis (clinical syndromes based on the DSM-5).
The validated Spanish version of the Personality Assessment Inventory (PAI) was used (Morey, 2007). The Spanish adaptation of the shortened scale (165 items) was validated by Ortiz-Tallo et al. (2015). Each item had four response options: False, not true at all (F), Slightly true (ST), Mainly true (MT), and Very true (VT). It comprehensively assesses adult psychopathology using 22 scales (Ortiz-Tallo et al., 2011): 4 validity scales (Inconsistency, Infrequency, Negative Impression, and Positive Impression), 11 clinical scales (Somatic Concerns, Anxiety, Anxiety-Related Disorders, Depression, Mania, Paranoia, Schizophrenia, Borderline Features, Antisocial Features, Alcohol Problems, and Drug Problems), 5 treatment consideration scales (Aggression, Suicidal ideation, Stress, Nonsupport, and Treatment rejection), and 2 interpersonal scales (Dominance and Warmth). Regarding the level of reliability of the short form, its mean internal consistency/alpha coefficient was 0.74 in normal samples and 0.81 in clinical samples. The mean test-retest reliability, understood as temporal consistency, was 0.82. The mean correlation value between the short and full forms was 0.90 in normal samples and 0.93 in clinical samples, suggesting that the short form yields scores reasonably close to the scores individuals would obtain using the full form of the questionnaire (Ortiz-Tallo et al., 2011).
Since the PAI does not measure eating disorders, the Eating Attitudes Test (EAT-26) was included. This 26-item version was developed by Garner et al. (1982) from the full form EAT-40 (Garner & Garfinkel, 1979). Each item uses a Likert scale with 6 response options: never (0), rarely (0), sometimes (0), often (1), usually (2), and always (3). Total scores range from 0 to 78. The EAT-26 consists of three scales or factors (Garandillas et al., 2003): the Diet Scale (attention to calories ingested and burnt doing physical exercise, desire to be thin, sense of guilt after eating), Bulimia Factor (scale of bulimia and concern about food), and Oral Control (items describing food intake mode and its control). As pointed out by Garandillas et al. (2003), the EAT-26 and the EAT-40 are highly correlated (r = .98), suggesting that the shortened version retains adequate reliability and validity properties for detecting eating disorders (Pike et al., 2008; Garfinkel & Newman, 2001). The level of reliability and internal consistency of the test as measured with Cronbach's alpha was excellent (> .90) (Rivas et al., 2010).
The Conformity to Feminine Norms Inventory (CFNI) was also included (Mahalik et al., 2005). It assesses women's conformity to a set of dominant feminine norms in American culture. Femininity is understood as the degree of conformity (emotional, cognitive, and/or behavioural) to a set of gender norms that are considered socially appropriate for women in terms of behaviours, attitudes, feelings, and thoughts transmitted by each culture with which members of each society can identify to a greater or lesser extent (Sánchez-López & Limiñana, 2017). The Spanish adaptation of the test was carried out by Sánchez-López et al. (2009). Reliability data (an alpha coefficient of .87 for the total scale) confirmed the validity of the CFNI for use in the Spanish population. The CFNI consists of 84 items rated on a 4-point scale (0 = Strongly disagree, 1 = Disagree, 2 = Agree, 3 = Strongly agree). It can be administered both individually and collectively to individuals over 18 years of age. The duration of the questionnaire ranges between 15 and 20 minutes. Total scores range from 0 to 252 points. Higher scores indicate higher satisfaction with the traditional role of women. The factors that make up the CFNI are the following: Nice in relationships (developing friendly and supportive relationships with others), Care for Children (taking care of and being with children), Thinness (pursuing a thin body ideal), Sexual Fidelity (keeping sexual intimacy contained within one committed relationship), Modesty (refraining from calling attention to one's talents or abilities), Romantic relationship (investing self in romantic relationship), Domestic (maintaining the home), and Invest in appearance (committing resources to maintaining and improving physical appearance) (Mahalik et al., 2005).
The Conformity to Masculine Norms Inventory (CMNI) was also included (Mahalik et al., 2003). It assesses conformity to a set of dominant masculine norms in US culture. Its Spanish adaptation was conducted by Cuéllar-Flores et al. (2011). Reliability data (an alpha coefficient of .90 for the total scale) confirmed the validity of the CMNI for use in the Spanish population. It consists of 94 items rated on a 4-point scale (0 = Strongly disagree, 1 = Disagree, 2 = Agree, 3 = Strongly agree). The factors making up the CMNI are related to attitudes, beliefs, and behaviours reflecting conformity or non-conformity to eleven normative messages associated with masculine gender roles: Winning, Emotional Control, Risk-Taking, Violence, Dominance, Playboy, Self-Reliance, Primacy of Work, Power Over Women, Disdain for Homosexuals, Physical Toughness, and Pursuit of Status (Cuéllar-Flores et al., 2011).
Reliability data for the scores of each scale obtained in the present study are provided in the Results section.
Procedure
Twenty-seven psychotherapists from different provinces of Spain (Madrid, Valencia, and Barcelona) and three psychotherapy centres participated in the recruitment of the sample. Participants in the control group (women who were not receiving psychological/psychiatric care at the time) were recruited using a snowball sampling procedure. Participation in the study was voluntary. Informed consent was included on the first page of the booklet where the different instruments used were compiled. The booklets were hand-delivered and did not need to be filled in on the spot. In most cases, the contact persons gave the questionnaires to the women, who returned them to the same contact person. In one of the collaborating centres, the questionnaires were filled in collectively.
Data analysis
After excluding the participants who failed to meet the aforementioned inclusion criteria, all gender variables and EAT scores were standardised using T-scores, the same scoring system used by the PAI. T-scores use a scale derived from Z-scores and are calculated by transforming the Z-score for each participant on each scale, multiplying it by 10, and adding 50 points. This results in variables whose mean would be equal to 50 and their standard deviation would be equal to 10. After standardisation of the variables, a descriptive study of each independent variable for each study group was carried out. Statistics of central tendency, variability, and shape of the distribution were included.
The aim of this study was to explore the classification ability of a number of clinical variables and variables related to conformity to gender norms when determining whether or not women were receiving psychological care. Therefore, in order to make a classification model, two approaches were taken: a binary logistic regression model and a random forest model. After constructing the binary logistic regression model, the collinearity analysis concluded that more than 20% of the variables had a high degree of collinearity. The results of the logistic regression analysis and the adjustment of assumptions, including the analysis of collinearity between predictor variables, is available upon request from the lead author.
In view of their collinearity, a random forest classification model was considered (Breiman, 2001). This type of model makes it possible to work with variables exhibiting collinearity problems, among others. The 10-fold cross-validation procedure was followed (Kuhn & Johnson, 2013). It consists of a resampling method based on cross-validation in which the total sample is divided into 10 parts with approximately the same number of participants. A model is built using all but the first sample. The remaining sample is used to estimate the performance measures of the model. Hyperparameters of a random forest model include the number of trees (in this case, this value was set to 500) and the number of predictor variables in each tree (in this case, the optimal model allowed this value to be set to 18). The Gini indicator (Mean Decrease Gini) was used to assess the significance of the variables. The error rate was studied using out-of-bag (OOB) procedures. Model accuracy and concordance between trees were also studied. As this is a classification problem, this algorithm provides a confusion matrix from which the indicators of prevalence, sensitivity, specificity, diagnostic accuracy, diagnostic odds ratio, Youden index, positive predictive value, and negative predictive value can be derived. All these indicators are reported with a 95% confidence interval.
Statistical analyses were performed using R software (R Core Team, 2019), and the caret (Kuhn, 2020), psych (Revelle, 2019), ggplot2 (Wickham, 2016) and epiR (Stevenson, 2020) packages.
Results
The descriptive statistics for each of the predictor variables in each group, as well as the internal consistency coefficients for each of the scales, are shown in Table 3. The internal consistency of most of the scales is adequate, with the exception of the Antisocial scale of the PAI.
Random forest was the classifier algorithm used, constructed with 35 predictors and hyperparameters described in the Data Analysis section. The accuracy of the model obtained was 89.40% and the agreement between the trees was κ=0.787. The confusion matrix is shown in Table 4. The OOB error was 12.01%.
The importance of the variables is shown in Table 5. The most important variables belong to the PAI, with Suicidal Ideation being the most relevant scale, followed by Borderline Features, Somatic Concerns, Anxiety, Stress, Depression, and Anxiety-Related Disorders. Regarding gender, the
Sexual Fidelity, Care for Children, and Nice in Relationships scales had the highest classification ability, although far from the classification ability exhibited by the clinical scales. In the case of variables related to Conformity to Masculine Norms, they have little importance within the model, with the Power over Women and Self-Reliance scales attaining the highest values.
Since the random forest algorithm has been used for classification, it was possible to obtain additional indicators of the quality of the model (see Table 6). The model has shown very good classification ability, as all diagnostic indicators have very high values.
Discussion
The aim of this study was to explore the classification ability of a number of clinical variables and sociocultural factors related to conformity to gender norms in order to ascertain whether or not women were receiving psychological care and determine the importance of each of the variables when predicting which women were receiving therapy.
In this sample, the main reasons for consultation in the clinical group were related to anxiety problems (39.29%) and depression (27.38%), which are among the most prevalent diagnoses in women (Bekker & van Mens-Verhulst, 2007; Ferrari et al., 2013; Leal, 2006; Pérez & Serra, 1997; Pérez & Gaviña, 2015; Sáenz-Herrero, 2015; Salk et al., 2017; Sánchez-López & Cuéllar, 2013), as well as couple relationship problems (27.98%). The second most prevalent reason for consultation in the sample was couple relationship problems, which is not a clinical diagnosis in itself, but rather a concern frequently voiced by women in the clinical sample (women who were receiving psychotherapy). This is consistent with Velasco (2009), who reports that couple's conflicts were the psychosocial process most frequently associated with common mental disorders, somatisation, and pain among men and young women up to 44 years of age (p. 181).
The data obtained indicate that women scoring higher on the clinical variables Suicidal Ideation (the variable with the highest weighting) and Borderline Features are more likely to be receiving psychological care. The present sample did not report significantly severe symptoms (women in the clinical sample were seen in private practices and not in hospitals), so a high score on this scale was indicative of severity, which fits with seeking therapy. Something similar could be said with respect to the Borderline Features variable, which is related to severe symptoms and an indicator of a higher likelihood of receiving psychological care. This is consistent with the fact that borderline personality disorder has a higher prevalence rate in women than in men (3:1) (Escribano, 2006; García et al., 2010; Kienast et al., 2014; Ortiz-Tallo et al., 2011; Skodol & Bernder, 2003; Tomko et al., 2014; Trull et al., 2010).
The variables with the highest classification ability in determining whether a woman is receiving psychological care are typically associated with conformity to feminine gender roles: Care for Children, Nice in Relationships, and Sexual Fidelity. This is consistent with a number of studies suggesting that having feminine but also masculine traits (androgyny) was associated with less severe eating disorders and with the fact that femininity and anxiety are positively correlated, while masculinity and anxiety are negatively correlated (Hartung & Lefler, 2019). In fact, one of the possible reasons why women have a higher prevalence of mental disorders than men is that social roles are different for each sex: moving away from gender ideals can be stressful for both women and men (if gender ideals are not met, it is more difficult to have a good external representation of self-worth). However, this has more negative consequences for women than for men (Matud & Aguilera, 2009), as social norms and standards are known to guide and constrain behaviours (Cialdini & Trost, 1999).
The clinical variables Stress, Warmth, Aggression, Anxiety, Anxiety-Related Disorders, and Somatic Concerns have the highest classification ability. These variables are related to the fact that women have a higher prevalence of internalising disorders (Eaton et al., 2012) and higher levels of social stress: they have a lower economic status and experience more abuse (sexual abuse and sexist violence) than men during both childhood and adulthood (WHO, 2018b). This is also consistent with some of the results found: women with higher femininity scores also had higher levels of anxiety and, overall, feminine traits hinder women's psychological wellbeing, as they are related to passivity, dependence, lack of assertiveness, and low self-esteem, which in turn are linked to anxious and depressive symptoms (Pérez & Serra, 1997).
In terms of conformity to masculine gender norms, the variable Pursuit of Status is ranked first, followed closely by the clinical variable Eating Disorders. Being female and scoring high on a “typically masculine” (gender) variable makes that woman almost as likely to seek psychological help as when scoring high on the clinical variable Eating Disorders. Eating disorders are the most commonly diagnosed disorders among the young population (Sáenz-Herrero et al., 2015). However, in this study the Eating Disorders variable does not seem to be the most important variable when classifying the clinical population, which may be due to the fact that the mean age of the participants was 40 years old. Therefore, it may be argued that gender determinants are at the root of attitudes and lifestyles related to vulnerability and the tendency to become ill. For instance, the ideal of femininity based on a thin, inert body (the contemporary gender model) increases the vulnerability of young women whose identity is developing, leading to eating disorders (Velasco, 2009, p. 148). As Sánchez-López and Dresch (2012) state, “conformity to some feminine norms could be related to greater reporting of chronic illnesses, which is a health cost” (p. 192) and so “identifying the association between people's constructions of femininity and masculinity and their health may contribute to changing problematic constructions of gender norms using cognitive techniques” (p. 193).
Conclusions
We can conclude that gender variables have been found to be relevant in determining whether women are receiving psychological care, i.e. greater conformity to gender roles leads women to seek more psychological care. When studying health, it is essential to include not only sex as a variable, but also gender as a sociocultural dimension mediating illness. Our results show the importance of further exploring sex and gender as two different dimensions that need to be considered in research; otherwise, there is a risk of introducing bias into the study of health.
The inclusion of a gender perspective in research will help to achieve a broader understanding of the aetiology of disorders, i.e. the underlying reasons why men and women become ill differently, a phenomenon that is not sufficiently explained by biology. Being able to analyse the weight of sociocultural variables in the development of a given mental illness becomes crucial in order to identify the relevant factors of an illness. Integrating the gender perspective into educational and health-related programmes, clinical treatments (including individual psychotherapy), and prevention programmes may thus increase their effectiveness and efficiency.
Moving away from gender ideals can be stressful for both women and men (Matud & Aguilera, 2009). However, not fitting into these ideals might not be so problematic if women were more aware of them and if they were to change. Taking a gender perspective approach and pointing to sociocultural factors as determinants, with the implications that this entails, can help to ‘depsychopathologise' disorders experienced by women and to design interventions promoting healthier behaviours.
It is our belief that our findings provide an avenue worth exploring. However, our study has a number of limitations that should be considered. This study is the first part of a larger study that is projected to include men in the sample. A comparison of the scales between the sample of men and women could thus be made to explore models such as this one and study the health of women, men, and non-binary people. It would also have been interesting to include in the sample women with severe mental disorders, in institutional care, or with a non-university level of education in order to make the results more generalisable.
It is essential that future lines of research further explore the different forms of illness that men and women (as well as transgender people) experience by taking a feminist, biopsychosocial perspective and by defining more clearly the sex/gender constructs. This will result in better psychological care and better mental and overall health for women, men, and non-binary people.