Skip to main content

Gender score development in the Berlin Aging Study II: a retrospective approach


In addition to biological sex, gender, defined as the sociocultural dimension of being a woman or a man, plays a central role in health. However, there are so far few approaches to quantify gender in a retrospective manner in existing study datasets. We therefore aimed to develop a methodology that can be retrospectively applied to assess gender in existing cohorts. We used baseline data from the Berlin Aging Study II (BASE-II), obtained in 2009–2014 from 1869 participants aged 60 years and older. We identified 13 gender-related variables and used them to construct a gender score by using primary component and logistic regression analyses. Of these, nine variables contributed to a gender score: chronic stress, marital status, risk-taking behaviour, personality attributes: agreeableness, neuroticism, extraversion, loneliness, conscientiousness, and level of education. Females and males differed significantly in the distribution of the gender score, but a significant overlap was also found. Thus, we were able to develop a gender score in a retrospective manner from already collected data that characterized participants in addition to biological sex. This approach will allow researchers to introduce the notion of gender retrospectively into a large number of studies.


Women and men differ in disease manifestations and outcomes though the underlying mechanisms are still poorly understood [1, 2]. Yet, it is well known that sex and gender play a central role in health [3,4,5,6,7]. Sex is the biological condition of being male or female and includes differences in genetic and hormonal levels. Its role in major diseases is well documented. In contrast, gender incorporates the psychological, behavioural, social, and cultural aspects, i.e. the sociocultural dimension of being a woman or a man in a given society [8,9,10]. Gender is composed of four dimensions, reflecting different aspects, e.g. gender roles, gender relations, institutionalized gender, and gender identity [11, 12]. Gender roles represent the behavioural norms applied to men and women in society, which influence individuals’ everyday actions, expectations, and experiences. Gender identity describes how we see ourselves as women or men (or as a third gender) and affects our feelings and behaviours. Gender relations refer to how we interact with or are treated by people in the world around us, based on our ascribed gender. Finally, institutionalized gender reflects the distribution of power between women and men in the political, educational, and social institutions in society.

The definition of gender has changed over time, but most researchers in the present times follow the definitions above that also underlie the definitions at the homepage of Canadian Institutes of Health Research and the National Institutes of Health, US and Horizon 2020, that promote the largest research programs in gender medicine [9]. Since biological sex and gender overlap, but are not identical, gender is likely to influence health differently from biological sex [3, 4].

Gender encompasses awareness of disease, risk taking and help-seeking behaviour, interaction of patients with doctors, the health care system [13, 14], and access to care. Gender, for example, is associated with a later arrival of women with myocardial infarction to the emergency departments and delayed access to treatments, as well as different treatment results by female and male doctors in their female and male patients [14].

To separate the effects of both sex and gender, variables that represent both in multivariate models are needed. Biological sex is usually assessed by a single binary variable on a one-dimensional scale (female, male) assuming that the sex chromosomes are unique identifiers of female and male individuals, even though this concept has been revised—resulting in the German legal system allowing for coding the sex of new-born babies to be undetermined, which might be a better reflection of biology. However, so far, the binary coding of sex has proved itself as a useful construct in clinical studies to describe differences between female and male individuals.

Unfortunately, there is no generally accepted method to assess the sociocultural dimension that is gender. Thus, researchers need to identify a number of gender-related variables and include them in multiple regression models. Depending on the number of events predicted, it may be impossible to include all aspects of gender in model. Compared to this approach, the use of a composite score measuring gender could offer advantages. Such a score can be used for adjustment in multiple regression models where a distinction from sex is the primary goal, for matching, and subgroup stratification in order to better control confounding variables [15, 16]. As compared with the use of a set of gender-related variables, a single score provides greater statistical power by reducing the number of covariates included in multiple regression models, offer the possibility to test interaction terms, and reduce multiple comparisons [15, 16].

In the present paper, we therefore searched to develop a method to operationalize gender in the context of clinical studies by developing a gender score, following previously published concepts [3, 17, 18].

In 2015, the Canadian members of our group constructed a gender score that was obtained by a questionnaire in a prospective manner and showed that this gender score was associated with health outcomes, in particular survival after acute coronary syndrome. This gender score was based on a variety of carefully selected psychosocial and sociocultural variables [3, 4]. In a prospective study in patients with acute coronary syndrome, the gender score was more strongly associated with several important cardiovascular risk factors, such as diabetes, hypertension, and family history for cardiovascular diseases and 1-year recurrent events than biological sex [3, 4]. It was concluded that a gender score can be used to evaluate the effect of gender beyond the effects of sex on disease risk, presentation, processes of care, and relevant outcomes. Including such a gender dimension into clinical studies may thus add new explanatory value to understand differences between women and men in disease manifestations and outcomes.

However, the majority of existing studies and databases did not include a gender questionnaire and did not measure gender in a prospective manner. Consequently, it is necessary to develop a methodology to assess gender in a retrospective approach in existing datasets. Since most study databases include a large number of psychosocial and/or socioeconomic variables that are potentially gender related, it is possible to construct a score to measure gender. Two recent studies have done this [17, 18]. In one study, a large population-based database was used, the Canadian Labour Force survey, to extract the most suitable variables related to the gender dimensions described above and to derive a gender score from these variables [18]. Most variables available in this dataset were related to work and living conditions, with a focus on gender roles and institutionalized gender [17, 18].

However, the fact that two of the four dimensions of gender, gender relations and gender identity, were only poorly covered in these scores represents a limitation. Furthermore, a longitudinal study to test the impact of gender on health outcomes was not possible due to their cross-sectional design, representing another limitation. The fact that both scores are only reflecting the Canadian workforce and society is another limitation to their use in Europe.

To overcome these limitations in the previous work and to develop a gender score for a European population, one that was not solely focused on work conditions and that offered the opportunity to assess the impact of gender on clinical outcomes, we decided to use, in a paradigmatic approach, the Berlin Aging Study II (BASE-II) to construct a gender score in a retrospective manner. We selected potentially gender-related variables from the initial investigation 6–10 years ago that were closely representing the gender-related variables used in the GENESIS-PRAXY study and in the studies by Smith and Lacasse and that covered the above mentioned four different dimensions of gender (Table 1) [17, 18]. Gender identity was operationally approached using chronic stress and perceived stress, as done also by Lacasse [17], by the Big Five personality traits [19], and risk-taking behaviour [20]. Gender roles were operationally defined using employment status. Gender relations were represented by loneliness and institutionalized gender was represented by education and family status. Due to the fact that data assessment was done from 2009 to 2014, i.e. 6–10 years ago, and the study participants have been medically reinvestigated from 2018 to 2020, we are able to test the association between gender measured 6–10 years ago with more recent clinical status and outcomes in a longitudinal approach.

Table 1 Selected gender-related variables from the BASE-II cohort

We hypothesized that it is possible to construct a gender score with this approach in a retrospective manner that distinguishes women and men based only on sociocultural factors and that integrates a number of gender-related variables. We hypothesized that the gender score overlaps with sex but differs from biological sex and gives additional information. We assumed that a number of males would have feminine gender characteristics and vice versa and that a significant overlap would exist, representing individuals with characteristics of both genders. We also assumed that the distribution of this gender score between females and males would be similar but not identical to the distribution of a gender score developed by our partners in Canada in the GENESIS-PRAXY study, and we assumed that sex and gender would be differently associated with clinical parameters. Associations with 6–10 years follow-up data are planned for the near future.


Data source

We used baseline data from the Berlin Aging Study II (BASE-II). Clinical, psychosocial, and socioeconomic variables were obtained in 2009–2014 from 1869 participants aged 60 years and older [21]. Selection of gender-related variables was done in 2018 (see below). Sex was determined by both self-reports and records in the official registry offices. For all but one of the participants used in the current data set, self-reports and official registry information converged. For one person, no official registry information could be obtained. Self-reports and registry information were further corroborated based on genetic data available for 98.2% of our sample (data not shown).

Identification of gender-related variables

For identification of gender-related variables, we, as previous investigators have done the following: (a) referred to the systematics and gender dimensions proposed by Johnson and (b) included variables that have been proposed by other researchers in the field [3, 12, 16, 17]. First, we selected potentially gender-related variables from the study database related to the four dimensions of gender (Table 1 and supplement S1) that reflect different aspects of the gender construct in which women and men traditionally differ and that have been used by other investigators in the field [11, 12]. To the extent possible, the four gender dimensions were measured using validated self-report questionnaires [22,23,24,25]. Chronic stress was assessed using the Trier Chronic Stress Inventory; perceived stress with the Perceived Stress Scale [22]; the personality dimensions openness to experience, conscientiousness, extraversion, agreeableness, and neuroticism with a short version of the Big-Five Inventory [19]; loneliness using the UCLA Loneliness Scale [23]; and single-item questions for risk-taking behaviour [20], employment status, family status, and education. When possible, we calculated scores according to questionnaire standard. For the Big-Five Inventory, Trier Chronic Stress Inventory, and UCLA-Loneliness, mean scores across items were used, as used in previous publications [22, 24, 26]. The item “The good things in my life are determined by other people” is a standard item assessing external control beliefs in powerful others [27,28,29].

All participants gave written informed consent. The Ethics Committee of the Charité – Universitätsmedizin Berlin approved the study (approval number EA2/029/09) [30].

Overall strategy and statistical approach

Our approach was divided into three steps (Fig. 1): (1) gender score development, (2) gender score calculation in individual cases and distribution in females and males, and (3) correlation with clinical variables.

Fig. 1
figure 1

Stratification of data analysis. Stratification of data analysis was divided into three major steps. In step 1, the algorithm for the gender score was developed; in step 2, gender score was calculated for individual cases; and in step 3, the implication of the gender score was estimated

Gender score development

In step 1, correlated variables were eliminated using a bivariate analysis, and for each correlated pair of variables with a correlation coefficient equal or greater than 0.80, one of the two variables was randomly removed (Table S1). Next, to reduce the dimensionality of the dataset, increase interpretability, and minimize information loss which is done by creating new uncorrelated variables and maximize variance, principal component analysis (PCA) was conducted with a varimax rotation method to further confirm it, where component represents the linear combinations of the entered variables and each component represents a group of related variables [31]. An item was said to load on a given component if the factor loading was 0.40 or greater for that component and was less for other components. We identified 5 major components (Table 2) and retained all 13 variables as the factor loading for the given two components of the same variable did not exceed 0.40. Variables having low communalities (< 0.40) do not contribute much to measuring the underlying factors [32].

Table 2 Gender-related variables and corresponding factor loading in PCA

To determine how the remaining sociocultural variables differentiated between females and males, logistic regression models for the association with sex were calculated one by one and all non-significant variables were removed in descending order of their significance. Coefficients of this logistic regression analyses were used to construct a propensity score [33] which was called gender score, in the same approach as used by previous investigators [3, 17, 18]. For step 1, the development of the algorithm for the gender score, the selected 13 variables were used in the dataset of 1869 cases.

Gender score calculation in individual cases and distribution in female and male

In step 2, the coefficient estimates of the variables from logistic regression analyses were used for gender score calculation in individual cases, i.e. a score from 0 to 100 was obtained for all individual cases. For this step, the presence of all variables was required in each study subject. This reduced the number of cases available for calculation of individual gender score to 1089 (Fig. 1). Next, we analysed the distribution of this gender score with regards to biological sex (Fig. 2).

Fig. 2
figure 2

Gender score distribution in females and males. Distribution of calculated gender score in females (n = 543) and males (n = 502). Zero is identical with completely masculine characteristics whereas 100 represents completely feminine characteristics

Association of gender score with clinical variables

In the third step, the association of gender score with independent clinical and psychosocial variables that were generally attributed to women and men in our societies was tested using linear regression analyses. This step required the presence of gender score and the clinical/psychosocial investigation in all cases. This reduced the number of cases to 1045.

Statistical tests

SPSS version 25 was used to perform statistical analysis. Logistic models were used to model the probability of associated variables. To look for an index of sensitivity and specificity, the curve statistic was performed for gender score (0–100) as the test variable and sex (0 = males, 1 = females). Significance level was determined for p < 0.05 for the group differences and logistic regression models.


Development of the gender score

Mean age, sex, and basic clinical coordinates of our cohort are presented in Table 1. To calculate the gender score in our cohort, we considered a total of 13 variables (Table 1). In the first step of gender score development, correlation and principal component analysis correlation matrix for the variables was low, and therefore, all 13 variables were retained for the next steps (Table 2, PCA, factor loadings).

In the second step, the 13 identified sociocultural variables were used to perform a logistic regression analysis with sex as a dependent variable. Non-significant variables were removed one by one in a descending order of their p value (0.05). In total, five models were explored. The final model (Table 3) identified nine statistically significant sociocultural variables that were assumed to characterize women and men in our model. They were chronic stress, marital status, risk taking behaviour, Big-Five personality traits scale (agreeableness, neuroticism, extraversion), loneliness (UCLA-Loneliness mean), Big-Five personality traits scale (Conscientiousness), and level of education.

Table 3 Gender-related variables for the calculation of the gender score

Calculation of gender score in individual cases and distribution between females and males

The coefficients of the final logistic regression analysis were used to calculate the gender score with a range from 0, extremely masculine, to 100, extremely feminine (Fig. 2). The gender score distribution in females and males differed: 55% of males were in the first tertile with more masculine gender scores and 51% of females were in the third tertile with the more feminine scores. At the same time, the distributions showed considerable overlap with a 36% and 31% of females and males in the middle tertiles and 13% of females and 14% of males having score values of the opposite expected gender (Fig. 2, Table S2).

The area under the curve statistic for 1089 participants who had a gender score available was .795, p < .001, 95% CI [.769, .821] (Fig S1) which indicates fair-to-good sensitivity/specificity to separate women from men.

Association of gender score with clinical and psychosocial variables

In the third step, we performed linear regressions to further investigate associations between sex, the gender score, and biological and well-being variables that were not used to build the score. A linear regression model was performed to determine the impact of sex and gender in combination (Table 4). These models revealed that sex was significantly associated with LDL-cholesterol and total cholesterol, with the gender score not showing any additional association in the models. In contrast, the gender score was significantly associated with cortisol levels, CES-depression, negative affect, and life satisfaction, with sex not associated with these parameters. Interestingly, hand grip strength was associated with both sex and gender score. The associations with hand grip strength, cortisol, and life satisfaction were negative, suggesting that a more feminine score is associated with lower levels of these parameters, whereas all other parameters were positively associated.

Table 4 Prediction of clinical and psychological variables by sex and gender score in combined linear regression models


Our study constructed a gender score in a retrospective manner from available study variables that characterized women and men based on only sociocultural variables, covering the four dimensions of gender. Biological sex and gender score characterized participants differently and had different predictive power for a number of clinical and psychological variables. The results will allow us to explain a greater part of variability among women and men during aging, and they provide researchers with clinical databases with a template to include gender in a retrospective manner in their analysis.

We used the BASE-II cohort that provided data on older, predominantly healthy adults in and around Berlin, Germany [21], to construct a gender score in analogy to the score that was developed in a mainly Canadian cohort of younger patients with acute coronary syndrome and by others in the Canadian health database [17, 18]. The novelty of our approach was the fact that we developed the gender score in a completely retrospective manner in a non-working cohort that undergoes systematic follow-up investigation. BASE-II was not primarily designed to analyse the impact of gender on study data [12]. This retrospective design was limited by the fact that not all the desired gender-related variables were available and not all dimensions of gender could be covered with similar strength. However, it has the advantage that the variables are measured prior to the outcome, meaning that the health status at the time of follow-up does not influence the components of the gender score.

As expected, more than half of females and males clustered in the respective feminine and masculine tertiles. However, the distribution showed a considerable overlap with almost a third of females and males in the middle tertiles and 13% of females and 14% of males having score values of the opposite gender. Thus, the gender score is able to separate women from men with a good sensitivity but clearly differs from biological sex, contributing additional explanatory power and is a major advantage of this score. The distribution is similar but somewhat different from the distribution found by Pilote in their younger cohort with myocardial infarction. They found a more asymmetrical distribution with a stronger clustering of males in the masculine area and a broader distribution of females over the whole scale, indicating that males kept their masculine characteristics whereas females had acquired more masculine characteristics. In contrast, more of our males were found in the middle tertile. This may be an effect of age or retirement—it is likely that the work-related criteria in our older cohort were less important.

When constructing the gender score in our study, all four dimensions of gender were covered. This was a much broader coverage of gender dimensions than in the only directly comparable retrospective studies [17, 18]. They are based on larger cohorts from Canada, therefore not directly comparable with Europe, include a younger working population and are heavily based on work-related variables, therefore not suitable for our retired/aged population, and are both purely cross-sectional, whereas the gender score in our longitudinal cohort study will allow us to develop risk prediction models. The variables family status and education, reflecting gender relations and institutionalized gender, had a high impact in gender score calculation, underscoring the significance of this area.

Thus, in the aged cohort of BASE-II, with mainly retired persons, other variables had a strong contribution to gender than in younger working cohorts. Psychological parameters, such as perceived stress, loneliness, agreeableness, and neuroticism, contributed significantly to the psychosocial differences between women and men. We conclude that the construction of a retrospective gender score should be based on the data of the study itself. In addition, it may be difficult to transfer a retrospective gender score from one study to another. This means that a gender score should not be taken from the literature and used as a fixed algorithm for a study population, but rather be constructed based on as many suitable study parameters as possible and then applied only to the probands of that study. Comparison of the main variables in gender scores from different studies will allow the assessment of the dependency of gender-related parameters and gender constructs from external variables such as age, ethnicity, cultural background, and working conditions and hopefully identify the most robust ones.

Use of gender score in prevention

The association of the gender score with clinical and psychosocial variables and risk factors was already tested by Pilote and Lacasse. Lacasse claimed that associations between gender index scores and presumed gender-related variables identified a priori and not included in the gender index supported the validity of the construct [17]. We followed these approaches and tested the association of the gender score with clinical and psychosocial variables. In combined linear regression models, gender score, but not sex, was significantly associated with cortisol levels, CES-depression, negative affect, and life satisfaction. Following the arguments of Lacasse, this would support the validity of our construct. Nevertheless, this must be confirmed by external validation in future studies.

Pelletier et al. found in their prospective approach that a higher gender score, but not female sex, was associated with an increased risk of hypertension, diabetes, family history of cardiovascular diseases, and increased depressive and anxiety symptoms [3]. They concluded that traditional sex differences in risk factors may partly be explained by patients’ personality traits, social roles, and life context. If this is true, tackling these risk factors and traits in gender sensitive prevention projects could reduce the risk for cardiovascular diseases. Previous investigations have shown that prevention programs need to be designed in gender sensitive manner to reach their respective audiences. Therefore, when focusing on target groups in prevention programs, not only the biological characteristics should be used but also the elements of the gender score as identified in our project [34].

Limitations of the study

In this paper, we aimed to generate a gender score that allows us to assess whether gender, as an assembly of sociocultural parameters taken together, affects disease risk differently in women and men. Undoubtedly, there are also limitations associated with this approach. Using individual variables related to gender, such as living alone or anxiety, would enable a study to identify specific factors of importance which, independent of sex, affects health outcomes. A compositional variable such a gender score cannot in itself identify such key factors. However, as compared with the use of a set of gender-related variables, a single score provides greater statistical power by reducing the number of covariates included in multiple regression models, offer the possibility to test interaction terms, and reduce multiple comparisons [15, 16, 35]. Therefore, the gender score makes it possible to capture better the interaction of gender with other variables.

When constructing a gender score, we needed to consider that gender, as a social construct, shifts over time and between generations, differs between places and cultures, and depends on socioeconomic conditions. Ideally, the selection of variables for a gender score would be contextual, both in terms of time and place. Therefore, our approach to develop a gender score specifically for the cohort in which it should be used has its strength. At the same time, this leads to significant limitations: the limited availability of variables may explain differences in the variables that constitute the gender score between our study and previously published prospective data [3]. We must accept the available variables and cannot, in a prospective manner, select the best ones. This is, however, an inherent component of the retrospective approach which is justified by the aim to develop a method that enables researchers to measure gender in a retrospective manner. Such an approach is valuable since many studies exist in which an estimate of gender, even if retrospective, could uncover new aspects and reveal important insights.

Conclusions and future aspects

With our strategy, we were able to develop a gender score in a retrospective manner in an elderly, non-working European cohort. This means that the notion of gender can be retrospectively introduced into a large number of studies and contribute to explain differences between women and men in health outcomes. In calculating gender scores in a number of different cohorts from different cultural background, ages, and ethnicities, we will learn which variables are most robust predictors of gender throughout cultures. Furthermore, if in future studies, when longitudinal data will become available, gender turns out to be correlated with risk factors or outcomes, this will provide new aspects for focused prevention to the health care systems.

Availability of data and materials

Due to concerns for participant privacy, data are available only upon request. External scientists may apply to the Steering Committee of BASE-II for data access. Please refer to the BASE-II website ( for additional information.



Berlin Aging Study


Berlin Aging Study II


Bundesministerium für Bildung und Forschung


Center for Epidemiologic Studies Depression Scale (CES-D)


Canadian Institutes of Health Research


Gender and sex determinants of cardiovascular disease: from bench to beyond-premature acute coronary syndrome


Haemoglobin A1C


Low-density lipoproteins


  1. Humphries KH, Izadnegahdar M, Sedlak T, Saw J, Johnston N, Schenck-Gustafsson K, Shah RU, Regitz-Zagrosek V, Grewal J, Vaccarino V, Wei J, Bairey Merz CN. Sex differences in cardiovascular disease - impact on care and outcomes. Front Neuroendocrinol. 2017;46:46–70.

    Article  CAS  Google Scholar 

  2. Mauvais-Jarvis F, Bairey Merz N, Barnes PJ, Brinton RD, Carrero JJ, DeMeo DL, De Vries GJ, Epperson CN, Govindan R, Klein SL, Lonardo A, Maki PM, McCullough LD, Regitz-Zagrosek V, Regensteiner JG, Rubin JB, Sandberg K, Suzuki A. Sex and gender: modifiers of health, disease, and medicine. Lancet. 2020;396:565–82.

    Article  Google Scholar 

  3. Pelletier R, Ditto B, Pilote L. A composite measure of gender and its association with risk factors in patients with premature acute coronary syndrome. Psychosom Med. 2015;77:517–26.

    Article  Google Scholar 

  4. Pelletier R, Khan NA, Cox J, Daskalopoulou SS, Eisenberg MJ, Bacon SL, Lavoie KL, Daskupta K, Rabi D, Humphries KH, Norris CM, Thanassoulis G, Behlouli H, Pilote L, Investigators G-P. Sex versus gender-related characteristics: which predicts outcome after acute coronary syndrome in the young? J Am Coll Cardiol. 2016;67:127–35.

    Article  Google Scholar 

  5. Regitz-Zagrosek V. Sex and gender differences in health. Science & Society Series on Sex and Science. EMBO reports. 2012;13:596–603.

    Article  CAS  Google Scholar 

  6. Bairey Merz CN, Regitz-Zagrosek V. The case for sex- and gender-specific medicine. JAMA internal medicine. 2014;174:1348–9.

    Article  Google Scholar 

  7. Doyal L. Sex, gender, and health: the need for a new approach. BMJ. 2001;323:1061–3.

    Article  CAS  Google Scholar 

  8. Research CIoH. How to integrate sex and gender into research; 2018. p. 2020.

    Google Scholar 

  9. Research CIoH. Online training modules:integrating sex & gender in health research - sex and gender in theanalysis of data from human participants. 2017.

    Google Scholar 

  10. Association AP. APA Dictionary of Psychology, Second Edition; 2015.

    Google Scholar 

  11. Phillips SP. Defining and measuring gender: a social determinant of health whose time has come. Int J Equity Health. 2005;4:11.

    Article  Google Scholar 

  12. Johnson JL, Greaves L, Repta R. Better science with sex and gender: facilitating the use of a sex and gender-based analysis in health research. Int J Equity Health. 2009;8:14.

    Article  Google Scholar 

  13. Brecht A, Oertelt-Prigione S, Seeland U, Rucke M, Hattasch R, Wagelohner T, Regitz-Zagrosek V, Baumann G, Knebel F, Stangl V. Left atrial function in preclinical diastolic dysfunction: two-dimensional speckle-tracking echocardiography-derived results from the BEFRI Trial. J Am Soc Echocardiogr. 2016;29:750–8.

    Article  Google Scholar 

  14. Greenwood BN, Carnahan S, Huang L. Patient-physician gender concordance and increased mortality among female heart attack patients. Proc Natl Acad Sci U S A. 2018;115:8569–74.

    Article  CAS  Google Scholar 

  15. Song MK, Lin FC, Ward SE, Fine JP. Composite variables: when and how. Nurs Res. 2013;62:45–9.

    Article  Google Scholar 

  16. Glynn RJ, Schneeweiss S, Stürmer T. Indications for propensity scores and review of their use in pharmacoepidemiology. Basic & Clinical Pharmacology & Toxicology. 2006;98:253–9.

    Article  CAS  Google Scholar 

  17. Lacasse A, Page MG, Choiniere M, Dorais M, Vissandjee B, Nguefack HLN, Katz J, Samb OM, Vanasse A, Group TCW. Conducting gender-based analysis of existing databases when self-reported gender data are unavailable: the GENDER Index in a working population. Can J Public Health. 2020;111:155–68.

    Article  Google Scholar 

  18. Smith PM, Koehoorn M. Measuring gender when you don’t have a gender measure: constructing a gender index using survey data. Int J Equity Health. 2016;15:82.

    Article  Google Scholar 

  19. John OP, Srivastava S. The Big-Five trait taxonomy: history, measurement, and theoretical perspectives. Handbook of personality - Theory and research (second edition). Vol. 2. New York/London: The Guilford Press; 2009. p. 102–38.

  20. Josef AK, Richter D, Samanez-Larkin GR, Wagner GG, Hertwig R, Mata R. Stability and change in risk-taking propensity across the adult life span. Journal of personality and social psychology. 2016;111:430–50.

    Article  Google Scholar 

  21. Gerstorf D, Hülür G, Drewelies J, Eibich P, Duezel S, Demuth I, Ghisletta P, Steinhagen-Thiessen E, Wagner GG, Lindenberger U. Secular changes in late-life cognition and well-being: towards a long bright future with a short brisk ending? 2015. p. 301–10.

    Google Scholar 

  22. Schulz P, Schlotz W, Becker P. Trierer Inventar zum Chronischen Stress (TICS) [Trier Inventory for Chronic Stress (TICS)]; 2004.

    Google Scholar 

  23. Russell D, Cutrona CE, Rose J, Yurko K. Social and emotional loneliness: an examination of Weiss’s typology of loneliness. Journal of personality and social psychology. 1984;46:1313–21.

    Article  CAS  Google Scholar 

  24. Gerstorf D, Bertram L Fau - Lindenberger U, Lindenberger U Fau - Pawelec G, Pawelec G Fau - Demuth I, Demuth I Fau - Stseinhagen-Thiessen E, Steinhagen-Thiessen E Fau - Wagner GG and Wagner GG. Editorial.

  25. Kühn S, Düzel S, Drewelies J, Gerstorf D, Lindenberger U, Gallinat J. Psychological and neural correlates of embitterment in old age. Psychological Trauma: Theory, Research, Practice, and Policy. 2018;10:51–7.

    Article  Google Scholar 

  26. Kühn S, Düzel S, Drewelies J, Gerstorf D, Lindenberger U, Gallinat J. Psychological and neural correlates of embitterment in old age. Psychol Trauma. 2018;10(1):51–57. PMID: 29323526.

  27. Kunzmann U, Little T, Smith J. Perceiving control: a double-edged sword in old age. J Gerontol: Series B. 2002;57:P484–91.

    Article  Google Scholar 

  28. Levenson H. 2 - Differentiating among internality, powerful others, and chance. In: Lefcourt HM, editor. Research with the Locus of Control Construct: Academic Press; 1981. p. 15–63.

    Chapter  Google Scholar 

  29. Gerstorf D, Drewelies J, Duezel S, Smith J, Wahl HW, Schilling OK, Kunzmann U, Siebert JS, Katzorreck M, Eibich P, Demuth I, Steinhagen-Thiessen E, Wagner GG, Lindenberger U, Heckhausen J, Ram N. Cohort differences in adult-life trajectories of internal and external control beliefs: A tale of more and better maintained internal control and fewer external constraints. Psychol Aging. 2019;34(8):1090–1108. PMID: 31804114.

  30. Konig M, Drewelies J, Norman K, Spira D, Buchmann N, Hulur G, Eibich P, Wagner GG, Lindenberger U, Steinhagen-Thiessen E, Gerstorf D, Demuth I. Historical trends in modifiable indicators of cardiovascular health and self-rated health among older adults: cohort differences over 20 years between the Berlin Aging Study (BASE) and the Berlin Aging Study II (BASE-II). PLoS One. 2018;13:e0191699.

    Article  Google Scholar 

  31. Jolliffe IT, Cadima J. Principal component analysis: a review and recent developments. Philos Trans A Math Phys Eng Sci. 2016;374:20150202.

    PubMed  PubMed Central  Google Scholar 

  32. Berg RG. SPSS Factor Analysis – Beginners Tutorial; 2020. p. 2020.

    Google Scholar 

  33. Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70:41–55.

    Article  Google Scholar 

  34. Robertson C, Archibald D, Avenell A, Douglas F, Hoddinott P, van Teijlingen E, Boyers D, Stewart F, Boachie C, Fioratou E, Wilkins D, Street T, Carroll P, Fowler C. Systematic reviews of and integrated report on the quantitative, qualitative and economic evidence base for the management of obesity in men. Health Technol Assess. 2014;18:v-vi, xxiii-xxix:1–424.

    Article  Google Scholar 

  35. Tadiri CP, Raparelli V, Abrahamowicz M, Kautzy-Willer A, Kublickiene K, Herrero MT, Norris CM, Pilote L; GOINGFWD Consortium. Methods for prospectively incorporating gender into health sciences research. J Clin Epidemiol. 2021;129:191–197. Epub 2020 Sep 24. PMID: 32980428.

Download references


Not applicable


The current work was part of the study GendAge ( supported by grants from the German Federal Ministry of Education and Research (Bundesministerium für Bildung und Forschung, BMBF) to Ilja Demuth and Vera Regitz-Zagrosek (grant number #01GL1716A) and Denis Gerstorf (#01GL1716B). Dr Pilote holds a James McGill chair at McGill University and was the recipient of the Berlin Institute of Health’s Excellence Award for Sex and Gender Aspects in Health Research that supported in part this work. This article is also based on data from the Berlin Aging Study II (BASE-II, Co-PIs are Lars Bertram, Ilja Demuth, Denis Gerstorf, Ulman Lindenberger, Graham Pawelec, Elisabeth Steinhagen-Thiessen, and Gert G. Wagner). BASE-II was supported by the German Federal Ministry of Education and Research (Bundesministerium für Bildung und Forschung, BMBF) under grant numbers #01UW0808; #16SV5536K, #16SV5537, #16SV5538, and #16SV5837. Another significant source of funding was the Max Planck Institute for Human Development, Berlin, Germany. Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations



Ilja Demuth, Vera Regitz-Zagrosek, and Denis Gerstorf planned and supervised the work. Tauseef Nauman, Ilja Demuth, Denis Gerstorf, Konstantinos Mantantzis, and Hassan Behlouli did the statistical analyses. Vera Regitz-Zagrosek and Tauseef Nauman wrote the manuscript. Ilja Demuth, Denis Gerstorf, Louise Pilote, and Friederike Kendel gave advice for manuscript writing. Nora Berger and Gert G. Wagner provided data from BASE-II. Johanna Drewelies helped organize collaboration. Nicholas Alexander and all other authors reviewed the manuscript. The authors read and approved the final manuscript.

Corresponding authors

Correspondence to Ahmad Tauseef Nauman or Vera Regitz-Zagrosek.

Ethics declarations

Ethics approval and consent to participate

All participants gave written informed consent. The Ethics Committee of the Charité –Universitätsmedizin Berlin approved the BASE-II study (approval number EA2/029/09).

Consent for publication

All authors consented the publication to the Journal (Biology of sex differences).

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Figure S1

. ROC analysis with 1089 participants for gender score and sex variable. Table S1. Correlation between gender variables in PCA. Table S2. Gender scores in the different tertiles of our cohort.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nauman, A.T., Behlouli, H., Alexander, N. et al. Gender score development in the Berlin Aging Study II: a retrospective approach. Biol Sex Differ 12, 15 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Gender
  • Sex
  • Aging
  • Berlin Aging Study II
  • Gender score