Article Text

Download PDFPDF

Importance of collecting data on socioeconomic determinants from the early stage of the COVID-19 outbreak onwards
  1. Saman Khalatbari-Soltani1,2,
  2. Robert C Cumming3,4,
  3. Cyrille Delpierre5,6,
  4. Michelle Kelly-Irving5,6,7
  1. 1 Faculty of Medicine and Health, The University of Sydney School of Public Health, Sydney, Australia
  2. 2 ARC Centre of Excellence in Population Ageing Research (CEPAR), The University of Sydney, Sydney, Australia
  3. 3 The University of Sydney School of Public Health, Sydney, Australia
  4. 4 ARC Centre of Excellence in Population Ageing Research (CEPAR), The University of Sydney Faculty of Medicine and Health, Sydney, Australia
  5. 5 LEASP, Université Toulouse III Paul Sabatier, Toulouse, France
  6. 6 Umr 1027, Inserm, Toulouse, France
  7. 7 IFERISS, Université de Toulouse, Toulouse, France
  1. Correspondence to Dr Saman Khalatbari-Soltani, Faculty of Medicine and Health, The University of Sydney School of Public Health, Sydney, New South Wales 2006, Australia; saman.khalatbarisoltani{at}


Disadvantaged socioeconomic position (SEP) is widely associated with disease and mortality, and there is no reason to think this will not be the case for the newly emerged coronavirus disease 2019 (COVID-19) that has reached a pandemic level. Individuals with a more disadvantaged SEP are more likely to be affected by most of the known risk factors of COVID-19. SEP has been previously established as a potential determinant of infectious diseases in general. We hypothesise that SEP plays an important role in the COVID-19 pandemic either directly or indirectly via occupation, living conditions, health-related behaviours, presence of comorbidities and immune functioning. However, the influence of socioeconomic factors on COVID-19 transmission, severity and outcomes is not yet known and is subject to scrutiny and investigation. Here we briefly review the extent to which SEP has been considered as one of the potential risk factors of COVID-19. From 29 eligible studies that reported the characteristics of patients with COVID-19 and their potential risk factors, only one study reported the occupational position of patients with mild or severe disease. This brief overview of the literature highlights that important socioeconomic characteristics are being overlooked when data are collected. As COVID-19 spreads worldwide, it is crucial to collect and report data on socioeconomic determinants as well as race/ethnicity to identify high-risk populations. A systematic recording of socioeconomic characteristics of patients with COVID-19 will be beneficial to identify most vulnerable groups, to identify how SEP relates to COVID-19 and to develop equitable public health prevention measures, guidelines and interventions.

  • Ageing
  • Epidemiology
  • Social inequalities
  • Social epidemiology
  • Health inequalities
  • Life course epidemiology
  • Cancer
  • Inequalities
  • Stress
  • Longitudinal studies

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See:

Statistics from

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.


The socioeconomic gradient in health is ubiquitous,1 and has been described across pathologies, in life expectancy and mortality. It is characterised by a social patterning of health, where the most socially disadvantaged are more likely to be exposed, to get sick and to die compared to their advantaged counterparts. The most commonly used individual measures of socioeconomic position (SEP) are occupation, income and education,2 each one measuring different components and aspects of the social environment. All have well-established associations with numerous health outcomes and mortality, through various pathways such as behaviours, chemical and physical exposures or chronic stress exposures.3 Moreover, evidence has suggested a link between disadvantaged SEP and increased risk of infectious disease in general4 5 and immune response. People with disadvantaged SEP should be considered as high-risk populations at the time of any infectious disease outbreak6 since their social context could affect the occurrence and severity of an infectious disease via several pathways.4

The outbreak of coronavirus disease 2019 (COVID-19) caused by the novel coronavirus SARS-CoV-2 was first reported on 31 December 2019, in Wuhan, China, and has since spread to 184 countries and regions globally. As of 9 April 2020, there were more than one million and half confirmed cases (n=1 511 104) and 88 338 deaths. So far, advanced age, being a man and having chronic conditions, such as obesity, diabetes, respiratory disease, kidney disease and cardiovascular diseases (CVD), have been linked with more severe COVID-19 symptoms often leading to the development of acute respiratory distress syndrome (ARDS) and progression from ARDS to death.2 7 Other potential factors such as race/ethnicity and socioeconomic factors may also play an important role in the COVID-19 epidemic. While the economic impact of the COVID-19 outbreak and the resulting measures has raised as an important issue, we should not neglect individuals’ own SEP and its effect on COVID-19 incidence, transmission, severity and mortality.

Based on our previous knowledge of social and structural determinants of health, there is no reason to think that these factors are any less important in affecting who is exposed to COVID-19, who gets sick, who is likely to need intensive care and who will die from it. Without collecting data on these social factors and making them available, we are blind to how these factors affect the epidemic and how they may be used to inform public health prevention measures. Here we attempt to briefly assess the extent to which the current data available on COVID-19 take into account socioeconomic factors. We then make the case for why social factors are likely to be involved, both directly and indirectly, in the epidemiology of COVID-19, and therefore why they need to be collected.

We identified published studies through a rapid review of PubMed from inception to 03 April 2020, with the following search terms: (‘COVID-19’, ‘SARS-CoV-2’) and (‘characteristic’, ‘risk factors’, ‘socio’, ‘socioeconomic’, ‘occupation’, ‘education’, ‘income’, ‘wealth’). We also screened the reference lists of the eligible publications. We included studies that were written in English and reported descriptive characteristics of COVID-19 cases or reported risk factors of COVID-19 incidence, severity and mortality. The search identified 161 articles; after title, abstract and full-text screening, 16 publications were eligible for inclusion. We identified another 13 eligible publications through reference screening. Most of the studies were from China (n=26; mostly in Wuhan), two from Singapore, one from Europe and one from the Diamond Princess Cruise, Japan. All studies included cases of COVID-19, and the sample sizes range from 18 to 72 314. Of note, in the rush to publish COVID-19 papers, it is highly possible that the same sample of patients have been reported in different articles.8 While all included studies reported data on age, gender and comorbidities, only one study reported any indicator of SEP. In this study, Shi et al, using data on occupational position from 484 COVID-19 patients in Zhejiang Province of China, reported that severe cases were more likely to be agricultural workers and less likely to be self-employed than mild cases.9 To our knowledge, no study thus far has reported data on other SEP indicators such as educational level, income and housing conditions.

One of the reasons for this is that data on individual-level SEP are not being collected. This could be due to the suggested World Health Organization standard COVID-19 case report form1 which only asks for each patient’s age, sex/gender, place where the case was diagnosed and usual place of residency. This could also be due to the fact that social measures are not considered as data of clinical interest by most clinicians. The consequence is that these data are absent from medical records, limiting the possibility of studying the evolution of diseases with regards to these determinants. However, it is critical to also consider socioeconomic factors at each phase of the epidemic to effectively interrupt human-to-human transmission chains, prevent further spread through appropriate equitable interventions, as well as to identify and better treat individuals who have greater susceptibility to becoming severe or even critically ill upon infection. The reasons why individual SEP must become ‘clinical’ data in the same way as age are numerous:

  • A person’s occupation may expose them to risk by the nature of their job. Work involving constant human contact, interaction with others or caring for people means that risk of infection spread through droplets of aerosol is higher.10 With regards to COVID-19, occupation is likely to be a direct determinant of infection and an indirect determinant of disease severity and mortality through the relationship between occupational social class and comorbidities. For example, the direct impact of occupation on the COVID-19 incidence could be seen among workers such as retail staff, cleaners, teachers, healthcare workers or crew on board cruise ships.11 People in disadvantaged SEP categories are more likely to be exposed to job stress including high job strain, burnout and unemployment, which may lead to disrupted immune and inflammatory system responses12 13 as well as an increased risk for CVD.14 So far, both reduced immune function and the presence of CVD are known risk factors of COVID-19 severity.

  • Low income might affect living conditions in many ways, such as residence in more deprived neighbourhoods and housing conditions, especially cramped or overcrowded housing, which has been associated with an increased risk of infection from numerous pathogens, such as tuberculosis15 Helicobacter pylori 16 or Epstein–Barr virus.5

  • A lower education level is indirectly associated with a number of factors that may increase the risk of developing severe forms of COVID-19, such as increased prevalence of smoking and poor nutrition, which could suppress the immune system. A recent systematic review of five retrospective or prospective studies reported that smoking is most likely associated with adverse outcomes of COVID-19.17 Furthermore, known comorbidities for COVID-19, such as CVD, diabetes, respiratory disease and kidney disease are more prevalent among socially disadvantaged individuals than in populations with higher SEP.18 Health literacy, which is associated with education, may play an important role in COVID-19 incidence and severity. Effective public health communication to act appropriately during an infectious disease outbreak is contingent upon people being able to access and understand the information. Individuals with limited health literacy may be more easily misguided by incorrect sources of information.6 Furthermore, we also know that screening measures are often overlooked by disadvantaged populations with low health literacy levels, which may lead to delay in seeking care late when their sickness is worse.19 Thus, a whole-of-society approach must include appropriate language and accessible healthcare strategies.

To be able to understand the complex and interrelated influence of socioeconomic factors on COVID-19 transmission, incidence and its health outcomes, data sources with comprehensive socioeconomic measures are needed. Some might argue that we can link people’s addresses or postcodes to area-based SEP through geolocalisation, which may offer some insight into the likelihood of exposure to certain health risks, including pollution or public transport. Indeed, these variables are often used as proxies for individual SEP; however, they are not an accurate reflection of individual circumstances, could underestimate the extent of social inequalities compared to individual social measures20 and are best used in parallel with individual-level variables to reflect geographical or aggregate-level exposures.

Other dimensions that interact with SEP need also to be considered


Ethnic minorities including indigenous peoples are subject to systematic and structural discrimination, which leads to them being more socioeconomically disadvantaged and less likely to be able to seek healthcare when they need it.21 Recent reports from health authorities and journalists suggest that COVID-19 hospitalisation and mortality rates may be higher in geographical areas or neighbourhoods which are more socially disadvantaged or where ethnic minorities are more likely to live. The poorest department in mainland France, Seine St. Denis, with a high proportion of people from ethnic minorities living there, reported a 63% increase in mortality during the week of the 21st of March, a bigger increase than any other French department. Reports emerging from the United States also suggest that African Americans and possibly people from other ethnic minorities are over-represented among the COVID-19 deaths. These are anecdotal and unscientific reports for the moment; however, they deserve to be taken seriously and met with scientific methods and analyses to examine them thoroughly. Furthermore, within each country, attention should be paid to the specific cultural practices and needs of different populations; they may need support in sending appropriate prevention messages to their community about festivals, rituals and habits. Thus, as previously suggested by the Sendai Framework, governments should engage with migrants, people from ethnic minorities and indigenous peoples in the design and implementation of policies during a pandemic.22

Gender and intersectionality

Gender is another important aspect that deserves serious attention in the COVID-19 pandemic,23 especially as it intersects with occupation, education and race/ethnicity. Women are more likely to have jobs or roles where they are in contact with others, as teachers, nurses, carers or retail workers. When schools closed across different countries, women are most likely to have had to take on childcare and homeschooling responsibilities. On the other hand, men are more likely than women to present with severe forms of COVID-19 and have a higher mortality rate.7 24 The reasons for this remain to be understood and may vary between contexts; however, men tend to have a higher prevalence of the main COVID-19 risk factors. When it comes to the mitigation policies put in place to curtail the spread of disease, such as confinement or quarantine, these intersecting factors including gender, class and race/ethnicity will affect how people cope with the social, economic and psychological consequences of such measures.25 26

National and local contexts

The impact of socioeconomic determinants on the COVID-19 outbreak could vary in high-income-countries versus low- and middle-income countries (LMICs). As observed across the world, once COVID-19 cases appear initially from abroad, the community spread of disease will depend upon the specific local infrastructures and social inequalities in each context. Current public health measures of social distancing including lockdowns, quarantine and self-isolation cannot be implemented or may carry consequences in LMICs. In low-income settings where absolute poverty is a major problem, access to basic needs such as water, sanitation and food will have a big impact on how easily people can practise any physical distancing measures. Moreover, in LMICs where people are more likely to live in overcrowded households or neighbourhoods, it would be impossible to isolate older adults or vulnerable people. In many parts of the world, health is not free at the point of need, and healthcare systems, which are stretched at the best of times, will become quickly saturated. Furthermore, while children appear to be less severely affected by COVID-19, they may be more vulnerable to the social, economic and psychological consequences of the disease and ensuing public health containment measures, especially in low-income settings.27


In order to identify groups who are most likely to have poor outcomes, high-quality data on socioeconomic factors are urgently needed, which will have important implications in the development of public health measures. Social measures should be considered as clinical variables, in the same way as age or gender, and should therefore be routinely recorded in medical records. Pandemic recommendations and guidelines provided by international and national agencies need to recognise the collective contribution of the social determinants of health and their intersectionality to pandemic risk mitigation.

Key messages

  • Prior evidence on the social and socioeconomic determinants of disease suggests that Covid-19 is likely to have a socially patterned population distribution.

  • Social and socioeconomic data are not being collected and made available to understand and address Covid-19 health inequalities.

  • Pandemic recommendations and guidelines need to reflect the contribution of the social determinants of health and their intersectional nature.



  • 1

  • Contributors SKS and MKI drafted all versions of the manuscript, SKS carried out the literature review, and RC and CD drafted the manuscript and provided edits.

  • Funding SKS is supported by the Australian Research Council Centre of Excellence in Population Ageing Research (Project number CE170100005). The funder had no role in study design, data collection, decision to publish or preparation of the manuscript. This work was also part of a project funded by the Agence National de Recherche Flash-Covid grant and Région Occitanie.

  • Competing interests None declared.

  • Patient consent for publication Not required.

  • Data sharing statement Data are available upon reasonable request.

  • Provenance and peer review Commissioned; internally peer reviewed.