Article Text

Download PDFPDF

Relation between number of siblings and adult mortality and stroke risk: 25 year follow up of men in the Collaborative study
  1. C L Hart1,
  2. G Davey Smith2
  1. 1Department of Public Health, University of Glasgow, Glasgow, UK
  2. 2Department of Social Medicine, University of Bristol, Bristol, UK
  1. Correspondence to:
 Dr C L Hart, Department of Public Health, University of Glasgow, 1 Lilybank Gardens, Glasgow G12 8RZ, UK;


Study objective: To investigate the relation between number of siblings, mortality risk, and stroke risk.

Design: Prospective cohort study.

Setting: 27 workplaces in Scotland.

Participants: 5765 employed men aged 35–64 from a variety of different workplaces, screened between 1970 and 1973.

Main results: There were strong relationships between number of siblings and socioeconomic variables and also with adult behavioural measures. Men with greater numbers of siblings had an increased risk of dying of all causes, coronary heart disease, lung cancer, stomach cancer, and respiratory disease over a 25 year follow up period. Adjustment for risk factors could explain these associations, excepting stomach cancer mortality. With the definition of stroke as either a hospital admission for stroke or death from stroke, there was a strong relation between number of siblings and haemorrhagic stroke, but not ischaemic stroke.

Conclusions: Number of siblings is strongly related to mortality risk, but as it is also related to many risk factors, adjustment for these can generally explain the relation with mortality. The exceptions are stomach cancer mortality and haemorrhagic stroke, which are known to be related to deprivation in childhood, and, in the case of stomach cancer to childhood infection.

  • cohort studies
  • mortality
  • siblings

Statistics from

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

It was reported a century ago that people from small families live longer than people from large families.1 However, number of siblings has not generally been analysed in large epidemiological studies, although it can be considered an indicator of material resources in the childhood home. People who have more siblings will, on average, have grown up in more overcrowded accommodation, with greater exposures to early infections, and with access to a less adequate diet.2 These factors could contribute to health in adulthood either through influencing childhood health (which in turn influences adult health), through an influence on the establishment of behavioural patterns in childhood, or through the latent effects of factors influenced by family size, such as chronic Helicobacter pylori infection or poorer infant and childhood nutrition, on adult disease risk.3 We have investigated the effect of number of siblings on adult health and mortality using a large cohort study that recorded number of siblings, in addition to several other socioeconomic variables and health measures, and that has 25 years of mortality follow up.


This analysis was based on part of a cohort of employed people from 27 workplaces in Glasgow, Clydebank, and Grangemouth, who were screened between 1970 and 1973. The full sample consisted of 6022 men and 1006 women. Participants completed a questionnaire and attended a physical examination. Women have been excluded from this study because of their small numbers and because they were not representative of the socioeconomic spectrum, as most were from only two workplaces. Full details have been described elsewhere.4

The physical examination included measurement of blood pressure, height, weight, plasma total cholesterol, forced vital capacity (FVC), forced expiratory volume in one second (FEV1), and a six lead electrocardiogram (ECG). The questionnaire collected information about smoking, alcohol consumption, angina from the Rose questionnaire,5 bronchitis, age leaving full time education, number of siblings, regular car driving, home address, main occupation of the participant’s father, and the participant’s own occupation.

Blood pressure was measured with the subject seated, and diastolic pressure was recorded at the disappearance of the fifth Korotkoff sound. Adjusted FVC was defined as the actual FVC as a percentage of the expected FVC. This was derived from a linear regression of age and height from a healthy subset (n=841) of the study population who had never smoked and did not report suffering from phlegm, breathlessness, wheezy or whistling chest or weather affecting breathing. The derived regression equation was


where height was in centimetres and age was the age at screening in years.

The adjusted FEV1 was similarly defined as the actual FEV1 as a percentage of the expected FEV1.4 A six lead ECG was made with the subject seated. The ECG was coded according to the Minnesota system with any of codes 1.1–1.3, 4.1–4.4, 5.1–5.3, and 7.1 being considered as evidence of ischaemia, encompassing diagnoses of definite myocardial infarction, myocardial ischaemia, and left bundle branch block.6, 7 Angina was defined as definite grades I and II from the Rose Angina Questionnaire.8 Bronchitis was defined as having persistent and infective phlegm and being breathless.4, 9 Body mass index in kg/m2 was calculated from the weight and height. Obesity was defined as having a body mass index of 30kg/m2 or above. A blood sample was taken for the measurement of whole plasma cholesterol. Units of alcohol consumed per week were calculated from responses to the questionnaire about usual weekly consumption of beer, spirits and wine.10

The home address at the time of screening was retrospectively postcoded, enabling deprivation category as defined by Carstairs and Morris to be ascertained.11 This measure is an area based measure of deprivation, obtained from four census variables, male unemployment, overcrowding, car ownership, and the proportion of heads of households in social classes IV and V. A deprivation score for each postcode sector is obtained, which is converted to seven categories ranging from 1 (least deprived) to 7 (most deprived).

The questionnaire asked for the main occupation of the participant’s father and the occupation at the time of screening. Social class was coded according to the contemporary Registrar General’s Classification12 for each occupation. Upward social mobility was defined as moving upwards from father’s to own adulthood social class. Downward social mobility was defined similarly.

Number of siblings were derived from answers to the two questions “How many brothers were there in your family? (Excluding yourself)” and “How many sisters were there in your family? (Excluding yourself)”. The answers to these questions were combined to form the number of siblings for each participant.

The analysis was based on 5765 men aged between 35 and 64 years at screening, who had not embarked from Britain during the follow up period.

Study participants were flagged at the National Health Service Central Register in Edinburgh. Dates of death up to the end of 1998 and their cause were provided. Causes of death were defined as coronary heart disease (CHD) (ICD9 codes 410–414 and 429.2), stroke (ICD9 430–438), lung cancer (ICD9 162), other smoking related cancer (ICD9 140, 141, 143–149, 150, 157, 160, 161, 163, 188 and 189), stomach cancer (ICD9 151), other cancer (remainder of ICD9 codes 140–208), respiratory disease (ICD9 460–519), and accidents and violent deaths (ICD9 800–998 and E800-E999).

In addition, a computerised linkage with acute hospital discharges in Scotland provided records of all main diagnoses of stroke between 1972 and 1998.13 These data were obtained for a project on stroke mortality and morbidity. Stroke was defined as ICD8 or ICD9 codes 430 – 438, and as ICD10 codes I60 – I69 and G45. Haemorrhagic stroke was defined as ICD8 codes 430 and 431, ICD9 codes 430–432 and ICD10 codes I60-I62. Ischaemic stroke was defined as ICD8 codes 432–435 and 437, ICD9 codes 433–435 and ICD10 codes I63, I65, I66 and G45. We have previously shown that risk factor associations with stroke admissions are very similar to risk factor associations with stroke deaths.14

Five groupings were constructed from number of siblings: no siblings, 1–2, 3–4, 5–6, and 7 or more. Analyses used either these groupings or the actual number of siblings. Cox’s models15 were used to calculate proportional hazards regression coefficients for sibling groups. The exponentiated proportional hazards regression coefficients are referred to as relative rates. Adjustments were made for risk factors by including the variables in the models. Adjustment for smoking was entered as number of cigarettes smoked per day for current and ex-smokers, with an extra term for ex-smokers (1 if ex-smokers, 0 otherwise). Social class, father’s social class, deprivation category, height, adjusted FEV1, systolic blood pressure, cholesterol, and units of alcohol were entered as continuous variables. Car user and bronchitis were added as discrete variables and education was entered as an ordinal variable representing four groups of age leaving full time education (12–14, 15–16, 17–18, and 19 or over). Cox’s models were used to calculate tests for trend and the relative rate per sibling, using actual number of siblings as a continuous variable. Survival time was taken from the date of screening until the date of death or 25 years from the date of screening. For analyses of stroke mortality or events, survival time was taken from the date of screening until either the date of hospital admission for stroke, or the date of death from stroke if no hospital admission for stroke were found. Means of continous variables were standardised for age using PROC GLM in the SAS system, with tests for trend and regression coefficients being obtained by regression analysis. Proportions of categorical variables were age standardised by the direct method, using the study population as the standard, with tests for trend and odds ratios obtained by logistic regression.


Figure 1 shows the number of siblings reported. It was most common to have one or two siblings, although the largest number reported was 17. Table 1 presents characteristics measured at baseline in terms of sibling groups. Men with more siblings were older at baseline. There were strong relations between number of siblings and the socioeconomic variables. Men with more siblings were less likely to be in social classes I and II, their fathers were also less likely to have been in social classes I and II, they were less likely to be regular car drivers, more likely to live in deprived areas, and to have left full time education at age 14 or below. Despite having more opportunity for upward social mobility, men with more siblings were less likely to be upwardly mobile. There were no meaningful associations of number of siblings with angina, ECG ischaemia or body mass index, weak positive associations with blood pressure, strong positive associations with bronchitis, and strong inverse associations with cholesterol and both lung function measures. There was an apparent U shaped relation between number of siblings and obesity, although there was no statistical evidence of a quadratic trend (p=0.51). There was a graded relation with height, with only children being the tallest on average. The percentage of men who had never smoked was inversely related to number of siblings. The percentage of current cigarette smokers and the amount of alcohol consumed were positively associated with number of siblings, although fewer of the men with one or two siblings smoked than only children. The age when smoking began for both current and former smokers was inversely related to number of siblings.

Table 1

Baseline characteristics of 5765 men aged 35–64 from the Collaborative study according to number of siblings. Values are age adjusted means or percentages

Figure 1

Number of siblings reported by men in the Collaborative study.

Table 2 presents for baseline characteristics, the regression coefficients for continuous variables and odds ratios for discrete variables per sibling adjusted for age and also adult (adult social class, deprivation category and regular car driver) and childhood (father’s social class and age leaving full time education) socioeconomic measures. Compared with age adjusted results, additional adjustment for adult and childhood socioeconomic variables generally attenuated the relation between siblings and the risk factors at baseline. Lung function measures saw particularly large decreases and additional adjustment for smoking decreased the regression coefficients further to −0.23 (95% confidence interval −0.44 to −0.01) for FEV1 and −0.14 (95% confidence interval −0.30 to 0.02) for FVC. Additional adjustment for smoking did not change the odds ratios for bronchitis.

Table 2

Regression coefficients or odds ratios and 95% confidence intervals of baseline characteristics per sibling adjusted for age and socioeconomic variables

Mortality in 25 years of follow up was related to number of siblings (table 3). The group with 1–2 siblings was taken as the baseline. Men with seven or more siblings had a 30% higher risk of dying than men with 1–2 siblings. Similar results were seen for deaths from CHD. Only children had a lower risk of stroke mortality than men with any number of siblings. Men with seven or more siblings had the highest relative rate of lung cancer mortality, and only children had the lowest. Similar results were seen for mortality from other smoking related cancers. The strongest positive association between number of siblings and any examined cause of death was with stomach cancer, with a greater than threefold risk difference between men with seven or more siblings and men with 0–2 siblings. There was no clear relation with other non-smoking related cancers as a combined end point, nor with accidental and violent deaths. Respiratory disease deaths were strongly associated with number of siblings, with men in the highest group having a 50% higher risk than men with 1–2 siblings.

Table 3

Age adjusted relative rates of mortality and 95% confidence intervals by number of siblings in 5765 men aged 35–64 from the Collaborative study in 25 years of follow up

Table 4 presents the relative rates of mortality adjusted for other risk factors. Adjustment for smoking could explain some of the relation between number of siblings and all cause, CHD, stroke, lung cancer, other smoking related cancer, stomach cancer, and respiratory disease mortality. Adjustment for socioeconomic variables (social class, father’s social class, car user, deprivation category, and education) attenuated the relationships between number of siblings and all the above causes of death. Adjustment for smoking, the socioeconomic variables, bronchitis, height, adjusted FEV1, systolic blood pressure, cholesterol and units of alcohol consumed further attenuated the relative rates. Number of siblings only remained substantially associated with stomach cancer mortality. Only children retained a lower risk of stroke and lung cancer mortality than all men with siblings, but there was no consistent gradient according to number of siblings. Rerunning the models in table 4 using only variables which were significant at the p<0.1 level did not affect the results.

Table 4

Adjusted relative rates of mortality and 95% confidence intervals by number of siblings in 5765 men aged 35–64 from the Collaborative study in 25 years of follow up

With the definition of stroke as either a hospital admission for stroke or a death from stroke, 416 men had a stroke in the follow up period (table 5). Of these, 47 were classified as haemorrhagic strokes and 100 as ischaemic. Because of the small numbers, the first two categories of siblings were combined to form a category of none to two siblings and this was used as the baseline. There was a strong positive trend with haemorrhagic stroke, but not with ischaemic stroke. Men with seven or more siblings were over twice as likely to have a haemorrhagic stroke than men with two or less siblings. Adjustment for other risk factors made little difference to the relative rates for haemorrhagic stroke, although the significance levels were lessened by adjustment. Adjusting only for risk factors significant at the p<0.1 level did not affect the results. In a similar analysis of stroke subtype by father’s social class, the age adjusted relative rate of haemorrhagic stroke was 2.84 (95% confidence interval 1.12 to 7.20) for men with fathers in manual social classes compared with men with fathers in non-manual social classes. Adjustment for other risk factors and adult socioeconomic measures increased the relative rate to 3.22 (1.15 to 9.03). Similar results for ischaemic stroke were 1.25 (0.77 to 2.03) when adjusted for age and 0.92 (0.53 to 1.61) when fully adjusted.

Table 5

Relative rates of haemorrhagic and ischaemic stroke and 95% confidence intervals by number of siblings in 5765 men aged 35–64 from the Collaborative study in 25 years of follow up. Stroke defined as either stroke hospital admission or death

Key points

  • Number of siblings is related to adverse behavioural, socioeconomic, and health measures.

  • There are strong relationships between number of siblings and risk of different causes of death, but with the exception of stomach cancer mortality, these relationships can be explained by the adverse risk factors.

  • Number of siblings is related to haemorrhagic, but not ischaemic, stroke.


We have shown that number of siblings was strongly associated with other socioeconomic variables, measures of health in adulthood, and mortality in 25 years of follow up. Adjustment for the other variables could account for many of the observed associations, although not with stomach cancer or haemorrhagic stroke. There are few mortality and adult health studies in the epidemiological literature using number of siblings, although some studies of birth order also include number of siblings.16, 17 A study of Swedish men and women found that having a large family (defined as having four or more brothers and/or sisters) was related to poorer health in adulthood.18 Other studies have found that patients with diabetes, peptic and duodenal ulcers, arthritis, contagious diseases, and increased sensitivity to physical pain are more likely to come from large families.19 Our study found a strong relationship between siblings and bronchitis in adulthood and also lung function which was attenuated after further adjustment for socioeconomic variables. Our findings provided weak evidence of higher blood pressure in men with over five siblings. Other studies have produced different results for blood pressure. An inverse relation with siblings was found in a study of British children aged between 5 and 7.5 years20 and in male Glasgow University students.21 Higher blood pressure was found in adults without siblings compared with those with siblings in the Buffalo Blood Pressure Study.22 Obesity in young men has been found to be greater in those with less siblings23 although we found a suggestion of a U shaped relation. Many studies have found number of siblings to be inversely related to height, in agreement with the current study.16, 24, 25 Achieved height in adulthood is related to early life social circumstances, nutrition and prenatal growth.26 For men born around the time of the present cohort, number of siblings was not related to birth weight, but was related to poorer post-natal growth.27

Policy implications

Number of siblings is related to risk of childhood infection, which is known to be related to stomach cancer risk in adulthood through H pylori infection. The paper contributes to the growing body of evidence linking processes generating stomach cancer and haemorrhagic stroke risk within populations.

Of particular note were the strong relations between adult behavioural factors and siblings—as number of siblings increased, the amount of alcohol consumed per week increased and men were more likely be smokers and to have started smoking at a younger age. A risk factor acting in the opposite way was cholesterol—men with more siblings had lower (that is, healthier) cholesterol levels. In studies of this era, before health messages on reduction of fat consumption were common, higher cholesterol was seen in the more affluent groups.7

Studies relating number of siblings to mortality are sparse. In a study of Swedish men and women, having a large family was associated with an increased although not statistically robust mortality risk.18 Conversely, in a study of intellectually gifted children in California begun in the 1920s, number of siblings was inversely associated with all cause mortality in women, and non-cardiovascular, non-cancer mortality in both men and women.17 However, this was a highly selected cohort and may not be generally representative. We found positive associations between number of siblings and all cause, CHD, stroke, lung cancer, other smoking related cancers, stomach cancer, and respiratory disease mortality. Adjusting for several risk factors could explain the associations with all of these causes of death, excepting stomach cancer. Adjusting for several socioeconomic variables could be over-adjustment, as siblings could “explain” or at least contribute to childhood socioeconomic effects.

CHD has been the most investigated cause of death in previous studies, which have found inconsistent results, and have generally not been able to adequately adjust for risk factors in adulthood.28–32 Our study found no robust association. For respiratory mortality the direction of association was reversed on adjustment for other risk factors. These included some factors that could be influenced by family size in childhood, however, including lung function and bronchitis.3 Although adjustment for socioeconomic position and smoking attenuated associations of sibling number with these measures, some relation remained. Thus chronic obstructive pulmonary disease and susceptibility to respiratory infections in adulthood could be influenced by family size through these pathways. Conversely, the number of siblings is protective against asthma, at least in children,33 and it is possible, therefore, that number of siblings may have opposite effects on different components of respiratory disease in adulthood.

Stomach cancer mortality was strongly associated with number of siblings. This could reflect Helicobacter pylori infection, which is known to be a risk factor for cancer of the body of the stomach,34 is generally acquired in childhood,35 and, in another Scottish survey, shows a strong graded association with number of siblings, similar in magnitude to the association we show with stomach cancer.36 Limited previous evidence links sibling number to stomach cancer risk,37 although a recent study of comparatively young Swedish adults failed to confirm this,38 perhaps because of the changing nature of stomach cancer in wealthy countries, where a substantial decline in overall stomach cancer mortality has been accompanied by a switch of the predominant component from cancer of the body of the stomach—which is related to Helicobacter pylori infection—to cancer of the gastric cardia, which is unrelated to Helicobacter pylori infection.34

There was only weak evidence of a graded association between number of siblings and stroke mortality, although men who were only children had about half the risk of men with any number of siblings. The findings for stroke (hospital admission or death) by subtype were of particular interest and confirmed previous findings of different relationships with haemorrhagic and ischaemic stroke. The strong relationships seen between siblings and haemorrhagic stroke, and between father’s social class and haemorrhagic stroke are consistent with findings in another Scottish cohort study, the Renfrew/Paisley general population study, in which there was a strong inverse relation between height and haemorrhagic stroke, each 10 cm increment in height resulting in a 30% decrease in risk, but little association with ischaemic stroke.39 In a Finnish study, haemorrhagic stroke was inversely associated with birth weight adjusted for head circumference.40 The authors suggested that reduced fetal growth could increase the risk of haemorrhagic stroke by permanently changing the cerebral arterial structure. A recent study showed birth weight to be inversely associated with haemorrhagic stroke in Swedish men and women, which was strengthened when adjusting for birth length and head circumference.41 These authors suggested that the risk of haemorrhagic stroke was related to impaired growth of soft tissue mass relative to bone growth. It is possible that number of siblings is related to haemorrhagic stroke because it is associated with poorer fetal growth, however, the association between number of siblings and birth weight has not been well described. Indeed birth weight tends to increase with parity from the first birth on, although at high birth orders there is some evidence of a decline in birth weight.42 Short birth intervals, which will occur in families that end up with a high number of offspring, are associated with lower birth weight, however.42 Thus the exact prediction regarding how number of siblings would relate to birth weight is unclear. Furthermore, as CHD is also related to poor fetal growth, it would be expected that a similar association with CHD should be observed if fetal growth mediated the sibling-stroke association.

We have previously reported on the influence of other socioeconomic variables and stroke risk (hospital admission or death) in the Collaborative study cohort.13 The most striking results were for father’s social class—that is, socioeconomic circumstances in childhood—with men growing up in households with fathers in manual occupations having a 70% increased risk of having a stroke than men with fathers in non-manual occupations. Men who were upwardly socially mobile (had a father with a manual occupation and their own occupation in adulthood was non-manual) had the same risk of stroke as stable manual men (father manual and own occupation manual). This suggests the importance of early life conditions for stroke risk. Poor early life conditions as indexed by number of siblings in this study confirm this suggestion.

The association of number of siblings with two specific causes—stomach cancer and haemorrhagic stroke—after adjustment for childhood and adulthood socioeconomic circumstances and other risk factors suggests a link between these conditions. This link is supported by consideration of time trends in these conditions—which both decreased markedly in Britain during the 20th century43 and in an ecological analysis that found that the infant mortality rate in the 1920s was associated with stomach cancer and overall stroke mortality in the 1990s across 27 countries, suggesting that early life circumstances, particularly those that would influence infectious diseases in early childhood, are important in the aetiology of these conditions.44 The potential contribution of a childhood acquired infection in haemorrhagic stroke deserves further investigation. Current evidence suggests that Helicobacter pylori infection is not robustly associated with stroke,45 but there are no data specifically on haemorrhagic stroke, and most strokes in previous studies would have been ischaemic. The shared epidemiological patterns of stomach cancer and haemorrhagic stroke could be attributable to childhood infections contributing to both diseases, and declining family size may have contributed to the secular decreases across the 20th century.

Number of siblings is related to the cause of death, which a priori reasoning suggests it should be: stomach cancer, presumably through increased risk of Helicobacter pylori infection among those with more siblings. Furthermore, number of siblings is also strongly related to peptic ulcer risk in this cohort (C Metcalfe, personal communication, 2001). Therefore number of siblings seems to be serving as a marker of risk of at least some infections in childhood. The lack of association of number of siblings with CHD, ischaemic stroke, and other cancers suggests—within the limits of the power of this study—that childhood infections that would be indexed by number of siblings are not important contributors to the aetiology of these conditions.


Victor M Hawthorne, Charles R Gillis, and David J Hole were responsible for the original study of the cohort and Pauline L MacKinnon is responsible for updating mortality.



  • Funding: the research received grants from Chest, Heart and Stroke Scotland, and the Stroke Association.

  • Conflicts of interest: none.

Linked Articles

  • In this issue
    John R Ashton Carlos Alvarez-Dardet