Article Text

other Versions


Informing the ‘early years’ agenda in Scotland: understanding infant feeding patterns using linked datasets
  1. Omotomilola Ajetunmobi1,
  2. Bruce Whyte2,
  3. James Chalmers1,
  4. Michael Fleming1,
  5. Diane Stockton1,
  6. Rachel Wood1
  1. 1Information Services Division, NHS National Services Scotland, Edinburgh, Scotland, UK
  2. 2Glasgow Centre for Population Health, Glasgow, Scotland, UK
  1. Correspondence to Omotomilola Ajetunmobi, Area 151A, Information Services Division, NHS National Services Scotland, 1 South Gyle Crescent, South Gyle, Edinburgh, Scotland EH12 9EB, UK; o.ajetunmobi{at}


Background Providing infants with the ‘best possible start in life’ is a priority for the Scottish Government. This is reflected in policy and health promotion strategies to increase breast feeding, which gives the best source of nutrients for healthy infant growth and development. However, the rate of breast feeding in Scotland remains one of the lowest in Europe. Information is needed to provide a better understanding of infant feeding and its impact on child health. This paper describes the development of a unique population-wide resource created to explore infant feeding and child health in Scotland.

Methods Descriptive and multivariate analyses of linked routine/administrative maternal and infant health records for 731 595 infants born in Scotland between 1997 and 2009.

Results A linked dataset was created containing a wide range of background, parental, maternal, birth and health service characteristics for a representative sample of infants born in Scotland over the study period. There was high coverage and completeness of infant feeding and other demographic, maternal and infant records. The results confirmed the importance of an enabling environment—cultural, family, health service and other maternal and infant health-related factors—in increasing the likelihood to breast feed.

Conclusions Using the linked dataset, it was possible to investigate the determinants of breast feeding for a representative sample of Scottish infants born between 1997 and 2009. The linked dataset is an important resource that has potential uses in research, policy design and targeting intervention programmes.

Statistics from


The importance of linked administrative datasets in epidemiological research is gaining increasing prominence.1 ,2 Linked datasets are a cost effective resource for designing population-wide interventions, monitoring population health, evaluating health outcomes and identifying best clinical practice. Such data provide intelligence that could influence a wide range of policy issues including infant nutrition.

Infant feeding policies that are informed by relevant contextual data spanning demographic, psychosocial, healthcare, community and public policy attributes can potentially provide a foundation for developing effective intervention programmes.3 ,4 This is particularly pertinent to the Scottish Government's early years agenda, which aims to provide ‘every child with the best possible start in life’ by delivering integrated services for early intervention that secure positive health outcomes and address health inequalities.5 ,6

Breastfeeding rates in Scotland remain among the lowest in Europe and have been relatively unchanged since 1990.7–9 About half of the infants born annually are exclusively breast fed at birth and a quarter continue to breast feed exclusively up to their 6–8-week review.10

Different sources of breastfeeding data in Scotland currently provide some of the picture for the whole population, for example, the Guthrie test data9 ,11 and the Child Health Systems Programme–Pre School—CHSP-PS,10 or all of the picture for some of the population, for example, the Millennium Cohort Study,12 the 5 yearly Infant Feeding Survey13 and the Growing Up in Scotland—GUS study.14

Although there is extensive evidence of the protective effects of breast feeding,15 ,16 there remains a paucity of information on the patterns of breast feeding in Scotland,3 ,12 ,17 in particular how family background and health service-related factors influence the likelihood to breast feed.

This paper summarises a linkage study set up to investigate factors that influence the likelihood and patterns of infant feeding in Scotland using population-level administrative data.


Development of the linked dataset


The creation of the linked dataset was proposed as part of a research project jointly funded by the Scottish Collaboration for Public Health Research and Policy and the Glasgow Centre for Population Health (GCPH). The project linkage was set up under the guidance of GCPH and a project advisory group. Information Services Division (ISD) Scotland created the linked dataset, which comprised anonymised extracts of birth records linked to maternal, infant and child health records (see box 1) for all infants born in Scotland over a 13- year period, 1997–2009. Approval for the project design and confidentiality of patient data was obtained from the Privacy Advisory Committee of NHS National Services Scotland—a body set up to ensure appropriate use of patient identifiable information.18 Further ethical approval was not required.

Box 1 Components of the linked dataset
  • National Records of Scotland Birth Records: comprising all births in Scotland, which are registered by law within 21 days. The registration includes information on the country of birth of the mother and father, occupation, socioeconomic status, marital status, maternal parity, an indicator of multiple births, infant sex, live or stillbirth.

  • The Maternity and Neonatal Linked Database: a permanently linked scheme developed by ISD, containing maternal obstetric discharge records (SMR02), neonatal discharge records (SMR11), Scottish birth records (SBR—replaced SMR11 in 2003), and vital events of births and deaths held by the National Records of Scotland (formerly the General Register Office for Scotland) since 1975.

  • The Child Health Systems Programme Pre-School data: introduced in 1991 and collates information on child health from birth until shortly after school entry. Information on breast feeding is collected in two parts of the core programme (at the first visit—after discharge following child birth and 6–8-week review) via a recall interview with the mother/primary carer by a health visitor or public health nurse. The information collected includes parental background (eg, maternal age, father/partner's age), measures of growth/development and health behaviour (height, weight, maternal smoking status, exposure to passive smoking, type of infant feeding at birth, hospital discharge, first visit and the 6–8 week review) (details on ISD website—

Linkage process

All births registered in Scotland between 1997 and 2009 were linked to the CHSP-PS records using probability matching techniques applied to personal identifiers within each dataset such as surname (transformed to Soundex code), first initial, date of birth, sex and postcode of residence.19 Using this approach, pairs of records are compared and a ‘score’ or ‘weight’ given to the paired identifiers reflecting the likelihood of a true match. Weights from individual identifier comparisons were added to provide a cumulative weighted score and a threshold set to accept or reject linkage pairs based on the weighted scores.20

There were two main stages to the linkage process. A probability matching process allowed the Community Health Index (CHI) number —a unique patient identifier used on Scottish health records—to be allocated to birth registration records held within ISD's Maternity and Neonatal Linked Database (MNLD). Following this, each child health record was probability matched against records within the MNLD. The addition of the CHI number from the first linkage improved the matching process. The linked dataset released for analysis was anonymised and contained one record per child with variables coming from several different sources, namely, the birth and death registration records, maternal and child health records (figure 1). Markers for infants who migrated (based on CHI database) were also included in the dataset.

Figure 1

Description of the linkage process.

Additional derived information

Geographical information, including output area, data zone, council area, intermediate zone and area characteristics based on the Scottish Index for Multiple Deprivation (SIMD—2006 version), were derived from the postcode on the birth registration address and added to the dataset. The dataset also included a marker for siblings of the same mother.

In addition, the ethnic and religious backgrounds of the parents were derived from the mother's forename and maiden name and the father's forename and surname using Onomap software. Onomap is a package designed by the University of London to classify names into groups of cultural, ethnic and linguistic origin21 and has been validated in Scotland.22 This was included to provide additional information on ‘latent’ cultural factors that may influence infant feeding patterns. For example, cultural affiliations of second or third generation immigrants could be derived using the mother's country of origin.


Using the linked dataset, descriptive and multivariate (logistic regression) analyses were conducted to show demographic trends, describe patterns of infant feeding and explore the independent associations between a wide range of predictive variables (ie, parental, maternal, infant health and delivery characteristics) and infant feeding outcomes (SPSS V.17). Infant deaths, non-Scottish residents and invalid review records were excluded from the analysis.


Description of the linked dataset

The dataset consisted of 731 595 records of infants born between 1997 and 2009, 613 900 of whom had corresponding child health surveillance (CHSP-PS) records, 84% of the birth cohort. The coverage of CHSP-PS increased progressively over the study period with the phased roll out of the system within Scotland. Child records that linked to more than one infant in the cohort (ie, ‘bad links’) were minimal and were estimated to make up less than 1% of the population (0.3%). A total of 722 180 of the births were registered in Scotland; an additional 9415 (1%) had child health surveillance records but no information collected by the national birth registry. The latter may refer to infants born outside Scotland (hence not recorded in vital events) or errors in the linkage and/or recording systems; in the analyses, these records were excluded.

The linked dataset contained a wide range of variables associated with infant feeding. Birth registration records provided the most comprehensive recording and coverage of demographic variables and details not available in other recording schemes, for example, information on mother's (and father's) country of birth, socioeconomic status and marital status. The child health surveillance scheme was the main source of breastfeeding information with over 90% of the records having complete and valid infant feeding fields. Data were collected for infant feeding at 10 days after birth and the 6–8 week review; feeding was defined as the ‘predominant mode of infant feeding in the previous 24 h’, that is, exclusive breast feeding, bottle (or formula) feeding and mixed ‘breast and bottle/formula’ feeding. From 2001, data were also collated on feeding at birth and hospital discharge. Unlike the birth registration and CHSP-PS schemes, completeness of the Scottish Morbidity records was dependent on the type of data field, that is, mandatory or optional. Variables such as the ‘mode of delivery’, a mandatory field, had a higher rate of completion than ‘ethnicity’, an optional field, which was poorly recorded.

Description of cohort, maternal characteristics and birth delivery details

Overall, the 1997—2009 cohort was made up of 3% multiple births, 51% male and 48% female. About a third of the infants (31%) were sibling pairs (infants of the same mother within the same cohort). Over the study period, births to mothers born outside the British Isles increased from 5% in 1997 to 13% in 2009. Similarly, there was an increase in births to older mothers; the proportion of first time mothers aged 35 years or older doubled over the study period from 7% in 1997 to 14% in 2009. Overall, 60% of the infants were born via normal/spontaneous delivery, 98% of which took place within a hospital setting. There was an increasing trend in caesarean sections from 18% in 1997 to 26% in 2009. More than a half (55%) of the births took place in a fully accredited baby friendly institution (table 1). Overall, 4% of the cohort had migrated by the age of 2 years.

Table 1

Descriptive characteristics and unadjusted infant feeding trends in the birth cohort

Description of the derived characteristics (OnoMAP, SIMD)

In the 1997—2009 birth cohort, a quarter of the infants were resident in the 20% most deprived areas of Scotland (quintile) and 18% in the 20% least deprived areas at the time of birth (derived from the postcode recorded at birth registration). Most of the infants had parents of British ethnic origin (84%), mothers of a British birth and origin (89%) and mothers of a Christian religious background (95%). The trends however were toward increasing ethnic and religious diversity. For example, there was an increase in mothers of a non-British birth and non-British origin, that is, ‘first generation immigrants’ from 2% in 1997 to 9% in 2009 (table 1).

Description of characteristics associated with infant feeding

Breastfeeding rates over the period 2001–2009 showed that about a half of infants were exclusively breast fed at birth, but this decreased steadily with increasing time from birth, to 44% by hospital discharge, to 37% by the first visit (10 days after birth) and to 25% by the 6–8-week review. Exclusive breastfeeding trends have been relatively unchanged over the study period while, in contrast, mixed ‘bottle and breastfeeding’ trends increased steadily over the same period (figure 2).

Figure 2

Trends in mixed ‘bottle and breast feeding’ 2001–2009.

There were however varying patterns across the population. Table 1 shows the characteristics of the population overall and changes in the characteristics of the population over the survey period. It also outlines changes in crude (unadjusted) rates for exclusive and mixed feeding at the first review. Greater rates of exclusive breast feeding (and mixed feeding) were observed among infants of older mothers, of mothers of non-British birth, of mothers of a higher socioeconomic status, of married parents, of non-smoking mothers, of multiparous mothers and those resident in less deprived areas. The rising trend in mixed feeding was observed among all categories of infants. Further multivariate analyses were based on ‘any’ breast feeding because of the similar profile between mixed and exclusively breastfed infants.

Univariate descriptive analysis highlighted clear associations among a range of parental, maternal health/delivery, infant and hospital characteristics and infant feeding. For example, higher breastfeeding rates were noted among infants of first generation immigrants (mothers of non-British birth and non-British ethnic origin) compared with ‘second generation’ immigrants (mothers of British birth and non-British origin). Mothers of British birth and origin, representing 89% of the cohort, consistently had the lowest level of breast feeding at each review (figure 3).

Figure 3

Exclusive breastfeeding trends by mother's country of birth and origin 2001–2009.

Multivariate analysis identified a range of parental and hospital-related factors that independently increased the relative likelihood to establish and continue any breast feeding (at the first visit and 6–8-week review). These included having an older mother, one or both parents being of non-British birth or origin, having married parents, being a female infant, infants with longer postnatal stay in hospital, being born in a baby friendly unit, infants born post-term, infants of first-time mothers and those resident in non-urban settings or one of the less deprived areas. In contrast, there was relatively less likelihood of breast feeding among infants of multiple births, infants of single or cohabiting parents, of mothers who smoked, of mothers or fathers of a lower socioeconomic status, among preterm infants, those admitted to a neonatal unit and infants born via instrumental and caesarean section (table 2).

Table 2

Factors that influence the likelihood of any (exclusive or mixed) breast feeding 1997–2009


Our findings emphasise the important influence of cultural, familial, socioeconomic and health service factors on infant feeding patterns and trends in Scotland. The creation of a child and maternal dataset was based on the linkage of a wide range of characteristics at the individual level for a large representative sample of the Scottish infant population.

Record linkage has been described as ‘bringing together in one file, records from different sources that relate to the same individual or event’.23 ISD Scotland, the organisation that is the main repository for Scottish health services data, has over 30 years experience in developing and implementing linkage methods and has been a key contributor to probabilistic matching techniques developed in Oxford and Canada.20 Such data can play a vital role in identifying and targeting scarce resources to vulnerable groups, informing policy, changing clinical practice and supporting local efforts to improve child health.24 ,25

The cohort data over time (1997–2009) enabled monitoring of social and demographic trends. The breadth of information highlights demographic and societal changes—such as increasing ethnic diversity, increasing numbers of older mothers, changes in family structure and rising caesarean rates—that are key determinants of breastfeeding trends. It also provided scope to explore the impact of these determinants on infant feeding in Scotland. For instance, although the crude rates indicated an increase in exclusive breastfeeding trends among infants resident in deprived areas (table 1), this increase was not sustained after adjustment for other social, cultural and demographic factors.26

The societal and healthcare associations found with breast feeding in our study are consistent with findings reported in other studies showing a greater chance of breast feeding with increasing age of mother, residence in less deprived areas and less urban settlements among first time mothers,16 ,27 ,28 parents with a non-British birth or origin,29 ,30 female infants, those with a longer postnatal stay in hospital27 and among infants born in a baby friendly hospital.16 ,31 Conversely, there was less chance of any breast feeding among infants of cohabiting or single/separated parents,28 ,32 of a father or mother of a lower socioeconomic status, of mothers who smoked,16 ,27 multiple births, among infants born via instrumental or caesarean section, preterm infants and infants admitted to a neonatal unit.27 ,28 The true strength of these relationships with breast feeding may be underestimated, given that our study tracks feeding up to 6–8 weeks and breast feeding beyond this point cannot be measured from these data, for instance up to 6 months as recommended by current policy.16

This study highlights changing patterns of infant feeding in Scotland over the study period. Specifically, the increasing trends in mixed feeding, whereas exclusive breastfeeding rates have remained static. This suggests the need for additional support to mothers in the first few weeks after birth as many mothers who stopped breast feeding before 6 months report that they would have liked to have continued.13 In Scotland, 80% of the births now take place in baby friendly institutions;33 an increase in the proportion of skilled staff (and lay persons) trained in managing common lactation problems may help mothers establish the practice of breast feeding after discharge from hospital.

Linkage over time of nationally representative administrative records has advantages over cross-sectional surveys, providing a cost effective way of conducting research with better population coverage and completeness.20 ,34 ,35 Indepth reviews of infant feeding that provide trends across Scotland in large surveys such as the Infant Feeding Survey or the UK Millennium Cohort Study are often limited by the relatively small sample size for Scotland.12 Moreover, the unique patient identifier (CHI) used in Scotland enabled efficient pairing of records across different datasets20 and tagging of the migrant status of infants in the cohort (although some infants who travelled abroad may be ‘lost’ to follow-up—Personal communication, NHS Central Register Scotland, National Records of Scotland, 2012). Thus, infants who had emigrated could be censored providing potential for prospective time series research or longitudinal analyses.

In addition, it allowed subgroup analysis of those often described as ‘hard to reach’, for example, ethnic minorities and young mothers in deprived areas, addressing an important requirement of the health services35 while also maintaining patient confidentiality of the individuals involved. For example, ethnicity, often poorly recorded on administrative data sources,36 was addressed in our study by using derived ethnic/cultural background. Findings from the derived variables appear to confirm published reports showing higher breastfeeding rates among mothers of non-British background compared with mothers of British background; reflecting possible acculturation among second generation immigrants.29 ,30 Furthermore, the (derived) mother's religious background confirmed a tendency to bottle feed observed among certain ethnic minority groups in the UK, for example, mothers of a Muslim background—mainly of Pakistani origin.37

In this study, the utility of some of the individual datasets was limited by coverage, discontinuities in recording schemes and revisions to questions or the timing of data collection. Overall, there was still a relatively high coverage and completeness of variables associated with infant feeding at the first visit and the 6–8-week reviews. A slight trend in ‘selective’ coverage at subsequent child health review visits has been reported by others.38 The relatively short duration of infant feeding captured on the child health surveillance schemes may restrict the potential of exploiting the linked records in research; data schemes that provided more information on the duration of infant feeding have been discontinued. Moreover, the definition of infant feeding as the predominant mode of feeding the day before data collection (dependent on the interpretation of health worker who collects the information) was not consistent with the definition used in the Infant Feeding Survey or recommended by WHO and may bias the results.

The current findings build a ‘population profile’ comprising a range of factors that independently influence the chances of establishing infant feeding in Scotland. The need for an ‘enabling environment’ to increase initiation and duration of breast feeding in Scotland is emphasised. This environment will be influenced by cultural background otherwise described as ‘embodied knowledge’,39 family and other social circumstances and health service factors—such as the mode of delivery and implementation of baby friendly practices.


Breast feeding is an effective intervention for reducing the risk of childhood diseases and addressing health inequalities through to adulthood.15 Several recommendations have been made to improve breastfeeding rates; however, there is little evidence of changing trends in Scotland overall, although trends in some local areas have changed significantly, due to demographic, cultural and socioeconomic impacts.62 40

In Scotland, the predominantly bottle feeding culture is yet to give way to a breastfeeding culture, although some may argue that rising rates of mixed feeding may be the transition between both extremes. Nevertheless, it highlights the need for a more supportive environment and multifaceted interventions across the population in order to improve breastfeeding trends in Scotland.

This project has demonstrated an effective framework for using linked data collated from surveillance and administrative records in child health research. This approach provides clear benefits for the Scottish population, without imposing additional risk or burdens to individuals within it. It provides a resource for understanding Scotland's changing demography and potential for subgroup analysis, which could be used to better inform policies and programmes. Moreover, the results, which are consistent with other findings, provide a ‘Scottish context’ that could be further exploited to improve child health outcomes and facilitate a broader, ‘joined-up’ perspective for addressing feeding in the early years.

There is strong argument for using linked datasets to provide indepth analysis of child health trends in Scotland prospectively in order to guide both qualitative and quantitative research, inform policy, design health promotion initiatives and monitor population health.

What is already known on this subject

  • Although breast feeding is regarded as an important public health intervention for safeguarding child health, there has been little change in the breast feeding trends in Scotland.

What the study adds

  • This study has confirmed the strength of association between a range of cultural, family, health service, infant and maternal health characteristics and the likelihood to breast feed in a Scottish context.

Policy implications

  • A wide range of factors influence the likelihood to establish or sustain breast feeding in Scotland.

  • Interventions to increase breastfeeding rates in Scotland should extend beyond the health service, engage the entire population and consider the wider context of changing demographic and cultural influences.

  • The potential of administrative datasets to provide vital intelligence on population health and ‘hard to reach’ subgroups can be improved and exploited further in order to inform, influence and monitor child health policy.


The authors are grateful to Pauline Craig who was involved in the initial stages of the study design/project set up and to James Boyd for his contribution to early drafts of the manuscript. We also gratefully acknowledge funding provided by the Scottish Collaboration of Public Health Research and Policy (SCPHRP) and the Glasgow Centre for Population Health (GPCH).

View Abstract


  • Collaborators GCPH Steering Group Members: Bruce Whyte (GCPH), Pauline Craig (NHS Health Scotland, formerly GCPH), Jim Chalmers (ISD Scotland), Linda Wolfson (Infant Feeding Advisor, NHS GG&C), Rachel Wood (ISD Scotland), David Tappin (Paediatric Epidemiology and Child Health Unit, Royal Hospital for Sick Children, Glasgow), Ali McDonald (NHS Health Scotland), Judith Tait (Child Health Information Team, ISD Scotland), Jill Muirie (NHS Health Scotland), Kate Woodman (NHS Health Scotland), Ruth Campbell (Consultant Dietician, NHS Ayrshire & Arran), Helen Yewdall (Scottish Government), James Egan (GCPH) and Omotomilola Ajetunmobi (ISD, Scotland).

  • Contributors OA cleaned, analysed and interpreted the data and prepared the first draft/revisions of the final manuscript; BW designed and managed the study, and also interpreted the results, drafted and revised the manuscript; JC was involved in the study design and reviewed drafts of the manuscript; MF linked the datasets and reviewed drafts of the manuscript; DS provided guidance in the analysis and interpretation of the data; RW was involved in data interpretation and review of the draft manuscript. All authors approved the final draft. DS and BW are the guarantors. All members of the GCPH Breastfeeding Steering Group provided guidance in the implementation of the project and interpretation of the data.

  • Funding The project was jointly funded by the Scottish Collaboration of Public Health Research and Policy (SCPHRP-SCPH/08) and the Glasgow Centre for Population Health. The SCPHRP had no role in the study design, collation, analysis, interpretation and decision to publish.

  • Competing interests None.

  • Ethics approval Privacy Advisory Committee (PAC), NHS National Services Scotland.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Data sharing statement The routine data analysed for this study are held by NHS National Services Scotland, Information Services Division.

Request permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.