Article Text


A shared data approach more accurately represents the rates and patterns of violence with injury assaults
  1. Benjamin J Gray1,
  2. Emma R Barton1,
  3. Alisha R Davies1,
  4. Sara J Long2,
  5. Janine Roderick1,
  6. Mark A Bellis1
  1. 1 Policy, Research and International Development, Public Health Wales, Cardiff, UK
  2. 2 Centre for the Development and Evaluation of Complex Interventions for Public Health Improvement, Cardiff University, Cardiff, UK
  1. Correspondence to Dr Benjamin J Gray, Policy, Research and International Development, Public Health Wales, Cardiff, CF10 4BZ, UK; benjamin.gray{at}


Background To investigate whether sharing and linking routinely collected violence data across health and criminal justice systems can provide a more comprehensive understanding of violence, establish patterns of under-reporting and better inform the development, implementation and evaluation of violence prevention initiatives.

Methods Police violence with injury (VWI) crimed data and emergency department (ED) assault attendee data for South Wales were collected between 1 April 2014 and 31 March 2016 to examine the rates and patterns of VWI. Person identifiable data (PID) were cross-referenced to establish if certain victims or events were less likely to be reported to criminal justice services.

Results A total of 18 316 police crimed VWI victims and 10 260 individual ED attendances with an assault-related injury were considered. The majority of ED assault attendances (59.0%) were unknown to police. The key demographic identified as under-reporting to police were young males aged 18–34 years, while a significant amount of non-reported assaults involved a stranger. The combined monthly age-standardised rates were recalculated and on average were 74.7 (95% CI 72.1 to 77.2) and 66.1 (95% CI 64.0 to 68.2) per 100 000 population for males and females, respectively. Consideration of the additional ED cases resulted in a 35.3% and 18.1% increase on the original police totals for male and female VWI victims.

Conclusions This study identified that violence is currently undermeasured, demonstrated the importance of continued sharing of routinely collected ED data and highlighted the benefits of using PID from a number of services in a linked way to provide a more comprehensive picture of violence.

  • violence
  • injury
  • prevention
  • record linkage

This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See:

Statistics from


Since 1996, violence has been recognised as a global public health challenge by WHO.1 Worldwide, around 6 million people have been killed as a result of interpersonal violence since 2000 and many more are non-fatally injured.2 It is these non-fatal injuries that have the most significant economic burden on public services and social and health resources.2 Although the full costs of violence are difficult to quantify, in 2015, the cost of violence to the global economy was estimated to be US$13.6 trillion.3 4 In 2012, in the UK alone, the impact of violence cost society an estimated £124 billion through direct and indirect costs and lost productivity.5

Evidence suggests that males are five times more likely than females to require emergency admission to hospital for violence,6 7while the age range, irrespective of gender, most frequently admitted are those aged between 18 and 30 years.7 8 Previous research has also demonstrated that violence is strongly correlated to deprivation6 7 9 10 and individuals from the highest areas of deprivation are at least three times more likely to be hospitalised from violence-related assaults than individuals resident in the most affluent areas.7 9 Excessive alcohol consumption also doubles the likelihood of an individual being involved in a physical confrontation.11 12 Furthermore, better weather conditions (higher temperatures, low rainfall) provide greater criminal opportunity with potential victims and perpetrators likely to be interacting together for a prolonged period of time13; it is therefore unsurprising that, in many countries, peak levels of violence are observed in the summer months.8 13–15

A multidisciplinary approach to violence prevention that reaches across organisational boundaries of social, health and policing, underpinned by comprehensive data on patterns of violence (both victims and perpetrators), can reduce the impact on population health.1 Data sharing and linking across health and criminal justice systems has been recognised as an important tool to help inform the development, implementation and evaluation of violence prevention initiatives.16–19 Previous efforts in the UK and Canada have used only single sources of health data20 21 or ambulance pick-up location data10 to complement police operations rather than using data from multiple sources. In South Wales, a novel approach sharing routinely collected data on victims of violence from the police, emergency department (ED) assault attendees from local health boards and violence-related call-outs from the ambulance service was implemented in 2014. This method was introduced with the overarching aim to improve prevention through better identification of areas at high risk of violence to target efficient use of criminal justice and health resources. Using the police and ED datasets, we examine variations in under-reporting of violence-related events, to better understand if certain victims or events are less likely to be reported to criminal justice services. These are important considerations for delivering a comprehensive assessment of violence at a local level to inform action.


Data sources

The data in this study were collected over a 24-month period between 1 April 2014 and 31 March 2016.

Police crimed dataset

This study looked to examine violence with injury (VWI) incidents, crimed by South Wales Police (SWP) as one of the following; (1) assault with injury, (2) assault with intent to cause serious injury, (3) murder, (4) manslaughter or (5) racially or religiously aggravated assault with injury. Variables collected included crimed date, offence group, victim name (full first name and surname initial), victim age at time of assault, victim gender and victim residence postcode. ‘Crimed date’ refers to the date the police deemed the incident a crime for investigatory purposes; this has to take place within 72 hours of the day the occurrence took place (in the majority of cases, this is the same day of the occurrence).

ED dataset

ED assault attendance data were collected from all three South Wales local health boards plus a local alcohol treatment centre established in an inner city area to respond to incidents of assault that occur specifically within the night time economy. Records of assaults were recorded by ED reception staff on patient arrival. Variables collected included ED attended, date of arrival, patient name (first name and surname initial), patient age at time of attendance, patient gender, patient residence postcode, assault location (‘site description’ such as own home, street, licensed premise and ‘site text’-specific named location) and assailant relationship to patient.

Data sharing agreements were established across police, local health boards and Public Health Wales (PHW) to allow PHW to act as the central partner to collate and analyse the data. Services provided person identifiable data (PID) on assault victims to PHW via National Health Service (NHS) secure file-sharing platforms. All data were incorporated into a violence surveillance database held on a secure NHS file server developed by PHW.

Population and deprivation data

The population of the SWP region was calculated using The Office for National Statistics 2014 mid-year population estimates22 for Cardiff, Swansea, Neath Port Talbot, Bridgend, The Vale of Glamorgan, Merthyr Tydfil and Rhondda Cynon Taf. The 2013 European Standard Population23 was used to subsequently age-standardise the population data. Lower Super Output Areas (LSOAs) were determined from victim and patient postcodes using the 2014 version of the Welsh Index of Multiple Deprivation.24 There are 1909 LSOAs in Wales, each with a population of approximately 1600 individuals, and these LSOAs are categorised equally into deprivation quintiles (or fifths24).

Data linkage

Victim data for assault-related attendances at ED were linked with police data by patient name, age and either date of ED arrival or VWI ‘crimed date’. Internal validity was carried out through manual checks of a sample of records from 3 months, which identified inconsistencies in name spelling across the two datasets. Therefore, a matching algorithm using first two letters of the first name+surname initial+age (±1 year)+date of ED visit/reported crimed date (±1 day) was adopted. The algorithm also allowed for late-night assaults, which may have taken place where an individual reported the crime to the police and then arrived at the ED in the early hours of the following morning or vice versa. Data quality was monitored with quarterly data audits performed by PHW researchers from September 2014.

Data analysis

Data represented in figures are expressed as either age-standardised or age-specific rates (per 100 000 population) and 95% CIs. Logistic regression models were applied to investigate the patterns between assault location and gender, age and deprivation (SPSS V.23).


In total, there were 18 316 victims (9128 males) recorded on the police database for VWI crimes during the 24-month period. In males, the monthly average victim rates were 55.2 (95% CI 53.0 to 57.3) per 100 000 population, while the monthly average victim rates for females were slightly higher at 55.9 (95% CI 54.0 to 57.9) per 100 000 population. Seasonality was evident with increases in victims during June to August and again in December, especially among males. Age-standardised victim rates were threefold and fourfold greater in the most compared with the least deprived areas in males (97.7 (95% CI 91.3 to 104.0) vs 33.2 (95% CI 31.8 to 34.7) per 100 000 population) and females (106.8 (95% CI 100.7 to 113.0) vs 27.0 (95% CI 25.4 to 28.6) per 100 000 population), respectively. In males, the victim rate was highest among those aged 18–24 years (124.0 (95% CI 117.7 to 130.3) per 100 000 population; figure 1A) and declined in each of the subsequent older age groups. Among females, victim rates were highest in the 18–24 (131.0 (95% CI 123.5 to 138.5) per 100 000 population) and 25–34 years (132.1 (95% CI 125.8 to 138.4) per 100 000 population) age groups, and comparable with the males, rates declined with increasing age (figure 1A).

Figure 1

Average age-specific rates (per 100 000 population) over the 24-month period between April 2014 and March 2016 for (A) police data, (B) unknown to police emergency department data and (C) combination of police and unknown emergency department attendance data. ED, emergency department.

A total of 10 260 attendees (6954 males) visited EDs as a result of being a victim of a violence-related assault. Of these ED attendees, two-fifths (41.0%) were matched between databases using our algorithm, which suggests that the vast majority of ED attendances (59.0%) for a violence-related injury are potentially unknown to police. Overall, the proportion of unknown ED assault attendances was similar between genders, although differences across age groups were evident. The proportion unknown to police was significantly higher among young males (aged 18–34 years) (figure 1B). There were also a considerable number of females between 18 and 44 years that attended EDs without their injuries being reported to the police (figure 1B). In every age group, there were some ED attendances that were unreported to police (figure 1B).

The addition of the unknown ED attendances to the police-recorded victims (figure 1C) changed the relationship previously observed (figure 1A). The combined totals revealed that, rather than the female rates being greater than males in the ages between 25 and 44 years, the gender rates were now similar (figure 1C). On a month-by-month basis, there was an average 135 (95% CI 127 to 144) male and 69 (95% CI 64 to 74) female ED attendees unknown to police. These new cases resulted in a 35.3% and 18.1% increase on the original totals for males and females, respectively. The combined totals’ monthly age-standardised rates were recalculated and on average were 74.7 (95% CI 72.1 to 77.2) and 66.1 (95% CI 64.0 to 68.2) per 100 000 population for males and females, respectively. The age-standardised rates for males and females were recalculated and compared against the original police data age-standardised rates (figure 2). The seasonable peaks in violence-related assaults around the summer months and the festive period are more apparent in the combined monthly totals.

Figure 2

Month-by-month age-standardised rates (per 100 000 population) and 95% CIs for police-recorded data only and the combination of police-recorded data and unknown ED attendances for violence-related attendances. ED, emergency department.

When considering ED data, males were most likely to be a victim of an assault requiring medical treatment where the reported assault location was either the street or a licensed premise and usually involving a stranger (table 1). The majority of these cases were unknown to police; 945 (57.4%) street assaults and 60.3% of assaults in licensed premises involving male victims recorded in the ED database were not recorded in the police records (data not shown), whereas females were most frequently a victim in their own home and in these cases the vast majority of the alleged assailants were either current or ex-partners (66.7%; table 1). A substantial number of these female ‘domestic’ cases (244; 42.0%) were not known to police, and a high proportion (47.5%) of males victims assaulted in their own home by current or ex-partners were also unknown to police (data not shown). At a population level, the highest rates observed for females were for assaults occurring in their own home (table 2). These rates were 0.5 times greater than those for males in this location, while the rates for males were three and four times higher than females for assaults occurring either in a licensed premise or on the street, respectively (table 2). Generally, the rates were observed to decline for each location with an increase in deprivation. Applying logistic regression models and adjusting these location relationships for age, gender and deprivation revealed that females were four times more likely than males to be a victim in their own home (table 3). The odds that the individual was a victim in their own home increased with age, where the reverse relationship was observed for licensed premises or the street with the likelihood declining with increasing age (table 3).

Table 1

Emergency department attendees cross-tabulated by proportion of alleged assailant relationship to victim and assault location

Table 2

Age-standardised rates (per 100 000 population) of assault gender by assault location compared by all ages, adults and deprivation quintiles (adults only)

Table 3

Logistic regression model for assault location and age, gender and deprivation (all adults aged ≥18 years)


To our knowledge, this is the first time a study has used linked (person identifiable) data in an attempt to create a more accurate picture of local patterns of violence with an additional emphasis on those individuals not reporting violence. The benefits of this will enable violence interventions to be evidence based and tailored to suit the demographics of those presenting to services within specific communities. Many studies in the literature examining trends in violence use anonymised health data and/or crime statistics.8 9 20 21 One previous UK-based study reported using violence-related injury patient data derived from EDs combined with police intelligence to generate areas of ‘violence hotspots’ to inform the deployment of targeted violence prevention resources.21 However, as this current study demonstrates, the real picture of violence across communities is only revealed when including both PID from health services and police data to allow for cross-referencing to establish those individuals reporting serious assaults resulting in a violence-related injury to EDs not known/reported to police. This helps us to identify, more precisely, violence hotspot areas and to profile communities more accurately to target violence prevention initiatives and also to address the potential factors behind under-reporting. For example, adding the ‘unknown to police’ violence-related ED assault attendances to the police-recorded VWI victims data (figure 1C) reveals that, rather than female rates being greater than males between the ages of 25 and 44 years, the gender rates were actually similar. Additionally, these added cases resulted in a 35.3% and 18.1% increase on the original police totals for male and females, respectively (figure 2). These findings are of particular interest, especially given that the rate of violent crime across the UK can often be up to 60% higher than official statistics suggest.25 This is an important consideration given that trends in violence from one source alone may not accurately represent the true reflection of violence.26

One of the further benefits to using ED data alongside police data is the extra information it can provide on violence-related assaults. ED data detail both the reported assault location type and the assailant relationship to the victim, allowing for a more complete picture of violence to emerge. When considering ED linked data and the ‘police unknowns’, ED data can provide a more complete profile of the demographics of those that do not report to police to include assault location types and potential relationship to assailant and highlight potential areas that would benefit from more targeted violence interventions. The key demographic identified as under-reported to police were young males, and a substantial number (~60%) of male incidents unknown to police were assaults in streets or licensed premises involving a stranger. A Norwegian study looking at ED registrations of violence-related assaults also showed that victims were mostly males experiencing street attacks.27 Females, on the other hand, are four times more likely than males to be a victim of assault in their own homes, a traditional proxy measure for domestic violence,28 and the majority of these females know their alleged attacker (partner or ex-partner). Nearly half of ED-recorded own home incidents involving current or ex-partners as the assailant, for both male and female victims (47.6% and 42.0% respectively), were not known to the police, an important finding that could have strategic importance in addressing incidents of domestic violence. Accounting for assailant relationship gives a more accurate picture of patterns around assault location and the potential rates of domestic violence across communities. In addition, an increase in age revealed a rise in the proportion of cases that took place in an individual’s own home across all age categories; when compared with those aged 18–24, those aged ≥65 years were six times more likely to report being assaulted in their own home. Interestingly, a recent study undertaken in China revealed similar findings, where 28.5% of all hospital-recorded domestic violence incidents over an 8-year period (2006–2013) were in those individuals aged 65 years and older.29 These observations suggest that domestic abuse among the elderly population is an emerging global public health challenge.

Our study also showed results comparable with existing literature. In agreement with previous studies,7 8 30 a greater number of males were observed to attend EDs for violence-related assaults than females. Both the ED and police data illustrated a higher incidence of victims in the most deprived areas, and the relationship between deprivation and an increase in violence has been widely reported.6 7 9 10 However, a novel aspect to our study is that the results also demonstrated that relationships between violence and deprivation were consistent among a range of assault locations for both males and females. The majority of alcohol-related incidents in Australia requiring ED attendance were recorded to have taken place in a location described as ‘other’, a location not ‘own home’, ‘street’ or ‘licensed premises’.31 This is consistent with our findings, where a substantial number of males and females reported the location of their incident as ‘Other’, somewhat suggesting that reporting patterns by ‘victims’ are similar globally. Comparable trends in seasonality8 13–15 and peak levels of violence in those aged 18–34 years7 8 were also observed in our results.

The strengths of this study have been explained in detail; however, we acknowledge the following limitations. When using ED data to explore patterns of violence, data will not include violence where medical treatment was not sought; furthermore, individuals may be reluctant to report the injury occurring as the result of a violence-related assault. Additionally, there is currently no standardised data collection system across local health boards in South Wales; this means that some field responses can have more categories than others. For example, for the purpose of this study, the assault location for one health board had to be narrowed to fall into the same categories as the remaining two health boards (own home, someone else’s home, licensed premises, street and other). This could therefore contribute to the underestimation of what is happening within the named categories. One further limitation to the study is that our matching algorithm would not be able to identify an individual who deliberately provided false details to either the police or EDs. However, although this may happen in some cases, it is likely to be infrequent and would not have a significant impact on our findings.

Nevertheless, in conclusion and in agreement with a systematic review16 looking at the effectiveness of community-level interventions to reduce alcohol-related violence based on ED data sharing, this study demonstrates the importance of continued sharing of routinely collected ED data. This shared approach enhances the current picture or even changes existing profiles of violence, thus allowing services to develop and target more specifically violence prevention activities by providing additional information on patient demographics and types of violence to inform broader violence prevention work. Our study demonstrates that using PID from both police and EDs in a linked way can change the landscape of violence in terms of ‘what is known’ at a local level, providing a more comprehensive picture of violence, and therefore can more accurately inform targeted violence prevention approaches. Furthermore, it raises the question of policy and data sharing and highlights a need for more comprehensive datasets within services and the development of secure but accessible mechanisms to allow data sharing across agencies.

What is already known on this subject

Evidence demonstrates that a multidisciplinary approach to violence prevention that reaches across organisational boundaries of social, health and policing, underpinned by comprehensive data on patterns of violence (both victims and perpetrators), can reduce the impact on population health. This study introduces a novel approach sharing routinely collected data on victims of violence from the police, emergency department assault attendees from local health boards and violence-related call-outs from the ambulance service, which was implemented in the South Wales region in 2014.

What this study adds

From this study, we now know that sharing data between agencies such as the police and emergency departments enhances the current picture or even changes existing profiles of violence especially when considering the characteristics of those individuals who do not report their injuries to police. From a practical perspective, this shared data approach would then allow services to develop and target more specifically violence prevention activities by providing additional information on patient demographics and types of violence to inform broader violence prevention work.


The authors wish to extend their gratitude to the South Wales Police and Crime Commissioner, Cwm Taf, Cardiff and Vale, and Abertawe Bro Morgannwg University Health Boards for their support with this study.


View Abstract


  • Contributors BJG designed the study, performed data analysis and prepared the first draft of the manuscript. ERB and ARD designed the study and prepared the first draft of the manuscript. SJL, JR and MAB were involved in the concept of the study and provided comments to the first draft. All authors contributed to and approved the final draft for submission.

  • Funding This study was funded by the Home Office Grant Reference Numbers 2013-094 and 2015–097.

  • Competing interests None declared.

  • Ethics approval The data presented in this manuscript are part of a routine surveillance dataset; the relevant permissions to use this data were sought through data disclosure agreements. As an additional security measure, files that contain PID were password protected, and accessed only by named project researchers.

  • Provenance and peer review Not commissioned; externally peer reviewed.

Request permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.